Home

Open RAM

Open-source AI on rented compute

Run open-source models on rented RAM, GPU and CPU machines, benchmark them on real hardware, and deploy the best setup as an API.

Open-source AI on rented compute

Deploy open-source AI on the right machine, every time.

Rent any machine. Run any model.

Rent CPU, GPU and high-RAM compute by the hour, then run, benchmark and serve open models — Open RAM matches each workload to a machine that actually fits.

16machines
12open models
10providers
11GPU machines

What you can do

Everything you need to run open-source AI on rented infrastructure.

How it works

Rent compute or grab an API key, then use it — in three steps.

1 · Rent a machine or create a key

On the Marketplace, rent a GPU/CPU/RAM machine with SOL — or create a universal API key and top it up in SOL.

2 · Open the Workspace

Your rented machines and your API key both live in the Workspace, ready to use in one place.

3 · Run it

Open a Jupyter notebook (or SSH in) to use a machine, or send prompts to any model with your API key. Stop anytime.

You send a request

A prompt, job, benchmark, or deploy.

Router picks compute

Matches RAM / GPU / CPU to the workload + your strategy.

Runs on a rented machine

A marketplace machine with enough resources.

Open model executes

Llama, Qwen, Mistral, SDXL, Whisper…

Result + cost back

Output, latency, RAM/GPU used, price.

Every action — a prompt, deploy, benchmark or job — is matched to a machine with enough RAM / GPU / CPU to run it.

$RAM token

Pay in $RAM for 50% off — and every token burns.

At checkout you choose SOL or $RAM. Pay in $RAM and it's 50% cheaper. Every $RAM spent goes to the Open RAM treasury and is burned automatically every 10 minutes — so supply only shrinks as the platform is used.

50% cheaper

Pay for compute, API credits and more in $RAM at half the SOL price.

Autonomous burn

Treasury $RAM is burned on-chain every 10 minutes. Deflationary by design — supply only goes down.

Transparent treasury

4unRTJ…Tmk5RM

Payments land in one public wallet, then burn — all verifiable on-chain.

$RAM launches soon — pay in SOL today, $RAM the moment it's live.

Two layers, one platform

A compute layer you rent, and an open-source AI layer that runs on it.

Compute layer

Rent the hardware

Raw infrastructure, billed by the hour.

  • Rent CPU, GPU and RAM across providers
  • Launch ready-to-use cloud computers
  • Run heavy jobs straight from the browser
Router
Open-source AI layer

Run the models

Open models, served and compared.

  • Deploy models as private API endpoints
  • Benchmark quality, latency and cost
  • Route requests through one smart API

When you deploy a model, run a benchmark, use the API router, or submit a job, Open RAM matches it to a machine with enough RAM / GPU / CPU — automatically.

Workload to hardware

How common workloads map to the right kind of machine.

Small chat model

Cheap CPU / RAM machine

Huge Llama / Qwen model

High-RAM or GPU machine

Stable Diffusion

GPU machine (≥16GB VRAM)

Whisper transcription

CPU / GPU machine

Long document prompt

High-RAM machine

Private business workload

Dedicated, high-trust machine

FAQ

Straight answers on renting compute, using it, and paying.

I rented a machine — how do I actually use it?
It shows up in the Workspace about a minute after you pay. Click Use it here to open a Jupyter notebook right in the page and run code on the machine's GPU, or copy its SSH command to connect from your own terminal.
Can I use the machine from my own computer?
Yes. Each machine shows an SSH command in the Workspace — copy it into your terminal (Windows Terminal, macOS Terminal, Linux) to log in and run anything, upload files, and more.
What's the difference between renting a machine and the API keys?
Renting a machine gives you a whole computer (GPU/CPU/RAM) you control via Jupyter or SSH. An API key lets you call ready-made models (Claude, Llama, Qwen…) over an OpenAI-compatible endpoint and pay per token — no machine to manage.
How do I use my API key?
Create a key under Marketplace → Model APIs (or in the Workspace), top it up in SOL, then either use the built-in prompt box or call the endpoint from your code — Authorization: Bearer <key>, OpenAI-compatible.
How do I pay, and when am I billed?
Everything is paid in SOL with your Solana wallet (Phantom). Machines bill per hour while active — click Stop to end charges. API keys are prepaid and charged per token as you call models; each top-up is verified on-chain.
Are the machines real?
Yes — the Marketplace lists live RunPod GPUs with real prices, and renting launches a real pod with Jupyter + SSH. If RunPod has nothing free, it falls back to a clearly-labelled demo catalog.
Can I pay with the $RAM token?
Yes — at checkout you choose SOL or $RAM, and paying in $RAM is 50% cheaper. (Launching soon — SOL works today.)
What happens to $RAM when I spend it?
It goes to the Open RAM treasury and is burned autonomously every 10 minutes on-chain — so $RAM supply only shrinks as the platform is used.

Run open-source AI on rented compute.

Pick a machine, deploy a model, benchmark the trade-offs, and serve it through one API — all from your dashboard.