Open RAM

Open-source AI on rented compute

Run open-source models on rented RAM, GPU and CPU machines, benchmark them on real hardware, and deploy the best setup as an API.

Open-source AI on rented compute

Deploy open-source AI on the right machine, every time.

Rent any machine. Run any model.

Rent CPU, GPU and high-RAM compute by the hour, then run, benchmark and serve open models — Open RAM matches each workload to a machine that actually fits.

Explore Marketplace Deploy Model Run Benchmark How it works

16machines

12open models

10providers

11GPU machines

What you can do

Everything you need to run open-source AI on rented infrastructure.

Rent RAM, GPU & CPU compute

Browse a live marketplace of machines across providers and regions. Pay by the hour.

Open

Launch cloud AI workstations

Spin up a configured Linux box with Ollama, vLLM, Jupyter and CUDA pre-installed.

Open

Run open models as APIs

Deploy Llama, Qwen, Mistral and more to a private endpoint with one click.

Open

Benchmark on real hardware

Compare quality, latency and cost for any model across CPU and GPU profiles.

Open

One API router for many models

Send a prompt; we pick the right model and machine by cost, speed or quality.

Open

How it works

Rent compute or grab an API key, then use it — in three steps.

1 · Rent a machine or create a key

On the Marketplace, rent a GPU/CPU/RAM machine with SOL — or create a universal API key and top it up in SOL.

2 · Open the Workspace

Your rented machines and your API key both live in the Workspace, ready to use in one place.

3 · Run it

Open a Jupyter notebook (or SSH in) to use a machine, or send prompts to any model with your API key. Stop anytime.

You send a request

A prompt, job, benchmark, or deploy.

Router picks compute

Matches RAM / GPU / CPU to the workload + your strategy.

Runs on a rented machine

A marketplace machine with enough resources.

Open model executes

Llama, Qwen, Mistral, SDXL, Whisper…

Result + cost back

Output, latency, RAM/GPU used, price.

Every action — a prompt, deploy, benchmark or job — is matched to a machine with enough RAM / GPU / CPU to run it.

Read the full guide →

$RAM token

Pay in $RAM for 50% off — and every token burns.

At checkout you choose SOL or $RAM. Pay in $RAM and it's 50% cheaper. Every $RAM spent goes to the Open RAM treasury and is burned automatically every 10 minutes — so supply only shrinks as the platform is used.

50% cheaper

Pay for compute, API credits and more in $RAM at half the SOL price.

Autonomous burn

Treasury $RAM is burned on-chain every 10 minutes. Deflationary by design — supply only goes down.

Transparent treasury

4unRTJ…Tmk5RM

Payments land in one public wallet, then burn — all verifiable on-chain.

$RAM launches soon — pay in SOL today, $RAM the moment it's live.

Two layers, one platform

A compute layer you rent, and an open-source AI layer that runs on it.

Compute layer

Rent the hardware

Raw infrastructure, billed by the hour.

Rent CPU, GPU and RAM across providers
Launch ready-to-use cloud computers
Run heavy jobs straight from the browser

Router

Open-source AI layer

Run the models

Open models, served and compared.

Deploy models as private API endpoints
Benchmark quality, latency and cost
Route requests through one smart API

When you deploy a model, run a benchmark, use the API router, or submit a job, Open RAM matches it to a machine with enough RAM / GPU / CPU — automatically.

Workload to hardware

How common workloads map to the right kind of machine.

Small chat model

Cheap CPU / RAM machine

Huge Llama / Qwen model

High-RAM or GPU machine

Stable Diffusion

GPU machine (≥16GB VRAM)

Whisper transcription

CPU / GPU machine

Long document prompt

High-RAM machine

Private business workload

Dedicated, high-trust machine

FAQ

Straight answers on renting compute, using it, and paying.

I rented a machine — how do I actually use it?

It shows up in the Workspace about a minute after you pay. Click Use it here to open a Jupyter notebook right in the page and run code on the machine's GPU, or copy its SSH command to connect from your own terminal.

Can I use the machine from my own computer?

Yes. Each machine shows an SSH command in the Workspace — copy it into your terminal (Windows Terminal, macOS Terminal, Linux) to log in and run anything, upload files, and more.

What's the difference between renting a machine and the API keys?

Renting a machine gives you a whole computer (GPU/CPU/RAM) you control via Jupyter or SSH. An API key lets you call ready-made models (Claude, Llama, Qwen…) over an OpenAI-compatible endpoint and pay per token — no machine to manage.

How do I use my API key?

Create a key under Marketplace → Model APIs (or in the Workspace), top it up in SOL, then either use the built-in prompt box or call the endpoint from your code — Authorization: Bearer <key>, OpenAI-compatible.

How do I pay, and when am I billed?

Everything is paid in SOL with your Solana wallet (Phantom). Machines bill per hour while active — click Stop to end charges. API keys are prepaid and charged per token as you call models; each top-up is verified on-chain.

Are the machines real?

Yes — the Marketplace lists live RunPod GPUs with real prices, and renting launches a real pod with Jupyter + SSH. If RunPod has nothing free, it falls back to a clearly-labelled demo catalog.

Can I pay with the $RAM token?

Yes — at checkout you choose SOL or $RAM, and paying in $RAM is 50% cheaper. (Launching soon — SOL works today.)

What happens to $RAM when I spend it?

It goes to the Open RAM treasury and is burned autonomously every 10 minutes on-chain — so $RAM supply only shrinks as the platform is used.

Run open-source AI on rented compute.

Pick a machine, deploy a model, benchmark the trade-offs, and serve it through one API — all from your dashboard.

Explore Marketplace Deploy Model Open Dashboard