TutorialStefan VaskevichStefan Vaskevich

큰 GPU 없이 로컬 3D AI 모델을 테스트하는 방법 (클라우드 GPU 워크플로)

CubePart는 24GB 이상의 VRAM을 요구하지만 제겐 없습니다. Roblox의 오픈소스 부위 분할 모델을 처음부터 설치하고 테스트할 때 사용한 클라우드 GPU 워크플로 전체를 정리했습니다 —— SSH, Cursor Remote-SSH, AI 코딩 에이전트, Gradio 공유 링크, 그리고 고려할 만한 RunPod 대안까지.

이 기사는 현재 영어로만 제공됩니다.

Every few weeks a new open-source 3D AI model drops, and half of them ship with the same catch — they want 24 GB of VRAM or more. My card has 14 GB. But I still want to test these models properly before I write about them or add them to the Arena. So I do not wait for a quantized build or a hosted demo — I rent a cloud GPU for an hour, let an AI coding agent install the thing from scratch, and run it.

Below is the exact process I used to get Roblox's CubePart running — every command, in order. The model is just the example; this same flow works for any heavy local 3D AI repo. CubePart itself is already live on top3d.ai if you only want to see its output — this guide is about getting it running.

What you will get out of this
A repeatable recipe: rent a GPU, SSH in from your editor, create a safe user, install an AI coding agent, and let it clone + install + launch the model — with the UI piped back to your own browser. Shut it down and you stop paying.
About to test Roblox CubePart
The real case that drove this: testing Roblox's CubePart, now available on top3d.ai.

The problem: it wants 24 GB+ of VRAM

The example throughout is Roblox CubePart, a new open-source local model — but the model barely matters here. What matters is its requirement: Roblox recommends 24 GB of VRAM for a normal run, and my card has 14 GB. That is the wall, and it is the same wall most heavy local 3D AI models put in front of you. Push quality higher and even 48 GB runs out.

CubePart VRAM requirement vs my 14 GB card
CubePart wants ~24 GB for a normal run. My card has 14 GB — this is the wall.
License: research-only, check before commercial use
Roblox calls Cube/CubePart open-source and the code and weights are public, but the license is Research-Only RAIL-MS — academic and research use only. Fine for testing, demos and learning; not cleared for commercial products or SaaS without extra permission from Roblox.

The fix: rent a cloud GPU for an hour

Instead of buying hardware, I rent a GPU with SSH access, install everything from scratch, run my tests, and shut it down. For this I used RunPod with an A40 (48 GB) — overkill, but I wanted headroom. It cost about $0.50 per hour.

RunPod is just my example — not a sponsor
I am not sponsored by RunPod and this is not an ad. It is simply what I use. Any cloud GPU with SSH works identically — the steps below are provider-agnostic. A few alternatives:
  • Vast.ai — cheapest marketplace pricing
  • Lambda Cloud — clean ML-focused instances
  • Paperspace (DigitalOcean) — simple hourly GPUs
  • Modal / Replicate — code-first, serverless GPU
An underrated advantage of cloud GPUs
You install everything from scratch on a clean machine — the right CUDA wheels, all the dependencies, no leftover conflicts from your local setup. Heavy 3D AI repos are notorious for environment hell; a fresh instance sidesteps most of it.

The full workflow, command by command

1

Spin up the pod with enough room

On RunPod I picked an A40 (48 GB) — 24 GB is enough for standard CubePart runs. The part people miss is storage: these projects pull a lot of weights, so set a ~50 GB container and up to 100 GB persistent storage.

Choosing an A40 48 GB pod on RunPod
A40 (48 GB) for headroom — 24 GB works, and you can go cheaper. ~$0.50/hr.
2

Add an SSH key and test the connection

An SSH key is just a pair of files — a private one that stays on your machine, a public one you give the provider — so it can let you in without a password. Generate it once with ssh-keygen -t ed25519, paste the contents of id_ed25519.pub into RunPod under Settings → SSH Public Keys, then test the connection from Windows PowerShell (use the host/port your pod shows):

ssh root@<POD-IP> -p <PORT> -i ~/.ssh/id_ed25519

If you land inside the pod's shell, it works.

RunPod's official step-by-step (recommended)
RunPod has a clear, screenshot-by-screenshot guide that covers the SSH key, adding it to your account, and connecting your IDE — follow it directly: docs.runpod.io — Connect to your Pod from an IDE.
Testing the SSH connection to the pod in PowerShell
Testing the pod's SSH command in PowerShell first — it connects.
3

Connect Cursor / VS Code over Remote-SSH

In Cursor (or VS Code), press Ctrl+Shift+P Remote-SSH: Open SSH Configuration File, and add a host block (fill in your pod's HostName and Port):

Host runpod-cubepart
    HostName <POD-IP>
    User root
    Port <PORT>
    IdentityFile ~/.ssh/id_ed25519

Then Ctrl+Shift+P Remote-SSH: Connect to Hostrunpod-cubepart → choose Linux, and open the /workspace folder. Everything you keep needs to live here — only persistent storage survives a shutdown.

4

Create a safe non-root user

You connect as root. Before letting an AI agent run freely, I make a normal user so it is not operating as root. In the Cursor terminal, still as root:

apt update && apt install -y curl git sudo

useradd -m -s /bin/bash stefan
mkdir -p /workspace/stefan
chmod -R a+rwX /workspace/stefan
ls -ld /workspace/stefan
5

Switch to that user and install Claude Code

Drop into the new user (no need to reconnect), then install the agent. From here on, the agent runs as stefan, not root:

su - stefan
cd /workspace/stefan

curl -fsSL https://claude.ai/install.sh | bash
source ~/.bashrc
claude --version

On first launch claude asks you to log in — open the link in your browser and paste the code back.

Claude Code installed and ready inside the pod
In through Remote-SSH, running as a non-root user, agent installed and logged in.
6

Run the agent in bypass mode and hand it the job

Start the agent with permission prompts skipped — safe here because it is an isolated, throwaway cloud box (Anthropic only recommends this mode inside containers/VMs, which is exactly what this is):

claude --dangerously-skip-permissions

From here I do not type install commands myself. I just tell Claude, in plain English, what I want: clone the CubePart repo, install it with all dependencies, download the model weights, and run a quick test. It works out the right CUDA wheels and commands for this exact GPU and does the whole thing. The one instruction I always make explicit: install into /workspace, since anything outside persistent storage is wiped when the pod shuts down.

Be patient on the PyTorch + CUDA step
The dependency install can take 10–15 minutes. The agent may look idle — it is not. It will tell you when everything is ready.
First CubePart test running successfully on the cloud GPU
Dependencies in, weights downloaded, first run passing — all driven by the agent on the rented GPU.
7

Ask for the Gradio demo and open it in your browser

CubePart ships a Gradio app (no ComfyUI build). I simply ask Claude to launch the Gradio demo and expose it with a public share link. It starts the app and hands back a https://….gradio.live URL — I open that on my own laptop and get the full UI, while the model keeps running on the cloud GPU. From there it is the usual Gradio flow: drop in a GLB, type the part names, run.

CubePart Gradio interface running through a share link in a local browser
The gradio.live link running in my local browser — model executes on the cloud GPU.

That is it — and where to see the output

That is the whole method: a model I cannot run locally went from a GitHub link to a working UI in my own browser, for about a dollar. The same flow works for the next heavy model, and the one after that.

Want to see what CubePart actually produces?
I will not rehash the results here — CubePart is already live on the site, so you can look at it directly instead of taking my word for it. Open it in the Segmentation Arena on top3d.ai.

Cost, cleanup, and when this is worth it

~$0.50 / hr
A40 48 GB. A whole test session is a dollar or two.
Use /workspace
Persistent storage survives shutdown; everything else does not.
24 GB target
Enough for standard CubePart; go higher only to push resolution.

Shut the pod down the moment you are done. Billing is hourly and it runs until you stop it. If you only need a model occasionally, this beats buying a $2,000+ GPU by a mile — you pay for the exact minutes you use. If you are testing models every single day, that maths flips and local hardware starts to make sense.

The pattern, in one line
Rent a clean GPU box → SSH in from your editor → create a safe user → install an AI agent → let it clone + install into persistent storage → expose the UI with a share link → test → shut it down. Works for CubePart and basically any heavy local 3D AI model.

That is how I test the heavy stuff without the hardware. New models land constantly — whenever one does, it goes straight into the Arena so you can compare it without installing anything at all.

이 도구들을 직접 비교해 보시겠습니까? 3D AI 아레나를 확인하세요.

큰 GPU 없이 로컬 3D AI 모델을 테스트하는 방법 (클라우드 GPU 워크플로) | Top 3D AI