Question 1

What does BYOLLM mean?

Accepted Answer

BYOLLM stands for "Bring Your Own LLM." It means the AI application doesn't bundle a specific model. Instead, you connect whichever LLM you want to use. In Bodega One Code, you can wire up a local model running on your machine (Ollama, LM Studio, llama.cpp) or a cloud API (OpenAI, Anthropic, Groq) and switch between them at any time without restarting the app.

Question 2

Why does BYOLLM matter?

Accepted Answer

Most AI coding tools lock you to their bundled model. When they raise prices, change the model, or drop features, you don't have a choice. BYOLLM inverts that: you own the model relationship. Use a free local model for everyday work. Switch to Claude when you need top-tier reasoning on a hard problem. Your API keys go directly to the provider. Bodega One Code never touches your usage.

Question 3

Which local LLMs work best with Bodega One Code?

Accepted Answer

For coding tasks, Qwen3.6-27B (16-24 GB VRAM) is the current gold standard at 77.2% on SWE-bench Verified. For 6-8 GB cards, Qwen3.5-9B Q4_K_M is solid. For 12-16 GB, Gemma 4 26B MoE punches above its weight. For Apple Silicon, Qwen3.6-27B MLX on 64+ GB unified memory runs well. Bodega One Code auto-detects your hardware on launch and recommends a model tier based on your VRAM.

Question 4

Can I use different models for different tasks?

Accepted Answer

Yes. Bodega One Code supports per-session model override. You can set a different model for a single conversation without changing your default provider. Use Ollama for everyday tasks, then override to Claude for a specific complex problem, then go back to your local default.

Question 5

Does switching models require a restart?

Accepted Answer

No. Provider switching takes effect immediately in Bodega One Code. Change your provider or model mid-session and the next message uses the new selection.

Question 6

What's the difference between local and cloud providers in Bodega One Code?

Accepted Answer

Local providers (Ollama, LM Studio, llama.cpp, etc.) run entirely on your machine. Zero data leaves your hardware. Cloud providers (OpenAI, Anthropic, Groq) send your prompts to external APIs. Both work in Bodega One Code. If you need full data privacy, use a local provider, or enable air-gap mode to block all outbound connections at the OS level.

	BYOK	BYOLLM
What you bring	Your API key	Your key, or the model itself
Where inference runs	The provider's cloud	Your machine, or any cloud you pick
Can your code stay local	No, prompts go to the provider	Yes, fully, including air-gapped
Offline use	No	Yes, with a local model

Bring Your Own LLM. No lock-in.

What does Bring Your Own LLM mean?

Run models on your own hardware

Or connect a cloud API

Switching takes seconds

Select your provider

Add your key or local server

Switch takes effect immediately

BYOLLM vs BYOK: what is the difference?

Common
questions

Related resources

Your model. Your data. Your IDE.