Configuring LLM providers¶

Connect marimo to an LLM via the notebook Settings panel (recommended) or by editing marimo.toml directly. Prefer going through the notebook settings menu to avoid errors with the config file.

To locate your config file:

marimo config show

The path to marimo.toml is printed at the top of the output.

Model configuration¶

Model roles and routing¶

marimo uses three roles:

chat_model: chat panel
edit_model: refactor cells (Cmd/Ctrl-Shift-E) and Generate with AI
autocomplete_model: inline code completion

Models are written as provider/model, and the provider prefix routes to the matching config section:

marimo.toml

[ai.models]
chat_model = "openai/gpt-4o-mini"           # Routes to OpenAI config
edit_model = "anthropic/claude-3-sonnet"    # Routes to Anthropic config
autocomplete_model = "ollama/codellama"     # Routes to Ollama config
autocomplete_model = "some_other_provider/some_model" # Routes to OpenAI compatible config

Custom models¶

Add custom entries to the model dropdown:

marimo.toml

[ai.models]
custom_models = ["ollama/somemodel"]

Supported providers¶

Configure LLM providers in the notebook settings panel.

You can configure the following providers:

Anthropic
AWS Bedrock
GitHub
Google AI
DeepSeek
xAI
LM Studio
Mistral
Ollama
OpenAI
OpenRouter
Weights & Biases
Together AI
Vercel v0
and any OpenAI-compatible provider

Below we describe how to connect marimo to your AI provider.

OpenAI¶

Requirements

pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "openai/gpt-4o-mini"
edit_model = "openai/gpt-4o"
# See https://platform.openai.com/docs/models for the latest list

[ai.open_ai]
# Get an API key at https://platform.openai.com/account/api-keys
api_key = "sk-proj-..."

OpenAI-compatible providers

If your model does not start with openai/, it will not be routed to the OpenAI config, and likely will be routed to the OpenAI-compatible config.

Reasoning models (o1, o3, etc.)

These models can incur higher costs due to separate reasoning tokens. Prefer smaller responses for refactors or autocompletion, and review your provider limits.

Anthropic¶

Requirements

Create an account and key: Anthropic Console

Configuration

marimo.toml

[ai.models]
chat_model = "anthropic/claude-3-7-sonnet-latest" # other options: claude-3-haiku, claude-3-opus
# See Anthropic model list: https://docs.anthropic.com/en/docs/about-claude/models

[ai.anthropic]
api_key = "sk-ant-..."

AWS Bedrock¶

AWS Bedrock exposes multiple foundation models via a unified AWS API.

Requirements

pip install boto3
Enable model access in the Bedrock console
AWS credentials via aws configure, env vars, or ~/.aws/credentials

Configuration

marimo.toml

[ai.models]
chat_model = "bedrock/anthropic.claude-3-sonnet-latest"
# Example model families include Anthropic Claude, Meta Llama, Cohere Command

[ai.bedrock]
region_name = "us-east-1" # AWS region where Bedrock is available
# Optional AWS profile name (from ~/.aws/credentials)
profile_name = "my-profile"

Use profile_name for a non-default named profile, or rely on env vars/standard AWS resolution. For regional inference models, specify the inference profile ID (e.g., bedrock/eu.anthropic.claude-sonnet-4-20250514-v1:0) and corresponding region.

Required AWS Bedrock permissions

Ensure your IAM policy allows bedrock:InvokeModel and bedrock:InvokeModelWithResponseStream for the models you plan to use.

Google AI¶

Requirements

pip install google-genai

You can use Google AI via two backends: Google AI Studio (API key) or Google Vertex AI (no API key required).

Using Google AI Studio (API key)¶

Sign up at Google AI Studio and obtain your API key.
Configure marimo.toml (or set these in the editor Settings):

marimo.toml

[ai.models]
chat_model = "google/gemini-2.5-pro"
# or any model from https://ai.google.dev/gemini-api/docs/models/gemini

[ai.google]
api_key = "AI..."

Using Google Vertex AI (no API key required)¶

Ensure you have access to a Google Cloud project with Vertex AI enabled.
Set the following environment variables before starting marimo:

export GOOGLE_GENAI_USE_VERTEXAI=true
export GOOGLE_CLOUD_PROJECT='your-project-id'
export GOOGLE_CLOUD_LOCATION='us-central1'

GOOGLE_GENAI_USE_VERTEXAI=true tells the client to use Vertex AI.
GOOGLE_CLOUD_PROJECT is your GCP project ID.
GOOGLE_CLOUD_LOCATION is your region (e.g., us-central1).

No API key is needed in your marimo.toml for Vertex AI.

For details and advanced configuration, see the google-genai Python client docs: https://googleapis.github.io/python-genai/#create-a-client.

Azure¶

There are two offerings for serving LLMs on Azure

Azure OpenAI

marimo.toml

[ai.models]
chat_model = "azure/gpt-4.1-mini"

[ai.azure]
api_key = "sk-proj-..."
base_url = "https://<your-resource-name>.openai.azure.com/openai/deployments/<deployment_name>?api-version=<api-version>"

The deployment name is typically the model name.

Azure AI Foundry

AI Foundry uses OpenAI-compatible models, so you can use the same configuration as OpenAI-compatible providers.

marimo.toml

[ai.models]
custom_models = ["custom-azure/mistral-medium"]

[ai.open_ai_compatible]
api_key = "sk-proj-..."
base_url = "https://<your-resource-name>.services.ai.azure.com/openai/v1"

GitHub Copilot¶

Use Copilot for code refactoring or the chat panel (Copilot subscription required).

Requirements

Install the gh CLI
Get a token: gh auth token

Configuration

marimo.toml

[ai.models]
chat_model = "github/gpt-4o-mini"

[ai.github]
api_key = "gho_..."

My token starts with ghp_ instead of gho_?

This usually happens when you previously authenticated gh by pasting a personal access token (ghp_...). However, GitHub Copilot is not available through ghp_..., and you will encounter errors such as:

bad request: Personal Access Tokens are not supported for this endpoint

To resolve this issue, you could switch to an OAuth access token (gho_...):

Re-authenticate by running gh auth login.
Choose Login with a web browser (instead of Paste an authentication token) this time.

OpenRouter¶

Route to many providers through OpenRouter with a single API.

Requirements

Create an API key: OpenRouter Dashboard
pip install openai or uv add openai (OpenRouter is OpenAI‑compatible)

Configuration

marimo.toml

[ai.models]
# Use OpenRouter's model slugs (vendor/model). Examples:
chat_model = "openrouter/openai/gpt-4o-mini"
edit_model = "openrouter/anthropic/claude-3-7-sonnet"

[ai.openrouter]
api_key = "sk-or-..."
base_url = "https://openrouter.ai/api/v1/"
# Optional but recommended per OpenRouter best practices
# extra_headers = { "HTTP-Referer" = "https://your-app.example", "X-Title" = "Your App Name" }

See available models at https://openrouter.ai/models. Make sure to prepend openrouter/ to the model slug (e.g., openrouter/deepseek/deepseek-chat, openrouter/meta-llama/llama-3.1-8b-instruct).

Weights & Biases¶

Access hosted AI models through Weights & Biases Weave for ML development and inference.

Requirements

Create an API key: Weights & Biases Settings
pip install openai or uv add openai (Weights & Biases is OpenAI‑compatible)

Configuration

marimo.toml

[ai.models]
# Use wandb/ prefix for Weights & Biases models. Examples:
chat_model = "wandb/meta-llama/llama-3-70b-instruct"
edit_model = "wandb/gpt-4o"

[ai.wandb]
api_key = "your-wandb-api-key"
base_url = "https://api.inference.wandb.ai/v1/"  # Optional, this is the default

See available models at the Weights & Biases documentation. Make sure to prepend wandb/ to the model name.

Local models with Ollama¶

Run open-source LLMs locally and connect via an OpenAI‑compatible API.

Requirements

Install Ollama
pip install openai or uv add openai

Setup

Pull a model

# View available models at https://ollama.com/library
ollama pull llama3.1
ollama pull codellama  # recommended for code generation

# View your installed models
ollama ls

Start the Ollama server:

ollama serve
# In another terminal, run a model (optional)
ollama run codellama

Visit http://127.0.0.1:11434 to confirm that the server is running.

Port already in use

If you get a "port already in use" error, you may need to close an existing Ollama instance. On Windows, click the up arrow in the taskbar, find the Ollama icon, and select "Quit". This is a known issue (see Ollama Issue #3575). Once you've closed the existing Ollama instance, you should be able to run ollama serve successfully.

Configuration

marimo.toml

[ai.models]
chat_model = "ollama/llama3.1:latest"
edit_model = "ollama/codellama"
autocomplete_model = "ollama/codellama" # or another model from `ollama ls`

Important: Use the /v1 endpoint

marimo uses Ollama's OpenAI‑compatible API. Ensure your base_url includes /v1:

[ai.ollama]
base_url = "http://127.0.0.1:11434/v1"  # ✅ Correct - includes /v1

Common mistake:

base_url = "http://127.0.0.1:11434"     # ❌ Will cause 404 errors

If you see 404s, verify the model is installed with ollama ls and test the endpoint:

curl http://127.0.0.1:11434/v1/models

OpenAI-compatible providers¶

Many providers expose OpenAI-compatible endpoints. Point base_url at the provider and use their models. Common examples include GROQ, DeepSeek, xAI, Together AI, and LM Studio.

Requirements

Provider API key
Provider OpenAI-compatible base_url
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "provider-x/some-model"

[ai.open_ai_compatible]
api_key = "..."
base_url = "https://api.provider-x.com/"

Use the /v1 path if required by your provider

Some OpenAI-compatible providers expose their API under /v1 (e.g., https://host/v1). If you see 404s, add /v1 to your base_url.

Using OpenAI-compatible providers

Via UI SettingsVia marimo.toml

Open marimo's Settings panel
Navigate to the AI section
Enter your provider's API key in the "OpenAI-Compatible" section
Set Base URL to your provider (e.g., https://api.provider-x.com)
Set Model to your provider's model slug

Add this to marimo.toml:

[ai.models]
chat_model = "provider-x/some-model"

[ai.open_ai_compatible]
api_key = "px-..."
base_url = "https://api.provider-x.com/"

DeepSeek¶

Use DeepSeek via its OpenAI‑compatible API.

Requirements

DeepSeek API key
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "deepseek/deepseek-chat"  # or "deepseek-reasoner"

[ai.open_ai_compatible]
api_key = "dsk-..."
base_url = "https://api.deepseek.com/"

xAI¶

Use Grok models via xAI's OpenAI‑compatible API.

Requirements

xAI API key
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "xai/grok-2-latest"

[ai.open_ai_compatible]
api_key = "xai-..."
base_url = "https://api.x.ai/v1/"

LM Studio¶

Connect to a local model served by LM Studio's OpenAI‑compatible endpoint.

Requirements

Install LM Studio and start its server
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "lmstudio/qwen2.5-coder-7b"

[ai.open_ai_compatible]
base_url = "http://127.0.0.1:1234/v1"  # LM Studio server

Mistral¶

Use Mistral via its OpenAI‑compatible API.

Requirements

Mistral API key
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "mistral/mistral-small-latest"  # e.g., codestral-latest, mistral-large-latest

[ai.open_ai_compatible]
api_key = "mistral-..."
base_url = "https://api.mistral.ai/v1/"

Together AI¶

Access multiple hosted models via Together AI's OpenAI‑compatible API.

Requirements

Together AI API key
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "together/mistralai/Mixtral-8x7B-Instruct-v0.1"

[ai.open_ai_compatible]
api_key = "tg-..."
base_url = "https://api.together.xyz/v1/"

Vercel v0¶

Use Vercel's v0 OpenAI‑compatible models for app-oriented generation.

Requirements

v0 API key
pip install openai or uv add openai

Configuration

marimo.toml

[ai.models]
chat_model = "v0/v0-1.5-md"

[ai.open_ai_compatible]
api_key = "v0-..."
base_url = "https://api.v0.dev/"  # Verify the endpoint in v0 docs

See the LiteLLM provider list for more options. For non‑compatible APIs, submit a feature request.