What is Continue.dev and why use it with Ollama?

Continue.dev is an open-source AI coding assistant (34,000+ GitHub stars, Apache 2.0 license) that integrates with VS Code and JetBrains IDEs. Using it with Ollama gives you a free, private GitHub Copilot alternative that runs entirely on your machine. You get tab autocomplete, AI chat, edit mode, and agent capabilities without monthly subscriptions or sending code to the cloud. Continue is backed by Y Combinator and used by companies like Siemens and Morningstar. As of June 2026, Continue was acquired by Cursor and the open-source repo is read-only at the final v2.0.0 release — the VS Code extension, JetBrains plugin, and CLI still install and run with local Ollama models, but receive no further updates from the original team.

What are the best Ollama models for Continue autocomplete?

For tab autocomplete, use small, fast models: qwen2.5-coder:1.5b (recommended, ~4GB VRAM), starcoder2:3b (~4GB), or deepseek-coder:1.3b-base (~3GB). Specialized code models outperform GPT-4 and Claude for autocomplete because they're trained specifically on code completion patterns. Large models are unnecessary and add latency—1.5B-3B parameter models provide the best balance of speed and quality for real-time code suggestions.

What Ollama models work best for Continue chat and reasoning?

For chat and reasoning, use larger models: qwen3-coder:30b (recommended — a 30B Mixture-of-Experts model with only ~3.3B active params, so it runs faster than its size suggests and supports a 256K-token context for whole-repo work), deepseek-r1:32b (excellent step-by-step reasoning), or llama3.1:8b (8GB VRAM, good lightweight balance). qwen3-coder supersedes the older qwen2.5-coder line for local coding. Match model size to your available VRAM. For complex refactoring and debugging, 30B-class models significantly outperform 7-8B models. Enable tool_use capability for agent mode with multi-step tasks.

How do I configure Continue.dev with Ollama?

Create ~/.continue/config.yaml (macOS/Linux) or %USERPROFILE%\.continue\config.yaml (Windows). Add models with provider: ollama and apiBase: http://localhost:11434. Assign roles: autocomplete for fast models (1.5-3B), chat/edit/apply for quality models (7-32B), and embed for nomic-embed-text. Set temperature, contextLength, and debounceDelay options. Restart VS Code after config changes. Use "ollama list" to verify models are installed before configuring.

How do I set up codebase indexing with Ollama embeddings?

Pull nomic-embed-text model: "ollama pull nomic-embed-text". In config.yaml, add a model with roles: [embed] and model: nomic-embed-text. Continue will index your workspace for semantic search. Type @codebase in chat to search across your entire project. The embeddings enable Continue to find relevant code beyond simple keyword matching. Nomic-embed-text is recommended over the built-in all-MiniLM-L6-v2 for better accuracy.

Why is my Continue autocomplete slow with Ollama?

Autocomplete latency issues usually have three causes: 1) Model too large—switch from 7B+ to qwen2.5-coder:1.5b or starcoder2:3b. 2) debounceDelay too low—increase from 250ms to 350ms in autocompleteOptions. 3) GPU not utilized—verify with "ollama ps" that model is using GPU layers. Also ensure Ollama is running ("ollama serve") before opening VS Code. For Qwen3 models specifically, disable thinking mode with extraBodyProperties: { think: false }.

Can Continue.dev use multiple Ollama models simultaneously?

Yes, Continue supports role-based model assignment. Configure separate models for autocomplete (small, fast), chat (large, quality), edit (medium), and embed (embeddings). Each model runs independently via Ollama's API. However, running multiple large models simultaneously requires significant VRAM—a 32B chat model + 1.5B autocomplete model works well on 24GB GPUs. Ollama handles model loading/unloading automatically based on requests.

How does Continue.dev agent mode work with local models?

Agent mode enables autonomous multi-step task execution: reading/writing files, running terminal commands, and searching codebases. Configure with capabilities: [tool_use] in your model config. Agent mode works best with larger models (8B+) that understand tool calling. qwen3-coder:30b is the strongest local pick here—it was trained with long-horizon RL on agentic software-engineering tasks (SWE-Bench)—with llama3.1:8b as a lighter fallback. Note: smaller models may struggle with complex agent workflows—test thoroughly before relying on agent mode for production work.

What's the difference between Continue.dev and GitHub Copilot?

Continue.dev: Free, open-source (Apache 2.0), supports any LLM including local Ollama models, fully customizable workflows, VS Code + JetBrains support. GitHub Copilot: $10-19/month subscription, polished experience, limited to GitHub's models, wider IDE support. Choose Continue for privacy, cost savings, and customization. Choose Copilot for easier setup and enterprise features. Continue gives you Copilot-like features without monthly fees or sending code to external servers.

How do I use MCP servers with Continue and Ollama?

Continue supports Model Context Protocol (MCP) for extended functionality. Add mcpServers to config.yaml with command and args for each server. Example: SQLite MCP for database queries, GitHub MCP for repository access. Type @ in chat, select "MCP" from dropdown, choose the resource. MCP servers work with both cloud and local Ollama models, enabling capabilities like SQL queries, file system access, and external API integration within your coding workflow.

What VRAM do I need for Continue.dev with Ollama?

Minimum setup (4-8GB VRAM): qwen2.5-coder:1.5b for autocomplete + llama3.1:8b for chat. Recommended (12-16GB): starcoder2:3b for autocomplete + codellama:13b for chat. Optimal (24GB+): starcoder2:3b for autocomplete + deepseek-r1:32b for chat. Add ~1GB for nomic-embed-text embeddings. Apple Silicon unified memory works well—M1/M2 16GB handles 8B models, M3 Max 64GB handles 32B models comfortably.

How do I create custom slash commands in Continue?

Add prompts section to config.yaml with name, description, and prompt text. Example: - name: test\n description: Generate unit tests\n prompt: Write comprehensive tests for this code. Activate in chat by typing /test. You can also create .prompt files in .continue/prompts/ directory for complex prompts with template variables. Custom commands enable workflow automation like generating tests, documentation, refactoring patterns, or code reviews tailored to your team's standards.

Continue.dev + Ollama Setup 2026: Free Copilot, Full config.yaml

The working `config.yaml` for Continue.dev + Ollama

Drop this into ~/.continue/config.yaml (or %USERPROFILE%\.continue\config.yaml on Windows) and restart VS Code. It wires a fast local autocomplete model and a quality chat model through Ollama — provider is ollama with apiBase: http://localhost:11434:

name: Local AI Assistant
schema: v1
models:
  - name: Llama 3.1 8B          # chat + edit
    provider: ollama
    model: llama3.1:8b
    apiBase: http://localhost:11434
    roles: [chat, edit, apply]
  - name: Qwen Coder 1.5B       # tab autocomplete
    provider: ollama
    model: qwen2.5-coder:1.5b
    apiBase: http://localhost:11434
    roles: [autocomplete]
  - name: Nomic Embed           # @codebase search
    provider: ollama
    model: nomic-embed-text
    apiBase: http://localhost:11434
    roles: [embed]

First run ollama pull qwen2.5-coder:1.5b llama3.1:8b nomic-embed-text. Cost: $0/month · Privacy: 100% local · Stars: 34,000+ GitHub. Full breakdown of each field below.

June 2026 update — does this still work? Yes. Continue was acquired by Cursor in June 2026 and the open-source continuedev/continue repo is now read-only, with v2.0.0 (released June 19, 2026) shipped as the final polished release of the VS Code extension, JetBrains plugin, and CLI. The tools still install and run, and the bring-your-own-LLM / local-Ollama path in this guide is fully intact — there just won't be further updates from the original team. If you want a local-first setup that's guaranteed to keep getting maintained, Cline + Ollama is the most active alternative.

What is Continue.dev?

Continue.dev is a leading open-source AI coding assistant (Apache 2.0), offering a free alternative to GitHub Copilot that runs entirely on your machine with local models.

Key Statistics

Metric	Value
GitHub Stars	34,000+
Contributors	450+
License	Apache 2.0
IDE Support	VS Code, JetBrains
Backing	Y Combinator (W23)

Why Continue + Ollama?

Free forever - No $10-20/month subscription
100% private - Code never leaves your machine
Fully customizable - Any model, any workflow
Open source - Audit, modify, contribute
Enterprise-ready - Used by Siemens, Morningstar

Reading articles is good. Building is better.

Free account = 20+ free chapters across 20 courses, with a per-chapter AI tutor. No card. Cancel anytime if you ever upgrade.

Start free in 30 seconds See pricing

Installation

Step 1: Install Ollama

macOS:

brew install ollama

Linux:

curl -fsSL https://ollama.com/install.sh | sh

Windows: Download from ollama.com and run the installer.

Step 2: Pull Required Models

# Start Ollama
ollama serve

# Autocomplete model (fast, small)
ollama pull qwen2.5-coder:1.5b

# Chat model (quality, reasoning)
ollama pull llama3.1:8b

# Embeddings for codebase search
ollama pull nomic-embed-text

# Verify
ollama list

Step 3: Install Continue Extension

VS Code:

Open Extensions (Cmd/Ctrl + Shift + X)
Search "Continue"
Click Install

JetBrains:

Settings → Plugins → Marketplace
Search "Continue"
Install and restart

Step 4: Configure Continue

Continue reads configuration from:

macOS/Linux: ~/.continue/config.yaml
Windows: %USERPROFILE%\.continue\config.yaml

Complete Configuration Guide

Basic config.yaml

name: Local AI Assistant
version: 1.0.0
schema: v1

models:
  # Chat and reasoning (quality model)
  - name: Llama 3.1 8B
    provider: ollama
    model: llama3.1:8b
    apiBase: http://localhost:11434
    roles:
      - chat
      - edit
      - apply
    defaultCompletionOptions:
      temperature: 0.7
      contextLength: 8192

  # Tab autocomplete (fast model)
  - name: Qwen Coder 1.5B
    provider: ollama
    model: qwen2.5-coder:1.5b
    roles:
      - autocomplete
    autocompleteOptions:
      debounceDelay: 250
      maxPromptTokens: 1024
      multilineCompletions: auto

  # Embeddings for @codebase
  - name: Nomic Embed
    provider: ollama
    model: nomic-embed-text
    roles:
      - embed

# Context providers
context:
  - provider: code
  - provider: docs
  - provider: diff
  - provider: terminal
  - provider: folder
  - provider: codebase

# Coding rules
rules:
  - Give concise, focused responses
  - Follow existing code style
  - Prefer TypeScript over JavaScript

Advanced Configuration (24GB+ VRAM)

name: Power User Config
version: 1.0.0
schema: v1

models:
  # Primary chat / edit model (recommended)
  - name: Qwen3 Coder 30B
    provider: ollama
    model: qwen3-coder:30b
    apiBase: http://localhost:11434
    roles:
      - chat
      - edit
      - apply
    capabilities:
      - tool_use  # Enable agent mode
    defaultCompletionOptions:
      temperature: 0.7
      contextLength: 16384
      top_p: 0.9

  # Fast autocomplete
  - name: StarCoder 3B
    provider: ollama
    model: starcoder2:3b
    roles:
      - autocomplete
    autocompleteOptions:
      debounceDelay: 200
      maxPromptTokens: 2048
      multilineCompletions: auto

  # Embeddings
  - name: Nomic Embed
    provider: ollama
    model: nomic-embed-text
    roles:
      - embed

# Custom slash commands
prompts:
  - name: test
    description: Generate unit tests
    prompt: |
      Write comprehensive unit tests for this code.
      Use Jest/Vitest. Cover edge cases.

  - name: refactor
    description: Refactor for readability
    prompt: |
      Refactor this code for better readability.
      Explain your changes.

  - name: review
    description: Code review
    prompt: |
      Review this code for:
      - Bugs and edge cases
      - Performance issues
      - Security concerns
      - Code style
      Provide actionable feedback.

# MCP servers for extended functionality
mcpServers:
  - name: filesystem
    command: npx
    args:
      - "-y"
      - "@anthropic/mcp-filesystem-server"
      - "/path/to/allowed/directory"

Autodetect Models (Simplest)

name: Simple Config
schema: v1

models:
  - name: Autodetect
    provider: ollama
    model: AUTODETECT
    roles:
      - chat
      - edit
      - autocomplete

Best Models by Hardware

4-8GB VRAM (RTX 3060, M1/M2 8GB)

Role	Model	VRAM
Autocomplete	qwen2.5-coder:1.5b	~2GB
Chat	llama3.1:8b	~6GB
Embeddings	nomic-embed-text	~1GB

12-16GB VRAM (RTX 4070, M2 Pro)

Role	Model	VRAM
Autocomplete	starcoder2:3b	~4GB
Chat	codellama:13b	~10GB
Embeddings	nomic-embed-text	~1GB

24GB+ VRAM (RTX 4090, M3 Max)

Role	Model	VRAM
Autocomplete	starcoder2:3b	~4GB
Chat	qwen3-coder:30b	~19GB (q4_K_M)
Embeddings	nomic-embed-text	~1GB

Updated June 2026: qwen3-coder:30b (30B Mixture-of-Experts, ~3.3B active params, 256K context) is now the recommended local chat/edit model for Continue — it replaces the older qwen2.5-coder line. The MoE design keeps inference fast despite the 30B size, and the q4_K_M quant (~19GB) fits comfortably on a 24GB GPU. deepseek-r1:32b remains a strong alternative when you want explicit chain-of-thought reasoning. If you're picking a chat/edit model, our roundup of the best local AI models for programming and the best 14B coding models compare quality, speed, and VRAM head-to-head.

Reading articles is good. Building is better.

Free account = 20+ free chapters across 20 courses, with a per-chapter AI tutor. No card. Cancel anytime if you ever upgrade.

Start free in 30 seconds See pricing

Which Ollama autocomplete model is best for config.yaml?

Autocomplete is the role most people get wrong, because it isn't about raw model quality — it's about latency and fill-in-the-middle (FIM) support. Tab completion has to respond in well under ~500ms or it feels broken, which rules out anything larger than roughly 3B parameters on consumer hardware. It also has to be an FIM-trained model: autocomplete feeds the model the code before and after your cursor and asks it to predict the middle, which a plain chat model cannot do well.

Autocomplete model	Params	VRAM (~)	FIM	Speed	Notes
`qwen2.5-coder:1.5b`	1.5B	~2GB	Yes	Fastest	Best default for 8GB machines; the most popular Copilot-replacement pick
`qwen2.5-coder:3b`	3B	~3GB	Yes	Fast	Noticeably better suggestions if you have headroom
`starcoder2:3b`	3B	~4GB	Yes	Fast	Strong code-completion specialist; great fallback
`deepseek-coder:1.3b-base`	1.3B	~3GB	Yes	Fastest	Tiny + FIM-native; good on very low-VRAM laptops
`qwen2.5-coder:7b`	7B	~6GB	Yes	Slower	Higher quality but often too slow for real-time tab completion

Rule of thumb: stay at 1.5B–3B for the autocomplete role and put your big model on chat/edit. Bigger is not better here — a 7B autocomplete model adds latency without a meaningful quality gain for inline suggestions, and many users report it actually feeling worse because the cursor stalls.

Why does my Continue autocomplete produce garbage or syntax errors?

This is the single most common config.yaml mistake, and it's almost always a fill-in-the-middle templating problem, not a model-quality problem. If autocomplete spits out broken syntax, duplicated lines, or completions that ignore the code after your cursor, check these in order:

You picked a non-FIM model. Generic chat models (llama3.1:8b, mistral, most "instruct" general models) don't understand the <|fim_prefix|> / <|fim_suffix|> / <|fim_middle|> tokens autocomplete relies on. Use a code model that ships FIM: qwen2.5-coder, starcoder2, or deepseek-coder.
Wrong tag for the role. On Ollama, qwen2.5-coder:1.5b already resolves to a FIM-capable build that works for tab autocomplete, while you'll generally want an instruct model for the chat role. Don't reuse one chat model for both roles and expect clean completions.
The template wasn't auto-detected. Continue normally infers the FIM prompt template from the model name. If you renamed the model or are proxying through vLLM, the wrong chat template can get applied and the <|fim_prefix|> token ends up missing — producing exactly this garbage output. Pulling the model directly through Ollama (ollama pull qwen2.5-coder:1.5b) avoids this.

For a head-to-head of completion engines, Cline + Ollama takes a different, more agent-first approach than Continue's inline tab autocomplete — worth comparing if FIM tuning keeps fighting you.

How do I connect Continue to a remote or networked Ollama server?

A common 2026 setup is running Ollama on a beefy GPU box (or a home server / VM) and coding from a thin laptop. Continue supports this with one field — point apiBase at the remote host instead of localhost:

models:
  - name: Remote Qwen Coder
    provider: ollama
    model: qwen3-coder:30b
    apiBase: http://192.168.1.136:11434   # remote GPU box IP
    roles:
      - chat
      - edit
      - apply

On the server side you must tell Ollama to listen on all interfaces (by default it only binds to localhost):

# Linux/macOS — expose Ollama on the network
OLLAMA_HOST=0.0.0.0:11434 ollama serve

# Persist it as a service env var instead (Linux systemd):
#   Environment="OLLAMA_HOST=0.0.0.0:11434"

Checklist for remote setups:

Open the port. Allow inbound 11434 on the server's firewall (and your VPN/subnet only — don't expose Ollama to the open internet without auth in front of it).
Set CORS if needed. OLLAMA_ORIGINS=* avoids origin-blocked requests from the extension.
Verify reachability first: curl http://192.168.1.136:11434/version from the laptop before editing config.yaml, so you know whether a failure is network or config.
Use a VPN/SSH tunnel for anything beyond your LAN. Ollama has no built-in authentication, so a tunnel (e.g. ssh -L 11434:localhost:11434 user@server) is the safe way to reach a cloud GPU box — then keep apiBase: http://localhost:11434 on the client.

Key Features

Tab Autocomplete

Press Tab to accept inline suggestions. Works in any file type.

Config options:

autocompleteOptions:
  debounceDelay: 250      # ms before triggering
  maxPromptTokens: 1024   # context for model
  multilineCompletions: auto
  onlyMyCode: true        # ignore node_modules etc.

Tip: Specialized code models (qwen2.5-coder, starcoder2) outperform GPT-4 for autocomplete.

Chat Interface

VS Code: Cmd/Ctrl + L
JetBrains: Cmd/Ctrl + J

Select code and ask questions. Add context with @ mentions:

@codebase - Search entire project
@file - Reference specific file
@folder - Include directory
@docs - Documentation context

Edit Mode

Select code → Press Cmd/Ctrl + I → Describe changes → Apply

Continue modifies code while preserving formatting and style.

Agent Mode

Enable with capabilities: [tool_use] for autonomous multi-step tasks:

Read and write files
Run terminal commands
Search codebase
Make multiple changes

models:
  - name: Agent Model
    provider: ollama
    model: llama3.1:8b
    capabilities:
      - tool_use

Custom Slash Commands

Create shortcuts for common tasks:

prompts:
  - name: doc
    description: Add documentation
    prompt: Add comprehensive JSDoc/docstring to this code.

  - name: optimize
    description: Optimize performance
    prompt: Suggest performance optimizations for this code.

Use with /doc or /optimize in chat.

Performance Optimization

Reduce Autocomplete Latency

Use small models (1.5B-3B parameters)

Increase debounce delay:

autocompleteOptions:
  debounceDelay: 350

Disable thinking for Qwen3:

requestOptions:
  extraBodyProperties:
    think: false

Verify GPU Acceleration

# Check GPU usage
ollama ps

# Should show GPU layers loaded
NAME              SIZE    PROCESSOR
llama3.1:8b       4.7GB   100% GPU

Debug Issues

# Restart Ollama with debug logging
pkill ollama
OLLAMA_DEBUG=1 ollama serve

# Check Continue logs
cat ~/.continue/logs/core.log

Continue vs Alternatives

Feature	Continue	GitHub Copilot	Cursor	Cline
Price	Free	$10-19/mo	$20-200/mo	Free
Open Source	Yes	No	No	Yes
Local Models	Full	No	Limited	Yes
IDE Support	VS Code, JetBrains	Many	VS Code fork	VS Code
Agent Mode	Yes	Yes	Yes	Yes
Customization	Excellent	Limited	Good	Good

When to Choose Continue

Privacy-critical projects - No code leaves your machine
Cost-conscious teams - Save $120-240/year per developer
Custom workflows - Build exactly what your team needs
Open source preference - Full transparency and control

When to Choose Copilot

Quick setup priority - Works out of the box
Enterprise requirements - SSO, compliance features
Multi-IDE teams - Wider IDE support
Training data quality - GitHub's massive codebase

Troubleshooting

"Cannot connect to Ollama"

# 1. Ensure Ollama is running
ollama serve

# 2. Verify port
curl http://localhost:11434/version

# 3. Check config.yaml apiBase
apiBase: http://localhost:11434

Models Not Loading

# Pull models first
ollama pull qwen2.5-coder:1.5b
ollama list  # Verify installed

# Restart Continue
# VS Code: Cmd/Ctrl + Shift + P → "Continue: Reload"

Slow Performance

Switch to smaller autocomplete model
Increase debounceDelay
Reduce contextLength
Check GPU usage: ollama ps

Config Not Applied

# Validate YAML syntax
python -c "import yaml; yaml.safe_load(open('~/.continue/config.yaml'))"

# Restart VS Code after changes

MCP Integration

Extend Continue with Model Context Protocol servers:

mcpServers:
  # Database access
  - name: sqlite
    command: npx
    args: ["-y", "mcp-sqlite", "/path/to/db.sqlite"]

  # GitHub integration
  - name: github
    command: uvx
    args: [mcp-server-github]
    env:
      GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}

  # File system access
  - name: filesystem
    command: npx
    args: ["-y", "@anthropic/mcp-filesystem-server", "/allowed/path"]

Use in chat: Type @ → Select "MCP" → Choose resource.

MCP works the same whether the model behind it is cloud or local — for a deeper walkthrough of wiring MCP tools to local models, see our Ollama MCP integration guide.

Key Takeaways

Continue + Ollama = Free Copilot alternative with full privacy
Use small models for autocomplete (qwen2.5-coder:1.5b / starcoder2:3b) and a large coder for chat — qwen3-coder:30b is the current recommendation
nomic-embed-text enables powerful codebase search
Agent mode requires capabilities: [tool_use] and 8B+ models
Custom slash commands automate your team's workflows
MCP servers extend Continue's capabilities infinitely
34,000+ stars prove the community trust (Continue was acquired by Cursor in June 2026; the final v2.0.0 release still runs locally with Ollama)

Next Steps

Compare local AI tools for model management
Explore AI coding agents for autonomous development
Compare Cursor vs Copilot vs Claude Code for alternatives
Browse the best AI coding tools ranked side by side
Check VRAM requirements for model sizing
Learn about MCP servers for tool integration

Continue.dev with Ollama delivers a professional AI coding experience without monthly subscriptions or privacy compromises. Whether you're a solo developer seeking GitHub Copilot features for free, or an enterprise team requiring local deployment for compliance, Continue provides the flexibility and performance to transform your coding workflow.

Continue.dev + Ollama Setup: VS Code config.yaml Example (2026)

Want to go deeper than this article?

The working config.yaml for Continue.dev + Ollama

What is Continue.dev?

Key Statistics

Why Continue + Ollama?

Reading articles is good. Building is better.

Installation

Step 1: Install Ollama

Step 2: Pull Required Models

Step 3: Install Continue Extension

Step 4: Configure Continue

Complete Configuration Guide

Basic config.yaml

Advanced Configuration (24GB+ VRAM)

Autodetect Models (Simplest)

Best Models by Hardware

4-8GB VRAM (RTX 3060, M1/M2 8GB)

12-16GB VRAM (RTX 4070, M2 Pro)

24GB+ VRAM (RTX 4090, M3 Max)

Reading articles is good. Building is better.

Which Ollama autocomplete model is best for config.yaml?

Why does my Continue autocomplete produce garbage or syntax errors?

How do I connect Continue to a remote or networked Ollama server?

Key Features

Tab Autocomplete

Chat Interface

Edit Mode

Agent Mode

Custom Slash Commands

Performance Optimization

Reduce Autocomplete Latency

Verify GPU Acceleration

Debug Issues

Continue vs Alternatives

When to Choose Continue

When to Choose Copilot

Troubleshooting

"Cannot connect to Ollama"

Models Not Loading

Slow Performance

Config Not Applied

MCP Integration

Key Takeaways

Next Steps

Ollama’s running. Here’s what to build with it.

Liked this? 20 full AI courses are waiting.

Local AI Master Research Team

Build Real AI on Your Machine

Want structured AI education?

Continue Your Local AI Journey

How to Install Your First Local AI Model

How to Choose the Right AI Model for Your Computer

Comments (0)

Build Real AI on Your Machine

Related Guides

Jan vs LM Studio vs Ollama

Cursor vs Copilot vs Claude Code

VRAM Requirements 2026

MCP Servers Explained

Written by the Local AI Master Team

Grab the AI Starter Kit — career roadmap, cheat sheet, setup guide

Ollama’s running. Here’s what to build with it.

The working `config.yaml` for Continue.dev + Ollama