60 days of Pro — freeNo credit card required63% fewer tokens per sessionLocal-first · code stays homeShip today60 days of Pro — freeNo credit card required63% fewer tokens per sessionLocal-first · code stays homeShip today
✦ benchmarked on Cal.com · 40k stars

2.6× cheaper
AI coding sessions.

Ragtoolina gives your AI assistant semantic context about your codebase. Same answers — 63% fewer tokens.

or $ brew install ragtoolina
Your AI assistantclaude · cursor · copilot
Ragtoolinalocal MCP server · on-device
Your repo~/Dev/app
Embeddingson-device only
the problem

A napkin sketch of why your
AI bill is fat.

Most agents read like they're being paid by the token. Turns out, they are.

Without RAG

  • 01agent greps randomly
  • 02reads 12 irrelevant files
  • 03burns 8k tokens
  • 04still asks clarifying q's
$7.81/ session

With Ragtoolina

  • 01semantic retrieval
  • 02reads 2 relevant chunks
  • 032.8k tokens — done
  • 04one-shot answer
$3.01/ session
same answer · $4.80 less · every single session ✨
how it works

Three steps. That's it.

No infra changes. No new workflow. ~3 minutes.

01

Download & launch

Install the macOS app. Runs locally — your code never leaves your machine.

R
Ragtoolina.appmacOS 12+ · Apple Silicon
local processingMCP server baked inauto-updates
02

Add your projects

Point at any folder. Ragtoolina indexes symbols + semantic meaning in seconds.

Project folder
~/Projects/my-saas-app
● indexing · 87%
03

Connect & code

Any MCP-compatible tool. Add one line of config. Done.

// .cursor/mcp.json
{
  "mcpServers": {
    "ragtoolina": {
      "command": "ragtoolina",
      "args": ["mcp"]
    }
  }
}
numbers don't lie

Measured, not marketed.

Real benchmark on Cal.com (40k⭐) with Claude Sonnet 4. No cherry-picking.

63%
Token savings
avg reduction per coding session
52%
Fewer tool calls
less back-and-forth with your AI
2.6×
Cost reduction
$3.01 vs $7.81 per session
honest benchmark

We publish the bad row too.

Real queries run on Cal.com. Q1 shows overhead — we're not hiding it.

QueryWithoutWith RagtoolinaSavings
Find a specific exported functionsimple lookup — slight overhead1,2401,510+22% overhead
Explain the authentication flow8,5202,81067% saved
Refactor a complex component12,3404,15066% saved
Debug an API endpoint9,7803,24067% saved
Add a new feature across files7,1702,52065% saved
Average across all queries7,8102,84663% saved
calculate your savings

Drag the dial. Do the math.

Based on average token use per request, 22 workdays / month.

🤔 curious😎 enjoyer🚀 vibe coder
Provider
you'd save, monthly
$333
$528$195 / mo
yearly: $3,996
that's a whole team offsite ✈
integrations

Works with your tools.

Any MCP-compatible AI tool. Plug in once, save everywhere.

C

Cursor

AI-native IDE

Claude Code

CLI agent
W

Windsurf

AI code editor
V

VS Code

via Cline / Roo

Claude Desktop

desktop app
G

GitLab

git platform
soon

supports anything that speaks the Model Context Protocol

enterprise

Deploy on your infra.

For teams whose code has opinions about leaving the building.

Your infrastructure

On-prem or private cloud. No data leaves your network.

Full control

Manage access, storage, and updates on your terms.

Compliance-ready

SOC 2, HIPAA, internal security requirements — handled.

deployment diagram ↓
Your VPCaws · gcp · bare metal
Your reposgit · gitlab · github
Ragtoolina (self-hosted)your keys · your embeddings
Your devs' IDEsprivate MCP endpoint
pricing

Simple. Four tiers.

Start free. Upgrade when it hurts not to.

Free

For personal projects.

$0forever
  • 1 project
  • 10,000 chunks
  • Semantic search
  • MCP server
  • Community support

Team

Shared context for a squad.

$15/ seat / mo
  • 10 projects
  • 1M chunks
  • Team workspace
  • Shared indexes
  • Admin controls

Enterprise

Big fish. Big compliance.

Custom
  • Unlimited everything
  • Self-hosted deploy
  • SSO / SAML
  • Dedicated support
  • Custom SLA
questions, answered

Probably the ones you're thinking.

Can't find yours? Email us — one of us will reply.

Does my code leave my machine?

Never. Embeddings and indexing happen locally. We literally can't read your code.

Which languages are supported?

TypeScript, JavaScript, Python, Go, Rust, Swift, Kotlin, Java, C++, Ruby, PHP. More monthly.

Does it slow down my IDE?

No. Ragtoolina runs as a separate local service. Your IDE just asks it questions.

How is this different from Cursor's @codebase?

Tool-agnostic. The same index powers Cursor, Claude Code, Windsurf — no re-embedding per tool.

Start saving tokens today.

Download the app. Add your project. That's it.

⬇ Download for MacSign up free →
takes 3 minutes · pinky promise
© 2026 Ragtoolina · built in a terminalv 0.9.3 · build 207 · 63% lighter