codegraph: The MCP-Native Code Knowledge Graph for AI Agents

codegraph is not another code visualization dashboard. It’s a local-first semantic knowledge graph built specifically for AI coding agents to consume — and for enterprise teams running AI-assisted development at scale, the efficiency numbers are hard to ignore: roughly 35% lower cost and ~70% fewer tool calls on large real-world codebases.

What codegraph Does

The codegraph project by Colby McHenry parses your codebase using tree-sitter, indexes symbol relationships, call graphs, import chains, and web routes into a SQLite database with full-text search, and exposes that structured graph through a Model Context Protocol (MCP) server.

The core insight is straightforward: AI coding agents like Claude Code, Cursor, and Codex spend a significant portion of their token budget in a “discovery phase” — grepping files, reading imports, tracing function calls to understand what a codebase does before they can answer a question or make a change. codegraph front-loads that discovery into an offline indexing step. At query time, the agent asks the graph instead of the filesystem.

The result is fewer round-trips, lower token spend, and faster responses — especially on large or legacy codebases where the discovery cost is highest.

Technical Architecture

codegraph ships as three things in one package: a local code intelligence library, a CLI, and an MCP server.

The indexing pipeline uses tree-sitter for language-aware parsing across 19+ programming languages and detects routes for 13–14 web frameworks. The SQLite backend supports full-text search and stores the complete symbol graph locally — no cloud sync, no API keys, no data leaving the machine.

Integration targets as of v0.9.5 include Claude Code, Cursor, Codex CLI, OpenCode, Gemini CLI, AntiGravity IDE, and Hermes Agent. The MCP server interface means any agent that speaks MCP can query the graph without custom integration work.

Where It Fits in the Landscape

The code graph space has been underserved since Sourcetrail was discontinued in December 2021 — “Sourcetrail alternative” remains one of the most-searched terms in the category four years later. CodeSee, the other notable player focused on developer onboarding, was acquired by GitKraken in 2024 with unclear post-acquisition direction.

Existing tools like dependency-cruiser and Madge generate useful graphs for JavaScript/TypeScript but are built for human visualization, not agent consumption. Mermaid diagrams are manually maintained. Cursor added dependency graph awareness in March 2026, but it’s embedded in the IDE — not a standalone queryable graph that agents can interrogate via structured API.

codegraph occupies a distinct niche: a structured, queryable, local semantic graph purpose-built for AI agent consumption. No currently active tool directly fills the same role.

Enterprise Relevance

The efficiency case is clearest on large codebases where AI agents currently spend the most tokens just finding their footing. For enterprise teams paying per-token API costs across dozens or hundreds of developers, a 70% reduction in tool calls compounds quickly.

Beyond cost, several enterprise use cases stand out:

Legacy codebase onboarding. New engineers on underdocumented monorepos can orient themselves through graph queries — “what calls this function?”, “what does this module depend on?” — without waiting for a senior engineer to walk them through it.
Refactoring impact analysis. Symbol-level dependency mapping answers “what breaks if I change this?” before the change is made, reducing regression risk in complex systems.
Cross-language repos. Support for 19 languages and 13+ web frameworks covers most enterprise polyglot environments, including mixed mobile/web stacks after the recent iOS/React Native/Expo bridge support.
Java enterprise. Spring/MyBatis enterprise flow tracing landed on May 26, 2026 — a direct signal that Java enterprise environments are becoming a first-class target. This matters for organizations still running significant Spring Boot workloads.
Compliance and audit. Structured call graphs can surface undocumented dependencies or unauthorized service interactions that are otherwise invisible in large codebases.

What to Watch

v1.0 milestone. The project shipped v0.9.5 approximately four months after its January 2026 launch. A stable v1.0 release is likely soon and would be the trigger for broader enterprise adoption consideration.
Java/Spring investment. The Spring/MyBatis tracing work is a leading indicator. Organizations running Java enterprise stacks should watch this closely.
MCP as a standard. Anthropic, Microsoft, and OpenAI are converging on MCP as the standard interface for AI agent tool use. codegraph’s MCP-native design becomes more valuable as that standardization hardens — it will work with any compliant agent without modification.
Community visibility. The project’s web footprint is still small relative to its GitHub activity. No major HackerNews or Reddit discussions were captured in this research pass. That gap may close quickly once v1.0 ships.

Bottom Line

codegraph is an early-stage open-source project, but it’s moving fast and targeting a real problem: AI coding agents are expensive to run on large codebases, and the discovery overhead is a significant part of that cost. The MCP-native design, 100% local operation, and broad language support make it worth evaluating now — particularly for teams already running Claude Code, Cursor, or Codex at scale.

Big Hat Group helps enterprise engineering teams evaluate and adopt developer tooling that fits their environment and security posture. If you’re assessing AI coding agent infrastructure or trying to reduce the cost of AI-assisted development on legacy systems, reach out — we can help you run a structured evaluation.