Scribe - Intelligent Code Context for AI Agents

Scribe is a code analysis tool designed for AI agents and LLM-powered development workflows. Unlike simple file bundlers, Scribe understands code structure and dependencies—giving agents exactly the context they need without wasting tokens on irrelevant code.

The Problem: Context Retrieval is Expensive

When an AI agent needs to understand a function and its dependencies, the traditional approach is painful:

Traditional approach (4-10+ tool calls, ~30 seconds, wastes tokens):

grep "authenticate_user" --include="*.rs" — Find the function
read auth.rs — Read ENTIRE 800-line file
grep "use crate::" auth.rs — Find imports manually
read session.rs — Read ENTIRE dependency
grep "use crate::" session.rs — Find transitive imports
read crypto.rs — Another full file read
read config.rs — Keep going...

Result: Agent reads 4000+ lines, but only ~200 are relevant. Multiple round-trips, 95% wasted tokens.

Scribe approach (1 tool call, ~0.7 seconds, precise context):

scribe --covering-set "auth.rs:authenticate_user" --stdout

Returns authenticate_user + only the functions/types it uses:

auth.rs:authenticate_user (target)
session.rs:create_session (direct dependency)
crypto.rs:verify_password (direct dependency)
config.rs:AuthConfig (type dependency)

Single call, ~200 lines of precisely relevant code.

Key Differentiator: Surgical Code Retrieval

Unlike tools like repomix that bundle entire repositories, Scribe provides surgical precision:

Approach	Tool Calls	Tokens Used	Relevance
Manual grep + read	4-10+	~15,000	~5% relevant
Repomix (full bundle)	1	~500,000	~1% relevant
Scribe covering set	1	~2,000	95%+ relevant

This matters because:

Faster iteration: Single call vs. multiple round-trips
Lower cost: 10-100x fewer tokens per context retrieval
Better results: LLMs perform better with focused, relevant context
Automatic dependency resolution: No manual import tracing

Quick Start

For AI Agents (CLI)

# Get a function and all its dependencies
scribe --covering-set "src/auth.rs:authenticate_user" --stdout

# Get file-level dependencies (faster, less precise)
scribe --covering-set "src/auth.rs" --granularity file --stdout

# Analyze what code is affected by your current changes
scribe --covering-set-diff --stdout

# Limit depth for focused context
scribe --covering-set "src/lib.rs:Config" --max-depth 2 --stdout

Output Formats

# Text output (default)
scribe --covering-set "module.py:MyClass" --stdout

# XML output (structured, includes metadata)
scribe --covering-set "module.py:MyClass" --stdout --output-format xml

# JSON output (for programmatic use)
scribe --covering-set "module.py:MyClass" --stdout --output-format json

Example Output

Scribe Report
============
Total files: 3
Total tokens: 847
Algorithm: covering-set

=== src/auth.rs (312 tokens)
pub fn authenticate_user(credentials: &Credentials) -> Result<Session> {
    let user = lookup_user(&credentials.username)?;
    verify_password(&credentials.password, &user.password_hash)?;
    create_session(user.id)
}

=== src/session.rs (245 tokens)
pub fn create_session(user_id: UserId) -> Result<Session> {
    // ... only the relevant function, not the whole file
}

=== src/crypto.rs (290 tokens)
pub fn verify_password(input: &str, hash: &PasswordHash) -> Result<()> {
    // ...
}

Features

Covering Set Analysis

Entity-level granularity: Get specific functions/classes, not entire files
Automatic dependency resolution: Follows imports across your codebase
Multi-language support: Rust, Python, JavaScript/TypeScript, Go
Configurable depth: Control how deep to traverse dependencies
Diff-based analysis: Get context for your current git changes

Repository Bundling

Intelligent file selection: PageRank-based importance scoring
Token budget management: Stay within LLM context limits
Multiple output formats: HTML, XML, JSON, Markdown, Repomix-compatible

Code Analysis

Dependency graph construction: Understand code relationships
Heuristic scoring: Identify important files automatically
Git integration: Incorporate change history into analysis

Installation

npm (Recommended)

# Install globally
npm install -g @sibyllinesoft/scribe

# Or use directly with npx
npx @sibyllinesoft/scribe --help

Cargo

# From crates.io
cargo install scribe-cli

# From source
git clone https://github.com/sibyllinesoft/scribe
cd scribe/scribe-rs
cargo install --path .

AI Agent Integration

Scribe can automatically configure AI coding agents to use scribe for code context retrieval:

# Install integration for Claude Code (recommended)
scribe --install claude

# Install integration for OpenCode
scribe --install opencode

# Install for all detected agents
scribe --install

# Use warn mode instead of blocking (shows reminder but allows operation)
scribe --install claude --install-mode warn

This installs:

SCRIBE.md — Instructions for the agent on when/how to use scribe
Hooks — Pre-tool hooks that redirect Read/Grep on code files to scribe

After installation, restart your AI agent session for hooks to take effect.

What the hooks do

Block mode (default): Blocks Read/Grep on code files, tells agent to use scribe --covering-set
Warn mode: Shows a reminder but allows the operation

The hooks only affect code files (.rs, .py, .ts, etc.). Config files, docs, and other non-code files are unaffected.

Supported Languages

Import resolution and dependency tracking works for:

Language	Import Styles Supported
Rust	`use`, `mod`, grouped imports `use mod::{a, b}`
Python	`import`, `from...import`, relative imports
JavaScript/TypeScript	ES6 `import`, `require()`, type imports
Go	Single imports, block imports, aliased imports

CLI Reference

Covering Set Options

--covering-set <TARGET>     Find covering set for file or entity
                            Examples: "src/lib.rs", "src/auth.rs:login"

--covering-set-diff         Compute covering set for current git diff

--granularity <MODE>        file (whole files) or entity (functions/classes)
                            Default: file

--include-dependents        Include files that depend on target (impact analysis)

--max-depth <N>             Maximum dependency traversal depth

--max-files <N>             Maximum files in result

--stdout                    Output to stdout (for piping to other tools)

--output-format <FMT>       text (default), xml, json, markdown

Repository Bundling Options

--token-target <N>          Target token count for selection (default: 128000)

--include <PATTERNS>        Include only matching files

--exclude <PATTERNS>        Exclude matching files

--output-format <FMT>       text (default), html, xml, json, markdown, repomix

Agent Integration Options

--install [AGENT]           Install scribe integration for AI agents
                            AGENT: claude, opencode, or all (default: all)

--install-mode <MODE>       Hook behavior: block (default) or warn

Library Usage

Scribe can also be used as a Rust library:

use scribe::prelude::*;

#[tokio::main]
async fn main() -> Result<()> {
    // Analyze a repository
    let config = Config::default();
    let analysis = analyze_repository(".", &config).await?;

    // Get most important files
    for (file, score) in analysis.top_files(10) {
        println!("{}: {:.3}", file, score);
    }

    Ok(())
}

Feature Flags

[dependencies]
# Full installation (default)
scribe = "0.5"

# Minimal - core types only
scribe = { version = "0.5", default-features = false, features = ["core"] }

# Analysis without graph features
scribe = { version = "0.5", default-features = false, features = ["core", "analysis", "scanner"] }

Architecture

The CLI is built on a modular Rust workspace:

scribe-core — Shared types and configuration
scribe-scanner — File system traversal and filtering
scribe-patterns — Glob and gitignore pattern matching
scribe-analysis — Heuristics and scoring
scribe-graph — PageRank and dependency graph construction
scribe-selection — Covering sets and token budgeting

Performance

Covering set computation: ~0.7s for 140-file codebase
Full repository analysis: ~100ms for small repos, ~1-10s for large repos
Memory usage: ~2MB per 1000 files

Comparison with Other Tools

Feature	Scribe	Repomix	Manual
Dependency-aware selection	✅	❌	❌
Entity-level granularity	✅	❌	❌
Single-command context	✅	✅	❌
Token-efficient output	✅	❌	❌
Multi-language support	✅	✅	N/A
Git diff analysis	✅	❌	❌

License

Licensed under either of Apache License 2.0 or MIT license at your option.

Contributing

We welcome contributions! Please see the contributing guide for guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 153 Commits
.github/workflows		.github/workflows
docs		docs
scribe-rs		scribe-rs
spec		spec
templates/themes		templates/themes
tests		tests
tools		tools
.gitignore		.gitignore
.scribeignore		.scribeignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
mkdocs.yml		mkdocs.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scribe - Intelligent Code Context for AI Agents

The Problem: Context Retrieval is Expensive

Key Differentiator: Surgical Code Retrieval

Quick Start

For AI Agents (CLI)

Output Formats

Example Output

Features

Covering Set Analysis

Repository Bundling

Code Analysis

Installation

npm (Recommended)

Cargo

AI Agent Integration

What the hooks do

Supported Languages

CLI Reference

Covering Set Options

Repository Bundling Options

Agent Integration Options

Library Usage

Feature Flags

Architecture

Performance

Comparison with Other Tools

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

sibyllinesoft/scribe

Folders and files

Latest commit

History

Repository files navigation

Scribe - Intelligent Code Context for AI Agents

The Problem: Context Retrieval is Expensive

Key Differentiator: Surgical Code Retrieval

Quick Start

For AI Agents (CLI)

Output Formats

Example Output

Features

Covering Set Analysis

Repository Bundling

Code Analysis

Installation

npm (Recommended)

Cargo

AI Agent Integration

What the hooks do

Supported Languages

CLI Reference

Covering Set Options

Repository Bundling Options

Agent Integration Options

Library Usage

Feature Flags

Architecture

Performance

Comparison with Other Tools

License

Contributing

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages