trace-claude-code
About
This skill automatically traces Claude Code conversations to Braintrust for observability. It captures entire sessions, individual conversation turns, and every tool call as hierarchical traces. Use it to gain full visibility into AI-assisted coding workflows for debugging and analysis.
Quick Install
Claude Code
Recommendednpx skills add parcadei/Continuous-Claude-v3/plugin add https://github.com/parcadei/Continuous-Claude-v3git clone https://github.com/parcadei/Continuous-Claude-v3.git ~/.claude/skills/trace-claude-codeCopy and paste this command in Claude Code to install this skill
Documentation
Trace Claude Code to Braintrust
Automatically send Claude Code conversations to Braintrust for tracing and observability. Get full visibility into your AI coding sessions with hierarchical traces showing sessions, turns, and every tool call.
What you get
Claude Code Session (root trace)
├── Turn 1: "Add error handling"
│ ├── Read: src/app.ts
│ ├── Edit: src/app.ts
│ └── Response: "I've added try-catch..."
├── Turn 2: "Now run the tests"
│ ├── Terminal: npm test
│ └── Response: "All tests pass..."
└── Turn 3: "Great, commit this"
├── Terminal: git add .
├── Terminal: git commit -m "..."
└── Response: "Changes committed..."
How it works
Four hooks capture the complete workflow:
| Hook | What it captures |
|---|---|
| SessionStart | Creates root trace when you start Claude Code |
| PostToolUse | Captures every tool call (file reads, edits, terminal commands) |
| Stop | Captures conversation turns (your message + Claude's response) |
| SessionEnd | Logs session summary when you exit |
Quick setup
Run the setup script in any project directory where you want tracing:
bash /path/to/skills/trace-claude-code/setup.sh
The script prompts for your API key and project name, then configures all hooks automatically.
Manual setup
Prerequisites
- Claude Code CLI installed
- Braintrust API key
jqcommand-line tool (brew install jqon macOS)
Configuration
Create .claude/settings.local.json in your project directory:
{
"hooks": {
"SessionStart": [
{
"hooks": [
{
"type": "command",
"command": "bash /path/to/hooks/session_start.sh"
}
]
}
],
"PostToolUse": [
{
"matcher": "*",
"hooks": [
{
"type": "command",
"command": "bash /path/to/hooks/post_tool_use.sh"
}
]
}
],
"Stop": [
{
"hooks": [
{
"type": "command",
"command": "bash /path/to/hooks/stop_hook.sh"
}
]
}
],
"SessionEnd": [
{
"hooks": [
{
"type": "command",
"command": "bash /path/to/hooks/session_end.sh"
}
]
}
]
},
"env": {
"TRACE_TO_BRAINTRUST": "true",
"BRAINTRUST_API_KEY": "sk-...",
"BRAINTRUST_CC_PROJECT": "my-project"
}
}
Replace /path/to/hooks/ with the actual path to this skill's hooks directory.
Environment variables
| Variable | Required | Description |
|---|---|---|
TRACE_TO_BRAINTRUST | Yes | Set to "true" to enable tracing |
BRAINTRUST_API_KEY | Yes | Your Braintrust API key |
BRAINTRUST_CC_PROJECT | No | Project name (default: claude-code) |
BRAINTRUST_CC_DEBUG | No | Set to "true" for verbose logging |
Viewing traces
After running Claude Code with tracing enabled:
- Go to braintrust.dev
- Navigate to your project (e.g.,
claude-code) - Click Logs to see all traced sessions
Each trace shows:
- Session root: The overall Claude Code session
- Turns: Each conversation exchange (user input → assistant response)
- Tool calls: Individual operations (file reads, edits, terminal commands)
Trace structure
Traces are hierarchical:
-
Session (root span)
span_attributes.type:"task"metadata.session_id: Unique session identifiermetadata.workspace: Project directory
-
Turn (child of session)
span_attributes.type:"llm"input: User messageoutput: Assistant responsemetadata.turn_number: Sequential turn number
-
Tool call (child of turn or session)
span_attributes.type:"tool"input: Tool input (file path, command, etc.)output: Tool resultmetadata.tool_name: Name of the tool used
Troubleshooting
No traces appearing
-
Check hooks are running:
tail -f ~/.claude/state/braintrust_hook.log -
Verify environment variables in
.claude/settings.local.json:TRACE_TO_BRAINTRUSTmust be"true"BRAINTRUST_API_KEYmust be valid
-
Enable debug mode:
{ "env": { "BRAINTRUST_CC_DEBUG": "true" } }
Permission errors
Make hook scripts executable:
chmod +x /path/to/hooks/*.sh
Missing jq command
Install jq:
- macOS:
brew install jq - Ubuntu/Debian:
sudo apt-get install jq
State issues
Reset the tracing state:
rm ~/.claude/state/braintrust_state.json
Hook logs
View detailed hook execution logs:
# Follow logs in real-time
tail -f ~/.claude/state/braintrust_hook.log
# View last 50 lines
tail -50 ~/.claude/state/braintrust_hook.log
# Clear logs
> ~/.claude/state/braintrust_hook.log
File structure
hooks/
├── common.sh # Shared utilities (logging, API, state)
├── session_start.sh # Creates root trace span
├── post_tool_use.sh # Captures tool calls
├── stop_hook.sh # Captures conversation turns
└── session_end.sh # Finalizes trace
Alternative: SDK integration
For programmatic use with the Claude Agent SDK, use the native Braintrust integration:
import { initLogger, wrapClaudeAgentSDK } from "braintrust";
import * as claudeSDK from "@anthropic-ai/claude-agent-sdk";
initLogger({
projectName: "my-project",
apiKey: process.env.BRAINTRUST_API_KEY,
});
const { query, tool } = wrapClaudeAgentSDK(claudeSDK);
See Braintrust Claude Agent SDK docs for details.
GitHub Repository
Related Skills
content-collections
MetaThis skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.
himalaya-email-manager
CommunicationThis Claude Skill enables email management through the Himalaya CLI tool using IMAP. It allows developers to search, summarize, and delete emails from an IMAP account with natural language queries. Use it for automated email workflows like getting daily summaries or performing batch operations directly from Claude.
sglang
MetaSGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.
evaluating-llms-harness
TestingThis Claude Skill runs the lm-evaluation-harness to benchmark LLMs across 60+ standardized academic tasks like MMLU and GSM8K. It's designed for developers to compare model quality, track training progress, or report academic results. The tool supports various backends including HuggingFace and vLLM models.
