SKILL·48A487

open-notebook

Name: open-notebook
Author: K-Dense-AI

K-Dense-AI

Updated 1 month ago

4 views

31,025

3,113

31,025

View on GitHub

Metapdfwordaidata

About

Open Notebook is a self-hosted, open-source research assistant for organizing materials and performing AI-powered document analysis. It ingests diverse content like PDFs and videos, enabling features like generating summaries, creating podcasts, and chatting with documents using context-aware AI. Developers should use it to build private research workflows with support for multiple AI providers and local data control.

Quick Install

Claude Code

Recommended

Primary

npx skills add K-Dense-AI/claude-scientific-skills -a claude-code

Plugin CommandAlternative

/plugin add https://github.com/K-Dense-AI/claude-scientific-skills

Git CloneAlternative

git clone https://github.com/K-Dense-AI/claude-scientific-skills.git ~/.claude/skills/open-notebook

Copy and paste this command in Claude Code to install this skill

Documentation

Open Notebook

Overview

Open Notebook is an open-source, self-hosted alternative to Google's NotebookLM that enables researchers to organize materials, generate AI-powered insights, create podcasts, and have context-aware conversations with their documents — all while maintaining complete data privacy.

Unlike Google's Notebook LM, which has no publicly available API outside of the Enterprise version, Open Notebook provides a comprehensive REST API, supports 16+ AI providers, and runs entirely on your own infrastructure.

Key advantages over NotebookLM:

Full REST API for programmatic access and automation
Choice of 16+ AI providers (not locked to Google models)
Multi-speaker podcast generation with 1-4 customizable speakers (vs. 2-speaker limit)
Complete data sovereignty through self-hosting
Open source and fully extensible (MIT license)

Repository: https://github.com/lfnovo/open-notebook

Quick Start

Prerequisites

Docker Desktop installed
API key for at least one AI provider (or local Ollama for free local inference)

Installation

Deploy Open Notebook using Docker Compose:

# Download the docker-compose file
curl -o docker-compose.yml https://raw.githubusercontent.com/lfnovo/open-notebook/main/docker-compose.yml

# Set the required encryption key
export OPEN_NOTEBOOK_ENCRYPTION_KEY="your-secret-key-here"

# Launch the services
docker-compose up -d

Access the application:

Frontend UI: http://localhost:8502
REST API: http://localhost:5055
API Documentation: http://localhost:5055/docs

Configure AI Provider

After startup, configure at least one AI provider:

Navigate to Settings > API Keys in the UI
Add credentials for your preferred provider (OpenAI, Anthropic, etc.)
Test the connection and discover available models
Register models for use across the platform

Or configure via the REST API:

import requests

BASE_URL = "http://localhost:5055/api"

# Add a credential for an AI provider
response = requests.post(f"{BASE_URL}/credentials", json={
    "provider": "openai",
    "name": "My OpenAI Key",
    "api_key": "sk-..."
})
credential = response.json()

# Discover available models
response = requests.post(
    f"{BASE_URL}/credentials/{credential['id']}/discover"
)
discovered = response.json()

# Register discovered models
requests.post(
    f"{BASE_URL}/credentials/{credential['id']}/register-models",
    json={"model_ids": [m["id"] for m in discovered["models"]]}
)

Core Features

Notebooks

Organize research into separate notebooks, each containing sources, notes, and chat sessions.

import requests

BASE_URL = "http://localhost:5055/api"

# Create a notebook
response = requests.post(f"{BASE_URL}/notebooks", json={
    "name": "Cancer Genomics Research",
    "description": "Literature review on tumor mutational burden"
})
notebook = response.json()
notebook_id = notebook["id"]

Sources

Ingest diverse content types including PDFs, videos, audio files, web pages, and Office documents. Sources are processed for full-text and vector search.

# Add a web URL source
response = requests.post(f"{BASE_URL}/sources", data={
    "url": "https://arxiv.org/abs/2301.00001",
    "notebook_id": notebook_id,
    "process_async": "true"
})
source = response.json()

# Upload a PDF file
with open("paper.pdf", "rb") as f:
    response = requests.post(
        f"{BASE_URL}/sources",
        data={"notebook_id": notebook_id},
        files={"file": ("paper.pdf", f, "application/pdf")}
    )

Notes

Create and manage notes (human or AI-generated) associated with notebooks.

# Create a human note
response = requests.post(f"{BASE_URL}/notes", json={
    "title": "Key Findings",
    "content": "TMB correlates with immunotherapy response in NSCLC...",
    "note_type": "human",
    "notebook_id": notebook_id
})

Context-Aware Chat

Chat with your research materials using AI that cites sources.

# Create a chat session
session = requests.post(f"{BASE_URL}/chat/sessions", json={
    "notebook_id": notebook_id,
    "title": "TMB Discussion"
}).json()

# Send a message with context from sources
response = requests.post(f"{BASE_URL}/chat/execute", json={
    "session_id": session["id"],
    "message": "What are the key biomarkers for immunotherapy response?",
    "context": {"include_sources": True, "include_notes": True}
})

Search

Search across all materials using full-text or vector (semantic) search.

# Vector search across the knowledge base
results = requests.post(f"{BASE_URL}/search", json={
    "query": "tumor mutational burden immunotherapy",
    "search_type": "vector",
    "limit": 10
}).json()

# Ask a question with AI-powered answer
answer = requests.post(f"{BASE_URL}/search/ask/simple", json={
    "query": "How does TMB predict checkpoint inhibitor response?"
}).json()

Podcast Generation

Generate professional multi-speaker podcasts from research materials with 1-4 customizable speakers.

# Generate a podcast episode
job = requests.post(f"{BASE_URL}/podcasts/generate", json={
    "notebook_id": notebook_id,
    "episode_profile_id": episode_profile_id,
    "speaker_profile_ids": [speaker1_id, speaker2_id]
}).json()

# Check generation status
status = requests.get(f"{BASE_URL}/podcasts/jobs/{job['job_id']}").json()

# Download audio when ready
audio = requests.get(
    f"{BASE_URL}/podcasts/episodes/{status['episode_id']}/audio"
)

Content Transformations

Apply custom AI-powered transformations to content for summarization, extraction, and analysis.

# Create a custom transformation
transform = requests.post(f"{BASE_URL}/transformations", json={
    "name": "extract_methods",
    "title": "Extract Methods",
    "description": "Extract methodology details from papers",
    "prompt": "Extract and summarize the methodology section...",
    "apply_default": False
}).json()

# Execute transformation on text
result = requests.post(f"{BASE_URL}/transformations/execute", json={
    "transformation_id": transform["id"],
    "input_text": "...",
    "model_id": "model_id_here"
}).json()

Supported AI Providers

Open Notebook supports 16+ AI providers through the Esperanto library:

Provider	LLM	Embedding	Speech-to-Text	Text-to-Speech
OpenAI	Yes	Yes	Yes	Yes
Anthropic	Yes	No	No	No
Google GenAI	Yes	Yes	No	Yes
Vertex AI	Yes	Yes	No	Yes
Ollama	Yes	Yes	No	No
Groq	Yes	No	Yes	No
Mistral	Yes	Yes	No	No
Azure OpenAI	Yes	Yes	No	No
DeepSeek	Yes	No	No	No
xAI	Yes	No	No	No
OpenRouter	Yes	No	No	No
ElevenLabs	No	No	Yes	Yes
Perplexity	Yes	No	No	No
Voyage	No	Yes	No	No

Environment Variables

Key configuration variables for Docker deployment:

Variable	Description	Default
`OPEN_NOTEBOOK_ENCRYPTION_KEY`	Required. Secret key for encrypting stored credentials	None
`SURREAL_URL`	SurrealDB connection URL	`ws://surrealdb:8000/rpc`
`SURREAL_NAMESPACE`	Database namespace	`open_notebook`
`SURREAL_DATABASE`	Database name	`open_notebook`
`OPEN_NOTEBOOK_PASSWORD`	Optional password protection for the UI	None

API Reference

The REST API is available at http://localhost:5055/api with interactive documentation at /docs.

Core endpoint groups:

/api/notebooks - Notebook CRUD and source association
/api/sources - Source ingestion, processing, and retrieval
/api/notes - Note management
/api/chat/sessions - Chat session management
/api/chat/execute - Chat message execution
/api/search - Full-text and vector search
/api/podcasts - Podcast generation and management
/api/transformations - Content transformation pipelines
/api/models - AI model configuration and discovery
/api/credentials - Provider credential management

For complete API reference with all endpoints and request/response formats, see references/api_reference.md.

Architecture

Open Notebook uses a modern stack:

Backend: Python with FastAPI
Database: SurrealDB (document + relational)
AI Integration: LangChain with the Esperanto multi-provider library
Frontend: Next.js with React
Deployment: Docker Compose with persistent volumes

Important Notes

Open Notebook requires Docker for deployment
At least one AI provider must be configured for AI features to work
For free local inference without API costs, use Ollama
The OPEN_NOTEBOOK_ENCRYPTION_KEY must be set before first launch and kept consistent across restarts
All data is stored locally in Docker volumes for complete data sovereignty

GitHub Repository

K-Dense-AI/claude-scientific-skills

Path: skills/open-notebook

agent-skillsai-scientistbioinformaticschemoinformaticsclaudeclaude-skills

FAQ

Frequently asked questions

What is the open-notebook skill?

open-notebook is a Claude Skill by K-Dense-AI. Skills package instructions and resources that Claude loads on demand, so Claude can perform open-notebook-related tasks without extra prompting.

How do I install open-notebook?

Use the install commands on this page: add open-notebook to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does open-notebook belong to?

open-notebook is in the Meta category, tagged pdf, word, ai and data.

Is open-notebook free to use?

Yes. open-notebook is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Related Skills

content-collections

Meta

This skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.

View skill

polymarket

Meta

This skill enables developers to build applications with the Polymarket prediction markets platform, including API integration for trading and market data. It also provides real-time data streaming via WebSocket to monitor live trades and market activity. Use it for implementing trading strategies or creating tools that process live market updates.

View skill

creating-opencode-plugins

Meta

This skill helps developers create OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It provides the plugin structure, event API specifications, and implementation patterns for JavaScript/TypeScript modules. Use it when you need to intercept, monitor, or extend the OpenCode AI assistant's lifecycle with custom event-driven logic.

View skill

sglang

Meta

SGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.

View skill