ManagePrompt

Local LLM call debugger. Captures every LLM API call during development with full request/response details, token usage, cost, and latency.

ManagePrompt has two parts:

CLI (Go binary) — runs the local server and web UI for viewing captured requests
npm package — instruments your app to capture and send LLM call data to the server

Step 1: Install the CLI

Homebrew

brew install techulus/tap/manageprompt

Go

go install github.com/techulus/manage-prompt/cmd/manageprompt@latest

Build from Source

go build -o bin/manageprompt ./cmd/manageprompt

Step 2: Start the Server

manageprompt start

URL: http://localhost:54321

CLI Commands

manageprompt start            # Start the server (default port 54321)
manageprompt start -p 8080    # Custom port
manageprompt clear            # Clear all stored requests
manageprompt version          # Print version

Step 3: Add the npm Package to Your App

pnpm add manageprompt

Vercel AI SDK (Recommended)

import { generateText, wrapLanguageModel } from "ai";
import { openai } from "@ai-sdk/openai";
import { devToolsMiddleware } from "manageprompt";

const model = wrapLanguageModel({
  model: openai("gpt-4o"),
  middleware: devToolsMiddleware(),
});

const { text } = await generateText({ model, prompt: "Hello" });

Works with any AI SDK provider — OpenAI, Anthropic, Google, Mistral, etc.

capture()

Wraps any SDK call. Auto-detects provider, extracts tokens, cost, and latency.

import OpenAI from "openai";
import { capture } from "manageprompt";

const openai = new OpenAI();

const response = await capture(
  { model: "gpt-4o-mini", messages: [{ role: "user" as const, content: "Hello" }] },
  (input) => openai.chat.completions.create(input),
);

Works with OpenAI, Anthropic, and any SDK that returns a standard response object.

log()

Manual logging for full control over what gets sent.

import { log } from "manageprompt";

log({
  model: "gpt-4o",
  provider: "openai",
  prompt: messages,
  response_text: "Hello!",
  tokens_input: 10,
  tokens_output: 5,
  latency_ms: 230,
});

What Gets Captured

Full request and response bodies
Visual request flow with tool call visualization
Latency
Token usage (input, output, cache read, cache write)
Cost estimation (via models.dev pricing)

How It Works

Your app makes an LLM call wrapped with the manageprompt npm package
The wrapper captures the full request, response, tokens, cost, and latency
Data is sent to the local ManagePrompt server (POST /api/ingest)
Everything is stored in SQLite (.manageprompt/requests.db in the current directory)
The web UI updates in real-time via WebSocket

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 525 Commits
.github		.github
cmd/manageprompt		cmd/manageprompt
docs		docs
internal		internal
packages/manageprompt-js		packages/manageprompt-js
web		web
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
.mise.toml		.mise.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ManagePrompt

Step 1: Install the CLI

Homebrew

Go

Build from Source

Step 2: Start the Server

CLI Commands

Step 3: Add the npm Package to Your App

Vercel AI SDK (Recommended)

capture()

log()

What Gets Captured

How It Works

License

About

Uh oh!

Releases 3

Sponsor this project

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ManagePrompt

Step 1: Install the CLI

Homebrew

Go

Build from Source

Step 2: Start the Server

CLI Commands

Step 3: Add the npm Package to Your App

Vercel AI SDK (Recommended)

capture()

log()

What Gets Captured

How It Works

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Sponsor this project

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages