KISS overzealous agents goodbye. Plug & play agent oversight tool.
Based on research:
In our study agents calling Vibe Check improved success +27% and halved harmful actions -41%
Featured on PulseMCP “Most Popular (This Week)” • 5k+ monthly calls on Smithery.ai • research-backed oversight • STDIO + streamable HTTP transport
Plug-and-play mentor layer that stops agents from over-engineering and keeps them on the minimal viable path — research-backed MCP server keeping LLMs aligned, reflective and safe.
Run the server directly from npm without a local installation. Requires Node >=20.
# Start the server with stdio transport
npx -y @pv-bhat/vibe-check-mcp start --stdioFor client integrations (like Claude Desktop or Cursor), add this entry to your MCP configuration:
{
"mcpServers": {
"vibe-check-mcp": {
"command": "npx",
"args": ["-y", "@pv-bhat/vibe-check-mcp", "start", "--stdio"]
}
}
}The server will be downloaded and run on-demand. For detailed client setup and other commands like install and doctor, see the full documentation below.
- Featured on PulseMCP “Most Popular (This Week)” front page (week of 13 Oct 2025) 🔗
- Listed in Anthropic’s official Model Context Protocol repo 🔗
- Discoverable in the official MCP Registry 🔗
- Featured on Sean Kochel's Top 9 MCP servers for vibe coders 🔗
- Quickstart (npx)
- What is Vibe Check MCP?
- Overview
- The Problem: Pattern Inertia & Reasoning Lock-In
- Key Features
- What's New
- Development Setup
- Release
- Usage Examples
- Adaptive Metacognitive Interrupts (CPI)
- Agent Prompting Essentials
- When to Use Each Tool
- Documentation
- Research & Philosophy
- Security
- Roadmap
- Contributors & Community
- FAQ
- Listed on
- Credits & License
Vibe Check MCP keeps agents on the minimal viable path and escalates complexity only when evidence demands it. Vibe Check MCP is a lightweight server implementing Anthropic's Model Context Protocol. It acts as an AI meta-mentor for your agents, interrupting pattern inertia with Chain-Pattern Interrupts (CPI) to prevent Reasoning Lock-In (RLI). Think of it as a rubber-duck debugger for LLMs – a quick sanity check before your agent goes down the wrong path.
Vibe Check MCP pairs a metacognitive signal layer with CPI so agents can pause when risk spikes. Vibe Check surfaces traits, uncertainty, and risk scores; CPI consumes those triggers and enforces an intervention policy before the agent resumes. See the CPI integration guide and the CPI repo at https://github.com/PV-Bhat/cpi for wiring details.
Vibe Check invokes a second LLM to give meta-cognitive feedback to your main agent. Integrating vibe_check calls into agent system prompts and instructing tool calls before irreversible actions significantly improves agent alignment and common-sense. The high-level component map: docs/architecture.md, while the CPI handoff diagram and example shim are captured in docs/integrations/cpi.md.
Large language models can confidently follow flawed plans. Without an external nudge they may spiral into overengineering or misalignment. Vibe Check provides that nudge through short reflective pauses, improving reliability and safety.
| Feature | Description | Benefits |
|---|---|---|
| CPI Adaptive Interrupts | Phase-aware prompts that challenge assumptions | alignment, robustness |
| Multi-provider LLM | Gemini, OpenAI, Anthropic, and OpenRouter support | flexibility |
| History Continuity | Summarizes prior advice when sessionId is supplied |
context retention |
| Optional vibe_learn | Log mistakes and fixes for future reflection | self-improvement |
install --clientnow supports Cursor, Windsurf, and Visual Studio Code with idempotent merges, atomic writes, and.bakrollbacks.- HTTP-aware installers preserve
serverUrlentries for Windsurf and emit VS Code workspace snippets plus avscode:mcp/installlink when no config is provided. - Documentation now consolidates provider keys, transport selection, uninstall guidance, and dedicated client docs at docs/clients.md.
Use a lightweight “constitution” to enforce rules per sessionId that CPI will honor. Eg. constitution rules: “no external network calls,” “prefer unit tests before refactors,” “never write secrets to disk.”
API (tools):
update_constitution({ sessionId, rules })→ merges/sets rule set for the sessionreset_constitution({ sessionId })→ clears session rulescheck_constitution({ sessionId })→ returns effective rules for the session
# Clone and install
git clone https://github.com/PV-Bhat/vibe-check-mcp-server.git
cd vibe-check-mcp-server
npm ci
npm run build
npm testUse npm for all workflows (npm ci, npm run build, npm test). This project targets Node >=20.
Create a .env file with the API keys you plan to use:
# Gemini (default)
GEMINI_API_KEY=your_gemini_api_key
# Optional providers / Anthropic-compatible endpoints
OPENAI_API_KEY=your_openai_api_key
OPENROUTER_API_KEY=your_openrouter_api_key
ANTHROPIC_API_KEY=your_anthropic_api_key
ANTHROPIC_AUTH_TOKEN=your_proxy_bearer_token
ANTHROPIC_BASE_URL=https://api.anthropic.com
ANTHROPIC_VERSION=2023-06-01
# Optional overrides
# DEFAULT_LLM_PROVIDER accepts gemini | openai | openrouter | anthropic
DEFAULT_LLM_PROVIDER=gemini
DEFAULT_MODEL=gemini-2.5-proSee docs/TESTING.md for instructions on how to run tests.
The repository includes a helper script for one-command setup.
bash scripts/docker-setup.shSee Automatic Docker Setup for full details.
See API Keys & Secret Management for supported providers, resolution order, storage locations, and security guidance.
The CLI supports stdio and HTTP transports. Transport resolution follows this order: explicit flags (--stdio/--http) → MCP_TRANSPORT → default stdio. When using HTTP, specify --port (or set MCP_HTTP_PORT); the default port is 2091. The generated entries add --stdio or --http --port <n> accordingly, and HTTP-capable clients also receive a http://127.0.0.1:<port> endpoint.
Each installer is idempotent and tags entries with "managedBy": "vibe-check-mcp-cli". Backups are written once per run before changes are applied, and merges are atomic (*.bak files make rollback easy). See docs/clients.md for deeper client-specific references.
- Config path:
claude_desktop_config.json(auto-discovered per platform). - Default transport: stdio (
npx … start --stdio). - Restart Claude Desktop after installation to load the new MCP server.
- If an unmanaged entry already exists for
vibe-check-mcp, the CLI leaves it untouched and prints a warning.
- Config path:
~/.cursor/mcp.json(provide--configif you store it elsewhere). - Schema mirrors Claude’s
mcpServerslayout. - If the file is missing, the CLI prints a ready-to-paste JSON block for Cursor’s settings panel instead of failing.
- Config path: legacy
~/.codeium/windsurf/mcp_config.json, new builds use~/.codeium/mcp_config.json. - Pass
--httpto emit an entry withserverUrlfor Windsurf’s HTTP client. - Existing sentinel-managed
serverUrlentries are preserved and updated in place.
- Workspace config lives at
.vscode/mcp.json; profiles also storemcp.jsonin your VS Code user data directory. - Provide
--config <path>to target a workspace file. Without--config, the CLI prints a JSON snippet and avscode:mcp/install?...link you can open directly from the terminal. - VS Code supports optional dev fields; pass
--dev-watchand/or--dev-debug <value>to populatedev.watch/dev.debug.
- Restore the backup generated during installation (the newest
*.baknext to your config) to revert immediately. - To remove the server manually, delete the
vibe-check-mcpentry undermcpServers(Claude/Windsurf/Cursor) orservers(VS Code) as long as it is still tagged with"managedBy": "vibe-check-mcp-cli".
CPI (Chain-Pattern Interrupt) is the research-backed oversight method behind Vibe Check. It injects brief, well-timed “pause points” at risk inflection moments to re-align the agent to the user’s true priority, preventing destructive cascades and reasoning lock-in (RLI). In pooled evaluation across 153 runs, CPI nearly doubles success (~27%→54%) and roughly halves harmful actions (~83%→42%). Optimal interrupt dosage is ~10–20% of steps. Vibe Check MCP implements CPI as an external mentor layer at test time.
Links:
- 📄 CPI Paper (ResearchGate) — http://dx.doi.org/10.13140/RG.2.2.18237.93922
- 📘 CPI Reference Implementation (GitHub): https://github.com/PV-Bhat/cpi
- 📚 MURST Zenodo DOI (RSRC archival): https://doi.org/10.5281/zenodo.14851363
flowchart TD
A[Agent Phase] --> B{Monitor Progress}
B -- high risk --> C[CPI Interrupt]
C --> D[Reflect & Adjust]
B -- smooth --> E[Continue]
In your agent's system prompt, make it clear that vibe_check is a mandatory tool for reflection. Always pass the full user request and other relevant context. After correcting a mistake, you can optionally log it with vibe_learn to build a history for future analysis.
Example snippet:
As an autonomous agent you will:
1. Call vibe_check after planning and before major actions.
2. Provide the full user request and your current plan.
3. Optionally, record resolved issues with vibe_learn.
| Tool | Purpose |
|---|---|
| 🛑 vibe_check | Challenge assumptions and prevent tunnel vision |
| 🔄 vibe_learn | Capture mistakes, preferences, and successes |
| 🧰 update_constitution | Set/merge session rules the CPI layer will enforce |
| 🧹 reset_constitution | Clear rules for a session |
| 🔎 check_constitution | Inspect effective rules for a session |
- Agent Prompting Strategies
- CPI Integration
- Advanced Integration
- Technical Reference
- Automatic Docker Setup
- Philosophy
- Case Studies
- Changelog
This repository includes a CI-based security scan that runs on every pull request. It checks dependencies with npm audit and scans the source for risky patterns. See SECURITY.md for details and how to report issues.
- Structured output for
vibe_check: Return a JSON envelope such as{ advice, riskScore, traits }so downstream agents can reason deterministically while preserving readable reflections. - Agent prompt starter kit: Publish a plug-and-play system prompt snippet that teaches the CPI dosage principle (10–20% of steps), calls out risk inflection points, and reminds agents to include the last 5–10 tool calls in
taskContext. - Documentation refresh: Highlight the new prompt template and context requirements throughout the README and integration guides.
- LLM resilience: Wrap
generateResponseinsrc/utils/llm.tswith retries and exponential backoff, with a follow-up circuit breaker once the basics land. - Input sanitization: Validate and cleanse tool arguments in
src/index.tsto mitigate prompt-injection vectors. - State stewardship: Add TTL-based cleanup in
src/utils/state.tsand switchsrc/utils/storage.tsfile writes tofs.promisesto avoid blocking the event loop.
These initiatives are tracked as community-facing GitHub issues so contributors can grab them and see progress in the open.
- Telemetry sanity checks: Add a lint-style CI step that verifies
docs/examples compile (e.g., TypeScript snippet type-check) to catch drift between docs and code. - CLI help polish: Ensure every CLI subcommand prints a concise
--helpexample aligned with the refreshed prompt guidance. - Docs navigation cleanup: Cross-link
docs/agent-prompting.mdanddocs/technical-reference.mdfrom the README section headers to reduce context switching for new contributors.
Contributions are welcome! See CONTRIBUTING.md.
Vibe Check MCP is released under the MIT License. Built for reliable, enterprise-ready AI agents.
Vibe Check MCP created by: Pruthvi Bhat, Initiative - https://murst.org/