What is Chart Library?

Chart Library is a chart-pattern intelligence engine for humans and AI agents. Anchor any (symbol, date, timeframe) and it returns the cohort of historical analogs — from 25M+ indexed patterns across 19,000+ symbols and 10 years — what those analogs did next, and a calibrated forward-return distribution, over web, REST API, and MCP.

Does Chart Library predict stock prices?

No. Chart Library never forecasts. It returns historical distributions — what actually happened after setups like yours (median, p10/p90 band, up rate) — plus the receipts to audit them. Direction and decisions stay with you or your agent.

How calibrated are Chart Library's bands?

The nominal 80% forward-return band held 80.8% across 300,000+ live, audited cases under symbol-disjoint evaluation. The coverage record is public and recomputed continuously at chartlibrary.io/calibration.

For DevelopersAI AgentsMCPConditional Distribution

How to Build a Stock-Research Agent That Doesn't Hallucinate

Chart Library Team·April 13, 2026·7 min read

The problem every stock-research agent has

If you've built an AI agent that answers questions like 'what usually happens after a breakout like this in NVDA,' you've hit the same wall everyone does: the model confidently narrates a number that has no historical backing. The base rate is either invented or pulled from the model's training cut-off, not from real data conditioned on the actual setup.

The fix is structural, not prompt-engineered. You need a tool the agent calls that returns real conditional base rates — not 'on average, NVDA goes up X%' but 'given this chart shape, filtered by current regime and sector, in a corpus of historical analogs that includes delisted names, here's the distribution of forward returns.' One call, one number the agent can reason about, one sample size so it knows when to hedge.

The primitive: POST /api/v1/cohort

Chart Library's Conditional Distribution endpoint is the smallest composable unit for this pattern. You send an anchor (symbol + date) and optional filters, you get back a cohort of historical matches plus the distribution of outcomes at 1/5/10 day horizons:

POST /api/v1/cohort body: {"anchor": {"symbol": "NVDA", "date": "2024-06-18"}, "horizons": [1, 5, 10], "top_k": 500}

Response (abbreviated): cohort_id: "coh_...", distributions: {"5": {"n": 492, "return_pct": {"p10": -5.17, "p50": +0.50, "p90": +5.59}, "hit_rate": {"above_entry": 0.541}}}, survivorship: {"included_delisted": 54, "total_matches": 500}

Every response includes a 15-minute cohort_id you can refine progressively, and a survivorship flag so the agent knows whether delisted names are part of the base rate.

Three filter dimensions that matter

The reason shape-only matching doesn't produce alpha on its own is that outcomes are conditional on context. The cohort API takes three filter dimensions that meaningfully shift the distribution:

filters.sector: "same_as_anchor" restricts to the same GICS sector (or SIC code for delisted names)
filters.regime.same_vix_bucket = true keeps only matches whose VIX regime is within ±15 percentile of today's
filters.regime.same_trend = true matches the sign of the SPY 20d trend at the match date

Real example: NVDA 2024-06-18 unfiltered shows 54% up at 5 days across 492 analogs. Apply same_sector + same_vix_bucket and 1d drops to 48.6% up while 10d rises to 55.2% — a meaningful conditional pattern (short-term mean reversion, medium-term continuation) that's invisible in the unconditional stats.

The edge-mining loop (where it gets powerful)

Single calls are fine. The real leverage is the loop: start broad, ask which filter matters, narrow, repeat. Three tools:

POST /api/v1/cohort — the initial cohort. Returns cohort_id.
GET /api/v1/cohort/{id}/explain — ranks candidate filters (VIX regime, trend, recent-5-years) by how much each one shifts the above-entry hit rate. Tells the agent which dimension is actually moving the distribution for this specific setup.
POST /api/v1/cohort/{id}/filter — narrows the stored cohort with whichever filter was most informative. No kNN re-run (sub-second) and returns a new cohort_id so agents can branch.

This is how agents (and humans) discover conditional structure rather than pattern-match to a canned base rate. The cohort_id keeps the expensive embedding search cached, so refinement is free. Fork, compare, keep the branch with the highest-confidence distribution.

MCP: one tool call in any agent framework

The Chart Library MCP server (pip install chartlibrary-mcp) exposes this primitive as a single tool agents call:

get_cohort_distribution(symbol="NVDA", date="2024-06-18", same_sector=True, same_vix_bucket=True)
explain_cohort_filters(cohort_id="coh_...", horizon=5)
refine_cohort_with_filters(cohort_id="coh_...", same_trend=True)

Drop the MCP server into your CrewAI, LangGraph, AutoGen, or Claude function-calling setup. The agent discovers the tool, calls it, and returns a number grounded in real historical base rates instead of a number it made up.

Why this matters

The next wave of AI agents in finance will be judged on whether their answers are wrong in ways users can't detect. A hallucinated base rate is indistinguishable from a real one at the language-output level. The only structural defense is to ground every claim in a retrieval call backed by real data — conditional, explicit, sample-sized, and survivorship-aware.

Chart Library's cohort primitive is built for exactly that pattern. Free sandbox tier, $29 Builder, $299 Agent (with burst + session handles + 1K req/min), and the MCP server is one pip install away.

Ready to build? Grab an API key at chartlibrary.io/developers and the MCP server on PyPI (chartlibrary-mcp). The conditional distribution primitive is live on the Free tier.

Ready to try Chart Library?

Anchor any ticker + date — see what history says about your setup, with cohort statistics, feature attribution, and AI narrative.

Try it free

Learn the methodology

Chart Library is built on four canonical concepts. Read the pillars to understand what backs the numbers in this post:

Cohort intelligence →

What it is and why it beats point forecasts.

Calibrated stock forecasting →

Why distributions beat point estimates.

Symbol-disjoint evaluation →

The eval discipline that prevents leakage.

Conformal prediction in finance →

The math behind calibrated bands.

How to Add a Stock Base-Rate MCP Node to LangGraph, the OpenAI Agents SDK, and the Claude Agent SDK

The same calibrated historical-base-rate node, wired into three agent frameworks unchanged. How to drop a 'what usually happens next' stock node into LangGraph, the OpenAI Agents SDK, and the Claude Agent SDK — with a boundary, provenance, and a blind-judge receipt — runnable offline for free.

How to Build a Market-Research Agent Crew in 2026: Frameworks, Data Costs, and the Missing Primitive

A practical 2026 guide to building a multi-agent market-research crew — the specialist roles, what the data actually costs ($0 to ~$250/mo), the frameworks that wire it together, and the one calibrated-base-rate node most crews are missing.

What Does It Cost to Build an AI Trading Agent in 2026? A Data-Stack Breakdown

The honest 2026 line-item cost of feeding a multi-agent trading crew real market data — which lanes are free (SEC EDGAR, FRED), which actually cost money (price, options, news), and the two realistic budgets: a $0–30/mo one-day-lagged crew vs a ~$180–270/mo live-everything crew.

Try It Yourself

AAPL Patterns NVDA Patterns TSLA Patterns SPY Patterns AMD Patterns

← All articles