subsets.io

Connect your LLM with the world's statistics

Simple primitives. Thousands of datasets

1
search_datasets// Find relevant statistical data
2
get_dataset_details// Understand data structure
3
execute_sql_query// Extract specific insights
Clone datasets locally. Or query remotely

Loading datasets...

Ground your LLM with structured data

Web search scrapes fragments from hundreds of pages with no lineage. Subsets queries authoritative sources directly. Even for World Development Indicators—one of the most common datasets online—LLMs still hallucinate. Over 80% of our warehouse isn't even on the web.

Accuracy on 20 questions

GPT 5.2
Without Subsets
75%
With Subsets
95%
Gemini 3.0 Flash
Without Subsets
75%
With Subsets
95%

Evaluated on World Development Indicators

Even basic facts get hallucinated

"What are the top 3 European countries by GDP per capita in 2022?"

Without Subsets
Luxembourg$126,000
Switzerland$92,000
Norway$89,000
With Subsets
Luxembourg$125,006
Ireland$103,983
Switzerland$92,101

Dataset: wdi_gdp_per_capita

Setup

Bash
claude mcp add --transport sse subsets https://mcp.subsets.io/sse

Then try it out

Bash
claude -p "Use the subsets data warehouse to find the correlation between GDP per capita and life expectancy"