Data Collection

Curated datasets from various domains, regularly updated and freely available. Each set includes metadata, download options, and mirror links for major platforms.

La Corniche near Monaco. Claude Monet, 1884

FOMC Meeting Statements & Minutes

GitHub Actions
Automatically scraped Federal Reserve FOMC meeting statements and minutes since 2000. Tracks US monetary policy changes through time with comprehensive metadata including meeting dates, statement types, and full text content. Updated weekly to capture the latest policy decisions and economic outlook.

Download options

# Or simply use curl
curl -o fomc_statements.csv https://raw.githubusercontent.com/vtasca/fed-statement-scraping/main/data/fomc_statements.csv

Wikipedia Article Pageviews

GitHub Actions
Daily aggregation of the 100 most popular Wikipedia articles by pageviews since 2016. Enables tracking trending topics and public interest patterns over time. Includes article titles, pageview counts, dates, and language information for comprehensive analysis of informed web search intent.

Download options

# Or simply use curl
curl -o pageviews.csv https://raw.githubusercontent.com/vtasca/wikipedia-pageviews/main/data/pageviews.csv