Government Open Data

Curated directory of government open data portals, key datasets, and tools for analysis.

Updated April 2026

What Is Open Data?

Open data is government information published in formats that anyone can access, use, and redistribute without restrictions. It's the raw material of accountability — budget spreadsheets, procurement records, legislative votes, environmental monitoring, crime statistics, and much more.

The open data movement has transformed civic engagement. Journalists use procurement data to uncover corruption. Researchers use health data to identify disparities. Developers build apps that help citizens navigate government services. The possibilities are limited only by what governments publish and in what format.

Open Data Portals by Country

62 countries now operate national open data portals. Here are the most comprehensive and well-maintained ones:

Top-Rated Government Data Portals

CountryPortalDatasetsFormatsAPI AccessQuality
United Statesdata.gov300,000+CSV, JSON, XML, APIYes93
United Kingdomdata.gov.uk55,000+CSV, JSON, RDF, APIYes91
Francedata.gouv.fr45,000+CSV, JSON, APIYes89
Canadaopen.canada.ca95,000+CSV, JSON, XML, GeoJSONYes88
Australiadata.gov.au100,000+CSV, JSON, APIYes86
South Koreadata.go.kr70,000+CSV, JSON, XML, APIYes87
Germanygovdata.de60,000+CSV, JSON, RDFPartial82
Estoniaavaandmed.eesti.ee12,000+CSV, JSON, APIYes85
Indiadata.gov.in320,000+CSV, JSON, XML, APIYes74
Brazildados.gov.br12,000+CSV, JSONPartial71

International Open Data Sources

Beyond national portals, international organizations publish some of the most valuable governance datasets:

Essential International Datasets

  • World Bank Open Data: Economic indicators, development data, and financial statistics for 217 countries. Excellent API with consistent historical time series. The gold standard for comparative economic data.
  • IMF Data: Government finance statistics, fiscal data, exchange rates, and economic forecasts. Essential for understanding public finance across countries.
  • UN Data: Population, health, education, environment, and human development indicators. Covers nearly every country with standardized methodology.
  • OECD.Stat: Detailed governance, economic, and social statistics for 38 member countries. Excellent for comparing policies among developed nations.
  • EU Open Data Portal: Datasets from all EU institutions covering legislation, budgets, trade, agriculture, and regional development.
  • OpenAid / IATI: International aid flows — who gives what to whom, for what purpose. Critical for tracking development spending.

Key Dataset Categories

Budget & Financial Data

The most impactful category for accountability. Look for:

  • Budget documents: Revenue projections, spending allocations, actual expenditures
  • Procurement records: Government contracts — who won, for how much, through what process
  • Aid flows: International development aid received and distributed
  • Public debt: Government borrowing, bond issuance, and debt service costs
  • Tax revenue: Revenue by source, collection rates, and tax expenditures (deductions/exemptions)

Legislative Data

  • Bills and laws: Full text of proposed and enacted legislation
  • Voting records: How each legislator voted on each bill
  • Committee proceedings: Meeting minutes, hearing transcripts, and reports
  • Lobbying registries: Who is lobbying whom, on what issues, and for how much
  • Campaign finance: Political donations, spending, and donor information

Public Services & Performance

  • Healthcare: Hospital performance, wait times, health outcomes, disease surveillance
  • Education: School performance, graduation rates, spending per student
  • Crime: Reported crimes, clearance rates, police activity
  • Infrastructure: Road conditions, public transit performance, utility reliability
  • Environmental: Air/water quality, emissions data, permit compliance

Data Formats Explained

FormatUse CaseMachine-ReadableDifficulty
CSVTabular data, spreadsheetsYesEasy — opens in Excel/Sheets
JSONStructured data, APIsYesModerate — best with code
XMLStructured documentsYesModerate — verbose format
PDFReports, documentsNoEasy to read, hard to analyze
GeoJSON/ShapefileGeographic/spatial dataYesNeeds GIS software
APIReal-time/dynamic dataYesRequires programming

Beware: PDF Is Not Open Data

Many governments publish data as PDF documents and call it "open data." PDFs are designed for reading, not analysis. Extracting data from PDFs requires special tools and often introduces errors. True open data is published in machine-readable formats (CSV, JSON, XML) that can be directly imported into analysis tools. If your government publishes budgets only as PDFs, that's a red flag for transparency.

How to Work With Open Data

For Non-Technical Users

  1. Download CSV files and open them in Microsoft Excel, Google Sheets, or LibreOffice Calc
  2. Use pivot tables to summarize and explore large datasets
  3. Create charts to visualize trends and patterns
  4. Use Google Data Studio (free) to create interactive dashboards
  5. Try Datawrapper (free tier) to create publication-ready charts and maps

For Technical Users

  1. Python (pandas): The most popular tool for data analysis. pandas library handles CSV, JSON, Excel, and SQL. Combine with matplotlib/seaborn for visualization.
  2. R: Excellent for statistical analysis and visualization. The tidyverse ecosystem (dplyr, ggplot2) makes data wrangling intuitive.
  3. SQL: For working with large datasets. Load data into PostgreSQL or SQLite and query efficiently.
  4. Jupyter Notebooks: Combine code, analysis, and narrative in shareable documents. Great for reproducible research.
  5. APIs: Use Python's requests library or R's httr to pull real-time data directly from government APIs.

Open Data Quality Checklist

Not all open data is created equal. Evaluate datasets against these criteria:

  • Timeliness: Is the data current? Budget data from 3 years ago has limited accountability value.
  • Completeness: Are there gaps, missing fields, or unexplained null values?
  • Granularity: Is the data detailed enough? Aggregate spending by department is less useful than line-item expenditures.
  • Machine-readability: Can you import it directly into analysis tools without manual cleanup?
  • Documentation: Is there a data dictionary explaining each field, its format, and any codes used?
  • Licensing: Is the data truly open? Check for restrictive licenses that limit redistribution or commercial use.
  • Consistency: Does the government use consistent formats and identifiers across datasets and over time?

Start Small

Don't try to analyze everything at once. Pick one question you want to answer — "How much does my city spend on police overtime?" or "Which contractors receive the most government business?" — and find the specific dataset that answers it. Starting with a focused question is far more productive than browsing data portals aimlessly.