Data & Visualization Resources
Table of contents
Visualization Tools
- Vega-Lite API, which has API documentation and lots of examples.
- Arquero — a JavaScript-based data transformation library. See the Introducing Arquero Observable notebook to get started.
- SQL — a SQL interface for data manipulation. The standard for data manipulation in industry so always worth learning.
- JavaScript data utilities
- Pandas - Data table and manipulation utilites for Python.
Datasets
- Data is Plural, a spreadsheet archive of useful/curious public interest datasets curated by Jeremy Singer-Vine (with an accompanying newsletter).
- dataCommons.org
- Western Pennsylvania Regional Data Center
- An archive of datasets distributed with the R statistical language
- 30 Places to Find Open Data on the Web – Visual.ly
- Office for National Statistics (UK) – a repository of detailed statistics about Great Britain and Northern Irland
- World Bank Data Catalog
- CDC NCHS Data – CDC’s National Center for Health Statistics Data Access
- Machine Learning Repository – large variety of maintained data sets
- NOAA National Centers for Environmental Information
- EIA U.S. Energy Information Administration
- data.wprdc.org Western Pennsylvania Regional Data Center
- PlumePGH - Pittsburgh Air Quality Data
- data.gov - U.S. Government Open Datasets
- U.S. Census Bureau - Census Datasets
- IPUMS.org - Integrated Census & Survey Data from around the World
- registry.opendata.aws - Open Data on AWS
- Federal Elections Commission - Campaign Finance & Expenditures
- Federal Aviation Administration - FAA Data & Research
- fivethirtyeight.com - Data and Code behind the Stories and Interactives
- Buzzfeed News
- Socrata Open Data
- 17 places to find datasets for data science projects