Finding Data

Finding Data

A comprehensive list of data sources is not the goal of this page. Instead, we present a few high-level resources to get you started on finding the data you need. For any data you use, remember to cite it according to data citation best practices

COVID-19 Dataset Collections & Resources
University of Arizona Sources
NAME DESCRIPTION
ReDATA The University of Arizona Research Data Repository
UA Campus Repository The Campus Repository mainly holds document-based information (open access articles, monographs, theses and dissertations, reports, etc.) but it also contains a limited amount of data
University Libraries Special Collections Materials of local or regional significance and unique materials from the University of Arizona's history
Open Science Framework @ UA The portal for all public OSF projects at UA
Clinical Data Warehouse HIPAA-compliant electronic health record (EHR) information from the Banner University Medical Center - Tucson
University Analytics & Institutional Research Data about the University of Arizona itself.
University-Licensed Data
NAME DESCRIPTION
Library Databases: Data Sets The Libraries subscribes to many databases which are free to access for UA students, faculty, and staff. Data includes business, demographic, and geographic data among others.  To browse the rest of the subscription content, click on the Databases Home tab in the link. (Must access from campus network or via VPN)
Data Axle ReferenceUSA Contains establishment-level data about US businesses in annual snapshots from 1997-2021 and can help users create marketing plans and conduct competitive analyses.
ProQuest Historical Newspapers The New York Times (1851-1936) and The Washington Post (1877-1934). These files may be downloaded and used for text and data mining.
Planet Labs Near daily stream of Earth-observation satellite data
Annotated English Gigaword Annotated English Gigaword contains the nearly ten million documents (over four billion words) of the original English Gigaword Fifth Edition from seven news sources. The goal of the annotation is to provide a standardized corpus for knowledge extraction and distributional semantics
Data Indexers

In addition to the list below, see the Data Repositories page for more places to find datasets.

NAME DESCRIPTION
Re3Data Registry of Research Data Repositories. A worldwide index of data repositories.
Fairsharing A database of data repositories and related metadata standards and policies. Also useful for identifying metadata standards for writing a DMP.
Google Dataset Search Search for data across many data repositories and government websites
DataCite Commons Search across all public data repositories that use DataCite DOIs
   
Other Community Resources

These are useful compilations of which include government data sources and interesting or unique datasets

  • Awesome Datasets: A collection of community contributed datasets
  • Public APIs: A collection of publicly available APIs serving up ready-to-use data
  • Data is Plural: a curated mailing list of interesting datasets by Jeremy Singer Vine. The list is also available on Data.world