Finding Data
A comprehensive list of data sources is not the goal of this page. Instead, we present a few high-level resources to get you started on finding the data you need. For any data you use, remember to cite it according to data citation best practices.
COVID-19 Dataset Collections & Resources
- FAIRSharing.org COVID-19 Knowledgebases & Repositories
- Google COVID-19 Public Datasets
- COVID-19 Research Database
- Awesome Coronavirus
- Arizona State University List of COVID-19 Data Repositories (Archived)
University of Arizona Sources
NAME | DESCRIPTION |
---|---|
ReDATA | The University of Arizona Research Data Repository |
UA Campus Repository | The Campus Repository mainly holds document-based information (open access articles, monographs, theses and dissertations, reports, etc.) but it also contains a limited amount of data |
University Libraries Special Collections | Materials of local or regional significance and unique materials from the University of Arizona's history |
Open Science Framework @ UA | The portal for all public OSF projects at UA |
CB2 Health Data Services | Electronic health record (EHR) information from the Banner University Medical Center - Tucson, data from AHCCCS, ADHS |
University Analytics & Institutional Research | Data about the University of Arizona itself. |
University-Licensed Data
NAME | DESCRIPTION |
---|---|
Library Databases: Data Sets | The Libraries subscribes to many databases which are free to access for UA students, faculty, and staff. Data includes business, demographic, and geographic data among others. To browse the rest of the subscription content, click on the Databases Home tab in the link. (Must access from campus network or via VPN) |
Data Axle ReferenceUSA | Contains establishment-level data about US businesses in annual snapshots from 1997-2021 and can help users create marketing plans and conduct competitive analyses. |
ProQuest Historical Newspapers | The New York Times (1851-1936) and The Washington Post (1877-1934). These files may be downloaded and used for text and data mining. |
Planet Labs | Near daily stream of Earth-observation satellite data |
Annotated English Gigaword | Annotated English Gigaword contains the nearly ten million documents (over four billion words) of the original English Gigaword Fifth Edition from seven news sources. The goal of the annotation is to provide a standardized corpus for knowledge extraction and distributional semantics |
Data Indexers
In addition to the list below, see the Data Repositories page for more places to find datasets.
NAME | DESCRIPTION |
---|---|
Re3Data | Registry of Research Data Repositories. A worldwide index of data repositories. |
Fairsharing | A database of data repositories and related metadata standards and policies. Also useful for identifying metadata standards for writing a DMP. |
Google Dataset Search | Search for data across many data repositories and government websites |
DataCite Commons | Search across all public data repositories that use DataCite DOIs |
Other Community Resources
These are useful compilations of which include government data sources and interesting or unique datasets
- Awesome Datasets: A collection of community contributed datasets
- Public APIs: A collection of publicly available APIs serving up ready-to-use data
- Data is Plural: a curated mailing list of interesting datasets by Jeremy Singer Vine. A searchable archive is available.