The NC State University Libraries provides access to datasets for use in teaching, learning, and research. Sage Research Methods Datasets, Data Planet, and Linguistics Data Consortium corpora are only available to NC State faculty, students, and staff. All other resources are public.
Datasets for Teaching and Practicing
- CORGIS Datasets Project - Real-world datasets for subjects such as politics, education, literature, and construction.
- Sage Research Methods Datasets - This collection of practice datasets contains over 120 datasets using data from real research. The collection is designed to support the teaching and learning of data analysis techniques and research methods.
- Tableau Sample Data Sets - A changing sample of datasets for use in teaching and learning.
- Kaggle Datasets - A collection of datasets for predictive modeling and machine learning.
- UC Irvine Machine Learning Repository - A maintained repository of over 590 machine learning datasets
- Linguistics Data Consortium (LDC) corpora - Speech and text data for non-commercial use that may be especially appealing to those doing natural language processing and linguistics research. The Libraries has access to the TIDIGITS, OntoNotes, Penn Discourse Treebank Version 3.0, and CSR-I (WSJ0) Complete and CSR-II (WSJ1) Complete corpora.
Collections of Datasets
- Data-Planet - A collection of statistical datasets from public, private, and commercial sources. The Data-Planet Search Guide provides information on how to use the collection.
- DataHub.io - A collection of datasets that includes lists of countries, populations, geographic boundaries, economic data, and more.
- re3data.org - A registry of research data repositories.
- Web of Science Data Citation Index - Access research data from multiple disciplines worldwide.
Open Data
National/International
- Awesome Public Datasets - This curated list of datasets is arranged by discipline; the majority of the datasets are free.
- Dryad - Access datasets from a curated general-purpose repository that makes data discoverable, freely reusable, and citable. NC State University Libraries is also a member enabling free deposits for NC State researchers.
- Data.gov - The home of the U.S. Government’s open data.
- FedStats - This site provides access to the full range of official statistical information produced by the U.S. Government without having to know in advance which Federal agency produces which particular statistic.
- UNdata - A portal for the United Nations' statistical datasets.
North Carolina
For more information or assistance, meet with a librarian or Ask Us.