Allen Institute for AI (AI2)
NLP, commonsense reasoning, question answering datasets (e.g., SciFact, ARC, AI2 Reasoning Challenge).
|
Academic Torrents
A distributed system for sharing enormous datasets, fostering collaboration, and facilitating access to academic data and research materials.
|
BMIC Home
A collection of repositories supported by NIH, aimed at promoting data sharing and advancing biomedical research.
|
Data Asset eXchange
A platform providing access to curated datasets designed to help developers and data scientists build AI models and applications.
|
Data Excellence. Research Impact.
Provides access to a vast archive of social science data for research and instruction, supporting data preservation and sharing.
|
Datasets
A collection of ready-to-use datasets for machine learning and data science, covering a wide range of applications.
|
Dataset Search
A tool that enables users to find datasets stored across the web, making data discovery easy and comprehensive.
|
DBpedia
A crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects.
|
Dryad
An open-source repository for research data, providing a platform for researchers to publish and share datasets across various scientific disciplines.
|
European Data
Provides access to a wide range of data from EU institutions and bodies, supporting transparency and enabling data reuse.
|
Figshare
A repository where users can make all their research outputs available in a citable, shareable, and discoverable manner.
|
A Global Clinical Research Data Sharing Platform
An organization dedicated to sharing clinical research data globally, promoting transparency and collaboration in medical research.
|
The Global Health Observatory
Provides access to health-related data, supporting global health research and policymaking.
|
Google Public Data Explorer
Allows users to explore large public-interest datasets, visualize the data, and generate interactive charts and maps.
|
Harvard Dataverse
An open-source repository for sharing, citing, and preserving research data across all scientific disciplines.
|
Hugging Face Datasets
NLP, vision, audio, multimodal – includes benchmark datasets like SQuAD, IMDB, CommonVoice.
|
IEEE Data Port
A valuable resource for researchers, offering a repository for datasets in a variety of technical fields, enhancing data sharing and collaboration.
|
List of Datasets for Machine-Learning Research
This Wikipedia page provides a comprehensive list of datasets widely used in machine learning research, including descriptions and links to datasets across various domains such as computer vision, natural language processing, and more.
|
Nasdaq Data Link
Provides financial and economic data, offering a comprehensive resource for market researchers and financial analysts.
|
NIH-Supported Data Sharing Resources: Domain Specific Repositories
Lists repositories specific to certain domains supported by NIH, facilitating data sharing and preservation within specialized fields.
|
NIH-Supported Data Sharing Resources: Generalist Repositories
Provides a list of generalist repositories supported by NIH, designed for broad data sharing and accessibility across disciplines.
|
NAIRR Pilot:
The NAIRR Pilot aims to connect U.S. researchers and educators to computational, data, and training resources needed to advance AI research and research that employs AI. Federal agencies are collaborating with government-supported and non-governmental partners to implement the Pilot as a preparatory step toward an eventual full NAIRR implementation.
|
NASA Open Data portal
|
OSF
A platform to support researchers in managing their projects, sharing data, and collaborating openly with the global research community.
|
Our Data
Provides access to datasets used in FiveThirtyEight's data journalism articles, covering a wide range of topics including politics, sports, and science.
|
The Qualitative Data Repository
Provides a repository for storing and sharing qualitative data, supporting researchers in the social sciences.
|
Recent Uploads
An open-access repository developed by CERN, enabling researchers to share and preserve data and publications.
|
Registry of Open Data on AWS
Hosts a variety of public datasets, making it easier to find, access, and use open data in the AWS cloud.
|
Research Process: Datasets
Provides a curated collection of datasets available through the National University Library, supporting research across various disciplines with access to high-quality, reliable data sources.
|
Share Your Research Data
An open-access data repository that enables researchers to make their data discoverable, shareable, and citable.
|
United States Census Bureau
The U.S. Census Bureau provides a vast array of demographic, economic, and social data about the United States, supporting research and policymaking across multiple sectors.
|
World Bank Open Data
Economic, health, and development indicators for over 200 countries.
|