Open AI & Machine Learning Datasets

Datasets from different sectors, cleaned and ready to be used

Filter by sector
Datasets (76)
Learn More
dataset banner image
Silicon Valley Extreme Weather .csv A collection of datasets conveying climatic regions, seasons and years useful for EDA and prediction of extreme weather
Learn More
dataset banner image
Flood Dataset (Malaysia) .csv rainfall data for different states and districts in Malaysia over the period of 2000 to 2010
Learn More
dataset banner image
2007-2022 Homeless Populations by State (USA) .csv
Learn More
dataset banner image
North Carolina NPC Gaming dataset .csv Dataset for prompts and embeddings based off the Project Gutenberg dataset for NPC Gaming
Learn More
dataset banner image
Egypt Fresh Water .csv This dataset is a collection of water related information in Egypt as well as the statitistics
Learn More
dataset banner image
Homelessness in the United States (2007-2022) .csv
Learn More
dataset banner image
Global Wheat Head Detection (GWHD) .jpg dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods
Learn More
dataset banner image
COVID Radiology Images png chest X-ray images in PNG format that are divided into two categories - COVID positive and normal.
Learn More
dataset banner image
Yearly Economics and Unemployment (Pakistan) .csv yearly economic and unemployment data for Pakistan from 1991 to 2020.
Learn More
dataset banner image
Twitter Data with PQ Scores .csv The dataset consists of PQ scores of Twitter leaders' usernames derived from the metric score, language score, and sentiment score, the whole formula can be found in the report.
Learn More
dataset banner image
Twitter Data on Disaster-Related Tweets .csv Twitter data related to disaster events. Each record in the dataset represents a tweet, and the data includes various attributes associated with the tweets, such as the text of the tweet, a binary indicator of whether it's related to a disaster, the type of disaster if applicable, and hashtags used in the tweet.
Learn More
dataset banner image
Health Conditions and Treatments .csv Information related to various health conditions and their treatment options.

Why Use
Omdena Datasets

Explore a curated dataset library built around quality, community, diversity, ethical sourcing, and open collaboration.

Learn more on the right about the principles that shape the Omdena dataset collection and what makes it useful across different domains and use cases.

Learn more about Omdena Datasets

High Quality

Omdena's library offers a carefully curated selection of high-quality datasets from various domains.

Community-driven

The Omdena dataset library is built on the foundation of a strong and supportive community. By using our library, users become part of an engaged network of data professionals, fostering collaboration, knowledge sharing, and innovation.

Diverse and Inclusive

We prioritize diversity and inclusivity, ensuring that our datasets cater to a wide range of use cases, industries, and global perspectives. By using Omdena's library, users have access to unique and underrepresented datasets that may not be available in other libraries

Ethical Data Practices

Omdena is committed to promoting ethical data practices, ensuring that our datasets adhere to privacy and data protection guidelines. Users can trust that the data in our library is responsibly sourced and managed.