Open AI & Machine Learning Datasets
Datasets from different sectors, cleaned and ready to be used
Filter by sector
Datasets (76)
Silicon Valley Extreme Weather .csv A collection of datasets conveying climatic regions, seasons and years useful for EDA and prediction of extreme weather
Flood Dataset (Malaysia) .csv rainfall data for different states and districts in Malaysia over the period of 2000 to 2010
2007-2022 Homeless Populations by State (USA) .csv
North Carolina NPC Gaming dataset .csv Dataset for prompts and embeddings based off the Project Gutenberg dataset for NPC Gaming
Egypt Fresh Water .csv This dataset is a collection of water related information in Egypt as well as the statitistics
Homelessness in the United States (2007-2022) .csv
Global Wheat Head Detection (GWHD) .jpg dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods
COVID Radiology Images png chest X-ray images in PNG format that are divided into two categories - COVID positive and normal.
Yearly Economics and Unemployment (Pakistan) .csv yearly economic and unemployment data for Pakistan from 1991 to 2020.
Twitter Data with PQ Scores .csv The dataset consists of PQ scores of Twitter leaders' usernames derived from the metric score, language score, and sentiment score, the whole formula can be found in the report.
Twitter Data on Disaster-Related Tweets .csv Twitter data related to disaster events. Each record in the dataset represents a tweet, and the data includes various attributes associated with the tweets, such as the text of the tweet, a binary indicator of whether it's related to a disaster, the type of disaster if applicable, and hashtags used in the tweet.
Health Conditions and Treatments .csv Information related to various health conditions and their treatment options.
