Omdena Datasets
Beta VersionExplore and Download the different datasets create from Omdena AI Innovation challenges and
Local Chapter Projects. Built by Omdena Community members.
Datasets (77)
Silicon Valley Extreme Weather .csv A collection of datasets conveying climatic regions, seasons and years useful for EDA and prediction of extreme weather
Flood Dataset (Malaysia) .csv rainfall data for different states and districts in Malaysia over the period of 2000 to 2010
2007-2022 Homeless Populations by State (USA) .csv
North Carolina NPC Gaming dataset .csv Dataset for prompts and embeddings based off the Project Gutenberg dataset for NPC Gaming
Egypt Fresh Water .csv This dataset is a collection of water related information in Egypt as well as the statitistics
Homelessness in the United States (2007-2022) .csv
Global Wheat Head Detection (GWHD) .jpg dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods
COVID Radiology Images png chest X-ray images in PNG format that are divided into two categories - COVID positive and normal.
Yearly Economics and Unemployment (Pakistan) .csv yearly economic and unemployment data for Pakistan from 1991 to 2020.
Twitter Data with PQ Scores .csv The dataset consists of PQ scores of Twitter leaders' usernames derived from the metric score, language score, and sentiment score, the whole formula can be found in the report.
Twitter Data on Disaster-Related Tweets .csv Twitter data related to disaster events. Each record in the dataset represents a tweet, and the data includes various attributes associated with the tweets, such as the text of the tweet, a binary indicator of whether it's related to a disaster, the type of disaster if applicable, and hashtags used in the tweet.
Health Conditions and Treatments .csv Information related to various health conditions and their treatment options.
Mapping Seagrass Trieste labels A collection of satellite imagery of seagrass ecosystem in Trieste
India Covid Economy .csv A collection of economic factors attributed to the Indian economy during the COVID-19 pandemic
Arabic Sign Language Detection Assistance Dataset YOLOv8 Collection of images specifically curated for training and testing a language detection model. This dataset is designed to aid in the development of a system capable of recognizing Arabic sign language gestures and translating them into written or spoken language.
Local Government Areas Classification for Water Demand (Lagos, Nigeria) .csv
CSSE COVID19 Daily Reports (Pakistan) .csv COVID-19 daily reports of different provinces of Pakistan from the year 2020 and 2021
Water Points (Lagos, Nigeria) .csv
Road Accidents (Dhaka) .csv road accidents that resulted in deaths or injuries in Dhaka
ECG Images of Cardiac and COVID-19 Patients .jpg ECG images dataset of Cardiac and COVID-19 Patients created under the auspices of Ch. Pervaiz Elahi Institute of Cardiology Multan, Pakistan that aims to help the scientific community for conducting the research for COVID-19 and Cardiovascular diseases.
CNG and Scooter Detection (Labeled) .jpg annotated images that can help detect CNG scooters in an image.
Weather (Tunisia) .csv
Kenya Constituencies .geojson Dataset contains the geographical coordinates of the constituencies within Kenya.
Economic Damage from Natural Disasters (USA 1900-2018) .csv The dataset on Economic Damage from Natural Disasters in the USA from 1900-2018 is a comprehensive collection of information on the economic losses suffered by the United States due to natural disasters during the period mentioned.
Machakos Unemployment Rate .csv A collection of engineered data containing the unemployment rates and the various factors
Climate Risk Prediction (Malaysia) .csv Climate data for various locations in Malaysia
Arabic Scientific Technical Terms .csv List of Arabic scientific and technical terms with their corresponding English translations
Wind Speed (Tunisia) .tiff Wind speed data for the country of Tunisia.
AFCON News .csv Information about the African Cup of Nations (AFCON) tournament news from different sources.
Railway Lines (Tunisia) shp Information about railway lines and stations in Tunisia
CHIRPS Seasonal Rainfall Accumulation Anomaly (East Africa) .csv Seasonal Rainfall Accumulation Anomaly dataset
Water Levels (Venezia, Italia) .csv Venezia water levels from 1983 to 2015
2020 CPS Food Security .csv Supplement of the Current Population Survey (CPS), conducted by the U.S. Census Bureau.
Arabic-English Machine Learning Terminology .json Collection of Arabic terms and their English translations related to machine learning
São Paulo City - Subway - 2018 to 2023 Dataset .csv information about the passenger entrance and passengers transported on the subway lines in São Paulo City from 2018 to 2023
Water Supply and Energy Usage (Africa) .csv Water supply sources and energy usage in different regions of Africa
ECG Images of Cardiac Patients .jpg ECG images dataset of Cardiac Patients created under the auspices of Ch. Pervaiz Elahi Institute of Cardiology Multan, Pakistan that aims to help the scientific community for conducting the research for Cardiovascular diseases.
Benin Red Blood cells labels A collection of image and labels along with their descriptions for red blood cells
Crop Detection (Nakuru) shp Agriculture band (B11, B8, B2) and is scaled at 10 with a maximum pixel value of 1e13.
Crop Price Prediction (Senegal) .csv This dataset contains information about crop prices in Senegal.
Rwanda Job Postings .csv Information on job postings in Rwanda
Displacement and Gentrification Recommendation Inventory .csv Information on the recommendations and resolutions identified by the Office of the City Auditor.
Humanitarian Operational Presence (Tropical Cyclone Freddy), Malawi .csv Information on the humanitarian organizations supporting the response to Tropical Cyclone Freddy in the Southern Region of Malawi.
Phillipines Renewable energy .csv This dataset is a collection of renewable energy initiatives in Phillipines.
Crop Diseases Classification .tiff Images of plants, stored in the train_images folder, and a json file containing labels for the images.
Videos of Pure and Water Adulterated Milk .mp4 The videos are in .mp4 file format; video sizes range from 7KB to 351 KB and can be played using any media player of choice.
Crop Prices (Global) .csv Crop price prediction dataset is a collection of records of crop prices from various markets in different countries and regions.
Liberia Malaria Prevention .xlsx The Liberia Malariia Prevention dataset is a collection of various factors causing the spread of malaria in Liberia e.g. precipitation
ONFIRE Dataset .mp4 The Fire Detection Video Dataset is a collection of 322 videos specifically curated to address the challenges associated with fire detection in diverse conditions. This dataset is unparalleled in its heterogeneity, encompassing variations in image resolution, illumination, distance from fire or smoke, pixel size of flame or smoke, background activity, and the scenario (urban or wildfire). Moreover, it stands out as the most extensive and diverse fire detection video dataset, with annotations that include the fire ignition time – a unique feature not commonly found in public datasets.
Healthcare Related Tweets for Sentiment Analysis .csv
Cracow Poland Rural Farmers labels A collection of satellite imagery data depicting the agricultural landscape of Cracow
Ghana Job Applicants .csv Information on job applicants in Ghana
Machakos Tree cover shp The Machakos Tree cover is a collection of satellite images showing the tree cover in Machakos, Kenya useful for GIS analysis.
LATAM News Websites .csv
Solar Panel Performance and Sunlight Availability .csv Information about regions, their population, and their solar panel performance index
DataScience CoData Arabic Translated Articles .csv Articles from DataScience CoData website, which have been translated to Arabic.
AFCON 2022 Facebook Posts .csv Information about Facebook posts related to AFCON 2022.
Case Law .csv A collection of 3k cases on various case laws eg employment
Labor Stats 2005-2021 (Texas, USA) .csv Labor statistics for the state of Texas from 2005 to 2021
Leaves for Tree Species Classification png
Liberia News Corpus .csv The Liberia News Corpus dataset is a valuable collection of news articles related to Liberia. It encompasses 33,673 records, each providing a glimpse into various topics and events occurring in the region. This dataset is a valuable resource for researchers and analysts interested in studying news trends, information dissemination, and the distinction between real and fake news.
Crop Yield Prediction .csv Information about the crop yield of different crops, along with various environmental factors that affect the yield
Art on Leaves Dataset Documentation .csv The Art on Leaves dataset is a collection of articles and corresponding summaries. It features a wide range of topics, including obituaries, human interest stories, and artistic endeavors. This dataset serves as a valuable resource for natural language processing and summarization tasks.
Coursera Lecture Arabic Transcripts of Deep Learning Course .xlsx Arabic and English transcripts for lectures from deep learning courses offered on Coursera
PropertyAI Real Estate Dataset .csv This dataset contains information about various properties available for sale in Dhaka, Bangladesh, with details such as area, building type, building nature, image URL
Arab language datasets .csv It is a collection of linguistic data for common Arabic vocabulary
Illegal Dumpsites (Several Regions, Globally) .csv TrashOut Dataset to map illegal dumps around the world
Low-temperature Thermochronology data from Patagonia labels A collection of geojson data of various temperature from lower earth orbit
Ghana Job Market .csv Information about job seekers in Ghana, their job category, industry preference, professional experience, skills, education, languages spoken, availability, residence, geographical flexibility, and employment type.
Satellite Images of the Constituency (Budalangi, Kenya) .tiff Satellite Imagery of Budalangi.
Number of Returns to Homelessness .csv Information on individuals who exit homelessness to permanent housing destinations
Water Mopup and Baseline MIS Facility Data .csv information on water mopup and baseline Management Information System (MIS) facilities in Nigeria.
Toronto Alzheimer's labels Labels and collection of brain scan images
Coursera Lecture Arabic Transcripts of Applied Data Science Course .xlsx Arabic and English transcripts for lectures from various courses offered on Coursera
Natural Disasters Emergency Events Database - Country Profiles .csv aggregated figures for natural disasters country-wise
Crop and Livestock Production Statistics .csv
Global Temperatures .csv The dataset "Global Land Temperatures By Country" includes temperature measurements for various countries around the world, with readings taken at regular intervals over a period of time.