Machine Learning Datasets
Iris Flower Dataset
A classic dataset for classification tasks, featuring measurements of iris flowers.
Type: Tabular
Size: 150 samples, 4 features
Use Case: Classification, Clustering
Size: 150 samples, 4 features
Use Case: Classification, Clustering
MNIST Handwritten Digits
A large database of handwritten digits, widely used for image recognition.
Type: Image
Size: 70,000 images (60k train, 10k test)
Use Case: Image Classification, Digit Recognition
Size: 70,000 images (60k train, 10k test)
Use Case: Image Classification, Digit Recognition
Titanic Survival
Predict survival on the Titanic based on passenger characteristics.
Type: Tabular
Size: 891 samples, 12 features
Use Case: Binary Classification, Feature Engineering
Size: 891 samples, 12 features
Use Case: Binary Classification, Feature Engineering
CIFAR-10 Image Dataset
A collection of 60,000 color images in 10 classes, used for image classification.
Type: Image
Size: 60,000 images (50k train, 10k test)
Use Case: Image Classification, Deep Learning
Size: 60,000 images (50k train, 10k test)
Use Case: Image Classification, Deep Learning
Boston Housing Dataset
Predicting median house values in Boston suburbs based on various features.
Type: Tabular
Size: 506 samples, 13 features
Use Case: Regression, Feature Importance
Size: 506 samples, 13 features
Use Case: Regression, Feature Importance
IMDB Movie Reviews
A dataset of 50,000 movie reviews for sentiment analysis.
Type: Text
Size: 50,000 reviews (25k train, 25k test)
Use Case: Sentiment Analysis, NLP
Size: 50,000 reviews (25k train, 25k test)
Use Case: Sentiment Analysis, NLP