8 results · AI-generated index
H
https://www.kaggle.com
tool

Large Scale Machine Learning Dataset

Kaggle offers a wide range of large-scale datasets for machine learning model training, including image, text, and audio datasets.

H
https://commoncrawl.org
article

Common Crawl Dataset

Common Crawl is a non-profit organization that provides a large corpus of web pages for machine learning model training and research.

H
https://archive.ics.uci.edu
research

Machine Learning Repository

The University of California, Irvine's Machine Learning Repository provides a collection of datasets for machine learning model training and research.

H
https://datasetsearch.research.google.com
tool

Google Dataset Search

Google Dataset Search is a search engine for datasets, providing access to a wide range of datasets for machine learning model training and research.

H
https://huggingface.co
article

Hugging Face Datasets

Hugging Face provides a wide range of pre-trained models and datasets for natural language processing and machine learning model training.

H
https://nlp.stanford.edu
research

Stanford Natural Language Inference Corpus

The Stanford Natural Language Inference Corpus is a dataset for natural language processing and machine learning model training.

H
https://www.data.gov
official

US Government Dataset

The US Government's data.gov website provides a wide range of datasets for machine learning model training and research, including datasets from various government agencies.

H
https://research.google.com
video

YouTube-8M Dataset

The YouTube-8M dataset is a large-scale video dataset for machine learning model training and research, provided by Google Research.