Natural Language Processing Datasets
Explore a wide range of datasets for natural language processing model training, including text classification, sentiment analysis, and language translation.
Explore a wide range of datasets for natural language processing model training, including text classification, sentiment analysis, and language translation.
A new open-source dataset for natural language processing has been released, featuring over 45,000 hours of audio and 1.4 million text samples.
The University of California, San Diego, provides a collection of natural language processing datasets for research and model training purposes.
Google's natural language processing dataset is a large-scale collection of text data designed for training and evaluating NLP models.
Kaggle offers a variety of natural language processing datasets for machine learning competitions and model training, including text classification and sentiment analysis.
Researchers have released a large-scale dataset for natural language processing in low-resource languages, aiming to improve NLP model performance in these languages.
The National Institutes of Health provides a dataset for natural language processing in the healthcare domain, featuring clinical text and medical terminology.
Stanford University offers a collection of natural language processing datasets for social media analysis, including Twitter and Facebook data.