8 results · AI-generated index
H
huggingface.io
tool

Large Text Datasets for NLP Model Training

Discover a wide range of large text datasets for training and fine-tuning your NLP models, from Wikipedia to BookCorpus, available on the Hugging Face Hub.

K
kaggle.com
tool

NLP Datasets for Machine Learning

Explore and download various NLP datasets, including large text datasets, to train and evaluate your machine learning models on Kaggle.

N
nlp.stanford.edu
research

The Stanford Natural Language Processing Group

Learn about the latest research and datasets in NLP from the Stanford Natural Language Processing Group, including large-scale text datasets for model training.

A
arxiv.org
research

Large Scale Text Dataset for NLP

Read this research paper on creating and utilizing large-scale text datasets for NLP model training, highlighting the importance of diverse and extensive datasets.

G
github.com
tool

Text Dataset for NLP Model Training

Find open-source text datasets and NLP model training code on GitHub, including large-scale datasets for various languages and tasks.

D
data.gov
official

NLP Training Data

Access government-provided text datasets for NLP model training, covering a range of topics and formats, on the US Government's data portal.

A
aclweb.org
article

Large Text Datasets for NLP: A Survey

Read this survey paper on large text datasets for NLP, discussing their applications, challenges, and future directions, published in the Association for Computational Linguistics.

U
ucsd.edu
edu

NLP Dataset Collection

Explore the University of California, San Diego's collection of NLP datasets, including large text datasets for model training, covering various domains and languages.