8 results · AI-generated index
H
huggingface.io
tool

Natural Language Processing (NLP) Datasets

Explore a wide range of NLP datasets, including large corpora for text classification, sentiment analysis, and language modeling.

C
commoncrawl.org
article

The Common Crawl Corpus

A large corpus of web pages for NLP research, updated regularly with new data from the internet.

I
ieee.org
research

Natural Language Processing: A Review

This article reviews recent advances in NLP, including the use of large corpora for training and evaluating NLP models.

N
nltk.org
official

NLTK Data: Corpora and Lexicons

Access to a wide range of NLP corpora and lexicons, including large datasets for text processing and analysis.

G
github.com
tool

Large-Scale NLP Dataset Repository

A collection of large-scale NLP datasets for tasks such as language modeling, text classification, and machine translation.

S
stanford.edu
research

Stanford Natural Language Processing Group

Research group focused on NLP, with a wide range of projects and datasets available for use, including large corpora for NLP research.

C
cloud.google.com
official

Google's Natural Language API

A cloud-based API for NLP tasks, including text analysis and language modeling, using large corpora and machine learning models.

O
oreilly.com
article

Natural Language Processing with Python

Book chapter on using Python for NLP tasks, including working with large corpora and using popular NLP libraries.