HyperAIHyperAI

Command Palette

Search for a command to run...

Reuters-21578 Text Classification Dataset

Date

3 years ago

Size

7.78 MB

Organization

AT&T Labs Research

Reuters – 21578 Dataset is a test collection for text classification research. It is a multi-class, multi-label dataset that is expected to be replaced by RCV1 in the next few years. The dataset has 90 classes, 7769 training files and 3019 test files. It is a ModApte subdirectory of the Reuters – 21578 benchmark.

Reuters – 21578 The dataset was originally collected and labeled by Carnegie Group and Reuters in 1987 during the development of the CONSTRUE text classification system. It was later released by AT&T Labs Research in September 1997. The main publisher was David D. Lewis. The related papers are:

"Automated Learning of Decision Rules for Text Categorization"

"Toward Language Independent Automated Learning of Text Categorization Models"

"TCS: A Shell for Content-Based Text Categorization"

"CONSTRUE/TIS: A System for Content-Based Indexing of a Database of News Stories"

reuters21578.torrent
Seeding 3Downloading 0Completed 948Total Downloads 2,447
  • reuters21578/
    • README.md
      1.46 KB
    • README.txt
      2.92 KB
      • data/
        • reuters21578.tar.gz
          7.78 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Reuters-21578 Text Classification Dataset | Datasets | HyperAI