THUCNews News Dataset
Date
Size
Publish URL
License
Other
The THUCNews dataset is generated by filtering the historical data of Sina News from 2005 to 2011, and contains 740,000 news documents, all in UTF-8 plain text format. Based on the original Sina News classification system, this dataset is re-integrated into 14 candidate classification categories: finance, lottery, real estate, stocks, home, education, technology, society, fashion, current affairs, sports, constellations, games, and entertainment.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.