Gutenberg Dataset e-book Dataset
The Gutenberg dataset contains 3036 English books by 142 authors. It is a small part of the Project Gutenberg corpus and is mainly used for language modeling.
This dataset was released by Microft AI in April 2014. The main publisher was Matthew D. Scholefield. The related paper is "Complexity of Word Collocation Networks: A Preliminary Structural Analysis".
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.