Harvard University Releases Open-Source AI Training Dataset: Institutional Books 1.0
2 day ago / Read about 0 minute
Author:小编   

Harvard University has recently made available an AI training dataset titled Institutional Books 1.0 on the HuggingFace platform. This comprehensive dataset encompasses 983,000 books from the public domain, spanning 245 languages. The majority of these books were digitized from contributions by the Harvard Library to the Google Books project, and further refined by the Institutional Data Initiative. Users of this dataset must adhere to the specified terms and conditions of usage.

  • C114 Communication Network
  • Communication Home