The HathiTrust Research Center (HTRC) enables computational analysis of the HathiTrust corpus. It is a collaborative research center launched jointly by Indiana University and the University of Illinois, along with HathiTrust, to help meet the technical challenges researchers face when dealing with massive amounts of digital text. It develops cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.
Leveraging data storage and computational infrastructure at Indiana University and the University of Illinois at Urbana-Champaign, the HTRC builds tools and services for scholars to perform research using data from the HathiTrust Digital Library. The Center is breaking new ground in the areas of text mining and non-consumptive research, allowing scholars to fully utilize HathiTrust content while preventing intellectual property misuse within the confines of current U.S. copyright law.
HTRC tools and services
HTRC has developed a suite of tools and services for text data mining including web-based algorithms, freely-accessible datasets, and secure computing capsules. Access them from HTRC Analytics.
For more information, you can:
- Read a brief overview of the HTRC's Collections and Tools
- Find tutorials and detailed documentation in the HTRC Documentation
- Review the code that makes it all run on the HTRC GitHub
Learn More About
- Governance
- Architecture and Organization
- Access and Use
- Timeline and Deliverables
- Partnering in Research
- Non-Consumptive Use Policy
Stay in touch
- Contact Us: htrc-help@hathitrust.org.
- Join the User Group email list (htrc-usergroup-l @ list.indiana.edu) or Announcement email list (htrc-announce-l @ list.indiana.edu).
News and Updates
Papers and Presentations
-
Unlocking the Secrets of 3 Billion Pages: Introducing the HathiTrust Research Center
-
HathiTrust Research Center: The Workset Creation for Scholarly Analysis (WCSA) Prototyping Project
- 1 of 3
- next ›