Science and Research Content

HTRC unveils data mining and analytics tools for HathiTrust Digital Library -

The HathiTrust Research Center (HTRC) has announced the availability of data mining and analytics tools for the HathiTrust Digital Library, a collection of digital texts from over 70 research libraries around the world. The new tools seek to provide an entry point to large-scale analysis of HathiTrust's contents.

Indiana University and the University of Illinois are the founding partners of the HTRC. The new infrastructure release follows an aggressive development path set forth by the HTRC Executive Management Team at the 2012 HTRC UnCamp, a gathering of HTRC developers, researchers and librarians. Users can now expect to apply sophisticated computational research methodology across the large-scale collection, leveraging metadata crafted over time by libraries.

In phase two of the HTRC (September 2012-March 2013), the HTRC Technical Working Group created production versions of the beta services previewed at the 2012 UnCamp event. They are now working to open the resources to community testers who are part of the HTRC User Group Community.

The HTRC service stack, which provides the analytical entry point, is based on a completely new technical architecture. This framework leverages existing analytics tools such as SEASR (seasr.org), digital library software such as Blacklight, and a services-oriented architecture application interface. The current production phase includes a HTRC Sandbox that is open to scholars for evaluation of the HTRC services stack as part of their experiments.

Click here to read the original press release.

STORY TOOLS

  • |
  • |

sponsor links

For banner adsĀ click here