Educational publisher Gale, part of Cengage Learning, US, and 18thConnect have announced a partnership to share scholarly content and improve the searchability of documents within Gale’s Eighteenth Century Collections Online (ECCO) archive. 18thConnect is a scholarly organisation dedicated to forging links between eighteenth-century archives and today’s digital research environment.
Gale’s ECCO archive, one of the largest academic research collections of its kind, contains more than 180,000 key English and foreign language titles published primarily in the UK. Despite Gale’s use of the best in Optical Character Recognition (OCR) technology, eighteenth-century typefaces can still be challenging to capture with perfect accuracy, which may impact results when searching or data-mining.
Recently, 18thConnect was awarded National Endowment for the Humanities (NEH) sponsored supercomputer time to re-run page images from the ECCO archive through an open-source OCR programme that will generate cleaner texts. This improved OCR-created text will be incorporated into ECCO, resulting in improved searching within the resource. In addition, registered 18thConnect users will then have the opportunity to review the improved texts and correct them using a tool housed on the18thConnect website. The correction tool will be built as the result of a grant awarded to Miami University of Ohio from the Mellon Foundation.
Using this crowd-sourced correction tool, users can further correct issues not caught by the OCR process, and in exchange they will have the option to submit the revised text as a scholarly edition. 18thConnect will provide unlimited access to the corrected plain text or encoded text of the document submitted, depending on the researcher’s needs. Accepted scholarly editions will be filtered back into the ECCO archive on a periodic basis, and acceptance letters will be sent on behalf of researchers to the Promotion and Tenure Committees at their respective institutions.
The bibliographic information for ECCO is now freely searchable via the 18thConnect.org site. In January 2011, registered 18thConnect users who are interested in improving these documents will have the option to correct texts returned in their search results.
To access our daily STM news feed through your iPhone, iPad, or other smartphones, please visit www.myscoope.com for a mobile friendly reading experience.