Nov. 11: Saavy Researcher Workshop—ABBYY Finereader: An Introduction to OCR

All are welcome to attend the Savvy Researcher Workshop on ABBYY Finereader on Wednesday, November 11th in the Main Library. Register for the event on the Savvy Researcher Workshops website prior to attendance here.

ABBYY Finereader: An Introduction to OCR

Victoria Henry

November 11th, 1-1:50pm

314 Main Library

ABBYY FineReader is an Optical Character Recognition Software (OCR) that is able to scan and convert images and scanned documents into editable, searchable formats. OCR helps your computer to recognize letter shapes in a scanned document and turn them into text you can copy and edit as needed. This allows researchers to extract information from documents quickly and easily. OCR also enables these texts to be used in key data and text mining projects. This workshop will give attendees a basic understanding of how they could use optical character recognition software in their research as well as give them a chance for some hands-on experience using ABBYY FineReader in the Scholarly Commons.

Nov 3: Savvy Researcher Workshop—Introduction to the HathiTrust Research Center Portal for Text Mining Research

All are welcome to attend the Savvy Researcher Workshop on the HathiTrust Research Center and its uses in text mining on Tuesday, November 3rd in the Main Library. Register for the event on the Savvy Researcher Workshops website prior to attendance here.

Introduction to the HathiTrust Research Center Portal for Text Mining Research

Eleanor Dickson

November 3rd, 11-11:50

314 Main Library

Students and researchers today have access to massive amounts of digitized text from the world’s research libraries.  Access to this growing digital record of human knowledge provides researchers with an unprecedented opportunity, but working with such material requires new tools to effectively analyze digitized text at so large a scale. This workshop will introduce cutting-edge software tools and cyberinfrastructure that are being developed at the Hathi Trust Research Center (HTRC)* to meet these needs in the context of the digitized text collection of the Hathi Trust Digital Library, currently comprising more than 11 million digitized volumes.

TEI Workshop at GSLIS

Spend a weekend learning about the Text Encoding Initiative (TEI) markup language, an important tool for digital humanities research! Take the “Introduction to Text Encoding with TEI” workshop at the Graduate School of Library and Information Science (GSLIS) and learn the fundamentals of using XML for research, teaching, electronic publishing, and management of digital text collections. This hands-on workshop will be taught by Julia Flanders and Syd Bauman, experts known for their work on the Women Writers Project. During the two-and-a-half day course, participants will learn how to work with XML technologies to develop digital representations of texts using the TEI standard. The workshop will take place in the GSLIS building, beginning Friday, April 25 and ending Sunday, April 27, 2014.


Schedule, Cost, and Registration

Participants will meet in the GSLIS Learning Resource Lab for an introductory session on Friday evening and two full-day sessions on Saturday and Sunday. Participation is currently limited to 30 people seated at desktop workstations. Since there is limited space at this year’s Workshop, personal laptops will not be permitted.   Participants without prior markup experience will be asked to introduce themselves to TEI and XML by reading through a short suggested reading list, provided after registration.
Per person, the cost of the workshop is:
$30 for current UIUC students;
$125 for UIUC faculty, staff, and alumni;
$300 for non-UIUC affiliates.

You must sign up and pay in advance to attend. To reserve your spot and begin the registration process, please email with the following information:

phone number
University of Illinois ID number
(if you are an alumni, your year of graduation)
(if you aren’t affiliated with the U of I, the name of the institution with which you are affiliated)

Those interested in attending the workshop are encouraged to register as soon as possible as space is limited and the course fills up quickly. A registration waitlist will be kept after capacity is exceeded.  Since this TEI Workshop has been rescheduled and there is an extremely short timeline, there will be no opportunity to obtain a refund and registration payments must be made ASAP.

This year’s workshop is co-organized by Artie Berns, Ashley M. Clark, and Megan Senseney. If you have any questions, please contact us at: or (217) 244-5574.


About the Text Encoding Initiative


A seminal effort in the digital humanities community, the TEI is “an international and interdisciplinary standard that helps libraries, museums, publishers, and individual scholars represent all kinds of literary and linguistic texts for online research and teaching, using an encoding scheme that is maximally expressive and minimally obsolescent.” Allen Renear, GSLIS professor and interim dean, and John Unsworth, former GSLIS dean, have long been involved with the TEI community, and use of TEI markup is growing steadily. More information on the TEI can be found at the TEI Consortium website:


About the Instructors


Julia Flanders and Syd Bauman are active participants within the TEI and the Association for Computers and the Humanities. They have led numerous workshops, teaching the TEI standard to diverse groups at all levels of technical accomplishment. Julia and Syd work on the Women Writers Project (, a major text encoding effort of Northeastern University’s Digital Scholarship Group. Julia is Director of the Women Writers Project, as well as Professor of the Practice of English at Northeastern University. Syd is Senior Analyst for the Women Writers Project and former North American Editor of the TEI Guidelines.

HathiTrust Research Center Uncamp: Sept. 8-9


HathiTrust Research Center (HTRC) UnCamp
A 1.5 Day Event
Sept 8-9, 2013
University of Illinois at Urbana-Champaign
I Hotel and Conference Center


The HathiTrust Research Center (HTRC) is a unique collaborative research center launched jointly by Indiana University and the University of Illinois, along with the HathiTrust Digital Library, to help meet the technical challenges of dealing with massive amounts of digital text that researchers face by developing cutting-edge software tools and cyberinfrastructure to enable advanced computational access to the growing digital record of human knowledge.

The second annual HTRC UnCamp will be held in September 8-9, 2013 at the University of Illinois at Urbana-Champaign. The UnCamp is different: it is part hands-on coding and demonstration, part inspirational use-cases, part community building, and a part informational, all structured in the dynamic setting of an un-conference programming format. UnCamp will feature stellar keynote speakers including Matt Wilkens, who specializes in contemporary American fiction, and digital and computational literary studies at Notre Dame, and Christopher Warren, specialist in Renaissance literature as it relates to politics, law, international political thought, and intellectual history, at Carnegie Mellon.   New this year is a Scholarly Communication Office Hours.  The office hours is a pilot for user services: participants will have the option to sign up for individual consultation sessions with members of the UIUC library.

Who should attend?

The HTRC UnCamp is targeted to the digital humanities and informatics tool developers, researchers and librarians, and graduate students.


To make UnCamp as affordable as possible for you to attend, we have set registration at $100.00.  Please visit to register. Registration is due by August 31, 2013.

For more information:

If you have questions regarding the HTRC UnCamp please contact Megan Senseney, HTRC Project or 217-244-5574.

Looking forward to seeing you in Champaign!