Free, Open Source Optical Character Recognition with gImageReader

Optical Character Recognition (OCR) is a powerful tool to transform scanned, static images of text into machine-readable data, making it possible to search, edit, and analyze text. If you’re using OCR, chances are you’re working with either ABBYY FineReader or Adobe Acrobat Pro. However, both ABBYY and Acrobat are propriety software with a steep price […]

Choosing an OCR Software: ABBYY FineReader vs. Adobe Acrobat Pro

What is OCR? OCR stands for Optical Character Recognition. This is the electronic identification and digital encoding of typed or printed text by means of an optical scanner or a specialized software. Performing OCR allows computers to read static images of text to convert them to readable, editable, and searchable data on a page. There […]

Cool Text Data – Music, Law, and News!

Computational text analysis can be done in virtually any field, from biology to literature. You may use topic modeling to determine which areas are the most heavily researched in your field, or attempt to determine the author of an orphan work. Where can you find text to analyze? So many places! Read on for sources […]

Lightning Review: Optical Character Recognition: An Illustrated Guide to the Frontier

Lightning Review: Optical Character Recognition: An Illustrated Guide to the Frontier Stephen V. Rice, George Nagy, and Thomas A. Nartaker’s work on OCR, though written in 1999, is still a remarkably valuable bedrock text for diving into the technology. Though OCR systems have, and continue to, evolve with each passing day, the study presented within […]

What To Do When OCR Software Doesn’t Seem To Be Working

While optical character recognition (OCR) is a powerful tool, it’s not a perfect one. Inputting a document into an OCR software doesn’t necessarily mean that the software will actually output something useful 100% of the time. Though most documents come out without a hitch, we have a few tips on what to do if your […]

Learning to Make Documents Accessible with OCR Software

Accessibility in the digital age can be difficult for people to understand, especially given the sheer amount of ways to present information on the computer. However, creating content that is accessible to all individuals should be a priority for researchers. Creating accessible documents is an easy process, and the Scholarly Commons has the software you […]