Unreadable: Challenges and Critical Pedagogy to Optical Character Recognition Software

Posted on April 14, 2023 by Ryan Yoakum

In the 21^st century, Optical Character Recognition (OCR) software has fundamentally changed how we search for information. OCR is the process of taking images with text and making them searchable. The implications of OCR vary from allowing searchability on massive databases to promoting accessibility by making screen readers a possibility. While this is all incredibly helpful, it is not without fault, as there are still many challenges to the OCR process that create barriers for certain projects. There are also some natural limitations to using this software that especially have consequences for time-sensitive projects, but other factors within human control have negatively influenced the development of OCR technology in general. This blog post will explore two issues: the amount of human labor required on an OCR project and the Western biases of this kind of software.

Some text in ABBYY FineReader. Not all of the appropriate text is contained within a box, indicating the human labor that needs to go in to correct this. — _{Public Domain Image}

Human Labor Requirements

While OCR can save an incredible amount of time, it is not a completely automated system. For printed documents from the 20^th-21^st century, most programs can guarantee a 95-99% accuracy rate. The same is not true, however, for older documents. OCR software works by recognizing pre-built characters the software was initially programmed to recognize. When a document does not follow that same pattern, the software cannot recognize it. Handwritten documents are a good example of this, in which the same letter may appear differently to the software, depending on how it was written. Some programs, such as ABBYY FineReader, have attempted to resolve this problem by incorporating a training program, which allows users to train the system to read specific types of handwriting. Even still, that training process requires human input, and there is still much work for individuals to put into ensuring that the processed document is accurate. As a result, OCR can be a time-consuming process that still requires plenty of human labor for a project.

Western Biases

Another key issue with the OCR process is the Western biases that went into the creation of the software. Many common OCR programs were designed to handle projects with Latinized scripts. While helpful for some projects, this left barriers to documents with non-Latinized scripts, particularly from languages commonly used outside the West. While advances have been made on this front, the advancements are still far behind that of Latinized scripts. For example, ABBYY FineReader is one of the few software programs that will scan in non-western languages, but it cannot incorporate its training program when those scripts aren’t Latinized. Adobe Acrobat can also scan documents with languages that use non-Latinized scripts, but its precision is less consistent than with those languages that do.

An old version of ABBYY FineReader. The text scanned on the left is a language with a non-Latinized script. The right side shows a variety of errors due to the system's lack of knowledge of that language. — _{Photo Credit: Paul Tafford}

Addressing the Issues with OCR

Although OCR has performed many amazing tasks, there is still much development needed when it comes to projects related to this aspect of scholarly research. One crucial component when considering taking on an OCR project is to recognize the limitations of the software and to account for that when determining the scope of your project. At this stage, OCR technology is certainly a time-saver and fundamentally changing the possibilities of scholarship, but without human input, these projects fail to make an impact. Likewise, recognizing the inequality of processing for non-western languages in some of the more prevalent OCR software (which several developers have looked to offset by creating OCR programs specifically catered to specific non-Latinized languages). Acknowledging these issues can help us consider the scope of various projects and also allow us to address these issues to make OCR a more accessible field.

Drawing People: Practicing the Human Figure with Open Resources

Posted on October 28, 2022 by Nora Davies

"Person in Brown Shirt Writing on White Paper" by cottonbro from Pexels

It’s Open Access Week! Every year this international event brings the academic community together to discuss the benefits of free and immediate access to information, especially scholarly resources.

This week, I’ll be sharing open (and semi-open) resources for artists. When I’m not at the library desk, I like to draw, and I’m always on the hunt for high quality reference images. When learning how to draw people, you’ll often have to figure out a pose without the help of live models. References, however, are not always free or easy to find. Here some of the resources that I’ve found helpful over the years.

Practice and Reference

Line of Action

Provides both nude and clothed photos for study. Artists can start a drawing session by choosing the kinds of models, and the time intervals between photos. There are also posts here that give advice for improving your technique.

Bodies in Motion

This collection of motion images provides rapid sequence photographs of athletes and dancers. These images are a good way to study how the human body moves. Most of this content is only available with a subscription, but there are some free sequences. When browsing a section, click the “free” tab on the right-hand side of the page.

AdorkaStock

This stock photo collection has models with plenty of different body types. There are some fun poses in here: from fantasy to action, to sci-fi settings. All models are wearing clothing or flesh-tone bodysuits, so no need to worry about using it in a public space.

Sketch Daily

Provides a variety of photos in timed study sessions. You can choose to practice bodies, hands, feet, heads, or animals and structures. It’s a good tool for warm-up drawing with no fuss.

The Book of a Hundred Hands by George B. Bridgman

This book depicts musculature and examples of drawn hands in different positions. It can help you to focus-in on your hand drawing skills.

Figure Drawing for All It’s Worth by Andrew Loomis

Okay, so this one is from the 40’s and it shows; the majority of nude female figures are still sporting high heels. However, Loomis still offers many helpful tips. It contains an exhaustive instruction of perspective, musculature, the mechanics of motion, shading and lighting as well as exercises for practice.

"Study of Hands (recto); Sketch of a Hand (verso)" by Pierre Lenfant from Clevland Museum of Art CCO Images

Gesture Drawing

Gesture Drawing – The Ultimate Guide for Beginners

Practicing with the gesture technique can help you break out of “stiff” poses and figure out how to imbue your figures with character and expression. This guide contains an overview of gesture, videos of instruction, and a list of books on gesture.

Clothing

We Wear Culture

A good fashion reference site that showcases clothing through time and around the world. The information here gives context for clothing, bios of fashion icons, overviews of fashion movements, and the history of clothing items. It’s a good tool to inspire clothing design for the people and characters you draw.

History of Costume

You’ll have to create a free account on the Internet Archive to view this one. It’s a collection of costume plates from the 19th century. There are later editions of this book available, but this edition still contains original clothing pattern drafts.

"Fashion Designer Sketching Design" by Ron Lach from Pexels

Instruction

Love, Life, Drawing

This website provides free tutorials and podcasts on drawing topics with a focus on human figures. Sign up for the free “fresh eyes” drawing challenge, a ten-day course that teaches students to identify gesture and structure of the form.

FZD School

This resource isn’t human-figure specific but these videos are great resources for learning how to draw and design. Try “EP 30: Character Silhouettes” to buff up your character illustration skills. This channel is especially good for creatives interested in comics or illustration.

Muddy Colors

Muddy Colors posts helpful tips on all kinds of art topics from over 20 practicing artists. The site hosts paid classes from their contributing artists, but there is plenty of free advice here too.

Additional Resources

Character Design References

An independent website that showcases concept art from animation, games, and comics. There’s a little bit of everything here. I’d recommend checking out their visual library. There are anatomical references, character/creature design references, vehicles, props, and lighting/color tutorials.

Met Publications

The New York Met Gallery offers 609 publications of art, photography, sculpture and more, all free for download. This is an excellent place to find inspiration.

Happy Drawing!

"Art Material Pencils" by sketchify from Canva

Open Education Week 2022

Posted on March 7, 2022 by Zhaneille Green

Open Education Week

Open Education Week brings awareness to the Open Educational Resources (OER) movement and to the how OER transforms teaching and learning for instructors and students alike.

What is OER?

OER refers to open access, openly licensed instructional materials that are used for teaching, learning or research.

Why is OER Important?

OER provides free resources to institutions, teachers, and students. When incorporated into the classroom, OER can:

Lower the cost of education for students
Reinforce open pedagogy
- Allow educators to update and adapt materials to fit their needs
- Encourages students’ interaction with, and creation of, educational materials
Encourage open knowledge dissemination

OER Incentive Grant

The University is giving faculty an incentive to adopt, adapt, or create OER for their courses instead of using expensive materials. The OER Incentive Grants will fund faculty teaching undergraduate courses. Instructors can submit applications in three tiers:

Tier 1: Adopt – incorporate an existing open textbook into a course
Tier 2 Adapt – incorporate portions of multiple existing open textbooks, along with other freely available educational resources, modifications of existing open education materials/textbooks, or development of new open education materials
Tier 3: Create – write new openly licensed textbooks

The preferred deadline to submit a proposal is March 11^th. If you are interested in submitting a grant but cannot make this deadline, please reach out to Sara Benson at srbenson@illinois.edu. To learn more about this program see the webpage on the Faculty OER Incentive Program.

Upcoming OER Publication

In conjunction with Sara Benson, copyright librarian at UIUC, and the Illinois Open Publishing Network (IOPN), co-authors Christy Bazan, Brandi Barnes, Ryan Santens, and Emily Verone will publish an OER textbook, titled Drug Use and Misuse: A Community Health Perspective. This book explores drug use and abuse through the lens of community health and the impact of drug use and abuse on community health. Drug Use and Misuse is the third publication in IOPN’s Windsor & Downs Press OPN Textbook series. See the video below to learn more about the process of creating this textbook.

Introducing Drop-In Consultation Hours at the Scholarly Commons!

Posted on February 14, 2022 by Ryan Yoakum

Do you have a burning question about data management, copyright, or even how to work Adobe Photoshop but do not have the time to set up an appointment? This semester, the Scholarly Commons is happy to introduce our new drop-in consultation hours! Each weekday, we will have an expert from a different scholarly subject have an open hour or two where you can bring any question you have about that’s expert’s specialty. These will all take place in room 220 in the Main Library in Group Room A (right next to the Scholarly Commons help desk). Here is more about each session:

Mondays 11 AM – 1 PM: Data Management with Sandi Caldrone

Starting us off, we have Sandi Caldrone from Research Data Services offering consultation hours on data management. Sandi can help with topics such as creating a data management plan, organizing/storing your data, data curation, and more. She can also help with questions around the Illinois Data Bank and the Dryad Repository.

Tuesdays 11 AM – 1 PM: GIS with Wenjie Wang

Next up, we have Wenjie Wang from the Scholarly Commons to offer consultation about Geographic Information Systems (GIS). Have a question about geocoding, geospatial analysis, or even where to locate GIS data? Wenjie can help! He can also answer any questions related to using ArcGIS or QGIS.

Wednesdays 11 AM – 12 PM: Copyright with Sara Benson

Do you have questions relating to copyright and your dissertation, negotiating an author’s agreement, or seeking permission to include an image in your own work? Feel free to drop in during Copyright Librarian Sara Benson’s open copyright hours to discuss any copyright questions you may have.

Thursdays 1-3 PM: Qualitative Data Analysis with Jess Hagman

Jess Hagman from the Social Science, Health, and Education Library is here to help with questions related to performing qualitative data analysis (QDA). She can walk you through any stage of the qualitative data analysis process regardless of data or methodology. She can also assist in operating QDA software including NVivo, Atlas.ti, MAXQDA, Taguette, and many more! For more information, you can also visit the qualitative data analysis LibGuide.

Fridays 10 AM – 12 PM: Graphic Design and Multimedia with JP Goguen

To end the week, we have JP Goguen from the Scholarly/Media Commons with consultation hours related to graphic design and multimedia. Come to JP with any questions you may have about design or photo/video editing. You can also bring JP any questions related to software found on the Adobe Creative Cloud (such as Photoshop, InDesign, Premiere Pro, etc.).

Have another Scholarly Inquiry?

If there is another service you need help with, you are always welcome to stop by the Scholarly Commons help desk in room 220 of the Main Library between 10 AM – 6 PM Monday-Friday. From here, we can get you in contact with another specialist to guide you through your research inquiry. Whatever your question may be, we are happy to help you!

It Matters How We Open Knowledge: Open Access Week 2021

Posted on October 26, 2021 by Michael Steffen

It’s that time of year again! Open Access Week is October 25-31, and the University of Illinois Library is excited to participate. Open Access Week is an international event where the academic and research community come together to learn about Open Access and to share that knowledge with others. The theme guiding this year’s discussion of open access will be “It Matters How We Open Knowledge: Building Structural Equity.”

These discussions will build on last year’s theme of “Open with Purpose: Taking Action to Build Structural Equity and Inclusion.” While last year’s theme was intended to get people thinking about the ways our current information systems marginalize and exclude, this year’s theme is focused on information equity as it relates to governance.

OA Week digital banner with theme name and date

Specifically, this year’s theme intentionally aligns with the recently released United Nations Educational, Scientific and Cultural Organization (UNESCO) recommendation on Open Science, which encompasses practices such as publishing open research, campaigning for open access, and generally making it easier to publish and communicate scientific knowledge.

Circulated in draft form following discussion by representatives of UNESCO’s 193 member countries, the recommendation powerfully articulates and centers the importance of equity in pursuing a future for scholarship that is open by default. As the first global standard-setting framework on Open Science, the UNESCO Recommendation will provide an important guide for governments around the world as they move from aspiration to the implementation of open research practices.

UNESCO Icon

While the University of Illinois is not hosting any formal events for open access, the Library encourages students, staff, and faculty to familiarize themselves with existing open access resources, including:

IDEALS: The Illinois Digital Environment for Access to Learning and Scholarship, collects, disseminates, and provides persistent and reliable access to the research and scholarship of faculty, staff, and students at Illinois. Once an article is deposited in IDEALS, it may be efficiently and effectively accessed by researchers around the world, free of charge.
Copyright: Scholarly Communication and Publishing offers workshops and consultation services on issues related to copyright. While the Library cannot offer legal advice, we can help you to identify information and issues you may want to consider in addressing your copyright question.
Illinois Open Publishing Network: The Illinois Open Publishing Network (IOPN) is a set of digital publishing initiatives that are hosted and coordinated at the University of Illinois at Urbana-Champaign Library. IOPN offers a suite of publishing services to members of the University of Illinois at Urbana-Champaign community and aims to facilitate the dissemination of high-quality, open access scholarly publications. IOPN services include infrastructure and support for publishing open access journals, monographs, born-digital projects that integrate multimedia and interactive content.

IOPN logo

For more information on how to support access at the University of Illinois, please reach out to the Scholarly Commons or the Scholarly Communication and Publishing unit. For more information about International Open Access Week, please visit www.openaccessweek.org. Get the latest updates on Open Access events on twitter using the hashtag #OAWeek.

Commons Knowledge

Insights from the Scholarly Commons

Category Archives: Digital Scholarship