Using Reddit’s API to Gather Text Data

I initially started my research with an eye to using digital techniques to analyze an encyclopedia that collects a number of conspiracy theories in order to determine what constitute typical features of conspiracy theories. At this point, I realize there were two flaws in my original plan. First, as discussed in a previous blog post, […]

Endangered Data Week is Coming

Did you know that Endangered Data Week is happening from February 26-March 2? Endangered Data Week is a collaborative effort to help highlight on public datasets that are in danger of being deleted, repressed, mishandled, or lost. Inspired by recent events that have shown how fragile publicly administered data is, Endangered Data Week hopes to […]

Spotlight: Unexpected Surprises in the Internet Archive

For most of us, our introduction to the Internet Archive was the Wayback Machine, a search engine that can show you snapshots of websites from the past. It’s always fun to watch a popular webpage like Google evolve from November 1998 to July 2004 to today, but there is so much more that the Internet Archive […]

Got Bad Data? Check Out The Quartz Guide

If you’re working with data, chances are, there will be at least a few times where you encounter the “nightmare scenario”. Things go awry — values are missing, your sample is biased, there are inexplicable outliers, or the sample wasn’t as random as you thought. Some issues you can solve, other issues are less clear. […]

Open Source Tools for Social Media Analysis

This post was guest authored by Kayla Abner. Interested in social media analytics, but don’t want to shell out the bucks to get started? There are a few open source tools you can use to dabble in this field, and some even integrate data visualization. Recently, we at the Scholarly Commons tested a few of […]

Spotlight: PastPin

Who? What? Where? When? and Why? While these make up a catchy song from Spy Kids, they’re also questions that can get lost when looking at digital images, especially when metadata is missing. PastPin wants to help answer these questions, by tagging the location and time of vintage images on Flickr Commons, with the hope that […]

What To Do When OCR Software Doesn’t Seem To Be Working

While optical character recognition (OCR) is a powerful tool, it’s not a perfect one. Inputting a document into an OCR software doesn’t necessarily mean that the software will actually output something useful 100% of the time. Though most documents come out without a hitch, we have a few tips on what to do if your […]

The Library’s Eighth Data Purchase Program Round is Accepting Applications!

We’re starting a bit earlier than in past years to help researchers acquire data they need for their research! Through the Library’s Data Purchase Program, the University Library accepts applications from campus researchers to purchase data. All applications will be reviewed by the Library’s Data Discovery and Support committee, which looks for requests which meet […]

Register Today for ICPSR’s Summer Program in Quantitative Methods of Social Research

The Inter-university Consortium for Political and Social Research (ICPSR) is once again offering its summer workshops for researchers! Workshops range from Rational Choice Theories of Politics and Society to Survival Analysis, Event History Modeling, and Duration Analysis. There are so many fantastic choices across the country that we can hardly decide which we’d want to […]