What is web-scraping?
Put simply, web-scraping is the act of taking information from a website and placing it into a file so that you can analyze it. More practically and less formally, it’s basically copying text from a website into either a text or CSV file. This page is dedicated to maintaining resources about web-scraping. If you think I should include something, please email me at firstname.lastname@example.org. Thanks!
Readings and theory
E.J.T. Weltevrede’s Repurposing digital methods: The research affordances of platforms and engines
Basic workshop on web-scraping using XPath:
- Sample Google sheet (Youtube)
- Sample Google sheet (Game of Thrones Wikipedia)
- Game of Thrones characters
Useful summaries of web-scraping
Free non-coding resources