Sign up for our weekly newsletter!
Benjamin Bartlett is a Ph.D. Candidate in the Department of Political Science at UC Berkeley.
Web scraping is a powerful and increasingly important tool for social science researchers. It allows users to automatically download the content of websites as plain text files for use in data and text analysis. This workshop series is a thorough introduction to web scraping using the python package scrapy. By the end of the series, users will be able to automatically scrape the contents of an entire website. This workshop assumes some familiarity with python, which can be acquired by doing the first few lessons of codecademy. A laptop will be required.
We will send detailed instructions to participants prior to the workshop if there are any specific installation requirements.