Log in

Sign up for our weekly newsletter!

When & Where
Date: 
Thu, August 17, 2017 - 12:00 PM to 4:00 PM
Location: 
Barrows 356: Convening Room
Description
Type: 

Part 1: Methods and Approaches (12-2pm)

This non-technical workshop provides an overview of computational text analysis methods and tools. No experience in this area is expected or required. The goal is to provide an orientation for those wishing to go further with text analysis and interpret results of these methods.

This workshop is one of a four-part series that will prepare participants to move forward with text analysis research, with a special focus on humanities and social science applications. Please register for each day separately. 

 

Part 2: Basic Tools and Techniques (2-4pm)

This hands on workshop goes through the common “preprocessing recipe” that is used as the foundation for a variety of other applications as well as some basic natural language processing techniques.  These include: a) digitization (utf 8), b) removal of stopwords, numbers, punctuation, c) tokenization, d) calculation of word frequencies / proportions, e) part of speech tagging, and f) concordances.

Prior knowledge: We will be using the NLTK Python package, so basic familiarity with Python is required if you wish to follow along with the tutorial. Completion of D-Lab's Python FUN!damentals workshop series will be sufficient.

This workshop is one of a four-part series that will prepare participants to move forward with text analysis research, with a special focus on humanities and social science applications. 

Details
Training Host: 
D-lab Facilitator: 
Ben Gebre-Medhin
Format Detail: 
Interactive, hands-on
Log in to register for this training.