Log in

Sign up for our weekly newsletter!

When & Where
Date: 
Tue, December 6, 2016 - 4:00 PM to 6:00 PM
Location: 
D-Lab: Convening Room (356 Barrows Hall)
Description
Type: 

This hands on workshop goes through the common “preprocessing recipe” that is used as the foundation for a variety of other applications as well as some basic natural language processing techniques.  These include: a) digitization (utf 8), b) removal of stopwords, numbers, punctuation, c) tokenization, d) calculation of word frequencies / proportions, e) part of speech tagging, and f) concordances.  This will be done using the NLTK Python package, so basic familiarity with Python is required if you wish to follow along with the tutorial.

This workshop is one of a three part series that will prepare participants to move forward with text analysis research, with a special focus on humanities and social science applications. The other two workshops are:

Text Analysis Fundamentals: Methods and Approaches

Text Analysis Fundamentals: Unsupervised Approaches

Details
Training Host: 
D-lab Facilitator: 
Patty Frontiera
Log in to register for this training.