Log in

Sign up for our weekly newsletter!

When & Where
Date: 
Tue, August 23, 2016 - 9:30 AM to 12:30 PM
Location: 
Barrows 356: D-Lab Convening Room
Description
Type: 

Students will learn the basics of cleaning, transforming, and formatting text data. They will build corpora, pull specific elements out of text strings, and pull simple metrics from text data, such as word counts and sentiment polarity. Students will be introcued to document classification. This workshop intoduces the basics of NLTK and gensim.

Knowledge requirements: Python for Everything series

Technology Requirements: Laptop required; please install the Anaconda distribution of Python 3 or its equivalent. The workshop will utilize the Jupyter Notebook, but IDEs are also acceptable.

Please install the python packages “gensim”, “textblob” and “NLTK”:

  • pip install gensim
  • pip install nltk
  • pip install textblob

Or if you have anaconda:

  • conda install gensim
  • conda install nltk
  • conda install textblob

Details
Training Host: 
D-lab Facilitator: 
Jon Stiles
Format Detail: 
Interactive