Log in

Sign up for our weekly newsletter!

When & Where
Date: 
Fri, April 22, 2016 - 12:00 PM to 2:00 PM
Location: 
D-Lab: Convening Room (356 Barrows Hall)
Description
Type: 

This workshop addresses various topics in Natural Language Processing, primarily through the use of NLTK. We first scrape and clean a long text; create a corpus in NLTK; explore tagged corpora; build basic machine learning POS taggers; finally, we construct a grammar for chunking and tree building. This workshop concentrates of linguistic issues in NLP and text analysis.

Prerequisites: Attendees should either already have a thorough knowledge of Python, or have attended the Python for Everything series. Some familiarity with NLTK would be helpful.

Please install the following packages ahead of the workshop: 

Python 3 (https://www.continuum.io/downloads)

Packages:

  • NLTK ( $ pip install nltk) 
  • Brown corpus from NLTK ( >>> nltk.download() ) 
  • BeautifulSoup ( $ pip install beautifulsoup4)
Details
Training Host: 
D-lab Facilitator: 
Zawadi Rucks-Ahidiana
Format Detail: 
Hands-on, Follow-along, Interactive tutorial