This five part, interactive workshop series is your complete introduction to the capabilities of the Python language. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to collect data, process unstructured data, analyze tabular data, and automate the entire process.
subtitle : Intro to text analysis in Python
description : Students will learn the basics of cleaning, transforming, and formatting text data. They will build corpora, pull specific elements out of text strings, and pull simple metrics from text data like word counts and sentiment polarity. Students will be introcued to document classification. This workshop intoduces the basics of NLTK and gensim.
knowledge requirements : Python for Everything Part 3, or equivalent prior knowledge
tech requirements : Laptop required; please install the Anaconda distribution of Python 3 or its equivalent. The PyCharm IDE, is recommended, but not required