Geospatial Data in R: Part 1

Geospatial data are an important component of social science and humanities data visualization and analysis. The R programming language is a great platform for exploring these data and integrating them into your research. 

Geospatial Data in R, part I: Getting started with spatial data objects

Log in to register for this training.

The available spaces and the waitlist for this event are both full.

Text Analysis Fundamentals: Supervised Methods

In this workshop we will cover the most common CTA task: supervised classification. Using the Python library scikit-learn, we will implement Logistic Regression and Random Forest methods to perform sentiment analysis. Optional: introduction to word vector representations with Word2Vec.

Prior knowledge: Basic familiarity with Python is required if you wish to follow along with the tutorial. Completion of D-Lab's Python FUN!damentals workshop series will be sufficient.

Log in to register for this training.

Text Analysis Fundamentals: Unsupervised Approaches

This hands on workshop builds on part 1 by introducing the basics of Python's scikit-learn package to implement unsupervised text analysis methods. This workshop will cover a) vectorization and Document Term Matrices, b) weighting (tf-idf), and c) uncovering patterns using topic modeling.

Prior knowledge: Basic familiarity with Python is required if you wish to follow along with the tutorial. Completion of D-Lab's Python FUN!damentals workshop series will be sufficient.

Log in to register for this training.

Text Analysis Fundamentals: Basic Tools and Techniques

This hands-on workshop presents a broad overview of the existing methods to use text as data, with a focus on applications in social sciences and humanities. After a brief theoretical discussion, we will go through the common “preprocessing recipe” that is used as the foundation for a variety of other applications as well as some basic natural language processing techniques.  These include: a) tokenization b) removal of stopwords, numbers, punctuation, c) calculation of word frequencies / proportions, d) part of speech tagging.  

Log in to register for this training.

Machine Learning in Python

This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets.

Prior knowledge: We will assume a basic knowledge of Python and a basic understanding of machine learning techniques. No theory instruction will be provided.

Log in to register for this training.

The available spaces and the waitlist for this event are both full.

Introduction to Data Visualization in Python

For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter (formerly IPython) notebook. The following plot types will be covered:

  • line
  • bar
  • scatter
  • boxplot

We'll also learn about styles and customizing plots.

Throughout the workshop, we'll discuss the plot types best suited for particular kinds of data.

Log in to register for this training.

The available spaces and the waitlist for this event are both full.

Introduction to Pandas

Pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python.

In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

We plan to cover:

Log in to register for this training.

Python Fundamentals: Part 4

This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Part 4 Topics: We will applying the skills learned during previous sessions to a real world social science example.

Log in to register for this training.

Python Fundamentals: Part 3

This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Part 3 Topics:

Log in to register for this training.

The available spaces and the waitlist for this event are both full.

Python Fundamentals: Part 2

Part 2 Topics:

  • Lists
  • Loops
  • Conditionals
  • Functions
  • Scope

Knowledge requirements: Python Fundamentals: Part 1 or equivalent prior knowledge

Registration note: To participate in multiple parts of this series, please be sure to register for each day separately.

Log in to register for this training.

Pages