This three-part series will cover the following materials:
Part 1: Introduction
Sign up for our weekly newsletter!
This is an archive of our past training offerings. We are looking to include workshops on topics not yet covered here. Is there something not currently on the list? Send us a proposal.
This three-part series will cover the following materials:
Part 1: Introduction
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.
Part 1 Topics:
Add data visualization to your communication toolbox without learning to code. Tableau cuts down the time you need to spend creating visualizations through an intuitive graphical user interface. Learn the basics in this hands-on, two-part workshop.
Knowledge Requirements: No prerequisites
An intro to the basics that instructors often assume you know, but that you probably never had good instruction on! After this course, you should be able to more easily start learning to program (e.g., in our R or Python Fundamentals series), follow instructions and documentation online (e.g., StackExchange), and communicate better with your collaborators who are programming.
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets.
This workshop addresses various topics in Natural Language Processing, primarily through the use of NLTK. We'll work with a corpus of documents and learn how to identify different types of linguistic structure in the text, which can help in classifying the documents or extracting useful information from them.
R FUN!damentals Part 4: For-loops and Functions
Students will learn how to write for-loops and functions in R. You will learn how to personalize functions via control structures such as ‘if’ and ‘else’. These learning objectives will be exemplified through introduction to the construction and graphical representation of Monte Carlo resampling simulation.
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.
Pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python.
In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.
We plan to cover:
Participants in this workshop will learn about some of the issues surrounding the collection of health statistics, and will also learn about authoritative sources of health statistics and data. We will look at tools that let you create custom tables of vital statistics (birth, death, etc.), disease statistics, health behavior statistics, and more. The focus will be on U.S.
Building on the "Georectifying Historical Maps" workshop, the next step in creating useable GIS data from historical map sources often involves digitizing features to create new vector layers. This workshop will introduce the basic concepts and best-practices of feature digitization. We'll explore the tools and processes through a hands-on exercise.
R FUN!damentals Part 3: Data Exploration and Analysis
Students will be introduced to data exploration and analysis in R. You will learn how to summarize data and explore it with histograms, scatterplots, and boxplots. You will also be introduced to coding statistical data analysis via t-tests, analyses of variance, correlation, and linear regression.
This three-part series will cover the following materials:
Part 1: Introduction
This workshop will introduce students to the basics of designing a survey instrument using the Qualtrics platform, such as randomization and survey flow. We will also cover more advanced topics like implementing embedded data and using javascript, as well as tips and tricks on how to use your design to maximize the number of quality responses you get.
In this workshop we will cover two main supervised text analysis methods, the dictionary method, and supervised classification. We will use list comprehension to implement the dictionary method, using sentiment analysis as our example.
Join Kari Peterson, Associate Director of Data Analysis at Haas Development & Alumni Relations and founder of the Cal Data Visualization Network, for this workshop on best practices for communicating your data visually. We'll be covering terminology, color theory, how to pick a viz, tips for how to make sure your viz says what you want it to say, and more.
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.
Part 3 Topics:
This class will cover the basics of Excel, from simple formulas (SUM, COUNTIF) to more complex Excel features like Macros and the Data Analysis ToolPak. By the end of both sections, students will be able to employ Excel skills to open source policy data sets. These skills are transferrable to any sector.
Topics Covered Will Include:
R FUN!damentals Part 2: Subsetting and Reshaping
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.
Part 2 Topics: