Coleridge Initiative

Training

Overview

The Applied Data Analytics programs are targeted at government agency staff and public policy analysts. Over 500 people from over 100 agencies have been trained over the past three years.

For more detailed information about the vision behind our training programs, check out Change Through Data: A Data Analytics Training Program for Government Employees in the Harvard Data Science Review.

Course Curriculum

The Applied Data Analytics curriculum has been developed in conjunction with federal, state and local agencies.   The curriculum is designed to train government employees and public policy analysts how to tackle important policy problems.  The project-focused learning approach shows participants how to apply modern computational and data analysis methods and tools to actual confidential data.  Much of the hands-on learning involves working on pre-defined projects that are built around pre-built Jupyter notebooks which provide project “recipes” that can be customized for specific use cases as well as applied to later projects in participants’ agencies.

All work takes place in a secure environment – the Coleridge Initiative’s Administrative Data Research Facility. 

The class is taught using the second edition of the textbook Big Data and Social Science: A Practical Guide to Methods and Tools, edited by three of the Coleridge Initiative directors (Rayid Ghani, Frauke Kreuter and Julia Lane).

Module-Based Course Delivery

Source: Chapter 1.7 of the Second Edition of the textbook Big Data and Social Science: A Practical Guide to Methods and Tools

Module 1 – Introduction to SQL & R: Asynchronous self-paced learning (videos) weekly for 4 weeks; weekly 1 hour chat with instructor

Modules 2 and 3: Partially asynchronous self-paced learning (videos); partially synchronous online meeting 4 days a week 11am-1pm Eastern for 2 weeks per module; 1 hour lecture and discussion followed by 1 hour group project work with instructors. The time commitment for each module is equivalent to a 3-day in-person course.

Final presentations: Approximately 4 weeks after the end of Module 3

Upcoming Training Programs

Applied Data Analytics - New Jersey 2021

There will be a 2021 Applied Data Analytics workshop in New Jersey. Participants will work in teams with higher education and employment agency data. The specific focus of the workshop will be on connecting post-secondary and workforce outcomes. The program will provide instruction on using big data tools including SQL and R. Participants will receive training on core data tools such as record linkage and data visualization as well as cutting-edge training in machine learning.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • April 1, 8, 15, 22; 11am - 12pm EST
  • Module 2:
  • April 28, 29, and May 3, 4, 6, 7, 10, 11; 11am - 1pm EST
  • Module 3:
  • June 14, 15, 17, 18; 11am - 1pm EST
  • Final Presentations:
  • July 21; 11am - 1pm EST

Department of Labor, Employment, and Training Administration - Winter 2021 (Class 1)

The Department of Labor’s Employment and Training Administration is facilitating an Applied Data Analytics training program for state teams. The particular focus of this training series will be on using example data from the state of Illinois, training provider, and wage record data to examine the unemployment to reemployment trajectories of UI benefit claimants and visualize them in a dashboard.   The results will be to provide participants with (i) an understanding of how to work with similar data in their own state and (ii) the code necessary to produce a portal similar to the Illinois portal highlighted here. There will be an informational Zoom webinar describing the class on Wednesday, October 21 from 12-1pm Eastern. Registration for the webinar is available on the right-hand panel.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • Dec 17, and Jan 6, 13, 22; 11am - 12pm EST
  • Module 2:
  • Jan 25, 26, 28, 29, and Feb 1, 2, 4, 5; 11am - 1pm EST
  • Module 3:
  • March 1, 2, 4, 5; 11am - 1pm EST
  • Final Presentations:
  • April 2; 11am - 1pm EST

Department of Labor, Employment, and Training Administration - Winter 2021 (Class 2)

The Department of Labor’s Employment and Training Administration is facilitating an Applied Data Analytics training program for state teams. The particular focus of this training series will be on using example data from the state of Illinois, training provider, and wage record data to examine the unemployment to reemployment trajectories of UI benefit claimants and visualize them in a dashboard.   The results will be to provide participants with (i) an understanding of how to work with similar data in their own state and (ii) the code necessary to produce a portal similar to the Illinois portal highlighted here. There will be an informational Zoom webinar describing the class on Wednesday, October 21 from 12-1pm Eastern. Registration for the webinar is available on the right-hand panel.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • Dec 17 and Jan 6, 13, 22; 3pm - 4pm EST
  • Module 2:
  • Jan 25, 26, 28, 29, and Feb 1, 2, 4, 5; 3pm - 5pm EST
  • Module 3:
  • March 1, 2, 4, 5; 3pm - 5pm EST
  • Final Presentations:
  • April 2; 3pm - 5pm EST

Department of Labor, Employment, and Training Administration - Winter 2021 (Class 3)

The Department of Labor’s Employment and Training Administration is facilitating an Applied Data Analytics training program for state teams. The particular focus of this training series will be on using example data from the state of Illinois, training provider, and wage record data to examine the unemployment to reemployment trajectories of UI benefit claimants and visualize them in a dashboard.   The results will be to provide participants with (i) an understanding of how to work with similar data in their own state and (ii) the code necessary to produce a portal similar to the Illinois portal highlighted here. There will be an informational Zoom webinar describing the class on Wednesday, October 21 from 12-1pm Eastern. Registration for the webinar is available on the right-hand panel.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • Jan 27, and Feb 3, 10, 16; 11am - 12pm EST
  • Module 2:
  • Feb 11, 12, 17, 18, 22, 23, 25, 26; 11am - 1pm EST
  • Module 3:
  • March 22, 23, 25, 26; 11am - 1pm EST
  • Final Presentations:
  • April 23; 11am - 1pm EST

Department of Labor, Employment, and Training Administration - Winter 2021 (Class 4)

The Department of Labor’s Employment and Training Administration is facilitating an Applied Data Analytics training program for state teams. The particular focus of this training series will be on using example data from the state of Illinois, training provider, and wage record data to examine the unemployment to reemployment trajectories of UI benefit claimants and visualize them in a dashboard.   The results will be to provide participants with (i) an understanding of how to work with similar data in their own state and (ii) the code necessary to produce a portal similar to the Illinois portal highlighted here. There will be an informational Zoom webinar describing the class on Wednesday, October 21 from 12-1pm Eastern. Registration for the webinar is available on the right-hand panel.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • Jan 27, and Feb 3, 10, 16; 3pm - 4pm EST
  • Module 2:
  • Feb 11, 12, 17, 18, 22, 23, 25, 26; 3pm - 5pm EST
  • Module 3:
  • March 22, 23, 25, 26; 3 - 5pm EST
  • Final Presentations:
  • April 23; 3pm - 5pm EST

California - Spring 2021

Online

There will be a 2020 Applied Data Analytics workshop in California. Participants will work in teams with state child welfare, education, and employment agency data. The specific focus of the workshop will be on connecting child welfare data with education, housing, and workforce outcomes. The program will provide instruction on using big data tools including SQL and R. Participants will receive training on core data tools such as record linkage and data visualization as well as cutting-edge training in machine learning.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • Feb 1 (start date)
  • Module 2:
  • March 1, 2, 4, 5, 8, 9, 11, 12
  • Module 3:
  • April 21 - 23
  • Final Presentations:
  • TBD

Current Training Programs

NCSES - Alexandria, VA - Fall 2020

Online

In this Applied Data Analytics training program, participants will work in teams to define and complete a project related to career pathways for doctoral recipients. The program will provide up-to-date perspectives and hands-on instruction on using micro data in SQL and R for tasks such as data management, record linkage, data visualization, and machine learning.

Schedule

All sessions are virtual and will be held online.

  • Module 1 - Introduction to SQL & R:
  • Sept 14 (start date)
  • Module 2:
  • Oct 14, 16, 19, 20, 22, 23
  • Module 3:
  • Nov 16, 17, 19, 20, 30
  • Final Presentations:
  • Dec 16

Highlights From Previous Programs

The first six programs were partially sponsored by the US Census Bureau, the Laura and John Arnold Foundation, the Overdeck Family Foundation, and the Ewing Marion Kauffman Foundation. The thematic focus of each program was: Criminal Justice, Welfare programs, High Need Populations, and Economic Development. Over 250 government agency staff and researchers participated in these four training programs. Here is what they had to say:

“…knowing what’s possible has helped tremendously in influencing the reality of our projects”

“I also co-authored a journal article on predicting farms that would need to apply for new loans which wouldn’t have happened without the machine learning skills I learned in the class.”

“My colleague and I have found several ways to use Python to make our work processes more efficient.”

“…hearing the topics in the class taught by experts felt as though a veil was lifted.”

“I’ve been able to use tools to help streamline my work and to identify opportunities to utilize machine learning in the City.”

Sample Projects

Addressing Recidivism: Technical Violations
Mommy Don't Go: Recidivism of Mothers
From Prosecuted to Job Recruited: Employment after Prison

Previous Programs

Kentucky - Summer 2020

The Coleridge Initiative’s Applied Data Analytics workshop in Kentucky focused on connecting education and job training data with workforce outcomes using state education and employment agency data and core data tools such as record linkage and data visualization as well as cutting edge training in machine learning.

TDC - Washington, DC - Spring 2020

The Coleridge Initiative’s TDC training program focused on core data analysis techniques such as data exploration, record linkage and data visualization using confidential TANF recipient and wage record data. Virtual program.

Introduction to Big Data for Social Science - Short Course 2020

The Coleridge Initiative’s Introduction to Big Data for Social Science training program focused on presenting key big data tools to social and data scientists. Virtual program.

Applied Data Analytics - OSU 2020

The Coleridge Initiative’s Applied Data Analytics training program focused on connecting education and job training data with workforce outcomes. Held in Columbus, OH.

Applied Data Analytics - NCSES 2019

The Coleridge Initiative’s Applied Data Analytics training program focused on employment outcomes for doctoral recipients. Held in Washington, DC.

Applied Data Analytics - USDA 2019

The Coleridge Initiative’s Applied Data Analytics training program focused on food purchasing patterns of households participating in the WIC program. Held in Washington, DC.

Applied Data Analytics - TANF Data Collaborative 2019

The Coleridge Initiative’s Applied Data Analytics training program focused on employment outcomes of TANF recipients. Held in College Park, MD.

See all our material on GitHub