Event box

Parsing Data with Python In-Person

Develop techniques for wrangling messy data with Python. Data retrieved online often needs to be transformed or otherwise parsed before it can become usable for your research. In this workshop, we’ll walk through using Beautiful Soup, a Python library for extracting data from HTML and XML files.

This workshop is designed for participants who have taken the "First Steps with Python" workshop or who otherwise have a general understanding of Python's syntax and data types. If you missed or want to review what was covered in "First Steps with Python," you can find the tutorial on the DHLab's GitHub repository.

Registration & Requirements

Registration is open to all Yale students, faculty, and staff, but space is limited. If you have registered, you will be sent a Zoom link the day before the workshop. If you don't receive the email or lose the link, please contact the Digital Humanities Lab.

Participants are asked to come to the workshop with Anaconda Python (version 3.7 or higher) already installed. For help with the installation, stop by the DHLab’s virtual Office Hours. Sample data will be provided.

Instructors: Catherine DeRose (DHLab) and Douglas Duhaime (DHLab)

Thursday, October 22, 2020
3:00pm - 4:30pm
Time Zone:
Eastern Time - US & Canada (change)
Zoom Session - DHLab (online only_1)
  Digital Humanities  
Registration has closed.

Event Organizer

Profile photo of Catherine DeRose
Catherine DeRose