Parsing Data with Python
Parsing Data with Python In-Person
Develop techniques for wrangling messy data with Python. Data retrieved online often needs to be transformed or otherwise parsed before it can become usable for your research. In this workshop, we’ll walk through using Beautiful Soup, a Python library for extracting data from HTML and XML files.
This workshop is designed for participants who have taken the "First Steps with Python" workshop or who otherwise have a general understanding of Python's syntax and data types. If you missed or want to review what was covered in "First Steps with Python," you can find the tutorial on the DHLab's GitHub repository.
Registration & Requirements
Registration is open to all Yale students, faculty, and staff, but space is limited. If you have registered, you will be sent a Zoom link the day before the workshop. If you don't receive the email or lose the link, please contact the Digital Humanities Lab.
Participants are asked to come to the workshop with Anaconda Python (version 3.7 or higher) already installed. For help with the installation, stop by the DHLab’s virtual Office Hours. Sample data will be provided.
Instructors: Catherine DeRose (DHLab) and Douglas Duhaime (DHLab)
- Thursday, October 22, 2020
- 3:00pm - 4:30pm
- Time Zone:
- Eastern Time - US & Canada (change)
- Zoom Session - DHLab (online only_1)