Web Scraping with Python
Event box
Print the page
Add to a Calendar using iCal
Share page on Facebook
This link opens in a new window
Add to Google Calendar
This link opens in a new window
Share page on Twitter
This link opens in a new window
Web Scraping with Python In-Person
Websites can be full of useful data that are not always downloadable or easily accessible. Rather than doing a manual copy/paste of a site, python allows you to access the raw HTML behind every webpage and automate the process of retrieving, structuring, and outputting data from pages across a domain.
This workshop will cover:
- identifying websites for web scraping
- automating scraping with python
- scraping HTML tables
- scraping paginated search results
- exporting results
Before you come:
This workshop is designed for attendees who have the following:
- familiarity with HTML structure
- working knowledge of python (running scripts, installing libraries, etc)
- if using a lab computer:
- a Yale NetID
- experience with Windows OS
- if using personal laptop:
- python 3 installed (Anaconda recommended)
- 'BeauitifulSoup' & 'requests' for python installed
- Date:
- Friday, September 29, 2017
- Time:
- 10:00am - 12:00pm
- Time Zone:
- Eastern Time - US & Canada (change)
- Campus:
- Science Hill
- Categories:
- Marx Science and Social Science Library Miscellaneous Digital Humanities Computer Programming
Registration has closed.
Event Organizer
Joshua Dull