Event box

Web Scraping with Python

Websites can be full of useful data that are not always downloadable or easily accessible. Rather than doing a manual copy/paste of a site, python allows you to access the raw HTML behind every webpage and automate the process of retrieving, structuring, and outputting data from pages across a domain.

This workshop will cover:

  • identifying websites for web scraping
  • automating scraping with python
  • scraping HTML tables
  • scraping paginated search results
  • exporting results

Before you come:

This workshop is designed for attendees who have the following:

  • familiarity with HTML structure
  • working knowledge of python (running scripts, installing libraries, etc)
  • if using a lab computer:
    • a Yale NetID
    • ​experience with Windows OS 
  • if using personal laptop:

 

Date:
Friday, September 29, 2017
Time:
10:00am - 12:00pm
Time Zone:
Eastern Time - US & Canada (change)
Campus:
Science Hill
Categories:
  Marx Science and Social Science Library     Miscellaneous     Digital Humanities     Computer Programming  
Registration has closed.

Event Organizer

Profile photo of Joshua Dull
Joshua Dull