The following list provides links to the slides and code used in the guided coding sessions. The solutions to the challenges will be made available through RStudio Server.
SLIDES: Introduction to the course
Introduction to RMarkdown and Setup (.Rmd file)
SLIDES: Web scraping
Scraping tables (.Rmd file)
Coding challenge 1: Scraping Campaign Speeches (Part I) (.Rmd file)
Lists, Functions, and Loops (.Rmd file)
Scraping unstructured data from the web (.Rmd file)
Coding challenge 2: Scraping Campaign Speeches (Part II) (.Rmd file)
SLIDES: Advanced web scraping
Basics of text analysis (.Rmd file)
Scraping websites using Selenium (.Rmd file)
Coding challenge 3: Automating online searches (.Rmd file)
Scraping newspaper websites (.Rmd file)
Coding challenge 4: Scraping The Guardian’s homepage (.Rmd file)
SLIDES: Application Programing Interfaces
Scraping data from APIs (.Rmd file)
Interacting with the Clarifai API (.Rmd file)
Coding challenge 5: Working with the NYTimes API (.Rmd file)
SLIDES: Social media data
Scraping data from Twitter’s Streaming API (.Rmd file)
Coding challenge 6: Streaming API (.Rmd file)
Scraping data from Twitter’s REST API (.Rmd file)
Coding challenge 7: REST API (.Rmd file)
Extracting data from PDF files (.Rmd file)
Coding challenge 8: Working with PDF files (.Rmd file)
SLIDES: Text encoding
Dealing with encoding issues (.Rmd file)
Exception handling. Merging and reshaping dataset. (.Rmd file)