The following list provides links to the slides and code used in the guided coding sessions. The solutions to the challenges will be made available through RStudio Server.
SLIDES: Introduction to the course. Good coding practices in R.
Introduction to RMarkdown and Setup (.Rmd file)
SLIDES: Parallel computing
Efficient programming in R (.Rmd file)
Coding challenge 1: Writing more efficient code (.Rmd file)
Parallel computing in R (.Rmd file)
Coding challenge 2: Parallel computing (.Rmd file)
SLIDES: Introduction to SQL.
Querying an SQL database (.Rmd file)
Coding challenge 3 (.Rmd file)
Querying multiple SQL tables (.Rmd file)
Coding challenge 4 (.Rmd file)
Querying a massive-scale Google BigQuery database (.Rmd file)
Coding challenge 5 (.Rmd file)
SLIDES: Large-scale discovery in networks.
Basics of network analysis (.Rmd file)
Community detection in large-scale networks (.Rmd file)
Coding challenge 6 (.Rmd file)
Latent space network models (.Rmd file)
Coding challenge 7 (.Rmd file)
SLIDES: Text analysis. Dictionary methods.
Dictionary methods (.Rmd file)
Coding challenge 8 (.Rmd file)
SLIDES: Supervised machine learning methods.
Regularized regression (.Rmd file)
Other classifiers (.Rmd file)
Xgboost (.Rmd file)
Coding challenge 9 (.Rmd file)
SLIDES: Unsupervised analysis of text.
Exploratory analysis of text (.Rmd file)
Topic models (.Rmd file)
Coding challenge 10 (.Rmd file)
Word embeddings (.Rmd file)