This introductory 8-week R-bootcamp is aimed at providing the basics of data analysis in R. After a brief introduction to coding in R, using bioconductor and tidyverse, we will cover the basics of data analysis and inference, including (and not restricted to, time allowing): data visualization, the basics of statistical modeling, mixtures, clustering, testing (including permutation tests) with R. The objective of this bootcamp is to provide you with the tools necessary to perform data analysis on your own datasets.
No prior knowledge of R or of statistics is required. This is not a graded class, and the estimated workload is roughly 15h per week. The course will run using a flipped classroom format: the students will watch the lecture videos and work on the data analysis labs in their own time. The class will meet three times a week via Zoom (MWTh from 10am to 11am, starting June 29th) to discuss the material and go over the labs. Additional office hours will be held in the afternoon to provide extra individual support to the students if needed.
Instructor: Claire Donnat, PhD (cdonnat at stanford dot edu)
Instructional Advisor: Susan Holmes (susan at stat dot stanford dot edu)
Instruction Assistant: Zelin (James) Li (jameszli at stanford dot edu)
Office hours:
Communication: Please direct your questions on class materials and assignments to Claire (cdonnat at stanford dot edu) and cc James (jameszli at stanford dot edu)
Week 1 & Week 2: Introduction to R Objectives: getting familiar with R, Rstudio, R markdown, and basic statistics.
Week 3: Simulation & Graphics
Week 4: Testing
Week 5: Linear Models
Week 6: Linear Models & Permutations
Week 7: Clustering
Week 8: Multivariate Analysis
“Data Analysis for the Life Sciences” by Rafael A Irizarry and Michael I Love.
“Modern Statistics for Modern Biology” by Susan Holmes and Wolfgang Huber.
“R for Data Science” by Garrett Grolemund and Hadley Wickham.
Here are a couple of cheatsheets with basic R commands:
Here are a couple of cheatsheets with data import, transformation, visualization, and rmarkdown: