Bio-X R BootCamp Summer 2020

Logo

Home

Material

Assignments

Installation

Resources

Welcome to Bio-X R Bootcamp!

This introductory 8-week R-bootcamp is aimed at providing the basics of data analysis in R. After a brief introduction to coding in R, using bioconductor and tidyverse, we will cover the basics of data analysis and inference, including (and not restricted to, time allowing): data visualization, the basics of statistical modeling, mixtures, clustering, testing (including permutation tests) with R. The objective of this bootcamp is to provide you with the tools necessary to perform data analysis on your own datasets.

No prior knowledge of R or of statistics is required. This is not a graded class, and the estimated workload is roughly 15h per week. The course will run using a flipped classroom format: the students will watch the lecture videos and work on the data analysis labs in their own time. The class will meet three times a week via Zoom (MWTh from 10am to 11am, starting June 29th) to discuss the material and go over the labs. Additional office hours will be held in the afternoon to provide extra individual support to the students if needed.

Announcements

Logistics

Instructor: Claire Donnat, PhD (cdonnat at stanford dot edu)

Instructional Advisor: Susan Holmes (susan at stat dot stanford dot edu)

Instruction Assistant: Zelin (James) Li (jameszli at stanford dot edu)

Office hours:

Communication: Please direct your questions on class materials and assignments to Claire (cdonnat at stanford dot edu) and cc James (jameszli at stanford dot edu)

Syllabus (tentative)

Week 1 & Week 2: Introduction to R Objectives: getting familiar with R, Rstudio, R markdown, and basic statistics.

Week 3: Simulation & Graphics

Week 4: Testing

Week 5: Linear Models

Week 6: Linear Models & Permutations

Week 7: Clustering

Week 8: Multivariate Analysis

Optional textbooks

“Data Analysis for the Life Sciences” by Rafael A Irizarry and Michael I Love.

“Modern Statistics for Modern Biology” by Susan Holmes and Wolfgang Huber.

“R for Data Science” by Garrett Grolemund and Hadley Wickham.

R-cheatsheets

Here are a couple of cheatsheets with basic R commands:

Base R Vector

Short R-reference card

R cheatsheet

Here are a couple of cheatsheets with data import, transformation, visualization, and rmarkdown:

Data Import

Data Transformation (dplyr)

Data Visualization (ggplot2)

Rmarkdown