April 29-May 1 : Intermediate R and data visualization
Description
This course is the next step after the introductory course. It has several objectives. First, it will discuss effective programming with R, including data manipulation with tidyverse, functional programming and object-oriented programming, debugging, scalability, and reproducible workflows. Second, it will explore key principles of creating and designing data visualizations. Topics include effective composition and layout, effective use of color, improving figure clarity, and techniques for visualizing multidimensional data. The discussion will be supported by examples with ggplot2 graphing library in R. Finally, a lecture on principles of statistical design and data analysis will prepare the participants to the topics of the following week.
The course will combine lectures and practical hands-on exercises. The discussion of programming with R is based on the following textbooks:
- Grolemund & Wickham. ‘R for Data Science’, O’Reilly, 2017
- Wickham. ‘Advanced R’, Chapman and Hall/CRC, 2014
- Wickham. ‘R Packages’, O’Reilly, 2015
Target audience
- Target audience are experimental scientists, bioinformaticians, computer scientists, data scientists, statisticians or engineers, with a minimal prior exposure to R (e.g., at the level of the course ‘Beginner’s statistics in R’) is expected.
Speakers
-
- Kylie Bemis, Steven Braun, Laurent Gatto, Olga Vitek
Tentative schedule
Wednesday, April 29, 2020
- 12:30 p.m. Registration
- 1:30 p.m. Hands-on: Intro to ggplot2, Laurent Gatto
- 3:00 p.m. Refreshments
- 3:30 p.m. Lecture : Communicating scientific data through information design, Steven Braun
- 5:00 p.m. Hands-on: Visualization critiques, Steven Braun
- 6:00 p.m. Q&A and adjourn
Thursday, April 30, 2020
- 8:00 a.m. Q&A
- 9:00 a.m. Lecture/hands-on: Data exploration with dplyr, Kylie Bemis
- 10:30 a.m. Refreshments
- 11:00 a.m. Lecture/hands-on: Functions and functional programming, Kylie Bemis
- 12:30 p.m. Lunch
- 1:30 p.m. Lecture/hands-on: Intro to object-oriented programming, Kylie Bemis
- 3:00 p.m. Refreshments
- 3:30 p.m. Lecture/hands-on: Debugging and profiling, Kylie Bemis
- 5:00 p.m. Q&A and adjourn
Friday, May 1, 2020
- 8:00 a.m. Q&A
- 9:00 a.m. Lecture/hands-on: Reproducible workflows, Kylie Bemis
- 10:30 a.m. Refreshments
- 11:00 a.m. Lecture/hands-on: Visualization in practice – transformations, correlation, volcano plots, Laurent Gatto
- 12:30 p.m. Lunch
- 1:30 p.m. Lecture/hands-on: More visualization – upSetR, PCA, heatmap, Laurent Gatto
- 3:00 p.m. Refreshments
- 3:30 p.m. Lecture: Experimental design and statistics for mass spectrometry, Olga Vitek
- 5:00 p.m. Q&A and adjourn