Introduction to bulk RNAseq analysisΒΆ
Updated: 17/01/2023
This workshop material includes a tutorial on how to approach RNAseq data, starting from your sequencing reads (fastq files). Thus, the workshop only briefly touches upon laboratory protocols, library preparation, and experimental design of RNA sequencing experiments, mainly for the purpose of outlining considerations in the downstream bioinformatic analysis. This workshop is based on the materials developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC), a collection of modified tutorials from the DESeq2, R language vignettes and the nf-core rnaseq pipeline.
Authors
Overview
Syllabus:
- Course introduction
- Experimental planning
- Data explanation
- Read reprocessing and preprocessing pipelines
- Analysing RNAseq data
- RNAseq counts
- Exploratory analysis
- Differential Expression Analysis
- Functional analysis
- Summarized workflow
Total Time Estimation: 8 hours
Supporting Materials: Workshop slides with theory on bulk RNAseq can be found in this zenodo repository.
Target Audience: PhD, MsC, etc.
Level: Beginner/Intermediate/Advanced
License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Funding: This project was funded by the Novo Nordisk Fonden (NNF20OC0063268).
Course Requirements
- Knowledge of R, Rstudio and Rmarkdown. It is recommended that you have at least followed our workshop R basics
- Basic knowledge of RNAseq technology
- Basic knowledge of data science and statistics such as PCA, clustering and statistical testing
This workshop material includes a tutorial on how to approach RNAseq data, starting from your sequencing reads (fastq files). Thus, the workshop only briefly touches upon laboratory protocols, library preparation, and experimental design of RNA sequencing experiments, mainly for the purpose of outlining considerations in the downstream bioinformatic analysis. This workshop is based on the materials developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC), a collection of modified tutorials from the DESeq2, R language vignettes and the nf-core rnaseq pipeline.
The aim of this repository is to run a comprehensive but introductory workshop on bulk-RNAseq bioinformatic analyses. Each of the modules of this workshop is accompanied by a powerpoint slideshow explaining the steps and the theory behind a typical bioinformatics analysis (ideally with a teacher). Many of the slides are annotated with extra information and/or point to original sources for extra reading material.
Goals
By the end of this workshop, you should be able to analyse your own bulk RNAseq data:
- Preprocess your reads into a count matrix.
- Normalize your data.
- Explore your samples with PCAs and heatmaps.
- Perform Differential Expression Analysis.
- Annotate your results.
AcknowledgementsΒΆ
- Center for Health Data Science, University of Copenhagen.
- Hugo Tavares, Bioinformatics Training Facility, University of Cambridge.
- Silvia Raineri, Center for Stem Cell Medicine (reNew), University of Copenhagen.
- Harvard Chan Bioinformatics Core (HBC), check out their github repo
- nf-core community