Skip to content

Introduction to bulk RNAseq analysisΒΆ

This workshop material includes a tutorial on how to approach RNAseq data, starting from your sequencing reads (fastq files). Thus, the workshop only briefly touches upon laboratory protocols, library preparation, and experimental design of RNA sequencing experiments, mainly for the purpose of outlining considerations in the downstream bioinformatic analysis. This workshop is based on the materials developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC), a collection of modified tutorials from the DESeq2, R language vignettes and the nf-core rnaseq pipeline.


Authors

Jose Alejandro
Romero Herrera


Data Scientist

Diana
Andrejeva


Data Scientist

Henrike
Zschach


Data Scientist

Adrija
Kalvisa


Data Scientist

Samuele
Soraggi


Data Scientist

Jennifer
Bartell


Project Coordinator and Data Scientist


Overview

πŸ“– Syllabus:

  1. Course introduction
  2. Experimental planning
  3. Data explanation
  4. Read reprocessing and preprocessing pipelines
  5. Analysing RNAseq data
    1. RNAseq counts
    2. Exploratory analysis
    3. Differential Expression Analysis
    4. Functional analysis
  6. Summarized workflow

πŸ•° Total Time Estimation: 8 hours

πŸ“ Supporting Materials: Workshop slides with theory on bulk RNAseq can be found in this zenodo repository.

πŸ‘¨β€πŸ’» Target Audience: PhD, MsC, etc.

πŸ‘©β€πŸŽ“ Level: Beginner

πŸ”’ License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

πŸͺ™ Funding: This project was funded by the Novo Nordisk Fonden (NNF20OC0063268).

Course Requirements

  • Knowledge of R, Rstudio and Rmarkdown. It is recommended that you have at least followed our workshop R basics
  • Basic knowledge of RNAseq technology
  • Basic knowledge of data science and statistics such as PCA, clustering and statistical testing

This workshop material includes a tutorial on how to approach RNAseq data, starting from your sequencing reads (fastq files). Thus, the workshop only briefly touches upon laboratory protocols, library preparation, and experimental design of RNA sequencing experiments, mainly for the purpose of outlining considerations in the downstream bioinformatic analysis. This workshop is based on the materials developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC), a collection of modified tutorials from the DESeq2, R language vignettes and the nf-core rnaseq pipeline.

The aim of this repository is to run a comprehensive but introductory workshop on bulk-RNAseq bioinformatic analyses. Each of the modules of this workshop is accompanied by a powerpoint slideshow explaining the steps and the theory behind a typical bioinformatics analysis (ideally with a teacher). Many of the slides are annotated with extra information and/or point to original sources for extra reading material.

Goals

By the end of this workshop, you should be able to analyse your own bulk RNAseq data:

  • Preprocess your reads into a count matrix.
  • Normalize your data.
  • Explore your samples with PCAs and heatmaps.
  • Perform Differential Expression Analysis.
  • Annotate your results.

AcknowledgementsΒΆ


Last update: November 28, 2023
Created: November 28, 2023