Introduction

Modified

September 17, 2024

Course Overview

The course β€œHPC pipes” is designed to provide participants with foundational knowledge and practical skills in writing reproducible pipelines. As part of effective data management, it is crucial that researchers create reproducible analyses that enable others to validate and build upon their work. We will explore essential elements of reproducibility and efficiency in computational research, highlighting techniques and tools for creating robust and transparent coding and workflows. By prioritizing reproducibility and replicability, researchers can enhance the credibility and impact of their findings while fostering collaboration and knowledge dissemination within the scientific community. T, ensuring that you can handle your projects with ease. For more details on practical RDM strategies, explore our on practical RDM for biodata for more details.

HPC-pipes is divided into two main sections:

Course Requirements

It is important to be familiar with unix and python. Follow the tutorials in the links below if you need a refresher.

Module Goals
  • Understand the rol of scientific pipelines
  • Run existing pipelines
  • Implement and modify pipelines
  • Specify software and computational resource needs
  • Customise your pipeline to accept user-defined configurations (params)
  • Create reproducible analyses that can be adapted to new data with little effort
  • Integrate workflows with software environments

Acknowledgements