Home

Welcome¶

These pages are a tutorial workshop for the Nextflow pipeline nf-core/sarek.

In this workshop, we will recap the application of next generation sequencing to identify genetic variations in a genome. You will learn how to use the pipeline sarek to carry out this data-intensive workflow efficiently. We will cover topics such as experimental design, configuration of the pipeline and code execution.

Please note that this is not an introductory workshop, and we will assume some basic familiarity with Nextflow.

By the end of this workshop, you will be able to:

understand the key concepts behind variant calling, as adopted in this pipeline
analyse simple NGS datasets with the sarek workflow
customise some of its features for your own variant calling analyses
integrate different sources of information to identify candidate variants
make a hypothesis about variant interpretation using the output of sarek

Let's get started!

Running with Gitpod¶

In order to run this using GitPod, please make sure:

You have a GitHub account: if not, create one here
Once you have a GitHub account, sign up for GitPod using your GitHub user here choosing "continue with GitHub".

Now you're all set and can use the following button to launch the service:

Additional documentation¶

You can find detailed documentation on Nextflow here
You can find additional training on these pages

Credits & Copyright¶

This training material has been written by Francesco Lescai during the nf-core Hackathon in Barcelona, 2023. It was originally meant as a contribution for the nf-core community, and aimed at anyone who is interested in using nf-core pipelines for their studies or research activities.

The Docker image and Gitpod environment used in this repository have been created by Seqera but have been made open-source (CC BY-NC-ND) for the community.

All examples and descriptions are licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.