Skip to content
Snippets Groups Projects
VIENNE MAINA's avatar
VIENNE MAINA authored
Add coassembly

See merge request !24
2e61a9ee
History

metagWGS: Documentation

Introduction

metagWGS is a Nextflow bioinformatics analysis pipeline used for metagenomic Whole Genome Shotgun sequencing data (Illumina HiSeq3000 or NovaSeq, paired, 2*150bp ; PacBio HiFi reads, single-end).

Pipeline graphical representation

The workflow processes raw data from .fastq/.fastq.gz input and/or assemblies (contigs) .fa/.fasta and uses the modules represented in this figure:

metagWGS steps

metagWGS is split into different steps that correspond to different parts of the bioinformatics analysis. Many of these steps are optional and their necessity depends on the desired analysis.

All steps are launched one after another by default. Use --stop_at_[STEP] and --skip_[STEP] parameters to tweak execution to your will.

A report html file is generated at the end of the workflow with MultiQC.

The pipeline is built using Nextflow, a bioinformatics workflow tool to run tasks across multiple compute infrastructures in a very portable manner.

Three Singularity containers are available making installation trivial and results highly reproducible.

Documentation

The metagWGS documentation can be found in the following pages:

  • Installation
    • The pipeline installation procedure.
  • Usage
    • An overview of how the pipeline works, how to run it and a description of all of the different command-line flags.
  • Output
    • An overview of the different output files and directories produced by the pipeline.
  • Use case (WARNING: not up-to-date, needs to be updated)
    • A tutorial to learn how to launch the pipeline on a test dataset on genologin cluster.
  • Functional tests
    • (for developers) A tool to launch a new version of the pipeline on curated input data and compare its results with known output.

Contact us

If you have any questions or suggestions for improvement, please contact us to claire.hoede[@]inrae.fr.

Cite us

For the moment if you use metagWGS for your research, please cite : Joanna Fourquet, Jean Mainguy, Maïna Vienne, Céline Noirot, Pierre Martin, et al.. metagWGS: a workflow to analyse short and long HiFi metagenomic reads Taxonomic profile HiFi vs Short reads assembly. JOBIM 2022, Jul 2022, Rennes, France. ⟨10.15454/1.5572369328961167E12⟩. ⟨hal-03771202⟩