nf-core/bacass
Simple bacterial assembly and annotation pipeline
1.1.1
). The latest
stable release is
2.4.0
.
Introduction
Short Read Assembly
This pipeline is primarily for bacterial assembly of next-generation sequencing reads. It can be used to quality trim your reads using Skewer and performs basic sequencing QC using FastQC. Afterwards, the pipeline performs read assembly using Unicycler. Contamination of the assembly is checked using Kraken2 to verify sample purity.
Long Read Assembly
For users that only have Nanopore data, the pipeline quality trims these using PoreChop and assesses basic sequencing QC utilizing NanoPlot and PycoQC. The pipeline can then perform long read assembly utilizing Unicycler, Miniasm in combination with Racon or Canu. Long reads can be polished using specified Fast5 files with NanoPolish.
Hybrid Assembly
For users specifying both short read and long read (NanoPore) data, the pipeline can perform a hybrid assembly approach utilizing Unicycler, taking the full set of information from short reads and long reads into account.
Shared QC across all forms of assembly
In all cases, the assembly is assessed using QUAST. The resulting bacterial assembly is furthermore annotated using Prokka.
In addition, the pipeline creates various reports in the results
directory specified, including a MultiQC report summarizing some of the findings and software versions.
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple computing infrastructures in a portable manner. It comes with docker or singularity containers as well as conda environments, making installation trivial and results highly reproducible.
Documentation
The nf-core/bacass pipeline comes with documentation about the pipeline, found in the docs/
directory:
The nf-core/bacass pipeline comes with documentation about the pipeline which you can read at https://nf-core/bacass/docs or find in the docs/
directory.
Credits
nf-core/bacass was originally written by Andreas Wilm, Alexander Peltzer.
Contributions and Support
If you would like to contribute to this pipeline, please see the contributing guidelines.
For further information or help, don’t hesitate to get in touch on the Slack #bacass
channel (you can join with this invite).
Citation
If you use nf-core/bacass for your analysis, please cite it using the following doi: 10.5281/zenodo.3574476
You can cite the nf-core
publication as follows:
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x. ReadCube: Full Access Link