asm4pg
An automatic and reproducible genome assembly workflow for pangenomic applications using PacBio HiFi data.
This workflow uses Snakemake to quickly assemble genomes with a HTML report summarizing obtained assembly stats.
A first script (prejob.sh
) prepares the data until fasta.gz files are obtained. A second script (job.sh
) runs the genome assembly and stats.
doc: Gitlab pages
Table of contents
[TOC]
Repo directory structure
├── README.md
├── job.sh
├── prejob.sh
├── workflow
│ ├── rules
│ ├── scripts
│ ├── pre-job_snakefiles
| └── Snakefile
└── .config
├── snakemake_profile
| └── slurm
| ├── cluster_config.yml
| ├── config.yaml
| ├── CookieCutter.py
| ├── settings.json
| ├── slurm_utils.py
| ├── slurm-jobscript.sh
| ├── slurm-status.py
| └── slurm-submit.py
└── masterconfig.yaml
Requirements
- snakemake >= 6.5.1
- singularity
How to run the workflow
How to cite asm4pg?
We are currently writing a publication about asm4pg. Meanwhile, if you use the pipeline, please cite it using the address of this repository.
License
The content of this repository is licensed under (GNU GPLv3)
Contacts
For any troubleshouting, issue or feature suggestion, please use the issue tab of this repository. For any other question or if you want to help in developing asm4pg, please contact Ludovic Duvaux at ludovic.duvaux@inrae.fr