Pipeline for resistome analysis and AMR prediction from metagenomic sequencing data.
Snakemake is best installed via the Mamba package manager (a drop-in replacement for conda). If you have neither Conda nor Mamba, it can be installed via Mambaforge. For other options see here.
Given that Mamba is installed, run
mamba create -c conda-forge -c bioconda --name snakemake snakemaketo install Snakemake in an isolated environment. For all following commands ensure that this environment is activated via
conda activate snakemakeFirst, create an appropriate project working directory on your system and enter it:
WORKDIR=path/to/project_workdir
mkdir -p ${WORKDIR}
cd ${WORKDIR}In all following steps, we will assume that you are inside of that directory.
Second, to clone the full workflow run:
git clone https://github.com/IKIM-Essen/MetaResistInstall sra-tools for downloading SRA samples:
mamba install -c bioconda -c conda-forge sra-toolsInstall peppy for handling PEP-based sample metadata:
mamba install -c conda-forge -c bioconda peppyExample SRA runs used for initial testing:
- SRR26321896 (Klebsiella pneumoniae)
Download the sample using prefetch and convert it to FASTQ format with fasterq-dump :
prefetch SRR26321896
fasterq-dump SRR26321896 --split-files --threads 8For single-end data:
gzip SRR26321896.fastqEdit the config/pep/samples.csv file to add your sample:
sample_name,fastq
SRR26321896,data/raw/SRR26321896.fastq.gzPerform a dry-run first:
snakemake -nRun the workflow:
snakemake --cores all --use-conda