Skip to content
Snippets Groups Projects
Select Git revision
  • 43daea49b4d06200094ecd8441acf25e961841f9
  • master default protected
  • TEcount
  • v1.0
4 results

rnaflow

  • Clone with SSH
  • Clone with HTTPS
  • RNAflow

    Snakemake Python 3.6

    Authors

    What is RNAflow ?

    RNAflow is a snakemake pipeline dedicated to standard transcriptomic analysis @Hub of Bioinformatics & Biostatistics.

    Be careful, RNAflow was principally conceived to run on the HPC of Institut Pasteur, and not on other HPC.

    How to install RNAflow ?

    You need to install:

    • snakemake >= 5.10
    • python >= 3.6

    In addition to above tools, you need to install pipeline-related tools:

    • cutadapt
    • alienTrimmer
    • fastqc
    • sortmeRNA
    • samtools
    • bowtie2
    • bedtools
    • star
    • minimap2
    • kallisto
    • subread
    • deeptools
    • multiqc

    For Institut Pasteur HPC users, these tools are not required, they will be automatically loaded via module during pipeline running via the snakemake --use-envmodules instruction.

    How to run RNAflow ?

    Usage

    • Step 1: Install workflow

    If you simply want to use this workflow, download and extract the latest release.

    git clone https://gitlab.pasteur.fr/hub/rnaflow.git

    • Step 2: Configure workflow

    Configure the workflow according to your needs via editing the config.yaml and multiqc_config.yaml files in the config/ directory.

    • Step 3: Execute workflow on Maestro (Institut Pasteur HPC)

    Load your conda environment. See runme.sh to create the complete conda env.

    conda activate snakemake

    Test your configuration by performing a dry-run via

    snakemake -n

    run it in a Maestro using environment modules via

    sbatch -q hubbioit -p hubbioit snakemake --cluster-config config/cluster_config.json --use-envmodules --cluster "sbatch --mem={cluster.ram} --cpus-per-task={threads} -q hubbioit -p hubbioit" -j 300 --nolock

    Visualize how the rules are connected via

    snakemake -s Snakefile --rulegraph --nolock | dot -Tsvg > rulegraph.svg

    or how the files are processed via

    snakemake -s Snakefile -j 10 --dag --nolock | dot -Tsvg > dag.svg

    • Step 3bis: Execute workflow on NNCR (IFB HPC)

    Load all needed tools via module

    module load snakemake/6.5.0
    module load cutadapt/3.1
    module load fastqc/0.11.9
    module load singularity sortmerna/4.2.0
    module load samtools/1.9
    module load bowtie2/2.4.1
    module load bedtools/2.29.2
    module load star/2.6 
    module load minimap2/2.18
    module load kallisto/0.46.2
    module load deeptools/3.5.0
    module load subread/2.0.1
    module load multiqc/1.11

    Note that alienTrimmer is not installed in NNCR HPC yet.

    Test your configuration by performing a dry-run via

    snakemake -n -c 1

    run it in NNCR using environment modules via

    sbatch -c 8 snakemake --cluster-config config/cluster_config.json --cluster "sbatch --mem={cluster.ram} --cpus-per-task={threads} " -j 100 --nolock --cores 8