Skip to content

Latest commit

 

History

History
50 lines (33 loc) · 1.64 KB

README.md

File metadata and controls

50 lines (33 loc) · 1.64 KB

ARuS (Automated RNAseq analysis using Snakemake)

This is a fully automated Snakemake pipeline for streamlining end-to-end RNAseq analysis, from fastq reads to DE analysis.

To use as a Docker container, run:

  1. git clone https://github.com/JasonCharamis/ARuS.git
  2. cd ARuS/workflow/ && sudo docker build -t automated_rnaseq_analysis:latest .
  3. sudo docker run -it -v $(pwd):/mnt/workdir -w /mnt/workdir automated_rnaseq_analysis:latest snakemake --snakefile ARuS/workflow/Snakefile --cores 1 --use-conda --conda-frontend mamba

Usage: Wildcard for sample identification is "{sample}_1.fastq.gz" and "{sample}_2.fastq.gz".

The pipeline is designed for 150-bp paired-end Illumina reads and it includes:

  1. Read quality control (QC) and adapter-trimming
  2. Mapping of reads against provided genome sequence
  3. Assign mapped reads to genes - this step also computes TPM values and uses them to produce a PCA plot
  4. Differential expression (DE) analysis using edgeR
  5. Post-DE annotation of DE genes and optionally combine with orthology results

Dependencies:

  1. FASTQC https://github.com/s-andrews/FastQC

  2. Trimommatic https://github.com/usadellab/Trimmomatic

  3. STAR https://github.com/alexdobin/STAR

  4. featureCounts https://github.com/torkian/subread-1.6.1

  5. edgeR https://bioconductor.org/packages/release/bioc/html/edgeR.html

  6. Trinity-bundled Perl scripts for DE analysis using edgeR https://github.com/trinityrnaseq/trinityrnaseq

If you use it on hisat2-mapping mode, you will also need:

  1. hisat2 https://github.com/DaehwanKimLab/hisat2

  2. samtools https://github.com/samtools/samtools

Every dependency is automatically installed through conda.