GitHub - Dingersrun/Copy-number-normalization

Copy-number-aware differential analysis pipeline featuring copy number normalization

This concept can be applied to but not limited to ATAC-seq, ChIP-seq, Cut&Tag

Here are the major steps and the example corresponding bioinformatic tools for differential analysis incorporating the step of copy number normalization for ATACseq data and ChIPseq data in samples with different karyotype/copy numbers. Users are free to choose alternative tools.

Tools required for the workflow of differential analysis:

Raw reads to fastq: bcl2fastq

Read alignment: bwa

Read filtering: picard, samtools, bedtools

Peak calling: macs2

Signal quantification: htseq

Data normalization and differential analysis: DESeq2, DiffBind

Tools for the step of copy number normalization:

These steps can be run separately and integrated into differential analysis pipelines for other count-based functional genomic assays

Calling local relative copy number ratio: CNVkit Input data: genomic sequencing data or ChIP-seq input data copy number ratio (CNR) = Copy_number_perturbed_sample/Copy_number_control_sample

e.g. in Down Syndrome (trisomy 21), compared to a euploid sample, the CNR for chr21 is 3/2=1.5; the CNR for other regions is 2/2=1; if there is a relative copy number loss, the CNR will be <1.

Assigning peaks to DNA segments: bedtools closest and modify the read/fragment count in peaks matrix using the CNR as a scaling factor For peaks with CNR>1: divide the read/fragment count in perturbed_sample by CNR; otherwise multiply the read/fragment count in perturbed_sample by CNR. This is to avoid inflating the statistical power of detecting differential signals

Cite our bioRxiv preprint

https://doi.org/10.1101/2024.04.11.588815

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
ATACseq_pipeline.md		ATACseq_pipeline.md
Assigning_scaling_factor.sh		Assigning_scaling_factor.sh
CNnormalization_DESeq2_DA_ATACseq.r		CNnormalization_DESeq2_DA_ATACseq.r
ChIPseq_CNV_DiffBind.md		ChIPseq_CNV_DiffBind.md
Codex2.md		Codex2.md
Copy number normalization.md		Copy number normalization.md
CopyWriterR.md		CopyWriterR.md
HtseqCount_union.cluster.sh		HtseqCount_union.cluster.sh
Plot_Copy_number_ratio.r		Plot_Copy_number_ratio.r
QDNAseq.md		QDNAseq.md
README.md		README.md
hg38.centromers.UCSC.bed		hg38.centromers.UCSC.bed
hg38.chrM_end.accum		hg38.chrM_end.accum
narrowPeak2gtf.sh		narrowPeak2gtf.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Copy-number-aware differential analysis pipeline featuring copy number normalization

Tools required for the workflow of differential analysis:

Tools for the step of copy number normalization:

Cite our bioRxiv preprint

About

Releases

Packages

Languages

Dingersrun/Copy-number-normalization

Folders and files

Latest commit

History

Repository files navigation

Copy-number-aware differential analysis pipeline featuring copy number normalization

Tools required for the workflow of differential analysis:

Tools for the step of copy number normalization:

Cite our bioRxiv preprint

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages