Releases: maxsonBraunLab/atac_seq
Releases · maxsonBraunLab/atac_seq
Version 3.2
Added features
GitHub Actions testing scheme
- Download and subset test datasets from ENCODE. Test data is ATAC-Seq of adrenal glands of people aged 59 and 16, 2 replicates each condition. Subset the FASTQs to 4 M reads per read per biological replicate.
- Git clone the repo
- Install miniconda and custom snakemake env (5.32) with plotly
- Activate conda environment and gather basic intel
- Snakemake dry run + create pipeline environments
- Activate bwa env and make index of hg38 chr 1.
- Run SnakeMake.
Version 3.1
Version 3.1
Added features
- new consensus peak script. Take peaks that appear in n replicates per condition, then merge these peaks together.
scripts/make_differential_config.py
helps to generate DESeq2 and DiffBind config files.- homer submits commands to SLURM instead of a local run.
preseq
rules will now assess library complexity after all preprocessing steps. if lcextrap fails, the following restart attempts will be run in defect mode with a '-D' flag.- add functioning log files to deseq2 and diffbind.
- new scheme to make bigwig files. Average CPM signal using a 10 bp window all with bedtools and chrom size file. DeepTools bamCoverage usually takes between 1 - 2 hrs or more, this method takes 20 - 30 mins.
Modifications
- re-route output of fastp as the input for fastqc
- fragment length files are tab-separated and the length is capped at 1 kbp.
- combined mito removal + quality mapped reads rule into the
filter
rule. - reformatted all rules into
Snakefile
Deprecated features
- removed read shifting because we don't need base-pair resolution analyses (e.g. footprinting).
- removed intervene rule.
- removed tracking BAM entries before and after preprocessing steps.
need to do:
- sensible github actions testing.
Version 3.0
Major code refactoring
Quality Control
QC results summarized into multiqc report.
QC include fastqc, fastq_screen, fragment length, FRiP.
No preseq. Needs extra rule after markdup to sort + index.
Processes
pipe bwa mem to samtools sort
Blacklist removal is abstracted to its own rule originally from consensus peak calling.
Rules are written in order from top to bottom.
New features
MultiQC report summarizes your results.
Fastp is preferred read trimmer.
HOMER integration! DESeq2 exports contrast combinations table, which is parsed and fed to HOMER I/O. Only runs if a contrast gets >= 10 DE up/down peaks.
Version 2.0
Version 2
- No more down-sampling of data.
- No more playing with regex to determine input files. Just symlink and go.
- Moderate code formatting
essential_report.html
aggregates important metrics in alignment, peak calling, and differential peak stages.
Version 1
- Choose to down-sample or not down-sample reads before analysis
- Made the format of the analysis: read trimming, alignment, remove mito + low-qual + dup reads, shift reads, take consensus peaks, and DESeq2.