Workflow: Cut-n-Run pipeline paired-end
Experimental pipeline for Cut-n-Run analysis. Uses mapping results from the following experiment types: - `chipseq-pe.cwl` - `trim-chipseq-pe.cwl` - `trim-atacseq-pe.cwl` Note, the upstream analyses should not have duplicates removed
- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
Inputs
ID | Type | Title | Doc |
---|---|---|---|
threads | Integer (Optional) | Number of threads |
Number of threads for those steps that support multithreading |
rmdup_log | File [Textual format] | Remove duplicates log |
Remove duplicates log file from Samtools |
broad_peak | Boolean (Optional) | Call broad peaks |
Make MACS2 call broad peaks by linking nearby highly enriched regions |
bambai_pair | File [BAM] | ChIP-Seq paired-end experiment |
Coordinate sorted filtered BAM alignment and index BAI files |
genome_size | String | Effective genome size |
The length of the mappable genome (hs, mm, ce, dm or number, for example 2.7e9) |
chrom_length | File [Textual format] | Chromosome lengths file |
Chromosome lengths file in TSV format |
control_file | File (Optional) [BAM] | Control ChIP-Seq paired-end experiment |
Indexed BAM file from the ChIP-Seq paired-end experiment to be used as a control for MACS2 peak calling |
alignment_log | File [Textual format] | Read alignment log |
Read alignment log file from Bowtie |
promoter_dist | Integer (Optional) | Max distance from gene TSS (in both direction) overlapping which the peak will be assigned to the promoter region |
Max distance from gene TSS (in both direction) overlapping which the peak will be assigned to the promoter region |
upstream_dist | Integer (Optional) | Max distance from the promoter (only in upstream direction) overlapping which the peak will be assigned to the upstream region |
Max distance from the promoter (only in upstream direction) overlapping which the peak will be assigned to the upstream region |
annotation_file | File [TSV] | Genome annotation file |
Genome annotation file in TSV format |
max_fragment_size | Integer | Maximum fragment size |
The maximum fragment size needed for read/pair inclusion |
min_fragment_size | Integer | Minimum fragment size |
The minimum fragment size needed for read/pair inclusion |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
filter_bam |
../tools/deeptools-alignmentsieve.cwl
(CommandLineTool)
|
AlignmentSieve - utility from deepTools for BAM/CRAM file filtering |
For BAM files only. Only selected parameters are implemented. |
bam_to_bigwig |
../tools/bam-bedgraph-bigwig.cwl
(Workflow)
|
Workflow converts input BAM file into bigWig and bedGraph files. |
|
get_statistics |
../tools/python-get-stat-chipseq.cwl
(CommandLineTool)
|
Tool processes and combines log files generated by Bowtie aligner and samtools rmdup. |
|
macs2_callpeak |
../tools/macs2-callpeak.cwl
(CommandLineTool)
|
Tool is used to perform peak calling using MACS2. Input Trigger (default: true) allows to skip all calculation and return all input files unchanged. To set files to be returned in case of Trigger == false, use the following inputs: peak_xls_file_staged: narrow_peak_file_staged: broad_peak_file_staged: gapped_peak_file_staged: peak_summits_file_staged: moder_r_file_staged: treat_pileup_bdg_file_staged: control_lambda_bdg_file_staged: |
|
island_intersect |
../tools/iaintersect.cwl
(CommandLineTool)
|
Tool assigns each peak obtained from MACS2 to a gene and region (upstream, promoter, exon, intron, intergenic) |
|
average_tag_density |
../tools/atdp.cwl
(CommandLineTool)
|
Tool calculates average tag density profile around all annotated TSS. |
|
samtools_sort_index |
../tools/samtools-sort-index.cwl
(CommandLineTool)
|
Tool to sort and index input BAM/SAM/CRAM.
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools sort` and
`samtools index`, return sorted BAM and BAI/CSI index file.
If input `trigger` is set to `false`, return unchanged `sort_input` (BAM/SAM/CRAM) and index (BAI/CSI, if provided in
`secondaryFiles`) files. |
Outputs
ID | Type | Label | Doc |
---|---|---|---|
bigwig | File [bigWig] | Genome coverage |
Genome coverage in bigWig format |
atdp_result | File [TSV] | Average Tag Density Plot |
Average Tag Density Plot file in TSV format |
bambai_pair | File [BAM] | ChIP-Seq paired-end experiment |
Coordinate sorted filtered BAM alignment and index BAI files |
macs2_broad_peaks | File (Optional) [ENCODE broad peak format] | Broad peaks |
Called peaks file in ENCODE broad peak format |
alignmentsieve_log | File [Textual format] | Alignment filtering log |
Alignment filtering log from deepTool's alignmentSieve |
iaintersect_result | File [TSV] | Gene annotated peaks |
MACS2 peak file annotated with nearby genes |
macs2_called_peaks | File [xls] | Called peaks |
Called peaks file with 1-based coordinates in XLS format |
macs2_narrow_peaks | File (Optional) [ENCODE narrow peak format] | Narrow peaks |
Called peaks file in ENCODE narrow peak format |
https://w3id.org/cwl/view/git/480e99a4bb3046e0565113d9dca294e0895d3b0c/workflows/trim-chipseq-pe-cut-n-run.cwl