Workflow: chipseq-se.cwl
ChIP-Seq basic analysis workflow for a single-read experiment.
- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
Inputs
ID | Type | Title | Doc |
---|---|---|---|
threads | Integer (Optional) | Number of threads |
Number of threads for those steps that support multithreading |
broad_peak | Boolean | Callpeak broad |
Set to call broad peak for MACS2 |
fastq_file | File [FASTQ] | FASTQ input file |
Reads data in a FASTQ format, received after single end sequencing |
clip_3p_end | Integer (Optional) | Clip from 3p end |
Number of bases to clip from the 3p end |
clip_5p_end | Integer (Optional) | Clip from 5p end |
Number of bases to clip from the 5p end |
genome_size | String | Effective genome size |
MACS2 effective genome size: hs, mm, ce, dm or number, for example 2.7e9 |
chrom_length | File [Textual format] | Chromosomes length file |
Chromosomes length file |
control_file | File (Optional) [BAM] | Use experiment as a control |
Use experiment as a control for MACS2 peak calling |
indices_folder | Directory | Indexed genome folder (bowtie) |
Path to indexed genome folder by **bowtie** |
annotation_file | File [TSV] | Annotation file |
Tab-separated annotation file |
exp_fragment_size | Integer (Optional) | Expected fragment size |
Expected fragment size for MACS2 |
remove_duplicates | Boolean (Optional) | Remove duplicates |
Calls samtools rmdup to remove duplicates from sortesd BAM file |
force_fragment_size | Boolean (Optional) | Force fragment size |
Force MACS2 to use exp_fragment_size |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
get_stat |
../tools/python-get-stat-chipseq.cwl
(CommandLineTool)
|
Tool processes and combines log files generated by Bowtie aligner and samtools rmdup. |
|
bam_to_bigwig |
../subworkflows/bam-bedgraph-bigwig.cwl
(Workflow)
|
Workflow converts input BAM file into bigWig and bedGraph files |
|
extract_fastq |
../tools/extract-fastq.cwl
(CommandLineTool)
|
Tool to decompress input FASTQ file Bash script's logic: - disable case sensitive glob check - check if root name of input file already include '.fastq' or '.fq' extension. If yes, set DEFAULT_EXT to \"\" - check file type, decompress if needed - return 1, if file type is not recognized This script also works of input file doesn't have any extension at all |
|
bowtie_aligner |
../tools/bowtie-alignreads.cwl
(CommandLineTool)
|
Tool maps input raw reads files to reference genome using Bowtie. |
|
macs2_callpeak |
../tools/macs2-callpeak-biowardrobe-only.cwl
(CommandLineTool)
|
Tool runs peak calling using MACS2 within the custom script `run.py` |
|
samtools_rmdup |
../tools/samtools-rmdup.cwl
(CommandLineTool)
|
Tool to remove duplicates from coordinate sorted BAM file set as input `bam_file`.
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools rmdup`, return
newly generated BAM file without duplicates and log as outputs `rmdup_output` and `rmdup_log`.
If input `trigger` is set to `false`, return unchanged BAM and index files (if provided in secondaryFiles),
previously staged into output directory, and log. |
|
island_intersect |
../tools/iaintersect.cwl
(CommandLineTool)
|
Tool assigns each peak obtained from MACS2 to a gene and region (upstream, promoter, exon, intron, intergenic) |
|
average_tag_density |
../tools/atdp.cwl
(CommandLineTool)
|
Tool calculates average tag density profile around all annotated TSS. |
|
fastx_quality_stats |
../tools/fastx-quality-stats.cwl
(CommandLineTool)
|
Tool calculates statistics on the base of FASTQ file quality scores. If `output_filename` is not provided call function `default_output_filename` to return default output file name generated as `input_file` basename + `.fastxstat` extension. |
|
samtools_sort_index |
../tools/samtools-sort-index.cwl
(CommandLineTool)
|
Tool to sort and index input BAM/SAM/CRAM.
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools sort` and
`samtools index`, return sorted BAM and BAI/CSI index file.
If input `trigger` is set to `false`, return unchanged `sort_input` (BAM/SAM/CRAM) and index (BAI/CSI, if provided in
`secondaryFiles`) files, previously staged into output directory. |
|
samtools_sort_index_after_rmdup |
../tools/samtools-sort-index.cwl
(CommandLineTool)
|
Tool to sort and index input BAM/SAM/CRAM.
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools sort` and
`samtools index`, return sorted BAM and BAI/CSI index file.
If input `trigger` is set to `false`, return unchanged `sort_input` (BAM/SAM/CRAM) and index (BAI/CSI, if provided in
`secondaryFiles`) files, previously staged into output directory. |
Outputs
ID | Type | Label | Doc |
---|---|---|---|
bigwig | File [bigWig] | BigWig file |
Generated BigWig file |
atdp_log | File [TSV] | ATDP log |
Average Tag Density generated log |
macs2_log | File (Optional) [Textual format] | MACS2 log |
MACS2 output log |
bowtie_log | File [Textual format] | BOWTIE alignment log |
BOWTIE generated alignment log |
atdp_result | File [TSV] | ATDP results |
Average Tag Density generated results |
bambai_pair | File [BAM] | Coordinate sorted BAM alignment file (+index BAI) |
Coordinate sorted BAM file and BAI index file |
get_stat_log | File (Optional) [Textual format] | Bowtie & Samtools Rmdup combined log |
Processed and combined Bowtie aligner and Samtools rmdup log |
macs2_moder_r | File (Optional) [Textual format] | MACS2 generated R script |
R script to produce a PDF image about the model based on your data |
iaintersect_log | File [TSV] | Island intersect log |
Iaintersect generated log |
fastx_statistics | File [Textual format] | FASTQ statistics |
fastx_quality_stats generated FASTQ file quality statistics file |
macs2_broad_peaks | File (Optional) [ENCODE broad peak format] | Broad peaks |
Contains the peak locations together with peak summit, pvalue and qvalue |
macs2_gapped_peak | File (Optional) [bed12] | Gapped peak |
Contains both the broad region and narrow peaks |
iaintersect_result | File [TSV] | Island intersect results |
Iaintersect generated results |
macs2_called_peaks | File (Optional) [xls] | Called peaks |
XLS file to include information about called peaks |
macs2_narrow_peaks | File (Optional) [ENCODE narrow peak format] | Narrow peaks |
Contains the peak locations together with peak summit, pvalue and qvalue |
macs2_peak_summits | File (Optional) [BED] | Peak summits |
Contains the peak summits locations for every peaks |
samtools_rmdup_log | File [Textual format] | Remove duplicates log |
Samtools rmdup generated log |
macs2_fragment_stat | File (Optional) [Textual format] | FRAGMENT, FRAGMENTE, ISLANDS |
fragment, calculated fragment, islands count from MACS2 results |
https://w3id.org/cwl/view/git/a9551ece898f619167db58e4b74a6cae2d7f7d13/workflows/chipseq-se.cwl