Workflow: super-enhancer.cwl

Fetched 2023-01-12 18:10:12 GMT

Both `islands_file` and `islands_control_file` should be produced by the same cwl tool (iaintersect.cwl or macs2-callpeak-biowardrobe-only.cwl)

children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
bambai_pair File [BAM] Coordinate sorted BAM alignment file (+index BAI)

Coordinate sorted BAM file and BAI index file

promoter_bp Integer (Optional) Promoter distance

Promoter distance for gene names assignment

islands_file File [xls] XLS called peaks file

XLS file to include information about peaks returned by iaintersect.cwl or macs2-callpeak-biowardrobe-only.cwl

tss_distance Integer (Optional) TSS distance

Distance from TSS to exclude, 0 = no TSS exclusion

annotation_file File [TSV] Annotation file

Tab-separated input annotation file

stitch_distance Integer (Optional) Stitching distance

Linking distance for stitching

chrom_length_file File [Textual format] Chromosome length file

Chromosome length file

islands_control_file File (Optional) [xls] XLS called peaks file (control)

XLS file to include information about peaks returned by iaintersect.cwl or macs2-callpeak-biowardrobe-only.cwl for control

Steps

ID Runs Label Doc
make_gff
../tools/makegff.cwl (CommandLineTool)

Tool produces GFF output from the file generated by iaintersect.cwl or macs2-callpeak-biowardrobe-only.cwl tool Both islands_file and islands_control_file should be produced by the same tool

run_rose
../tools/rose.cwl (CommandLineTool)

-b and -c arguments are not supported

sort_bed
../tools/linux-sort.cwl (CommandLineTool)

Tool sorts data from `unsorted_file` by key

`default_output_filename` function returns file name identical to `unsorted_file`, if `output_filename` is not provided.

rename_png
../tools/rename.cwl (CommandLineTool)

Tool renames `source_file` to `target_filename`. Input `target_filename` should be set as string. If it's a full path, only basename will be used. If BAI file is present, it will be renamed too

bed_to_macs
../tools/custom-bash.cwl (CommandLineTool)

Tool to run custom script set as `script` input with arguments from `param`. Default script runs sed command over the input file and exports results to the file with the same name as input's basename

assign_genes
../tools/iaintersect.cwl (CommandLineTool)

Tool assigns each peak obtained from MACS2 to a gene and region (upstream, promoter, exon, intron, intergenic)

`default_output_filename` function returns output filename with sufix set as `ext` argument. Function is called when either `output_filename` or `log_filename` inputs are not provided.

bed_to_bigbed
../tools/ucsc-bedtobigbed.cwl (CommandLineTool)

Tool converts bed file to bigBed

Before running `baseCommand` the following files are created in Docker working directory (using `InitialWorkDirRequirement`): `narrowpeak.as` - default BED file structure template for ENCODE narrowPeak format `broadpeak.as` - default BED file structure template for ENCODE broadPeak format

`default_output_filename` function returns default output file name based on `input_bed` basename with `*.bb` extension if `output_filename` is not provided.

`get_bed_type` function returns default BED file type if `bed_type` is not provided. Depending on `input_bed` file extension the following values are returned: `*.narrowpeak` --> bed6+4 `*.broadpeak` --> bed6+3 else --> null (`bedToBigBed` will use its own default value)

`get_bed_template` function returns default BED file template if `bed_template` is not provided. Depending on `input_bed` file extension the following values are returned: `*.narrowpeak` --> narrowpeak.as (previously staged into Docker working directory) `*.broadpeak` --> broadpeak.as (previously staged into Docker working directory) else --> null (`bedToBigBed` will use its own default value)

add_island_names
../tools/custom-bash.cwl (CommandLineTool)

Tool to run custom script set as `script` input with arguments from `param`. Default script runs sed command over the input file and exports results to the file with the same name as input's basename

Outputs

ID Type Label Doc
png_file File [PNG] ROSE visualization plot

Generated by ROSE visualization plot

bigbed_file File [TSV] Gateway Super Enhancer bigBed file

Gateway Super Enhancer bigBed file

gene_names_file File [TSV] Gateway Super Enhancer results from ROSE with gene names

Gateway Super Enhancer results from ROSE with assigned gene names

Permalink: https://w3id.org/cwl/view/git/a7b031090f49ab52195a561c162b326998028a35/workflows/super-enhancer.cwl