Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph bam-bedgraph-bigwig.cwl

Workflow converts input BAM file into bigWig and bedGraph files. Input BAM file should be sorted by coordinates (required by `bam_to_bedgraph` step). If `split` input is not provided use true by default. Default logic is implemented in `valueFrom` field of `split` input inside `bam_to_bedgraph` step to avoid possible bug in cwltool with setting default values for workflow inputs. `scale` has higher priority over the `mapped_reads_number`. The last one is used to calculate `-scale` parameter for `bedtools genomecov` (step `bam_to_bedgraph`) only in a case when input `scale` is not provided. All logic is implemented inside `bedtools-genomecov.cwl`. `bigwig_filename` defines the output name only for generated bigWig file. `bedgraph_filename` defines the output name for generated bedGraph file and can influence on generated bigWig filename in case when `bigwig_filename` is not provided. All workflow inputs and outputs don't have `format` field to avoid format incompatibility errors when workflow is used as subworkflow.

https://github.com/datirium/workflows.git

Path: tools/bam-bedgraph-bigwig.cwl

Branch/Commit ID: 9d5cbdd3ea0bb417518115d8092584254598a440

workflow graph allele-vcf-alignreads-se-pe.cwl

Workflow maps FASTQ files from `fastq_files` input into reference genome `reference_star_indices_folder` and insilico generated `insilico_star_indices_folder` genome (concatenated genome for both `strain1` and `strain2` strains). For both genomes STAR is run with `outFilterMultimapNmax` parameter set to 1 to discard all of the multimapped reads. For insilico genome SAM file is generated. Then it's splitted into two SAM files based on strain names and then sorted by coordinates into the BAM format. For reference genome output BAM file from STAR slignment is also coordinate sorted.

https://github.com/Barski-lab/workflows.git

Path: subworkflows/allele-vcf-alignreads-se-pe.cwl

Branch/Commit ID: 99695ff7739364d02a1cb33cd6407c8693a7aaab

workflow graph scatter-valuefrom-wf5.cwl

https://github.com/common-workflow-language/cwltool.git

Path: cwltool/schemas/v1.0/v1.0/scatter-valuefrom-wf5.cwl

Branch/Commit ID: 1e3f5404b7d5af02e3dec0faea31352111ad7cd8

workflow graph taxonomy_check_16S

https://github.com/ncbi/pgap.git

Path: task_types/tt_taxonomy_check_16S.cwl

Branch/Commit ID: 4def84df33963fc9ac9d5c5f804b911d01a0d9ad

workflow graph GSEApy - Gene Set Enrichment Analysis in Python

GSEAPY: Gene Set Enrichment Analysis in Python ============================================== Gene Set Enrichment Analysis is a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states (e.g. phenotypes). GSEA requires as input an expression dataset, which contains expression profiles for multiple samples. While the software supports multiple input file formats for these datasets, the tab-delimited GCT format is the most common. The first column of the GCT file contains feature identifiers (gene ids or symbols in the case of data derived from RNA-Seq experiments). The second column contains a description of the feature; this column is ignored by GSEA and may be filled with “NA”s. Subsequent columns contain the expression values for each feature, with one sample's expression value per column. It is important to note that there are no hard and fast rules regarding how a GCT file's expression values are derived. The important point is that they are comparable to one another across features within a sample and comparable to one another across samples. Tools such as DESeq2 can be made to produce properly normalized data (normalized counts) which are compatible with GSEA.

https://github.com/datirium/workflows.git

Path: workflows/gseapy.cwl

Branch/Commit ID: 1131f82a53315cca217a6c84b3bd272aa62e4bca

workflow graph ROSE: rank ordering of super-enhancers

Super-enhancers, consist of clusters of enhancers that are densely occupied by the master regulators and Mediator. Super-enhancers differ from typical enhancers in size, transcription factor density and content, ability to activate transcription, and sensitivity to perturbation. Use to create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff)

https://github.com/datirium/workflows.git

Path: workflows/super-enhancer.cwl

Branch/Commit ID: a1f6ca50fcb0881781b3ba0306dd61ebf555eaba

workflow graph schemadef-wf.cwl

https://github.com/common-workflow-language/cwltool.git

Path: cwltool/schemas/v1.0/v1.0/schemadef-wf.cwl

Branch/Commit ID: 1e3f5404b7d5af02e3dec0faea31352111ad7cd8

workflow graph Single-cell Multiome ATAC and RNA-Seq Aggregate

Single-cell Multiome ATAC and RNA-Seq Aggregate Aggregates data from multiple Single-cell Multiome ATAC and RNA-Seq Alignment experiments

https://github.com/barski-lab/sc-seq-analysis.git

Path: workflows/sc-multiome-aggregate-wf.cwl

Branch/Commit ID: 8614e5d20f5e81dce537216bd340cdbc1067bbc7

workflow graph count-lines7-wf.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/count-lines7-wf.cwl

Branch/Commit ID: a5073143db4155e05df8d2e7eb59d9e62acd65a5

workflow graph workflow.cwl

https://github.com/NAL-i5K/Organism_Onboarding.git

Path: flow_dispatch/workflow.cwl

Branch/Commit ID: add45db6f08de518e224bdc3c04094fd69cad2d2