Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph tt_univec_wnode.cwl

https://github.com/ncbi/pgap.git

Path: task_types/tt_univec_wnode.cwl

Branch/Commit ID: cd97086739ae5988bab09b05e9259675c4b6bce6

workflow graph DiffBind - Differential Binding Analysis of ChIP-Seq Peak Data

Differential Binding Analysis of ChIP-Seq Peak Data --------------------------------------------------- DiffBind processes ChIP-Seq data enriched for genomic loci where specific protein/DNA binding occurs, including peak sets identified by ChIP-Seq peak callers and aligned sequence read datasets. It is designed to work with multiple peak sets simultaneously, representing different ChIP experiments (antibodies, transcription factor and/or histone marks, experimental conditions, replicates) as well as managing the results of multiple peak callers. For more information please refer to: ------------------------------------- Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, Ali S, Chin S, Palmieri C, Caldas C, Carroll JS (2012). “Differential oestrogen receptor binding is associated with clinical outcome in breast cancer.” Nature, 481, -4.

https://github.com/datirium/workflows.git

Path: workflows/diffbind.cwl

Branch/Commit ID: 9850a859de1f42d3d252c50e15701928856fe774

workflow graph LSU-from-tablehits.cwl

https://github.com/proteinswebteam/ebi-metagenomics-cwl.git

Path: tools/LSU-from-tablehits.cwl

Branch/Commit ID: b6d3aaf3fa6695061208c6cdca3d7881cc45400d

workflow graph strelka workflow

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/strelka_and_post_processing.cwl

Branch/Commit ID: 2e0562a5c3cd7aac24af4c622a5ae68a7fb23a71

workflow graph PCA - Principal Component Analysis

Principal Component Analysis --------------- Principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables (entities each of which takes on various numerical values) into a set of values of linearly uncorrelated variables called principal components. The calculation is done by a singular value decomposition of the (centered and possibly scaled) data matrix, not by using eigen on the covariance matrix. This is generally the preferred method for numerical accuracy.

https://github.com/datirium/workflows.git

Path: workflows/pca.cwl

Branch/Commit ID: e0a30aa1ad516dd2ec0e9ce006428964b840daf4

workflow graph scatter GATK HaplotypeCaller over intervals

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/gatk_haplotypecaller_iterator.cwl

Branch/Commit ID: 2e0562a5c3cd7aac24af4c622a5ae68a7fb23a71

workflow graph Bismark Methylation - pipeline for BS-Seq data analysis

Sequence reads are first cleaned from adapters and transformed into fully bisulfite-converted forward (C->T) and reverse read (G->A conversion of the forward strand) versions, before they are aligned to similarly converted versions of the genome (also C->T and G->A converted). Sequence reads that produce a unique best alignment from the four alignment processes against the bisulfite genomes (which are running in parallel) are then compared to the normal genomic sequence and the methylation state of all cytosine positions in the read is inferred. A read is considered to align uniquely if an alignment has a unique best alignment score (as reported by the AS:i field). If a read produces several alignments with the same number of mismatches or with the same alignment score (AS:i field), a read (or a read-pair) is discarded altogether. On the next step we extract the methylation call for every single C analysed. The position of every single C will be written out to a new output file, depending on its context (CpG, CHG or CHH), whereby methylated Cs will be labelled as forward reads (+), non-methylated Cs as reverse reads (-). The output of the methylation extractor is then transformed into a bedGraph and coverage file. The bedGraph counts output is then used to generate a genome-wide cytosine report which reports the number on every single CpG (optionally every single cytosine) in the genome, irrespective of whether it was covered by any reads or not. As this type of report is informative for cytosines on both strands the output may be fairly large (~46mn CpG positions or >1.2bn total cytosine positions in the human genome).

https://github.com/datirium/workflows.git

Path: workflows/bismark-methylation-se.cwl

Branch/Commit ID: 9850a859de1f42d3d252c50e15701928856fe774

workflow graph Add snv and indel bam-readcount files to a vcf

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/vcf_readcount_annotator.cwl

Branch/Commit ID: 2e0562a5c3cd7aac24af4c622a5ae68a7fb23a71

workflow graph genomics-workspace.cwl

https://github.com/NAL-i5K/Organism_Onboarding.git

Path: flow_genomicsWorkspace/genomics-workspace.cwl

Branch/Commit ID: 7198756b4b1519d102178042924671bd677e9b17

workflow graph EMG QC workflow, (paired end version). Benchmarking with MG-RAST expt.

https://github.com/proteinswebteam/ebi-metagenomics-cwl.git

Path: workflows/emg-qc-paired.cwl

Branch/Commit ID: 25129f55226dee595ef941edc24d3c44414e0523