Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph workflow.cwl

https://github.com/NAL-i5K/Organism_Onboarding.git

Path: flow_create_genomics-workspace_yml/flow_create_yml/workflow.cwl

Branch/Commit ID: 0ecf492419ddaa015f14a163381948c53b3deea6

workflow graph count-lines10-wf.cwl

https://github.com/common-workflow-language/cwltool.git

Path: cwltool/schemas/v1.0/v1.0/count-lines10-wf.cwl

Branch/Commit ID: e59538cd9899a88d7e31e0f259bc56734f604383

workflow graph Bismark Methylation - pipeline for BS-Seq data analysis

Sequence reads are first cleaned from adapters and transformed into fully bisulfite-converted forward (C->T) and reverse read (G->A conversion of the forward strand) versions, before they are aligned to similarly converted versions of the genome (also C->T and G->A converted). Sequence reads that produce a unique best alignment from the four alignment processes against the bisulfite genomes (which are running in parallel) are then compared to the normal genomic sequence and the methylation state of all cytosine positions in the read is inferred. A read is considered to align uniquely if an alignment has a unique best alignment score (as reported by the AS:i field). If a read produces several alignments with the same number of mismatches or with the same alignment score (AS:i field), a read (or a read-pair) is discarded altogether. On the next step we extract the methylation call for every single C analysed. The position of every single C will be written out to a new output file, depending on its context (CpG, CHG or CHH), whereby methylated Cs will be labelled as forward reads (+), non-methylated Cs as reverse reads (-). The output of the methylation extractor is then transformed into a bedGraph and coverage file. The bedGraph counts output is then used to generate a genome-wide cytosine report which reports the number on every single CpG (optionally every single cytosine) in the genome, irrespective of whether it was covered by any reads or not. As this type of report is informative for cytosines on both strands the output may be fairly large (~46mn CpG positions or >1.2bn total cytosine positions in the human genome).

https://github.com/datirium/workflows.git

Path: workflows/bismark-methylation-se.cwl

Branch/Commit ID: 9bf0aa495735f8081bb5870cb32fc898b9e6eb22

workflow graph RNA-seq (VCF) alelle specific pipeline for paired-end data

Allele specific RNA-Seq (using vcf) paired-end workflow

https://github.com/datirium/workflows.git

Path: workflows/allele-vcf-rnaseq-pe.cwl

Branch/Commit ID: 9bf0aa495735f8081bb5870cb32fc898b9e6eb22

workflow graph LSU-from-tablehits.cwl

https://github.com/ebi-metagenomics/ebi-metagenomics-cwl.git

Path: tools/LSU-from-tablehits.cwl

Branch/Commit ID: c34db66a79cec3b66a0f1be5e499eef88db5a9ed

workflow graph exomeseq-gatk4-02-variantdiscovery.cwl

https://github.com/bespin-workflows/exomeseq-gatk4.git

Path: subworkflows/exomeseq-gatk4-02-variantdiscovery.cwl

Branch/Commit ID: a243d20e040b0b4b6ed875e68c39fcaee2cd9620

workflow graph revsort.cwl

Reverse the lines in a document, then sort those lines.

https://github.com/common-workflow-language/cwltool.git

Path: tests/wf/revsort.cwl

Branch/Commit ID: 639229b1159cf484e70e52da10194561b3fad719

workflow graph indices-header.cwl

https://github.com/datirium/workflows.git

Path: metadata/indices-header.cwl

Branch/Commit ID: 2f0db4b3c515f91c5cfda19c78cf90d339390986

workflow graph CLIP-Seq pipeline for single-read experiment NNNNG

Cross-Linking ImmunoPrecipitation ================================= `CLIP` (`cross-linking immunoprecipitation`) is a method used in molecular biology that combines UV cross-linking with immunoprecipitation in order to analyse protein interactions with RNA or to precisely locate RNA modifications (e.g. m6A). (Uhl|Houwaart|Corrado|Wright|Backofen|2017)(Ule|Jensen|Ruggiu|Mele|2003)(Sugimoto|König|Hussain|Zupan|2012)(Zhang|Darnell|2011) (Ke| Alemu| Mertens| Gantman|2015) CLIP-based techniques can be used to map RNA binding protein binding sites or RNA modification sites (Ke| Alemu| Mertens| Gantman|2015)(Ke| Pandya-Jones| Saito| Fak|2017) of interest on a genome-wide scale, thereby increasing the understanding of post-transcriptional regulatory networks. The identification of sites where RNA-binding proteins (RNABPs) interact with target RNAs opens the door to understanding the vast complexity of RNA regulation. UV cross-linking and immunoprecipitation (CLIP) is a transformative technology in which RNAs purified from _in vivo_ cross-linked RNA-protein complexes are sequenced to reveal footprints of RNABP:RNA contacts. CLIP combined with high-throughput sequencing (HITS-CLIP) is a generalizable strategy to produce transcriptome-wide maps of RNA binding with higher accuracy and resolution than standard RNA immunoprecipitation (RIP) profiling or purely computational approaches. The application of CLIP to Argonaute proteins has expanded the utility of this approach to mapping binding sites for microRNAs and other small regulatory RNAs. Finally, recent advances in data analysis take advantage of cross-link–induced mutation sites (CIMS) to refine RNA-binding maps to single-nucleotide resolution. Once IP conditions are established, HITS-CLIP takes ~8 d to prepare RNA for sequencing. Established pipelines for data analysis, including those for CIMS, take 3–4 d. Workflow -------- CLIP begins with the in-vivo cross-linking of RNA-protein complexes using ultraviolet light (UV). Upon UV exposure, covalent bonds are formed between proteins and nucleic acids that are in close proximity. (Darnell|2012) The cross-linked cells are then lysed, and the protein of interest is isolated via immunoprecipitation. In order to allow for sequence specific priming of reverse transcription, RNA adapters are ligated to the 3' ends, while radiolabeled phosphates are transferred to the 5' ends of the RNA fragments. The RNA-protein complexes are then separated from free RNA using gel electrophoresis and membrane transfer. Proteinase K digestion is then performed in order to remove protein from the RNA-protein complexes. This step leaves a peptide at the cross-link site, allowing for the identification of the cross-linked nucleotide. (König| McGlincy| Ule|2012) After ligating RNA linkers to the RNA 5' ends, cDNA is synthesized via RT-PCR. High-throughput sequencing is then used to generate reads containing distinct barcodes that identify the last cDNA nucleotide. Interaction sites can be identified by mapping the reads back to the transcriptome.

https://github.com/datirium/workflows.git

Path: workflows/clipseq-se.cwl

Branch/Commit ID: 9bf0aa495735f8081bb5870cb32fc898b9e6eb22

workflow graph samples_fillout_workflow.cwl

Workflow to run GetBaseCountsMultiSample fillout on a number of samples, each with their own bam and maf files

https://github.com/mskcc/pluto-cwl.git

Path: cwl/samples_fillout_workflow.cwl

Branch/Commit ID: ba3ff09328cc646d7254b2d2ee0fbe1abca3d4ad