Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	Bacterial Annotation, pass 1, genemark training, by HMMs (first pass)	https://github.com/ncbi/pgap.git Path: bacterial_annot/wf_bacterial_annot_pass1.cwl Branch/Commit ID: 4ffbad9ffeab15ec8af5f6f91bd352ef96d1ef77
	Set Operations for Called Peaks (ChIP/ATAC/C&R/diffbind) # Set Operations for Peaks This workflow takes as input multiple peak list TSV files (the `iaintersect_result.tsv` output under the \"Files\" output tab) from the ChIP, ATAC, C&R, or diffbind workflows and performs the user-selected set operation on the group. Set operations include intersection, union, difference, and complement. See the tooltip for the `set_operator` input for more details.	https://github.com/datirium/workflows.git Path: workflows/filter-peaks-by-overlap.cwl Branch/Commit ID: d76110e0bfc40c874f82e37cef6451d74df4f908
	Generate genome indices for STAR & bowtie Creates indices for: * [STAR](https://github.com/alexdobin/STAR) v2.5.3a (03/17/2017) PMID: [23104886](https://www.ncbi.nlm.nih.gov/pubmed/23104886) * [bowtie](http://bowtie-bio.sourceforge.net/tutorial.shtml) v1.2.0 (12/30/2016) It performs the following steps: 1. `STAR --runMode genomeGenerate` to generate indices, based on [FASTA](http://zhanglab.ccmb.med.umich.edu/FASTA/) and [GTF](http://mblab.wustl.edu/GTF2.html) input files, returns results as an array of files 2. Outputs indices as [Direcotry](http://www.commonwl.org/v1.0/CommandLineTool.html#Directory) data type 3. Separates chrNameLength.txt file from Directory output 4. `bowtie-build` to generate indices requires genome [FASTA](http://zhanglab.ccmb.med.umich.edu/FASTA/) file as input, returns results as a group of main and secondary files	https://github.com/datirium/workflows.git Path: workflows/genome-indices.cwl Branch/Commit ID: d76110e0bfc40c874f82e37cef6451d74df4f908
	count-lines11-null-step-wf-noET.cwl	https://github.com/common-workflow-language/cwl-v1.1.git Path: tests/count-lines11-null-step-wf-noET.cwl Branch/Commit ID: 3e90671b25f7840ef2926ad2bacbf447772dda94
	fail-unconnected.cwl	https://github.com/common-workflow-language/cwl-v1.1.git Path: tests/fail-unconnected.cwl Branch/Commit ID: 3e90671b25f7840ef2926ad2bacbf447772dda94
	hmmsearch_wnode and gpx_qdump combined workflow to apply scatter/gather	https://github.com/ncbi/pgap.git Path: task_types/tt_hmmsearch_wnode_plus_qdump.cwl Branch/Commit ID: 1cfd46014be8d867044cb10d1ddde0cb3068ee84
	dynresreq-workflow.cwl	https://github.com/common-workflow-language/cwl-v1.2.git Path: tests/dynresreq-workflow.cwl Branch/Commit ID: 7d7986a6e852ca6e3239c96d3a05dd536c76c903
	kfdrc_RNAseq_workflow.cwl	https://github.com/kids-first/kf-rnaseq-workflow.git Path: workflow/kfdrc_RNAseq_workflow.cwl Branch/Commit ID: 23f866f01f36efd7feb8a62d2a6765495a999974
	pass-unconnected.cwl	https://github.com/common-workflow-language/cwl-v1.1.git Path: tests/pass-unconnected.cwl Branch/Commit ID: 3e90671b25f7840ef2926ad2bacbf447772dda94
	CLIP-Seq pipeline for single-read experiment NNNNG Cross-Linking ImmunoPrecipitation ================================= `CLIP` (`cross-linking immunoprecipitation`) is a method used in molecular biology that combines UV cross-linking with immunoprecipitation in order to analyse protein interactions with RNA or to precisely locate RNA modifications (e.g. m6A). (Uhl\|Houwaart\|Corrado\|Wright\|Backofen\|2017)(Ule\|Jensen\|Ruggiu\|Mele\|2003)(Sugimoto\|König\|Hussain\|Zupan\|2012)(Zhang\|Darnell\|2011) (Ke\| Alemu\| Mertens\| Gantman\|2015) CLIP-based techniques can be used to map RNA binding protein binding sites or RNA modification sites (Ke\| Alemu\| Mertens\| Gantman\|2015)(Ke\| Pandya-Jones\| Saito\| Fak\|2017) of interest on a genome-wide scale, thereby increasing the understanding of post-transcriptional regulatory networks. The identification of sites where RNA-binding proteins (RNABPs) interact with target RNAs opens the door to understanding the vast complexity of RNA regulation. UV cross-linking and immunoprecipitation (CLIP) is a transformative technology in which RNAs purified from _in vivo_ cross-linked RNA-protein complexes are sequenced to reveal footprints of RNABP:RNA contacts. CLIP combined with high-throughput sequencing (HITS-CLIP) is a generalizable strategy to produce transcriptome-wide maps of RNA binding with higher accuracy and resolution than standard RNA immunoprecipitation (RIP) profiling or purely computational approaches. The application of CLIP to Argonaute proteins has expanded the utility of this approach to mapping binding sites for microRNAs and other small regulatory RNAs. Finally, recent advances in data analysis take advantage of cross-link–induced mutation sites (CIMS) to refine RNA-binding maps to single-nucleotide resolution. Once IP conditions are established, HITS-CLIP takes ~8 d to prepare RNA for sequencing. Established pipelines for data analysis, including those for CIMS, take 3–4 d. Workflow -------- CLIP begins with the in-vivo cross-linking of RNA-protein complexes using ultraviolet light (UV). Upon UV exposure, covalent bonds are formed between proteins and nucleic acids that are in close proximity. (Darnell\|2012) The cross-linked cells are then lysed, and the protein of interest is isolated via immunoprecipitation. In order to allow for sequence specific priming of reverse transcription, RNA adapters are ligated to the 3' ends, while radiolabeled phosphates are transferred to the 5' ends of the RNA fragments. The RNA-protein complexes are then separated from free RNA using gel electrophoresis and membrane transfer. Proteinase K digestion is then performed in order to remove protein from the RNA-protein complexes. This step leaves a peptide at the cross-link site, allowing for the identification of the cross-linked nucleotide. (König\| McGlincy\| Ule\|2012) After ligating RNA linkers to the RNA 5' ends, cDNA is synthesized via RT-PCR. High-throughput sequencing is then used to generate reads containing distinct barcodes that identify the last cDNA nucleotide. Interaction sites can be identified by mapping the reads back to the transcriptome.	https://github.com/datirium/workflows.git Path: workflows/clipseq-se.cwl Branch/Commit ID: d76110e0bfc40c874f82e37cef6451d74df4f908