Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	CLIP-Seq pipeline for single-read experiment NNNNG Cross-Linking ImmunoPrecipitation ================================= `CLIP` (`cross-linking immunoprecipitation`) is a method used in molecular biology that combines UV cross-linking with immunoprecipitation in order to analyse protein interactions with RNA or to precisely locate RNA modifications (e.g. m6A). (Uhl\|Houwaart\|Corrado\|Wright\|Backofen\|2017)(Ule\|Jensen\|Ruggiu\|Mele\|2003)(Sugimoto\|König\|Hussain\|Zupan\|2012)(Zhang\|Darnell\|2011) (Ke\| Alemu\| Mertens\| Gantman\|2015) CLIP-based techniques can be used to map RNA binding protein binding sites or RNA modification sites (Ke\| Alemu\| Mertens\| Gantman\|2015)(Ke\| Pandya-Jones\| Saito\| Fak\|2017) of interest on a genome-wide scale, thereby increasing the understanding of post-transcriptional regulatory networks. The identification of sites where RNA-binding proteins (RNABPs) interact with target RNAs opens the door to understanding the vast complexity of RNA regulation. UV cross-linking and immunoprecipitation (CLIP) is a transformative technology in which RNAs purified from _in vivo_ cross-linked RNA-protein complexes are sequenced to reveal footprints of RNABP:RNA contacts. CLIP combined with high-throughput sequencing (HITS-CLIP) is a generalizable strategy to produce transcriptome-wide maps of RNA binding with higher accuracy and resolution than standard RNA immunoprecipitation (RIP) profiling or purely computational approaches. The application of CLIP to Argonaute proteins has expanded the utility of this approach to mapping binding sites for microRNAs and other small regulatory RNAs. Finally, recent advances in data analysis take advantage of cross-link–induced mutation sites (CIMS) to refine RNA-binding maps to single-nucleotide resolution. Once IP conditions are established, HITS-CLIP takes ~8 d to prepare RNA for sequencing. Established pipelines for data analysis, including those for CIMS, take 3–4 d. Workflow -------- CLIP begins with the in-vivo cross-linking of RNA-protein complexes using ultraviolet light (UV). Upon UV exposure, covalent bonds are formed between proteins and nucleic acids that are in close proximity. (Darnell\|2012) The cross-linked cells are then lysed, and the protein of interest is isolated via immunoprecipitation. In order to allow for sequence specific priming of reverse transcription, RNA adapters are ligated to the 3' ends, while radiolabeled phosphates are transferred to the 5' ends of the RNA fragments. The RNA-protein complexes are then separated from free RNA using gel electrophoresis and membrane transfer. Proteinase K digestion is then performed in order to remove protein from the RNA-protein complexes. This step leaves a peptide at the cross-link site, allowing for the identification of the cross-linked nucleotide. (König\| McGlincy\| Ule\|2012) After ligating RNA linkers to the RNA 5' ends, cDNA is synthesized via RT-PCR. High-throughput sequencing is then used to generate reads containing distinct barcodes that identify the last cDNA nucleotide. Interaction sites can be identified by mapping the reads back to the transcriptome.	https://github.com/datirium/workflows.git Path: workflows/clipseq-se.cwl Branch/Commit ID: b141f7e73005227d6d02fa03a47151836dd4109b
	Bisulfite alignment and QC	https://github.com/genome/analysis-workflows.git Path: definitions/pipelines/bisulfite.cwl Branch/Commit ID: 174f3b239018328cec1d821947438b457552724c
	kfdrc_alignment_wf.cwl	https://github.com/kids-first/kf-alignment-workflow.git Path: workflows/kfdrc_alignment_wf.cwl Branch/Commit ID: af97e25cb213233a4923c881f7c6210b57960cb9
	Cell Ranger Count (RNA+ATAC) Cell Ranger Count (RNA+ATAC) Quantifies single-cell gene expression and chromatin accessibility of the sequencing data from a single 10x Genomics library in a combined manner. The results of this workflow are primarily used in either “Single-Cell Multiome ATAC and RNA-Seq Filtering Analysis” or “Cell Ranger Aggregate (RNA+ATAC)” pipelines.	https://github.com/datirium/workflows.git Path: workflows/cellranger-arc-count.cwl Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869
	Motif Finding with HOMER with random background regions Motif Finding with HOMER with random background regions --------------------------------------------------- HOMER contains a novel motif discovery algorithm that was designed for regulatory element analysis in genomics applications (DNA only, no protein). It is a differential motif discovery algorithm, which means that it takes two sets of sequences and tries to identify the regulatory elements that are specifically enriched in on set relative to the other. It uses ZOOPS scoring (zero or one occurrence per sequence) coupled with the hypergeometric enrichment calculations (or binomial) to determine motif enrichment. HOMER also tries its best to account for sequenced bias in the dataset. It was designed with ChIP-Seq and promoter analysis in mind, but can be applied to pretty much any nucleic acids motif finding problem. Here is how we generate background for Motifs Analysis ------------------------------------- 1. Take input file with regions in a form of “chr\" “start\" “end\" 2. Sort and remove duplicates from this regions file 3. Extend each region in 20Kb into both directions 4. Merge all overlapped extended regions 5. Subtract not extended regions from the extended ones 6. Randomly distribute not extended regions within the regions that we got as a result of the previous step 7. Get fasta file from these randomly distributed regions (from the previous step). Use it as background For more information please refer to: ------------------------------------- [Official documentation](http://homer.ucsd.edu/homer/motif/)	https://github.com/datirium/workflows.git Path: workflows/homer-motif-analysis.cwl Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869
	io-int-default-tool-and-wf.cwl	https://github.com/common-workflow-language/cwl-v1.2.git Path: tests/io-int-default-tool-and-wf.cwl Branch/Commit ID: 707ebcd2173889604459c5f4ffb55173c508abb3
	bulk scRNA-seq pipeline using Salmon	https://github.com/hubmapconsortium/salmon-rnaseq.git Path: bulk-pipeline.cwl Branch/Commit ID: b9c8e269cc846e2ac6d3cdc902c0e37ac58b0ff4
	uncollapsed_bam_generation.cwl	https://github.com/msk-access/uncollapsed_bam_generation.git Path: uncollapsed_bam_generation.cwl Branch/Commit ID: 6576bfceadd14adc0655c1cc7f82a0045c0fd09b
	Interval overlapping alignments counts Interval overlapping alignments counts ====================================== Reports the count of alignments from multiple samples that overlap specific intervals.	https://github.com/datirium/workflows.git Path: workflows/bedtools-multicov.cwl Branch/Commit ID: aebf2355539fdf81fd9082616f8b21440d2691c6
	mutect panel-of-normals workflow	https://github.com/genome/analysis-workflows.git Path: definitions/pipelines/panel_of_normals.cwl Branch/Commit ID: 49508a2757ff2f49f1c200774a38af1c12b531bf