Explore Workflows
View already parsed workflows here or click here to add your own
Graph | Name | Retrieved From | View |
---|---|---|---|
|
revsort.cwl
Reverse the lines in a document, then sort those lines. |
![]() Path: cwltool/schemas/v1.0/v1.0/revsort.cwl Branch/Commit ID: a3d565bf8e630101d25d31804cfbceb0a0ba28de |
|
|
DESeq - differential gene expression analysis
Differential gene expression analysis ===================================== Differential gene expression analysis based on the negative binomial distribution Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution. DESeq1 ------ High-throughput sequencing assays such as RNA-Seq, ChIP-Seq or barcode counting provide quantitative readouts in the form of count data. To infer differential signal in such data correctly and with good statistical power, estimation of data variability throughout the dynamic range and a suitable error model are required. Simon Anders and Wolfgang Huber propose a method based on the negative binomial distribution, with variance and mean linked by local regression and present an implementation, [DESeq](http://bioconductor.org/packages/release/bioc/html/DESeq.html), as an R/Bioconductor package DESeq2 ------ In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. [DESeq2](http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html), a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. |
![]() Path: workflows/deseq.cwl Branch/Commit ID: 799575ce58746813f066a665adeacdda252d8cab |
|
|
count-lines8-wf-noET.cwl
|
![]() Path: v1.0/v1.0/count-lines8-wf-noET.cwl Branch/Commit ID: 1f501e38ff692a408e16b246ac7d64d32f0822c2 |
|
|
trimmed_fastq
Quality Control (raw data), Raw Data trimming and Quality Control (pre-processed) |
![]() Path: structuralvariants/cwl/subworkflows/trimmed_fastq.cwl Branch/Commit ID: de9cb009f8fe0c8d5a94db5c882cf21ddf372452 |
|
|
trimmed_fastq
Quality Control (raw data), Raw Data trimming and Quality Control (pre-processed) |
![]() Path: structuralvariants/cwl/abstract_operations/subworkflows/trimmed_fastq.cwl Branch/Commit ID: de9cb009f8fe0c8d5a94db5c882cf21ddf372452 |
|
|
EMG assembly for paired end Illumina
|
![]() Path: workflows/emg-assembly.cwl Branch/Commit ID: 5dc7c5ca618a248a99bd4bf5f3042cdb21947193 |
|
|
count-lines15-wf.cwl
|
![]() Path: v1.0/v1.0/count-lines15-wf.cwl Branch/Commit ID: 1f501e38ff692a408e16b246ac7d64d32f0822c2 |
|
|
CLIP-Seq pipeline for single-read experiment NNNNG
Cross-Linking ImmunoPrecipitation ================================= `CLIP` (`cross-linking immunoprecipitation`) is a method used in molecular biology that combines UV cross-linking with immunoprecipitation in order to analyse protein interactions with RNA or to precisely locate RNA modifications (e.g. m6A). (Uhl|Houwaart|Corrado|Wright|Backofen|2017)(Ule|Jensen|Ruggiu|Mele|2003)(Sugimoto|König|Hussain|Zupan|2012)(Zhang|Darnell|2011) (Ke| Alemu| Mertens| Gantman|2015) CLIP-based techniques can be used to map RNA binding protein binding sites or RNA modification sites (Ke| Alemu| Mertens| Gantman|2015)(Ke| Pandya-Jones| Saito| Fak|2017) of interest on a genome-wide scale, thereby increasing the understanding of post-transcriptional regulatory networks. The identification of sites where RNA-binding proteins (RNABPs) interact with target RNAs opens the door to understanding the vast complexity of RNA regulation. UV cross-linking and immunoprecipitation (CLIP) is a transformative technology in which RNAs purified from _in vivo_ cross-linked RNA-protein complexes are sequenced to reveal footprints of RNABP:RNA contacts. CLIP combined with high-throughput sequencing (HITS-CLIP) is a generalizable strategy to produce transcriptome-wide maps of RNA binding with higher accuracy and resolution than standard RNA immunoprecipitation (RIP) profiling or purely computational approaches. The application of CLIP to Argonaute proteins has expanded the utility of this approach to mapping binding sites for microRNAs and other small regulatory RNAs. Finally, recent advances in data analysis take advantage of cross-link–induced mutation sites (CIMS) to refine RNA-binding maps to single-nucleotide resolution. Once IP conditions are established, HITS-CLIP takes ~8 d to prepare RNA for sequencing. Established pipelines for data analysis, including those for CIMS, take 3–4 d. Workflow -------- CLIP begins with the in-vivo cross-linking of RNA-protein complexes using ultraviolet light (UV). Upon UV exposure, covalent bonds are formed between proteins and nucleic acids that are in close proximity. (Darnell|2012) The cross-linked cells are then lysed, and the protein of interest is isolated via immunoprecipitation. In order to allow for sequence specific priming of reverse transcription, RNA adapters are ligated to the 3' ends, while radiolabeled phosphates are transferred to the 5' ends of the RNA fragments. The RNA-protein complexes are then separated from free RNA using gel electrophoresis and membrane transfer. Proteinase K digestion is then performed in order to remove protein from the RNA-protein complexes. This step leaves a peptide at the cross-link site, allowing for the identification of the cross-linked nucleotide. (König| McGlincy| Ule|2012) After ligating RNA linkers to the RNA 5' ends, cDNA is synthesized via RT-PCR. High-throughput sequencing is then used to generate reads containing distinct barcodes that identify the last cDNA nucleotide. Interaction sites can be identified by mapping the reads back to the transcriptome. |
![]() Path: workflows/clipseq-se.cwl Branch/Commit ID: 7518b100d8cbc80c8be32e9e939dfbb27d6b4361 |
|
|
Build STAR indices
Workflow runs [STAR](https://github.com/alexdobin/STAR) v2.5.3a (03/17/2017) PMID: [23104886](https://www.ncbi.nlm.nih.gov/pubmed/23104886) to build indices for reference genome provided in a single FASTA file as fasta_file input and GTF annotation file from annotation_gtf_file input. Generated indices are saved in a folder with the name that corresponds to the input genome. |
![]() Path: workflows/star-index.cwl Branch/Commit ID: 799575ce58746813f066a665adeacdda252d8cab |
|
|
metabarcode (gene amplicon) analysis for fastq files
protein - qc, preprocess, annotation, index, abundance |
![]() Path: CWL/Workflows/metabarcode-fasta.workflow.cwl Branch/Commit ID: 81feefc84ec0faecf1ade718001d5f07610e616e |