Explore Workflows
View already parsed workflows here or click here to add your own
Graph | Name | Retrieved From | View |
---|---|---|---|
|
FASTQ Vector Removal
This workflow convert fastq to multiple fasta files |
![]() Path: workflows/File-formats/fastq-to-splitted-fasta.cwl Branch/Commit ID: 11f70a71cb68b3960c2d410ba1fdcd3b8a7e1419 |
|
|
FastQC - a quality control tool for high throughput sequence data
FastQC - a quality control tool for high throughput sequence data ===================================== FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis. The main functions of FastQC are: - Import of data from FastQ files (any variant) - Providing a quick overview to tell you in which areas there may be problems - Summary graphs and tables to quickly assess your data - Export of results to an HTML based permanent report - Offline operation to allow automated generation of reports without running the interactive application |
![]() Path: workflows/fastqc.cwl Branch/Commit ID: 935a78f1aff757f977de4e3672aefead3b23606b |
|
|
Generate genome indices for STAR & bowtie
Creates indices for: * [STAR](https://github.com/alexdobin/STAR) v2.5.3a (03/17/2017) PMID: [23104886](https://www.ncbi.nlm.nih.gov/pubmed/23104886) * [bowtie](http://bowtie-bio.sourceforge.net/tutorial.shtml) v1.2.0 (12/30/2016) It performs the following steps: 1. `STAR --runMode genomeGenerate` to generate indices, based on [FASTA](http://zhanglab.ccmb.med.umich.edu/FASTA/) and [GTF](http://mblab.wustl.edu/GTF2.html) input files, returns results as an array of files 2. Outputs indices as [Direcotry](http://www.commonwl.org/v1.0/CommandLineTool.html#Directory) data type 3. Separates *chrNameLength.txt* file from Directory output 4. `bowtie-build` to generate indices requires genome [FASTA](http://zhanglab.ccmb.med.umich.edu/FASTA/) file as input, returns results as a group of main and secondary files |
![]() Path: workflows/genome-indices.cwl Branch/Commit ID: e45ab1b9ac5c9b99fdf7b3b1be396dc42c2c9620 |
|
|
EMG assembly for paired end Illumina
|
![]() Path: workflows/emg-assembly.cwl Branch/Commit ID: 886df9de6713e06228d2560c40f451155a196383 |
|
|
kmer_ref_compare_wnode
|
![]() Path: task_types/tt_kmer_ref_compare_wnode.cwl Branch/Commit ID: add6b7724698694e0e72d972e2e85e1ae4e67902 |
|
|
Cell Ranger Count Gene Expression
Cell Ranger Count Gene Expression ================================= |
![]() Path: workflows/single-cell-preprocess-cellranger.cwl Branch/Commit ID: 1a46cb0e8f973481fe5ae3ae6188a41622c8532e |
|
|
Functional analyis of sequences that match the 16S SSU
|
![]() Path: workflows/16S_taxonomic_analysis.cwl Branch/Commit ID: 47d6375df6c6e4486e03d964b3abf1d71943e609 |
|
|
hmmsearch_wnode and gpx_qdump combined workflow to apply scatter/gather
|
![]() Path: task_types/tt_hmmsearch_wnode_plus_qdump.cwl Branch/Commit ID: d39017c63dd8e088f1ad3809d709529df602e05f |
|
|
FastQC - a quality control tool for high throughput sequence data
FastQC - a quality control tool for high throughput sequence data ===================================== FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis. The main functions of FastQC are: - Import of data from FastQ files (any variant) - Providing a quick overview to tell you in which areas there may be problems - Summary graphs and tables to quickly assess your data - Export of results to an HTML based permanent report - Offline operation to allow automated generation of reports without running the interactive application |
![]() Path: workflows/fastqc.cwl Branch/Commit ID: e45ab1b9ac5c9b99fdf7b3b1be396dc42c2c9620 |
|
|
Trim Galore RNA-Seq pipeline paired-end
The original [BioWardrobe's](https://biowardrobe.com) [PubMed ID:26248465](https://www.ncbi.nlm.nih.gov/pubmed/26248465) **RNA-Seq** basic analysis for a **pair-end** experiment. A corresponded input [FASTQ](http://maq.sourceforge.net/fastq.shtml) file has to be provided. Current workflow should be used only with the single-end RNA-Seq data. It performs the following steps: 1. Trim adapters from input FASTQ files 2. Use STAR to align reads from input FASTQ files according to the predefined reference indices; generate unsorted BAM file and alignment statistics file 3. Use fastx_quality_stats to analyze input FASTQ files and generate quality statistics files 4. Use samtools sort to generate coordinate sorted BAM(+BAI) file pair from the unsorted BAM file obtained on the step 1 (after running STAR) 5. Generate BigWig file on the base of sorted BAM file 6. Map input FASTQ files to predefined rRNA reference indices using Bowtie to define the level of rRNA contamination; export resulted statistics to file 7. Calculate isoform expression level for the sorted BAM file and GTF/TAB annotation file using GEEP reads-counting utility; export results to file |
![]() Path: workflows/trim-rnaseq-pe.cwl Branch/Commit ID: 6bf56698c6fe6e781723dea32bc922b91ef49cf3 |