Explore Workflows
View already parsed workflows here or click here to add your own
| Graph | Name | Retrieved From | View |
|---|---|---|---|
|
|
main.cwl
|
Path: online/streamflow/cwl/main.cwl Branch/Commit ID: e2c8ee3c187cb951066909296ead46b784cd2dee |
|
|
|
wgs alignment and tumor-only variant detection
|
Path: definitions/pipelines/wgs.cwl Branch/Commit ID: 4bc0a4577d626b65a4b44683e5a1ab2f7d7faf4c |
|
|
|
cond-wf-003.1.cwl
|
Path: testdata/cond-wf-003.1.cwl Branch/Commit ID: c1875d54dedc41b1d2fa08634dcf1caa8f1bc631 |
|
|
|
step-valuefrom3-wf.cwl
|
Path: cwltool/schemas/v1.0/v1.0/step-valuefrom3-wf.cwl Branch/Commit ID: 4c905b830371eee45188a53510ba0ee9113fd4c8 |
|
|
|
umi duplex alignment workflow
|
Path: definitions/subworkflows/duplex_alignment.cwl Branch/Commit ID: ddb49a0951d9ad537269d7db3fe8f904495a8bf4 |
|
|
|
Detect Variants workflow for WGS pipeline
|
Path: definitions/pipelines/detect_variants_wgs.cwl Branch/Commit ID: 8dc462a7d9ba1479f764682af99c69d8574cb3dc |
|
|
|
env-wf2.cwl
|
Path: cwltool/schemas/v1.0/v1.0/env-wf2.cwl Branch/Commit ID: c6cced7a2e6389d2eb43342e702677ccb7c7497c |
|
|
|
1st-workflow.cwl
|
Path: tests/wf/1st-workflow.cwl Branch/Commit ID: 2dce710246e091f0189fab41b589ee062ee94500 |
|
|
|
Kraken2 Metagenomic pipeline paired-end
This workflow taxonomically classifies paired-end sequencing reads in FASTQ format, that have been optionally adapter trimmed with trimgalore, using Kraken2 and a user-selected pre-built database from a list of [genomic index files](https://benlangmead.github.io/aws-indexes/k2). ### __Inputs__ Kraken2 database for taxonomic classification: - [Viral (0.5 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_viral_20221209.tar.gz), all refseq viral genomes - [MinusB (8.7 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_minusb_20221209.tar.gz), standard minus bacteria (archaea, viral, plasmid, human1, UniVec_Core) - [PlusPFP-16 (15.0 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_pluspfp_16gb_20221209.tar.gz), standard (archaea, bacteria, viral, plasmid, human1, UniVec_Core) + (protozoa, fungi & plant) capped at 16 GB (shrunk via random kmer downselect) - [EuPathDB46 (34.1 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_eupathdb48_20201113.tar.gz), eukaryotic pathogen genomes with contaminants removed (https://veupathdb.org/veupathdb/app) - [16S_gg_13_5 (73 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Greengenes13.5_20200326.tgz), Greengenes 16S rRNA database ([release 13.5](https://greengenes.secondgenome.com/?prefix=downloads/greengenes_database/gg_13_5/), 20200326)\n - [16S_silva_138 (112 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Silva138_20200326.tgz), SILVA 16S rRNA database ([release 138.1](https://www.arb-silva.de/documentation/release-1381/), 20200827) Read 1 file: - FASTA/Q input R1 from a paired end library Read 2 file: - FASTA/Q input R2 from a paired end library Advanced Inputs Tab (Optional): - Number of bases to clip from the 3p end - Number of bases to clip from the 5p end ### __Outputs__ - k2db, an upstream database used by kraken2 classifier ### __Data Analysis Steps__ 1. Trimming the adapters with TrimGalore. - This step is particularly important when the reads are long and the fragments are short - resulting in sequencing adapters at the ends of reads. If adapter is not removed the read will not map. TrimGalore can recognize standard adapters, such as Illumina or Nextera/Tn5 adapters. 2. Generate quality control statistics of trimmed, unmapped sequence data 3. (Optional) Clipping of 5' and/or 3' end by the specified number of bases. 4. Mapping reads to primary genome index with Bowtie. ### __References__ - Wood, D.E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol 20, 257 (2019). https://doi.org/10.1186/s13059-019-1891-0 |
Path: workflows/kraken2-classify-pe.cwl Branch/Commit ID: 675a3ff982091faf304931e9261aacdbabf51702 |
|
|
|
count-lines2-wf.cwl
|
Path: tests/count-lines2-wf.cwl Branch/Commit ID: 31ec48a8d81ef7c1b2c5e9c0a19e7623efe4a1e2 |
