Explore Workflows
View already parsed workflows here or click here to add your own
| Graph | Name | Retrieved From | View |
|---|---|---|---|
|
|
GSEApy - Gene Set Enrichment Analysis in Python
GSEAPY: Gene Set Enrichment Analysis in Python ============================================== Gene Set Enrichment Analysis is a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states (e.g. phenotypes). GSEA requires as input an expression dataset, which contains expression profiles for multiple samples. While the software supports multiple input file formats for these datasets, the tab-delimited GCT format is the most common. The first column of the GCT file contains feature identifiers (gene ids or symbols in the case of data derived from RNA-Seq experiments). The second column contains a description of the feature; this column is ignored by GSEA and may be filled with “NA”s. Subsequent columns contain the expression values for each feature, with one sample's expression value per column. It is important to note that there are no hard and fast rules regarding how a GCT file's expression values are derived. The important point is that they are comparable to one another across features within a sample and comparable to one another across samples. Tools such as DESeq2 can be made to produce properly normalized data (normalized counts) which are compatible with GSEA. |
Path: workflows/gseapy.cwl Branch/Commit ID: f3e44d3b0f198cf5245c49011124dc3b6c2b06fd |
|
|
|
checkm_wnode
|
Path: task_types/tt_checkm_wnode.cwl Branch/Commit ID: 369e2b6c7f4db75099d258729dec1326f55d2cc5 |
|
|
|
checkm
|
Path: checkm/wf_checkm.cwl Branch/Commit ID: 369e2b6c7f4db75099d258729dec1326f55d2cc5 |
|
|
|
Kraken2 Database installation pipeline
This workflow downloads the user-selected pre-built kraken2 database from: https://benlangmead.github.io/aws-indexes/k2 ### __Inputs__ Select a pre-built Kraken2 database to download and use for metagenomic classification: - Available options comprised of various combinations of RefSeq reference genome sets: - [Viral (0.5 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_viral_20221209.tar.gz), all refseq viral genomes - [MinusB (8.7 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_minusb_20221209.tar.gz), standard minus bacteria (archaea, viral, plasmid, human1, UniVec_Core) - [PlusPFP-16 (15.0 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_pluspfp_16gb_20221209.tar.gz), standard (archaea, bacteria, viral, plasmid, human1, UniVec_Core) + (protozoa, fungi & plant) capped at 16 GB (shrunk via random kmer downselect) - [EuPathDB46 (34.1 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_eupathdb48_20201113.tar.gz), eukaryotic pathogen genomes with contaminants removed (https://veupathdb.org/veupathdb/app) - [16S_gg_13_5 (73 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Greengenes13.5_20200326.tgz), Greengenes 16S rRNA database ([release 13.5](https://greengenes.secondgenome.com/?prefix=downloads/greengenes_database/gg_13_5/), 20200326)\n - [16S_silva_138 (112 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Silva138_20200326.tgz), SILVA 16S rRNA database ([release 138.1](https://www.arb-silva.de/documentation/release-1381/), 20200827) ### __Outputs__ - k2db, an upstream database used by kraken2 classification tool ### __Data Analysis Steps__ 1. download selected pre-built kraken2 database. 2. make available as upstream source for kraken2 metagenomic taxonomic classification. ### __References__ - Wood, D.E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol 20, 257 (2019). https://doi.org/10.1186/s13059-019-1891-0 |
Path: workflows/kraken2-databases.cwl Branch/Commit ID: d76110e0bfc40c874f82e37cef6451d74df4f908 |
|
|
|
import_schema-def.cwl
|
Path: tests/import_schema-def.cwl Branch/Commit ID: e515226f8ac0f7985cd94dae4a301150adae3050 |
|
|
|
conflict-wf.cwl#collision
|
Path: cwltool/schemas/v1.0/v1.0/conflict-wf.cwl Branch/Commit ID: d7cd45f7072960d264962ecc5a04d7c219f65c06 Packed ID: collision |
|
|
|
mutect panel-of-normals workflow
|
Path: definitions/pipelines/panel_of_normals.cwl Branch/Commit ID: 9143dc4ebacb9e1df36a712b0be6fa5d982b0c4f |
|
|
|
default-wf5.cwl
|
Path: tests/wf/default-wf5.cwl Branch/Commit ID: 047e69bb169e79fad6a7285ee798c4ecec3b218b |
|
|
|
kmer_cache_retrieve
|
Path: task_types/tt_kmer_cache_retrieve.cwl Branch/Commit ID: ac387721a55fd91df3dcdf16e199354618b136d1 |
|
|
|
kmer_seq_entry_extract_wnode
|
Path: task_types/tt_kmer_seq_entry_extract_wnode.cwl Branch/Commit ID: 122aba2dafbb63241413c82b725b877c04523aaf |
