Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph PCA - Principal Component Analysis

Principal Component Analysis --------------- Principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables (entities each of which takes on various numerical values) into a set of values of linearly uncorrelated variables called principal components. The calculation is done by a singular value decomposition of the (centered and possibly scaled) data matrix, not by using eigen on the covariance matrix. This is generally the preferred method for numerical accuracy.

https://github.com/datirium/workflows.git

Path: workflows/pca.cwl

Branch/Commit ID: e0a30aa1ad516dd2ec0e9ce006428964b840daf4

workflow graph scatter GATK HaplotypeCaller over intervals

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/gatk_haplotypecaller_iterator.cwl

Branch/Commit ID: 2e0562a5c3cd7aac24af4c622a5ae68a7fb23a71

workflow graph Bismark Methylation - pipeline for BS-Seq data analysis

Sequence reads are first cleaned from adapters and transformed into fully bisulfite-converted forward (C->T) and reverse read (G->A conversion of the forward strand) versions, before they are aligned to similarly converted versions of the genome (also C->T and G->A converted). Sequence reads that produce a unique best alignment from the four alignment processes against the bisulfite genomes (which are running in parallel) are then compared to the normal genomic sequence and the methylation state of all cytosine positions in the read is inferred. A read is considered to align uniquely if an alignment has a unique best alignment score (as reported by the AS:i field). If a read produces several alignments with the same number of mismatches or with the same alignment score (AS:i field), a read (or a read-pair) is discarded altogether. On the next step we extract the methylation call for every single C analysed. The position of every single C will be written out to a new output file, depending on its context (CpG, CHG or CHH), whereby methylated Cs will be labelled as forward reads (+), non-methylated Cs as reverse reads (-). The output of the methylation extractor is then transformed into a bedGraph and coverage file. The bedGraph counts output is then used to generate a genome-wide cytosine report which reports the number on every single CpG (optionally every single cytosine) in the genome, irrespective of whether it was covered by any reads or not. As this type of report is informative for cytosines on both strands the output may be fairly large (~46mn CpG positions or >1.2bn total cytosine positions in the human genome).

https://github.com/datirium/workflows.git

Path: workflows/bismark-methylation-se.cwl

Branch/Commit ID: 9850a859de1f42d3d252c50e15701928856fe774

workflow graph Add snv and indel bam-readcount files to a vcf

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/vcf_readcount_annotator.cwl

Branch/Commit ID: 2e0562a5c3cd7aac24af4c622a5ae68a7fb23a71

workflow graph genomics-workspace.cwl

https://github.com/NAL-i5K/Organism_Onboarding.git

Path: flow_genomicsWorkspace/genomics-workspace.cwl

Branch/Commit ID: 7198756b4b1519d102178042924671bd677e9b17

workflow graph EMG QC workflow, (paired end version). Benchmarking with MG-RAST expt.

https://github.com/proteinswebteam/ebi-metagenomics-cwl.git

Path: workflows/emg-qc-paired.cwl

Branch/Commit ID: 25129f55226dee595ef941edc24d3c44414e0523

workflow graph gcaccess_from_list

https://github.com/ncbi/pgap.git

Path: task_types/tt_gcaccess_from_list.cwl

Branch/Commit ID: 7cee09fb3e33c851e4e1dfc965c558b82290a785

workflow graph alignment for nonhuman with qc

https://github.com/genome/analysis-workflows.git

Path: definitions/pipelines/alignment_wgs_nonhuman.cwl

Branch/Commit ID: 389f6edccab082d947bee9c032f59dbdf9f7c325

workflow graph assm_assm_blastn_wnode

https://github.com/ncbi/pgap.git

Path: task_types/tt_assm_assm_blastn_wnode.cwl

Branch/Commit ID: 7cee09fb3e33c851e4e1dfc965c558b82290a785

workflow graph Unaligned BAM to BQSR and VCF

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/bam_to_bqsr_no_dup_marking.cwl

Branch/Commit ID: 77ec4f26eb14ed82481828bd9f6ef659cfd8b40f