Workflow: WGS processing workflow for single sample

Fetched 2023-01-11 14:51:00 GMT
children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
fastq1 File [FASTQ] One of set of pair-end FASTQs (R1)
fastq2 File [FASTQ] One of set of pair-end FASTQs (R2)
sample String Sample Name
headhtml File [HTML] Header for HTML report
tailhtml File [HTML] Footer for HTML report
reference File [FASTA] Reference genome
clinvarvcf File [VCF] Reference VCF for ClinVar
reportfunc File Function used to create HTML report
knownsites1 File [VCF] VCF of known SNPS sites for BQSR
knownsites2 File [VCF] VCF of known indel sites for BQSR
scattercount String Desired split for variant calling
fullintervallist File Full list of intervals to operate over

Steps

ID Runs Label Doc
fastqc
fastqc.cwl (CommandLineTool)
Quality check on FASTQ
samtools-sort
samtools-sort.cwl (CommandLineTool)
Sort BAM
samtools-index
samtools-index.cwl (CommandLineTool)
Index BAM
generate-report
report-wf.cwl (Workflow)
Report generation workflow
haplotypecaller Scattered variant calling workflow
mark-duplicates
mark-duplicates.cwl (CommandLineTool)
Mark duplicates using GATK (Picard)
bwamem-samtools-view
bwamem-samtools-view.cwl (CommandLineTool)
Align FASTQs with BWA

Outputs

ID Type Label Doc
gvcf File [VCF] GVCF generated from GATK Haplotype Caller
qc-zip File[] Zip files of FASTQ QC report and associated data
report File [HTML] ClinVar variant report
qc-html File[] [HTML] FASTQ QC reports
Permalink: https://w3id.org/cwl/view/git/7521f8770b81d944aad3c1e0fd7ef5a886600190/WGS-processing/cwl/helper/bwamem-gatk-report-wf.cwl