Workflow: allele-vcf-alignreads-se-pe.cwl
Workflow maps FASTQ files from `fastq_files` input into reference genome `reference_star_indices_folder` and insilico generated `insilico_star_indices_folder` genome (concatenated genome for both `strain1` and `strain2` strains). For both genomes STAR is run with `outFilterMultimapNmax` parameter set to 1 to discard all of the multimapped reads. For insilico genome SAM file is generated. Then it's splitted into two SAM files based on strain names and then sorted by coordinates into the BAM format. For reference genome output BAM file from STAR slignment is also coordinate sorted.
- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
Inputs
ID | Type | Title | Doc |
---|---|---|---|
strain1 | String | I strain name |
First strain name |
strain2 | String | II strain name |
Second strain name |
threads | Integer (Optional) | Number of threads |
Number of threads for those steps that support multithreading |
fastq_files | File[] | Input FASTQ file(s) |
Input FASTQ file or array of files |
strain1_chain_file | File | I strain chain file |
Chain file to project strain I to reference genome |
strain2_chain_file | File | II strain chain file |
Chain file to project strain II to reference genome |
reference_chrom_length_file | File | Chromosome length file for reference genome |
Chromosome length file for reference genome |
insilico_star_indices_folder | Directory | STAR indices folder for insilico genome |
Path to STAR generated indices folder for insilico genome |
reference_star_indices_folder | Directory | STAR indices folder for reference genome |
Path to STAR generated indices folder for reference genome |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
strain1_project |
../tools/crossmap.cwl
(CommandLineTool)
|
Runs CrossMap.py script to project input BAM, BED, BIGWIG file based on input chain file.
Not supported input file types: SAM, GFF, VCF, WIG |
|
strain2_project |
../tools/crossmap.cwl
(CommandLineTool)
|
Runs CrossMap.py script to project input BAM, BED, BIGWIG file based on input chain file.
Not supported input file types: SAM, GFF, VCF, WIG |
|
strain1_sam_filter |
../tools/custom-bash.cwl
(CommandLineTool)
|
Tool to run custom script set as `script` input with arguments from `param`. Default script runs sed command over the input file and exports results to the file with the same name as input's basename |
|
strain2_sam_filter |
../tools/custom-bash.cwl
(CommandLineTool)
|
Tool to run custom script set as `script` input with arguments from `param`. Default script runs sed command over the input file and exports results to the file with the same name as input's basename |
|
insilico_star_aligner |
../tools/star-alignreads.cwl
(CommandLineTool)
|
Tool runs STAR alignReads. |
|
strain1_bam_to_bigwig |
bam-bedgraph-bigwig.cwl
(Workflow)
|
Workflow converts input BAM file into bigWig and bedGraph files |
|
strain2_bam_to_bigwig |
bam-bedgraph-bigwig.cwl
(Workflow)
|
Workflow converts input BAM file into bigWig and bedGraph files |
|
reference_star_aligner |
../tools/star-alignreads.cwl
(CommandLineTool)
|
Tool runs STAR alignReads. |
|
reference_bam_to_bigwig |
bam-bedgraph-bigwig.cwl
(Workflow)
|
Workflow converts input BAM file into bigWig and bedGraph files |
|
reference_samtools_sort |
../tools/samtools-sort.cwl
(CommandLineTool)
|
Tool to sort BAM/SAM file (set as input `sort_input`).
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools sort`, return
newly generated sorted BAM/SAM/CRAM file (the actual format of the output file depends on `out_format` value and
extension of a filename set in `sort_output_filename`).
If input `trigger` is set to `false`, return unchanged BAM/SAM file, previously staged into output directory. |
|
strain1_samtools_sort_index |
../tools/samtools-sort-index.cwl
(CommandLineTool)
|
Tool to sort and index input BAM/SAM/CRAM.
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools sort` and
`samtools index`, return sorted BAM and BAI/CSI index file.
If input `trigger` is set to `false`, return unchanged `sort_input` (BAM/SAM/CRAM) and index (BAI/CSI, if provided in
`secondaryFiles`) files, previously staged into output directory. |
|
strain2_samtools_sort_index |
../tools/samtools-sort-index.cwl
(CommandLineTool)
|
Tool to sort and index input BAM/SAM/CRAM.
If input `trigger` is set to `true` or isn't set at all (`true` is used by default), run `samtools sort` and
`samtools index`, return sorted BAM and BAI/CSI index file.
If input `trigger` is set to `false`, return unchanged `sort_input` (BAM/SAM/CRAM) and index (BAI/CSI, if provided in
`secondaryFiles`) files, previously staged into output directory. |
Outputs
ID | Type | Label | Doc |
---|---|---|---|
strain1_bigwig | File | I strain bigWig file |
Generated bigWig file for the first strain, projected to reference genome |
strain2_bigwig | File | II strain bigWig file |
Generated bigWig file for the second strain, projected to reference genome |
reference_bigwig | File | Reference bigWig file |
Generated BigWig file for the reference genome |
strain1_bambai_pair | File | I strain output BAM |
Coordinate sorted BAM file mapped to the first strain genome, projected to reference genome |
strain2_bambai_pair | File | II strain output BAM |
Coordinate sorted BAM file mapped to the second strain genome, projected to reference genome |
insilico_star_out_log | File (Optional) | STAR log out for insilico genome |
STAR Log.out for insilico genome |
reference_bambai_pair | File | Reference output BAM |
Coordinate sorted BAM file mapped to reference genome |
reference_star_out_log | File (Optional) | STAR log out for reference genome |
STAR Log.out for reference genome |
insilico_star_final_log | File | STAR final log for insilico genome |
STAR Log.final.out for insilico genome |
insilico_star_stdout_log | File (Optional) | STAR stdout log for insilico genome |
STAR Log.std.out for insilico genome |
reference_star_final_log | File | STAR final log for reference genome |
STAR Log.final.out for reference genome |
reference_star_stdout_log | File (Optional) | STAR stdout log for reference genome |
STAR Log.std.out for reference genome |
insilico_star_progress_log | File (Optional) | STAR progress log for insilico genome |
STAR Log.progress.out for insilico genome |
reference_star_progress_log | File (Optional) | STAR progress log for reference genome |
STAR Log.progress.out for reference genome |
https://w3id.org/cwl/view/git/378f693ebfb3edf9f589007e366fec1195ec1464/subworkflows/allele-vcf-alignreads-se-pe.cwl