Workflow: consensus_maf.cwl

Fetched 2023-01-08 23:49:46 GMT

Workflow to merge a large number of maf files into a single consensus maf file for use with GetBaseCountsMultiSample

children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
maf_files File[]

Steps

ID Runs Label Doc
merge_all_mafs
concat-mafs.cwl (CommandLineTool)

Special case of concat-tables.py to use with concatenating maf files for use with https://github.com/zengzheng123/GetBaseCountsMultiSample Note that these columns are required by GetBaseCountsMultiSample: ( as per https://github.com/zengzheng123/GetBaseCountsMultiSample/blob/master/GetBaseCountsMultiSample.cpp#L771-L801 ) Hugo_Symbol, Chromosome, Start_Position, End_Position, Reference_Allele, Tumor_Seq_Allele1, Tumor_Seq_Allele2, Tumor_Sample_Barcode, Matched_Norm_Sample_Barcode, t_ref_count, t_alt_count, n_ref_count, n_alt_count, Variant_Classification However, only these columns appear to influence the output: Hugo_Symbol Chromosome Start_Position End_Position Variant_Classification Reference_Allele Tumor_Seq_Allele1 Tumor_Seq_Allele2 usually in a column order matching this; grep -v '#' Proj_08390_G.muts.maf | cut -f1,5,6,7,9,11,12,13,16,17,41,42,44,45 > Proj_08390_G.cols.muts.maf also, if present these columns also appear in output but seem to be optional Center NCBI_Build see also; https://docs.gdc.cancer.gov/Data/File_Formats/MAF_Format/

deduplicate_maf
deduplicate-maf.cwl (CommandLineTool)

Remove duplicate lines from the maf file Expected columns are in this order; 1 Hugo_Symbol 2 Center 3 NCBI_Build 4 Chromosome 5 Start_Position 6 End_Position 7 Variant_Classification 8 Reference_Allele 9 Tumor_Seq_Allele1 10 Tumor_Seq_Allele2 11 n_alt_count 12 Matched_Norm_Sample_Barcode 13 t_alt_count 14 t_ref_count 15 n_ref_count 16 Tumor_Sample_Barcode NOTE: actually this order might be wrong but all that matters is that Chromosome and Start_Position are columns 4 and 5

Outputs

ID Type Label Doc
output_file File
Permalink: https://w3id.org/cwl/view/git/ba3ff09328cc646d7254b2d2ee0fbe1abca3d4ad/cwl/consensus_maf.cwl