Workflow: FragPipe: Convert - Identify - PeptideProphet

Fetched 2024-04-30 08:55:45 GMT

The first step of the workflow consists of converting the raw mass spectrometry data to the mzML format using msconvert, followed by the database search using MSFragger, and the peptide validation using PeptideProphet. #End-user license agreement (EULA) A central tool in this workflow is MSFragger which can only be used for academic research, non-commercial or educational purposes. To run this workflow you must accept the EULA below. ``` #7143 - MSFragger - Academic and Research Education Use License This license includes a digital file, available for download after the license agreement has been executed and approved (if necessary) which includes the University of Michigan PROGRAM. NOTICE: MSFragger utilizes the MSFTBX library to read mzXML and mzML files. The MSFTBX uses third-party software that is subject to open source and/or third-party license terms. These terms are attached in the file LICENSE-3RD-PARTY.txt. By accepting the terms and conditions of this Agreement, LICENSEE will also be subject to those terms and conditions. PROGRAM: MSFragger UM # 7143 - Philosopher Pipeline Distribution Permissions - Cancer Genomics Cloud This Agreement is made by and between The Regents of The University of Michigan, a constitutional corporation of the state of Michigan (hereinafter \"MICHIGAN\") and LICENSEE. BACKGROUND 1. Members of the University of Michigan’s Medical School Departments of Pathology and Computational Medicine and Bioinformatics have developed a proprietary software application and related documentation for peptide identification referred to as “MSFragger”, for use by trained individuals in bioinformatics and proteomics research (hereinafter referred to as \"PROGRAM\"); and 2. LICENSEE desires to obtain, and MICHIGAN, consistent with its mission of education and research, desires to grant, a license to use the PROGRAM subject to the terms and conditions set forth below; and The parties therefore agree as follows: I. LICENSE MICHIGAN hereby grants to LICENSEE a non-exclusive, non-transferable right to use the PROGRAM solely for academic research, non-commercial or educational purposes within the LICENSEE’s department and subject to the terms and conditions of this Agreement. Commercial use of any kind of the PROGRAM is strictly prohibited. This license is solely for deployment of the PROGRAM in conjunction with the Cancer Genomics Cloud project. No other license rights are granted. II. LIMITATION OF LICENSE AND RESTRICTIONS A. LICENSEE shall not use, print, copy, translate, reverse engineer, decompile, disassemble, modify, create derivative works of or publicly display the PROGRAM, in whole or in part, unless expressly authorized by this Agreement. B. LICENSEE agrees that it shall use the PROGRAM only for LICENSEE'S sole and exclusive use, and shall not disclose, sell, license, or otherwise distribute the PROGRAM, in whole or in part, to any third party without the prior written consent of MICHIGAN. LICENSEE shall not assign this Agreement, and any attempt by LICENSEE to assign it shall be void from the beginning. LICENSEE agrees to secure and protect the PROGRAM and any copies of the PROGRAM in a manner consistent with the maintenance of MICHIGAN'S rights in the PROGRAM and to take appropriate action by instruction or agreement with its employees who are permitted access to the PROGRAM in order to satisfy LICENSEE'S obligations under this Agreement. C. LICENSEE may copy the PROGRAM for the purpose of using the PROGRAM on other devices and for archival purposes. D. Such license shall be only for its intended use and in accordance with applicable Food and Drug Administration (FDA) regulations for pre-clinical and clinical research, including, but not limited to, applicable investigational device exemption (IDE) regulations, institutional review board (IRB) approval, required protocols and monitoring of the study, required records and reports, and applicable protection of human subjects regulations. E. LICENSEE agrees to acknowledge MICHIGAN in scientific publications resulting (in part) of the use of the PROGRAM, by making reference to the publication of the PROGRAM when it becomes available in the scientific literature. Copyright notice should be as follows, “MSFragger © 2016 The Regents of the University of Michigan.” III. CONSIDERATION A no-cost option for the license rights granted in this Agreement. IV. TITLE AND OWNERSHIP A. No ownership rights of MICHIGAN in the PROGRAM are conferred upon LICENSEE by this Agreement. B. LICENSEE acknowledges MICHIGAN'S proprietary rights in the PROGRAM and agrees to reproduce all copyright notices supplied by MICHIGAN on all copies of the PROGRAM. C. LICENSEE agrees this license is explicitly for academic research applications only and that it cannot be used for commercial purposes of any kind. LICENSEE further acknowledges that any use in a commercial setting will be subject to prosecution to the fullest extent available under the law and remedies may include full recovery of legal, operational, economic and other losses which the LICENSEE will be directly liable for. V. DISCLAIMER OF WARRANTY AND LIMITATION OF LIABILITY A. THE PROGRAM IS PROVIDED \"AS IS\" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. MICHIGAN DOES NOT WARRANT THAT THE FUNCTIONS CONTAINED IN THE PROGRAM WILL MEET LICENSEE'S REQUIREMENTS OR THAT OPERATION WILL BE UNINTERRUPTED OR ERROR FREE. MICHIGAN shall not be liable for special, indirect, incidental, or consequential damages with respect to any claim on account of or arising from this Agreement or use of the PROGRAM, even if MICHIGAN has been or is hereafter advised of the possibility of such damages. Because some states do not allow certain exclusions or limitations on implied warranties or of liability for consequential or incidental damages, the above exclusions may not apply to LICENSEE. In no event, however, will MICHIGAN be liable to LICENSEE, under any theory of recovery, in an amount in excess of the license fee paid by LICENSEE under this Agreement. B. LICENSEE agrees that MICHIGAN has no obligation to provide to LICENSEE any maintenance, support, or update services. Should MICHIGAN provide any revised versions of the PROGRAM to LICENSEE, LICENSEE agrees that this license agreement shall apply to such revised versions. VI. WARRANTY OF LICENSEE LICENSEE warrants and represents that it will carefully review any documentation or instructional material provided by MICHIGAN. VIII. TERMINATION If LICENSEE at any time fails to abide by the terms of this Agreement, MICHIGAN shall have the right to immediately terminate the license granted herein, require the return or destruction of all copies of the PROGRAM from LICENSEE and certification in writing as to such return or destruction, and pursue any other legal or equitable remedies available. VIII. MISCELLANEOUS A. This Agreement shall be construed in accordance with the laws of the state of Michigan. Should LICENSEE for any reason bring a claim, demand, or other action against MICHIGAN, its agents or employees, arising out of this Agreement or the PROGRAM licensed herein, LICENSEE agrees to bring said claim only in the Michigan Court of Claims. B. THIS AGREEMENT REPRESENTS THE COMPLETE AND EXCLUSIVE STATEMENT OF THE AGREEMENT BETWEEN MICHIGAN AND LICENSEE AND SUPERSEDES ALL PRIOR AGREEMENTS, PROPOSALS, REPRESENTATIONS AND OTHER COMMUNICATIONS, VERBAL OR WRITTEN, BETWEEN THEM WITH RESPECT TO USE OF THE PROGRAM. THIS AGREEMENT MAY BE MODIFIED ONLY WITH THE MUTUAL WRITTEN APPROVAL OF AUTHORIZED REPRESENTATIVES OF THE PARTIES. C. The terms and conditions of this Agreement shall prevail notwithstanding any different, conflicting, or additional terms or conditions which may appear in any purchase order or other document submitted by LICENSEE. LICENSEE agrees that such additional or inconsistent terms are deemed rejected by MICHIGAN. D. Unless otherwise exempt therefrom, LICENSEE agrees that it will be responsible for any sales, use or excise taxes imposed by any governmental unit in this transaction except income taxes. E. LICENSEE acknowledges that the PROGRAM is of United States origin. LICENSEE agrees to comply with all applicable international and national laws that apply to the PROGRAM, including the United States Export Administration Regulations, as well as end-user, end-use, and destination restrictions issued by the United States. F. MICHIGAN and LICENSEE agree that any xerographically or electronically reproduced copy of this fully-executed agreement shall have the same legal force and effect as any copy bearing original signatures of the parties. G. MSFragger utilizes the MSFTBX library to read mzXML and mzML files. The MSFTBX uses third-party software that is subject to open source and/or third-party license terms. These terms are attached in the file LICENSE-3RD-PARTY.txt. By accepting the terms and conditions of this Agreement, LICENSEE will also be subject to those terms and conditions ```

children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
ppm Boolean (Optional)

use ppm mass error instead of Daltons for mass modeling

EULA Boolean Accept End Level User Agreement
decoy String (Optional)

semi-supervised mode, protein name prefix to identify decoy entries (default \"rev_\")

accmass Boolean (Optional)

use accurate mass model binning

combine Boolean (Optional)

combine the results from PeptideProphet into a single result file

nonparam Boolean (Optional)

use semi-parametric modeling, must be used in conjunction with --decoy option

deisotope Integer (Optional)

Perform deisotoping or not.

raw_files File[] RAW files
decoyprobs Boolean (Optional)

compute possible non-zero probabilities for decoy entries on the last iteration

expectscore Boolean (Optional)

use expectation value as the only contributor to the f-value for modeling

decoy_prefix String (Optional)

Prefix added to the decoy protein ID.

database_name File database file (fas)

Path to the protein database file in FASTA format.

isotope_error String (Optional)

Also search for MS/MS events triggered on specified isotopic peaks.

add_C_cysteine Float (Optional)
calibrate_mass Integer (Optional)

Perform mass calibration (0 for OFF, 1 for ON, 2 for ON and find optimal parameters).

variable_mod_01 String (Optional)
variable_mod_02 String (Optional)
variable_mod_03 String (Optional)
variable_mod_04 String (Optional)
variable_mod_05 String (Optional)
add_W_tryptophan Float (Optional)
precursor_charge String (Optional)

Assumed range of potential precursor charge states. Only relevant when override_charge is set to 1.

digest_max_length Integer (Optional)

Maximum length of peptides to be generated during in-silico digestion.

digest_min_length Integer (Optional)

Minimum length of peptides to be generated during in-silico digestion.

num_enzyme_termini Integer (Optional)

0 for non-enzymatic, 1 for semi-enzymatic, and 2 for fully-enzymatic.

output_report_topN Integer (Optional)

Reports top N PSMs per input spectrum.

search_enzyme_name String (Optional)

Name of enzyme to be written to the pepXML file.

fragment_mass_units Integer (Optional)

Fragment mass tolerance units (0 for Da, 1 for ppm).

max_fragment_charge Integer (Optional)

Maximum charge state for theoretical fragments to match (1-4).

precursor_mass_lower Integer (Optional)

Lower bound of the precursor mass window.

precursor_mass_units Integer (Optional)

Precursor mass tolerance units (0 for Da, 1 for ppm).

precursor_mass_upper Integer (Optional)

Upper bound of the precursor mass window.

precursor_true_units Integer (Optional)

True precursor mass tolerance units (0 for Da, 1 for ppm).

write_calibrated_mgf Integer (Optional)

Write calibrated MS2 scan to a MGF file (0 for No, 1 for Yes).

add_B_user_amino_acid Float (Optional)
add_J_user_amino_acid Float (Optional)
add_O_user_amino_acid Float (Optional)
add_U_user_amino_acid Float (Optional)
add_X_user_amino_acid Float (Optional)
add_Z_user_amino_acid Float (Optional)
search_enzyme_cutafter String (Optional)

Residues after which the enzyme cuts.

allowed_missed_cleavage Integer (Optional)

Allowed number of missed cleavages per peptide. Maximum value is 5.

fragment_mass_tolerance Integer (Optional)

Fragment mass tolerance (window is +/- this value).

precursor_true_tolerance Integer (Optional)

True precursor mass tolerance (window is +/- this value).

Steps

There are no steps in this workflow

Outputs

ID Type Label Doc
peptide_archive_with_mzml File (Optional)
peptide_prophet_folder_archive File (Optional)
Permalink: https://w3id.org/cwl/view/git/080063b7443c98bc378935e16f747f8c542c9521/FragPipe-Convert-Identify-PeptideProphet/fragpipe-convert-identify-peptideprophet.cwl