protein, saps

Syntax

The parameter can have the values yes or no.

Notes

  1. No error message is generated when an uninterpretable value is entered for this parameter.
  2. If no value is set, the default value is yes.
  3. This option was introduced in X! Tandem and P3 in version 2007.04.01.

Description

When this parameter is set to yes, a list of known potential single amino acid polymorphisms is checked for each peptide residue considered in the first round of modelling. The default lists are based on known coding non-synonymous single nucleotide polymorphisms derived from the SNP annotations in ENSEMBL: the amino acid consequence of a nsSNP is commonly referred to as a snAPs. The annotation file format is the same as the file format used to annotate mutations in GPM BIOML files, for example:

<?xml version="1.0"?>
<bioml label="human sap listing, 2007.04.01, for ENSEMBL human, version 43">
<protein id="ENSP00000286809">
	<aa id="rs686364" at="151" type="S" mut="P" />
	<aa id="rs685967" at="129" type="T" mut="A" />
	<aa id="rs1557294" at="25" type="T" mut="A" />
</protein>
<protein id="ENSP00000372288">
	<aa id="rs8127420" at="40" type="S" mut="P" />
</protein>
...
<protein id="ENSP00000373050">
	<aa id="rs17002888" at="1325" type="S" mut="N" />
	<aa id="rs17002890" at="1165" type="M" mut="I" />
	<aa id="rs5758651" at="722" type="S" mut="G" />
	<aa id="rs1064961" at="681" type="G" mut="V" />
	<aa id="rs6002656" at="485" type="T" mut="N" />
	<aa id="rs34030679" at="405" type="M" mut="V" />
	<aa id="rs4484130" at="10" type="Y" mut="F" />
</protein>
<note>Total SAPS: 138704</note>
<note>Created from: human_saps.txt</note>
<note>Original name: human_saps.xml</note>
</bioml>

You can check GPM SAP repository for latest versions of these annoation files for ENSEMBL species. Please be aware that these files can be quite large and they may cause problems when they are opened by HTML WWW browsers. If you wish to look at the details of these files, a text editor is usually the best choice.

The search engine requires that the protein id string matches the FASTA description line string, in order to make a one-to-one correspondence between a protein sequence and an SAP annoation. This may cause some problems when FASTA description lines that contain more than simply the protein accession number are used.

The association of an SAP annotation file with a taxonomy key word is made using a special entry in any taxonomy file. An example for a "human" entry is as follows:

	<taxon label="human">
		<file format="peptide" URL="../fasta/human.fasta.pro" />
		<file format="peptide" URL="../fasta/human_extra.fasta" />
		<file format="spectrum" URL="../fasta/human_fasta.hlf" />
		<file format="saps" URL="../fasta/human_saps.xml; />
	</taxon>

Any number of SAP annotation files may be associated with any taxon entry. If no SAP annotation file is specified or a specified file does not exist, the search will continue without failure. The X! series output file will contain a listing of the SAP files that were actually used.


see also: refine | point mutations | refinement snAPs | taxonomy | X! TANDEM point mutations

X! TANDEM API description project