Dear Gottfried,

I am the lead developer on this standardisation project so I will try to answer your questions:

1. To standardise the handling of protein modifications, all modifications are being grouped into categories based on their chemical structure and context. All modifications within each category are being handled in the same way, whereas the handling of modifications differs between modification categories. For example, phosphorylation is handled as a single chemical unit that includes both modification group and peptide residue (e.g. SEP). In contrast, all hemes are handled as being a separate chemical unit that is covalently linked to the peptide residue (e.g. HEC).

We now have a process in our annotation pipeline to identify protein modifications and categorise them. When a new modification enters the archive it will be assessed manually to decide the best way to consistenly handle it. This means that either way you deposit the new protein modification, an annotator will review and decide how best to handle it.

As Paul mentioned, a more detailed description of the project can be found here:
https://github.com/wwPDB/protein-modification-extension/tree/main

2-3. CAC already exists in the archive and has been observed covalently linked to Lysine. In this case it was handled as CAC covalently linked to LYS, so as two separate chemical units. The modification is described in the PCM category "Covalent chemical modification". If possible I would try to submit the structure maintaining this format (so splitting the modified residue into SER + CAC). If this is not possible then this can be handled by a PDB biocurator during the annotation process.

4-5. I don't know the simplest way to chop a residue into two groups. However, if you've already deposited a structure where SER + CAC are described as a single chemical unit then you don't need to separate them and resubmit the structure. It would be best to message the biocurators that are working on the structure and ask them to modify the entry.

I hope this helps answer your questions. Let me know if anything remains unclear.

Kind regards,
Marcus Bage


On 15/10/2024 16:19, Palm, Gottfried wrote:
Dear all,

  I have a couple of questions regarding deposition of a structure with a posttranslational modification.

1. I understand, that PCMs (protein chemical modifications) and PTMs are now handled in a standard way, but what is the standard way? Is there a link giving insight into these rules?

2. Specifically, I have a cacodylate (dimethylarsenic acid) modified serine. A very similar modification, phosphoserine, has its own three letter code, SEP. The cacodylate modification is handled as serine and cacodylate (CAC) with a link record (in entry 1LZK). What is the correct handling?

3. For SER-CAC I could not find an existing link.cif file for refinement. I rather created a DRG.cif file for the modified serine and used this for refmac and coot. This works, but how should I deposit this? The modified residue would clash with the existing combination of SER / CAC / SER-CAC link.

4. Can I exchange the modified serine by a serine plus cacodylate in the cif file without going through refmac (I don't have the link.cif file)? In pdb format I would simply exchange the ATOM lines, but in cif format?

5. Once I have a cif file with separate SER and CAC residues, I have to reupload the renamed coordinates again in the deposition process, thereby loosing all extra information I typed in already. Can I prevent this loss? (ok, that's a minor problem)

Best rergards
  Gottfried


Dr. Gottfried Palm
Universität Greifswald
Inst. für Biochemie (MNF)
Synthetische und Strukturelle Biochemie
Felix-Hausdorff-Straße 4
17489 Greifswald


------------------------------------------------------------------------

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1 <https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1>

--
-------------------------------------------------------------------
Marcus Bage, Ph.D.
Bioinformatician
PDBe - Protein Data Bank in Europe

European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD UK

http://www.PDBe.org
-------------------------------------------------------------------

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Reply via email to