Thank you, yes, threading style tools to assess the likelihood of having a 
given amino acid in a certain position in the fold would be a good approach. I 
have tried one but wasn't hugely informative, in my hands anyway. All 
suggestions very welcome but big database science is a bit outside my skill 
set. Of course, sequence conservation in the family helps a lot with the 
assignment but there are bigger ambiguities in less conserved surface regions 
e.g. a disordered Lys can refine well as a Ser, Ala or Gly even, but e.g. 
nearby conserved acidic groups might suggest the presence of the basic amino 
acid which could salt-bridge with them, but why then would it be so disordered? 
Tangles we weave...

Sent from ProtonMail mobile

-------- Original Message --------
On 29 Jul 2022, 14:17, Clemens Vonrhein wrote:

> Maybe a crazy idea, but couldn't one use various model/geometry validation 
> tools to figure out some of those ambiguities? As a test one could take a 
> very good 1.7A structure and do some random ASN->ASP, THR->VAL etc mutations 
> followed by refinement (including hydrogens). Wouldn't some validation tool 
> pick up unfavourable conformations, poor rotamers and/or hydrogen clashes and 
> poor H-networks (compared to the initial, correct sequence)? Maybe there is 
> some kind of "fingerprint" in validation results for such incorrect residue 
> assignments that can distinguish correct from incorrect sequences ... Or put 
> another way, if model validation can not pick up such sequence errors: should 
> we be worried about the reliability of our validation criteria? A large scale 
> re-refinement of deposited structures with (1) the current/correct sequence 
> and (2) those ASN/ASP, THR/VAL etc ambiguities artificially introduced, could 
> provide a clever algorithm (AI?) with the data basis to figure out those 
> "fingerprints". Maybe even for the ASN/GLN/HIS side-chain orientations when 
> the sequence is actually correct. Cheers Clemens On Fri, Jul 29, 2022 at 
> 12:08:58PM +0000, Jon Cooper wrote: > Thank you so much for your replies. I 
> apologise for being unclear. The protein is purified from a plant that hasn't 
> had its genome sequence determined. We know the enzyme family of the protein 
> and therefore the structure was originally solved by MR. The 'X-ray sequence' 
> we have is just determined from looking at the 1.7 Angstrom density, which is 
> good, over several refinement and rebuilding rounds. The resulting sequence 
> has been run through blast and it is up to 58% identical with other family 
> members. To me this seemed low but that degree of identity is typical of 
> other family members. The postgrad who did the work did obtain some peptide 
> sequences and prior to that about 20% of the sequence was determined by the 
> Edman method with the usual Asp/Asn and Glu/Gln ambiguity. However, there 
> isn't any prospect of us doing further experimental work, sorry, but that's 
> the way it is!! > > Best wishes, Jon.C. > > Sent from ProtonMail mobile > > 
> -------- Original Message -------- > On 29 Jul 2022, 12:23, Jan Dohnalek 
> wrote: > > > If you know at least something about your protein, organism, 
> type of molecule, ..., you could try mass spectrometry peptide mapping to 
> known sequences, this may give you some answers for the ambiguities you might 
> be seeing, if nothing else .. > > > > Jan > > > > On Fri, Jul 29, 2022 at 
> 12:15 PM Jon Cooper wrote: > > > >> Hello, I am looking for suggestions of 
> ways to check a 1.7 Angstrom X-ray sequence for a protein where it is 
> impractical to do experimental sequencing, protein or DNA. The structure 
> refines to publishable R/R-free and the main ambiguities seem to be Thr/Val, 
> Asp/Asn and Glu/Gln where alternative H-bonding networks are possible. 
> Running alpha-fold seems an interesting option? Any suggestions much 
> appreciated. > >> > >> Cheers, Jon.C. > >> > >> Sent from ProtonMail mobile > 
> >> > >> --------------------------------------------------------------- > >> 
> > >> To unsubscribe from the CCP4BB list, click the following link: > >> 
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1 > > > > -- > 
> > > > Jan Dohnalek, Ph.D > > Institute of Biotechnology > > > > Academy of 
> Sciences of the Czech Republic > > Biocev > > Prumyslova 595 > > 252 50 
> Vestec near Prague > > Czech Republic > > Tel. +420 325 873 758 > > 
> ######################################################################## > > 
> To unsubscribe from the CCP4BB list, click the following link: > 
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1 > > This 
> message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
> hosted by www.jiscmail.ac.uk, terms & conditions are available at 
> https://www.jiscmail.ac.uk/policyandsecurity/ -- 
> *-------------------------------------------------------------- * Clemens 
> Vonrhein, Ph.D. vonrhein AT GlobalPhasing DOT com * Global Phasing Ltd., 
> Sheraton House, Castle Park * Cambridge CB3 0AX, UK www.globalphasing.com 
> *-------------------------------------------------------------- 
> ######################################################################## To 
> unsubscribe from the CCP4BB list, click the following link: 
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1 This message 
> was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list hosted by 
> www.jiscmail.ac.uk, terms & conditions are available at 
> https://www.jiscmail.ac.uk/policyandsecurity/

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Reply via email to