Re: [ccp4bb] Resolution Discrepancy in Data Set

Randy John Read Tue, 23 Jan 2024 12:24:30 -0800

Hi,

I think the point about an R-factor of 42% is a bit more subtle than it comes 
across in Martin’s reply. For random data without measurement errors (i.e. 
coming from a Wilson distribution of intensities), the expected R-factor for 
acentric data is something like 59%. The 42% in Evans & Murshudov comes about 
when the intensities have such large errors that the measurements don’t 
contribute any information to the amplitudes, so after running the 
French/Wilson algorithm in truncate, they all have the same value, i.e. the 
expected value of an amplitude before making a measurement. So 42% is the 
R-factor expected for data that are all equal to the average of the structure 
factor amplitudes they’re being compared to.


In this data set, with I/sig(I) dropping to about 0.1 in the high resolution 
shells, an R of 42% or so would be expected, because all the “measured” 
amplitudes (produced by truncate) will have almost the same value.

Best wishes,

Randy Read

> On 23 Jan 2024, at 14:49, Martin Malý <martin.maly...@email.cz> wrote:
>
> Dear all,
>
> I am sorry for a late reply. R-values should not exceed 0.42 which happened 
> in your case for shells 1.91-1.83 and 1.83-1.77. It is because theoretically 
> (under some assumptions), a perfect model gives an R value of 0.42 against 
> random data (Evans & Murshudov 2013 https://doi.org/10.1107/S0907444913000061 
> ). So as David wrote, your data are not better than 1.9 A. I would refine the 
> structure against data at resolution of 2.2 A (including modelling of water 
> molecules), then run paired refinement to add resolution shells 2.2-2.1, 
> 2.1-2.0 and 2.0-1.9A and choose an optimal high-resolution cutoff according 
> results. Feel free to ask.
>
> "the actual reflections used in calculations and reported statistics may be 
> different (in phenix.refine)"
> Thank for this comment, this was new for me, good to know! I can imagine it 
> can cause misleading situations/interpretations...
>
> Nowadays, I quite often hear an opinion "You do not have to cut your data at 
> high-resolution, refinement program will put low weights for noisy 
> high-resolution reflections so they will not harm your model." I just would 
> like to ask Pavel and other refinement experts - would you dis/agree? Such an 
> approach should be possible by principle but R-values etc. look quite ugly 
> then.
>
> Best wishes,
> Martin
>
> On Wed, 2024-01-17 at 18:15 -0800, Pavel Afonine wrote:
>>
>>
>> Hi Liliana,
>>
>> a few things to consider:
>>
>> 0) There is a Phenix mailing list for Phenix specific questions (phenixbb);
>>
>> 1) Bin completeness depends on (obviously) how binning is done (number of 
>> reflections per bin or number of bins or binning in d^2 or d^3 spacing or 
>> log-binning etc etc etc) -- all of this will affect the number of 
>> reflections in the "highest resolution bin" and corresponding statistics. 
>> And..."how binning is done" wildly differs by the program used or even 
>> within the same program!
>>
>> 2) Refinement (in Phenix) as well as many other Phenix tools apply temporary 
>> reflection omission according to the Read (1999) paper (Acta Cryst. (1999). 
>> D55, 1759-1764). This means while you have so many reflections in your input 
>> reflection file, the actual reflections used in calculations and reported 
>> statistics may be different. Most logs files keep a good record of this, 
>> meaning you can track this down by inspecting logs files carefully.
>>
>> Let me know if you need more assistance with this issue.
>>
>> Cheers,
>> Pavel
>>
>> On Wed, Jan 17, 2024 at 1:43 PM David Briggs <david.bri...@crick.ac.uk> 
>> wrote:
>>> Hi Liliana,
>>>
>>> Two things leap out at me when I look at your data summary.
>>>
>>> (1) Your data probably do not go to 1.77Å. The CC1/2 in your outer shell is 
>>> below any of the usual thresholds. There are discussions to be had about 
>>> what the threshold is, but normally CC1/2 values of 0.5 or sometimes 0.3 
>>> are used. You should also consider I/sigI.
>>>
>>> (2) I believe that by default Phenix.refine excludes weaker reflections 
>>> from refinement, which leads to the discrepancy in completeness statistics. 
>>> As your data do not extend as far as what is contained in your mtz file, 
>>> Phenix excludes those essentially "empty" reflections. Judging by the 
>>> Phenix refine output, I would estimate your data goes to somewhere around 
>>> 1.9Å
>>>
>>> The program Pairef can help inform your choice of high-resolution cutoff.
>>>
>>> This can be run from CCP4cloud, but is also available for Phenix, I believe.
>>>
>>> See https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248825/
>>>
>>> Hope this helps,
>>>
>>> Dave
>>>
>>>
>>> Dr David C. Briggs CSci MRSB
>>> Principal Laboratory Research Scientist
>>> Signalling and Structural Biology Lab
>>> The Francis Crick Institute
>>> London, UK
>>> ==
>>> about.me/david_briggsFrom: CCP4 bulletin board <CCP4BB@JISCMAIL.AC.UK> on 
>>> behalf of Liliana Margent <lmarg...@gradcenter.cuny.edu>
>>> Sent: Wednesday, January 17, 2024 9:19:36 PM
>>> To: CCP4BB@JISCMAIL.AC.UK <CCP4BB@JISCMAIL.AC.UK>
>>> Subject: [ccp4bb] Resolution Discrepancy in Data Set
>>>
>>> External Sender: Use caution.
>>>
>>>
>>> Hello all,
>>>
>>> I hope this message finds you well.
>>>
>>> In my current data set, I’ve encountered a discrepancy between the 
>>> completeness in the high-resolution shells in merged statistics vs the 
>>> refinement statistics. For example, when I look at my merged statistics 
>>> file, output by Xia2 dials, the completeness in the high-resolution shells 
>>> are 97.6%. When I take this data and subsequently refine it in PHENIX I get 
>>> extremely different completeness ranges in the high-resolution shells, but 
>>> I cannot figure out why this large difference is occurring. I’m reaching 
>>> out to you, our esteemed community, for any insights or advice you might 
>>> have. Has anyone else faced a similar challenge? If so, how did you 
>>> navigate through it?  Your experiences and suggestions could be invaluable 
>>> in helping me understand and resolve this issue.
>>>
>>> Thank you in advance for your time and expertise.
>>>
>>> Best regards,
>>> Liliana
>>>
>>> see a side-by-side image of the files I mention in the message in 
>>> https://drive.google.com/file/d/1Y783MzlnqVwXRCtiLeV0Y2EpmkeDL2nd/view?usp=sharing
>>>
>>> ########################################################################
>>>
>>> To unsubscribe from the CCP4BB list, click the following link:
>>> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>>>
>>> This message was issued to members of http://www.jiscmail.ac.uk/CCP4BB, a 
>>> mailing list hosted by http://www.jiscmail.ac.uk/, terms & conditions are 
>>> available at https://www.jiscmail.ac.uk/policyandsecurity/
>>> The Francis Crick Institute Limited is a registered charity in England and 
>>> Wales no. 1140062 and a company registered in England and Wales no. 
>>> 06885462, with its registered office at 1 Midland Road London NW1 1AT
>>>
>>> To unsubscribe from the CCP4BB list, click the following link:
>>> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>>
>>
>> To unsubscribe from the CCP4BB list, click the following link:
>> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>
>
>
> To unsubscribe from the CCP4BB list, click the following link:
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

-----
Randy J. Read
Department of Haematology, University of Cambridge
Cambridge Institute for Medical Research     Tel: +44 1223 336500
The Keith Peters Building
Hills Road                                                       E-mail: 
rj...@cam.ac.uk
Cambridge CB2 0XY, U.K.                              
www-structmed.cimr.cam.ac.uk


########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Re: [ccp4bb] Resolution Discrepancy in Data Set

Reply via email to