Re: [ccp4bb] question about SIGF

Ethan Merritt Thu, 18 Aug 2011 09:52:37 -0700

On Thursday, August 18, 2011 08:19:13 am G Y wrote:
> Dear all,
> 
> I am a student in crystallography.  So not quite familiar with some even
> basic concepts.
> 
> In shelx .hkl file or ccp4 .mtz file there is a column SIGF which is related
> to standard deviation of the structure factor. I read through many text book
> for crystallography, there are many formulas about this topic. Sometimes it
> is a square of sigma, sometimes it is not.
> 
> My question is:
> 
> 1. What is the exact mathematical formula for SIGF or SIGFP in ccp4 or shelx
> format file?


There is none.  The column in the file simply holds whatever value was
input from some data-processing program.  You need to take a step back
and ask how each individual program calculates, or estimates, sigma(F).

> 
> 2. If it can be calculated from F, why it is necessary to include it in ccp4
> or shelx reflection file (they have F already) ?

It cannot be calculated from F.

> 3. Is this value really important in structure determination? Why and how?
> As I understood, during data collection each reflection measured several
> times, so there is a deviation from the average F. That is the meaning of
> SIGF. But how to use this value in structure determination? Is there some
> kind of correction or refinement on F according to SIGF?

It tells you how much faith to put in that particular Fobs.
If the sigma value is small (compared to Fobs) then you would tend to have
confidence that the Fobs value is accurate, and therefore rate your current
model better if it correctly predicts Fobs.  Conversely, if the sigma value
is large then you would tend to think that Fobs is unreliable and therefore
not penalize your current model if Fcalc != Fobs.
This line of reasoning explains why the primary use of sigma is to weight
the various residuals during refinement.   Typically the weight assigned
to each observation is (1 / sigma^2).

Although it goes a step or two beyond the specific question you asked,
I suggest that you work through Randy Read's notes on maximum likelihood,
weighting, and least-squares.
    http://www-structmed.cimr.cam.ac.uk/Course/Likelihood/likelihood.html
This may lead to a deeper understanding of how accounting for sigma
leads to a better model.

        Ethan
        




> And also when multiplicity during data collection is low, the SIGF would not
> be so interested. So is that means the SIGF would not be so important in
> some measurements?
> 
> Any kind reply from you guys would be greatly appreciated. Many thanks!
> 
> Best regards,
> G
> 

-- 
Ethan A Merritt
Biomolecular Structure Center,  K-428 Health Sciences Bldg
University of Washington, Seattle 98195-7742

Re: [ccp4bb] question about SIGF

Reply via email to