Re: [ccp4bb] Best method for weighted averaging of Friedel pairs?

ccp4 Mon, 20 Feb 2012 15:16:55 -0800

This is a case where it is really helpful to keep some record of theunmerged integrated data.

And again rejecting the odd outlier does no harm to most analyses..

I like to use scala to check for outliers looking at all i+ I-measurements; if there is a wild discrepancy for weak anomalous signalsyou have probably found an outlier which is best rejected.

If you have a huge anomalous signal with good redundancy you probablyshouldnt use Imean and the anomalous difference will be I= -I- with SD =sqrt (VarI+ VarI-).

Most software which uses that signal will check for outliers in theAnom difference lists too, and it is usually safest to exclude them fromanom site searches, and phase calculations.

Eleanor
  On 14.02.2012 06:30, Tim Gruene wrote:

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Dear Markus,

why don't you reintegrate the data with hkl2000 telling the programto

treat them as non-anomalous data-set? This should give you scalepack

output with the Bijvoet pairs merged and overcome the problem youdescribe.


Cheers,
Tim

On 02/13/2012 06:19 PM, Markus Meier wrote:

On 11/02/12 02:52 PM, Bryan Lepore wrote:
did you ever get a response on this? it is interesting but nobody
posted publicly.

-Bryan
Dear Bryan,

so far no one replied ... so please find my answer below. If someone
disagrees, please post.

None of the methods I have described are appropriate.
If the negative Bijvoet mates and the positive Bijvoet mates havebeenmerged separately to one intensity value for each (i.e. I+ or I-)plusthe associated standard deviation (sigI+ or sigI-), any weightedmethodto calculate the mean will bias the intensity to either the I+ orthe I-.
Therefore the only appropriate method is to use the unweighted mean:

Imean = 0.5*( I+ + I- )
sigImean = 0.5 * sqrt( sigI+^2 + sigI-^2 )
The only CCP4 program I found that actually does this is mtzMADmod.Thismethod also has the advantage that the original intensity values ofI+and I- can be reconstructed from the mean and the anomalousdifference
(albeit with the loss of the original standard deviations).

Method 1 (scalepack2mtz)
should not be used. The resulting value is not the best estimate
(maximum likelihood)

Method 2 (in book by B. Rupp)
gives the maximum likelihood average in case that the reflectionsareequivalent and is thus appropriate for the merging of the negative(orpositive) set of Bijvoet mates, centric reflections (where theanomalous
differences are zero) or in the case of an non-anomalous dataset the
merging of symmetry equivalent reflections.

Method 3
gives a more realistic sigma value in the case that the individual
intensity values are far apart and their individual standarddeviations
are small. Consider the example I have posted:

I+: 23841.50 sigI+: 634.01 I-: 9628.57, sigI-: 264.75
Method 2: Imean=11738.95, sigIMean=244.31
Method 3: Imean=11738.95, sigIMean=7106.47

If the I+ and I- values above actually were symmetry equivalent
reflections in an non-anomalous dataset, the sigImean from method 2is
ridiculously small and method 3 gives a far more realistic value. If
method 3 is the best mathematical solution to this problem I am notable
to judge and I have to trust the statistician (or programmer) who
implemented this solution.

Cheers,
Markus

On 10/02/12 01:47 PM, Markus Meier wrote:
Dear all,
I have a anomalous dataset, processed in HKL2000. Scalepack outputsafile containing the separately merged sets of the Friedel pairs I-andI+ and their standard deviations sigI+ and sigI-. Scalepack doesnot
output the averaged intensities (Imean) and the standard deviations
(sigIMean).

The CCP4 program truncate that I use to convert the intensities to
amplitudes requires Imean, I- and I+ and the respective standard
deviations in its input file.

I have now found at least three different methods to generate the
averaged intensities from the Friedel pairs:

1) scalepack2mtz

   uses standard deviations for the weights:
   weights w = 1/sigI

   Imean = (w+*I+ + w-*I- ) / (w+ + w-)
   sigImean = 1 / (w+ + w-)
2) Method described in Biomolecular crystallography by BernhardRupp, p.
332/333
   to average symmetry equivalent reflections

   uses variances for the weights:
   weight w = 1/sigI^2

   Imean = (w+*I+ + w-*I- ) / (w+ + w-)
   sigImean = 1 / sqrt(w+ + w-)

3) Method used in cctbx
   function miller.set.average_bijvoet_mates() that calls generic
merge.merge_equivalent_obs():

   same as methods 2, except that

   sigImean is the larger of either
     a) sigImean = 1 / sqrt(w+ + w-)
     or
     b) sigImean = sqrt( wvariance )

   where wvariance =
     (w+ + w-) / [ (w+ + w-)^2 - (w+^2 + w-^2) ] *
     [ w+*(F+ - Imean)^2 + w-*(F- - Imean)^2 ]
What are the advantages and disadvantages of each method? Shouldmethod
1 be used at all?

Some example from my dataset:
Reflection (1, 1, 0), space group P3 2 1

I+: 23841.50 sigI+: 634.01 I-: 9628.57, sigI-: 264.75
Method 1: Imean=13815.32, sigImean=186.76
Method 2: Imean=11738.95, sigIMean=244.31
Method 3: Imean=11738.95, sigIMean=7106.47

Thanks a lot!

Cheers,
Markus


- --
- --
Dr Tim Gruene
Institut fuer anorganische Chemie
Tammannstr. 4
D-37077 Goettingen

GPG Key ID = A46BEE1A

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iD8DBQFPOUi5UxlJ7aRr7hoRArTUAJ4uF/sx2CcdWeETz1G+r19vi7W6SQCgzlAp
nSEYj5gbOlOvZ1+KBYRUrdM=
=qg83
-----END PGP SIGNATURE-----

Re: [ccp4bb] Best method for weighted averaging of Friedel pairs?

Reply via email to