Re: [ccp4bb] Problems with phasing a protein (1300aa)

Jim Pflugrath Mon, 23 Mar 2009 09:22:16 -0700

On Mon, 23 Mar 2009, Phil Evans wrote:

I'm happy to change the column titles if it makes it clearer. Actually the"I/sigma" column in the Scala output is not very useful:it is <I> / RMSscatter, ie the mean intensity/mean error, for individualobservations, not taking into account multiple measurements. Because it isratio of means (rather than a mean of ratios), it can behave oddly dependingon the distribution of intensities, for instance giving an overall valuewhich is outside the range of values in resolution bins. It is the ratio ofthe previous two columns.
On the other hand the column labelled "Mn(I)/sd" is the mean of ratios foreach reflection, ie< <I>/σ(<I>) > and does take into account themultiplicity of measurements, so is much more relevant as an indicator ofdata quality
see
http://www.ccp4wiki.org/~ccp4wiki/wiki/index.php?title=Scaling_experimental_intensities_with_Scala

FWIW, dtscaleaverage in the d*TREK has the same two columns of I/sigI andlabels them "I/sig unavg" and "I/sig avg"

Of course, going from one to the other takes into account the multiplicityof the observations with the ASSUMPTION that the errors are random andnormally distributed. With all the systematic and erratic errors inmeasurements, I'm not sure this assumption is always valid.


Note to students:  I think this will be on the quiz I hand out next week!

Jim

Scala also outputs a convenient "Table 1" summary
...

Re: [ccp4bb] Problems with phasing a protein (1300aa)

Reply via email to