Re: [ccp4bb] Problems with phasing a protein (1300aa)

James Holton Mon, 23 Mar 2009 08:54:29 -0700

I guess when I talk about signal-to-noise I assume the one that is mostrelevant to the task at hand. So, to me, I/sigma(I) at the phasing stepwould be the average intensity (I) divided by the sigma (standarddeviation) assigned to it AFTER scaling/mergeing. I admit that the"I/sigma" column from SCALA is potentially confusing, but if you aredealing with spot intensities, this is the first I/sigma you thinkabout, so I guess this is what Phil was thinking.

Personally, I think the descriptions of the columns in this table areclear if you read the caption before the table in the SCALA output, butTassos is right that an alarming number of people have never done this.RTFM?

When it doubt, use mtzdmp to see what the average values of the datacolumns really are.


-James Holton
MAD Scientist

Anastassis Perrakis wrote:

I like to think of things in terms of signal-to-noise, and one can use a
rearrangement of the Crick-Magdoff equation to tell you what the I/sigma
of your data set needs to be for delta-F to be greater thansigma(delta-F):
I/sigma(I) > 1.3*sqrt(Daltons/sites)/f"

where:
I/sigma(I) is the signal-to-noise ratio of the data set required to
solve it by MAD/SAD
Daltons   is the molecular weight of the protein in amu
sites         is the number of Se sites
f"            is the f" of those sites (in "electrons")
let me see .... we recently solved a 200 residues protein, 4 mol AU,with 2 Se per mol, total 8 Se.Since 160 residues were ordered, I will make for you a discount,18,000 D/monomer, 70,000 in AU.
I truncated data to 4.2 for Se search.

1.3*sqrt(70000/8)/6.5= 19

Statistics from Scala:
N 1/d^2 Dmin(A) Rmrg Rfull Rcum Ranom Nanom Av_I SIGMAI/sigma sd Mn(I/sd)1 0.0098 10.11 0.048 0.049 0.048 0.036 349 4967 41911.9 342 25.82 0.0196 7.15 0.050 0.044 0.049 0.031 707 5360 46211.6 372 28.83 0.0293 5.84 0.089 0.062 0.057 0.047 975 1634 2247.3 177 19.44 0.0391 5.06 0.065 0.048 0.059 0.039 1140 2107 20710.2 218 21.05 0.0489 4.52 0.061 0.043 0.060 0.034 1315 2523 22711.1 253 21.96 0.0587 4.13 0.072 0.051 0.062 0.035 1470 2142 2239.6 242 20.27 0.0685 3.82 0.091 0.061 0.066 0.042 1605 1566 2037.7 219 16.18 0.0782 3.57 0.128 0.086 0.071 0.052 1737 1034 1865.6 199 12.39 0.0880 3.37 0.189 0.137 0.077 0.074 1859 667 1813.7 187 8.910 0.0978 3.20 0.314 0.224 0.085 0.129 1940 374 1702.2 178 5.5
So, that would support your argument.

HOWEVER that would mean looking at the M(I/sd) in the table!!!
"Mn(I/sd)" is not the same as "I/sigma" in Scala notation!!!! Mostpeople think of I/sigma(I) in your notation,to be the I/sigma in the scala output, or the I devided by sd in theDenzo output. These are (very) different.I am not sure which you meant since I/sigma(I) is not the fullnotation (place the <> in the favorite place first ...),but it seems correct if you meant Mn(I/sd) which most people do notquote or use much ;-)
Greetings,
Tassos


A.

Re: [ccp4bb] Problems with phasing a protein (1300aa)

Reply via email to