-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 tg@slartibartfast:~/tmp$ phenix.python run.py 0.001 627.413 -4.01639e+06 303880 0.1 275.984 275.247 435.678 0.5 92.2049 92.206 93.6615 1 47.8941 47.8936 47.9421 10 3.54414 3.54415 3.5439 100 0.217171 0.21717 0.21714
weird numbers. A proper description would have 6e/A^3 for a C at x=(0,0,0) with B=0. How are these numbers 'not inaccurate'? Cheers, Tim On 09/19/2012 06:47 PM, Pavel Afonine wrote: > Hi James, > > using dynamic N-Gaussian approximation to form-factor tables as > described here (pages 27-29): > > http://cci.lbl.gov/publications/download/iucrcompcomm_jan2004.pdf > > and used in Phenix since 2004, avoids both: singularity at B=0 and > inaccurate density values (compared to the raw forma-factor tables) > for B->0. > > Attached is the script that proves this point. To run, simply > "phenix.python run.py". > > Pavel > > On Sun, Sep 16, 2012 at 11:32 PM, James Holton <jmhol...@lbl.gov> > wrote: > >> Yes, the constant term in the "5-Gaussian" structure factor >> tables does become annoying when you try to plot electron density >> in real space, but only if you try to make the B factor zero. If >> the B factors are ~12 (like they are in 1m1n), then the electron >> density 2.0 A from an Fe atom is not -0.2 e-/A^3, it is 0.025 >> e-/A^3. This is only 1% of the electron density at the center of >> a nitrogen atom with the same B factor. >> >> But if you do set the B factor to zero, then the electron density >> at the center of any atom (using the 5-Gaussian model) is >> infinity. To put it in gnuplot-ish, the structure factor of Fe >> (in reciprocal space) can be plotted with this function: >> Fe_sf(s)=Fe_a1*exp(-Fe_b1*s*s)**+Fe_a2*exp(-Fe_b2*s*s)+Fe_a3*** >> exp(-Fe_b3*s*s)+Fe_a4*exp(-Fe_**b4*s*s)+Fe_c >> >> where: Fe_c = 1.036900; Fe_a1 = 11.769500; Fe_a2 = 7.357300; >> Fe_a3 = 3.522200; Fe_a4 = 2.304500; Fe_b1 = 4.761100; Fe_b2 = >> 0.307200; Fe_b3 = 15.353500; Fe_b4 = 76.880501; and "s" is >> sin(theta)/lambda >> >> applying a B factor is then just multiplication by exp(-B*s*s) >> >> >> Since the terms are all Gaussians, the inverse Fourier transform >> can actually be done analytically, giving the real-space version, >> or the expression for electron density vs distance from the >> nucleus (r): >> >> Fe_ff(r,B) = \ >> +Fe_a1*(4*pi/(Fe_b1+B))**1.5***safexp(-4*pi**2/(Fe_b1+B)*r*r) \ >> +Fe_a2*(4*pi/(Fe_b2+B))**1.5***safexp(-4*pi**2/(Fe_b2+B)*r*r) \ >> +Fe_a3*(4*pi/(Fe_b3+B))**1.5***safexp(-4*pi**2/(Fe_b3+B)*r*r) \ >> +Fe_a4*(4*pi/(Fe_b4+B))**1.5***safexp(-4*pi**2/(Fe_b4+B)*r*r) \ >> +Fe_c *(4*pi/(B))**1.5*safexp(-4*pi****2/(B)*r*r); >> >> Where here applying a B factor requires folding it into each >> Gaussian term. Notice how the Fe_c term blows up as B->0? This >> is where most of the series-termination effects come from. If you >> want the above equations for other atoms, you can get them from >> here: >> http://bl831.als.lbl.gov/~**jamesh/pickup/all_atomsf.**gnuplot<http://bl831.als.lbl.gov/~jamesh/pickup/all_atomsf.gnuplot> >> >> http://bl831.als.lbl.gov/~**jamesh/pickup/all_atomff.**gnuplot<http://bl831.als.lbl.gov/~jamesh/pickup/all_atomff.gnuplot> >> >> This "infinitely sharp spike problem" seems to have led some >> people to conclude that a zero B factor is non-physical, but >> nothing could be further from the truth! The scattering from >> mono-atomic gasses is an excellent example of how one can observe >> the B=0 structure factor. In fact, gas scattering is how the >> quantum mechanical self-consistent field calculations of electron >> clouds around atoms was experimentally verified. Does this mean >> that there really is an infinitely sharp "spike" in the middle >> of every atom? Of course not. But there is a "very" sharp >> spike. >> >> So, the problem of "infinite density" at the nucleus is really >> just an artifact of the 5-Gaussian formalism. Strictly speaking, >> the "5-Gaussian" structure factor representation you find in >> ${CLIBD}/atomsf.lib (or Table 6.1.1.4 in the International Tables >> volume C) is nothing more than a curve fit to the "true" values >> listed in ITC volume C tables 6.1.1.1 (neutral atoms) and 6.1.1.3 >> (ions). These latter tables are the Fourier transform of the >> "true" electron density distribution around a particular >> atom/ion obtained from quantum mechanical self-consistent field >> calculations (like those of Cromer, Mann and many others). >> >> The important thing to realize is that the fit was done in >> _reciprocal_ space, and if you look carefully at tables 6.1.1.1 >> and 6.1.1.3, you can see that even at REALLY high angle >> (sin(theta)/lambda = 6, or 0.083 A resolution) there is still >> significant elastic scattering from the heavier atoms. The >> purpose of the "constant term" in the 5-Gaussian representation >> is to try and capture this high-angle "tail", and for the really >> heavy atoms this can be more than 5 electron equivalents. In >> real space, this is equivalent to saying that about 5 electrons >> are located within at least ~0.03 A of the nucleus. That's a >> very short distance, but it is also not zero. This is because >> the first few shells of electrons around things like a Uranium >> nucleus actually are very small and dense. How, then, can we >> have any hope of modelling heavy atoms properly without using a >> map grid sampling of 0.01A ? Easy! The B factors are never >> zero. >> >> Even for a truly infinitely sharp peak (aka a single electron), >> it doesn't take much of a B factor to spread it out to a >> reasonable size. For example, applying a B factor of 9 to a >> point charge will give it a full-width-half max (FWHM) of 0.8 A, >> the same as the "diameter" of a carbon atom. A carbon atom with >> B=12 has FWHM = 1.1 A, the same as a "point" charge with B=16. >> Carbon at B=80 and a point with B=93 both have FWHM = 2.6 A. As >> the B factor becomes larger and larger, it tends to dominate the >> atomic shape (looks like a single Gaussian). This is why it is >> so hard to assign atom types from density alone. In fact, with >> B=80, a Uranium atom at 1/100th occupancy is essentially >> indistinguishable from a hydrogen atom. That is, even a modest B >> factor pretty much "washes out" any sharp features the atoms >> might have. Sometimes I wonder why we bother with "form >> factors" at all, since at modest resolutions all we really need >> is Z (the atomic number) and the B factor. But, then again, I >> suppose it doesn't hurt either. >> >> >> So, what does this have to do with series termination? Series >> termination arises in the inverse Fourier transform (making a map >> from structure factors). Technically, the "tails" of a Gaussian >> never reach zero, so any sort of "resolution cutoff" always >> introduces some error into the electron density calculation. >> That is, if you create an arbitrary electron-density map, convert >> it into structure factors and then "fft" it back, you do _not_ >> get the same map that you started with! How much do they differ? >> Depends on the RMS value of the high-angle structure factors that >> have been cut off (Parseval's theorem). The "infinitely sharp >> spike" problem exacerbates this, because the B=0 structure >> factors do not tend toward zero as fast as a Gaussian with the >> "atomic width" would. >> >> So, for a given resolution, when does the B factor get "too >> sharp"? Well, for "protein" atoms, the following B factors will >> introduce an rms error in the electron density map equal to about >> 5% of the peak height of the atoms when the data are cut to the >> following resolution: d B 1.0 <5 1.5 8 2.0 27 2.5 45 3.0 65 >> 3.5 86 4.0 >99 >> >> smaller B factors than this will introduce more than 5% error at >> each of these resolutions. Now, of course, one is often not >> nearly as concerned with the average error in the map as you are >> with the error at a particular point of interest, but the above >> numbers can serve as a rough guide. If you want to see the >> series-termination error at a particular point in the map, you >> will have to calculate the "true" map of your model (using a >> program like SFALL), and then run the map back and forth through >> the Fourier transform and resolution cutoff (such as with SFALL >> and FFT). You can then use MAPMAN or Chimera to probe the >> electron density at the point of interest. >> >> But, to answer the OP's question, I would not recommend trying to >> do fancy map interpretation to identify a mystery atom. Instead, >> just refine the occupancy of the mystery atom and see where that >> goes. Perhaps jiggling the rest of the molecule with "kick maps" >> to see how stable the occupancy is. Since refinement only does >> forward-FFTs, it is formally insensitive to series termination >> errors. It is only map calculation where series termination can >> become a problem. >> >> Thanks to Garib for clearing up that last point for me! >> >> -James Holton MAD Scientist >> >> >> >> On 9/15/2012 3:12 AM, Tim Gruene wrote: >> > Dear Ian, > > provided that f(s) is given by the formula in the Cromer/Mann > article, which I believe we have agreed on, the inset of Fig.1 of > the Science article we are talking about is claimed to be the graph > of the function g, which I added as pdf to this email for better > readability. > > Irrespective of what has been plotted in any other article > meantioned throughout this thread, this claim is incorrect, given > a_i, b_i, c > 0. > > I am sure you can figure this out yourself. My argument was not > involving mathematical programs but only one-dimensional calculus. > > Cheers, Tim > > On 09/14/2012 04:46 PM, Ian Tickle wrote: > >>>>> On 14 September 2012 15:15, Tim Gruene >>>>> <t...@shelx.uni-ac.gwdg.de> wrote: >>>>> >>>>>> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 >>>>>> >>>>>> Hello Ian, >>>>>> >>>>>> your article describes f(s) as sum of four Gaussians, >>>>>> which is not the same f(s) from Cromer's and Mann's paper >>>>>> and the one used both by Niu and me. Here, f(s) contains >>>>>> a constant, as I pointed out to in my response, which >>>>>> makes the integral oscillate between plus and minus >>>>>> infinity as the upper integral border (called 1/dmax in >>>>>> the article Niu refers to) goes to infinity). >>>>>> >>>>>> Maybe you can shed some light on why your article uses a >>>>>> different f(s) than Cromer/Mann. This explanation might >>>>>> be the answer to Nius question, I reckon, and feed my >>>>>> curiosity, too. >>>>>> >>>>> Tim & Niu, oops yes a small slip in the paper there, it >>>>> should have read "4 Gaussians + constant term": this is >>>>> clear from the ITC reference given and the >>>>> $CLIBD/atomsf.lib table referred to. In practice it's >>>>> actually rendered as a sum of 5 Gaussians after you >>>>> multiply the f(s) and atomic Biso factor terms, so unless >>>>> Biso = 0 (very unphysical!) there is actually no constant >>>>> term. My integral for rho(r) certainly doesn't oscillate >>>>> between plus and minus infinity as d_min -> zero. If yours >>>>> does then I suspect that either the Biso term was forgotten >>>>> or if not then a bug in the integration routine (e.g. can >>>>> it handle properly the point at r = 0 where the standard >>>>> formula for the density gives 0/0?). I used QUADPACK >>>>> (http://people.sc.fsu.edu/~**jburkardt/f_src/quadpack/**quadpack.html<http://people.sc.fsu.edu/~jburkardt/f_src/quadpack/quadpack.html> >>>>> >>>>> ) >>>>> which seems pretty good at taking care of such >>>>> singularities (assuming of course that the integral does >>>>> actually converge). >>>>> >>>>> Cheers >>>>> >>>>> -- Ian >>>>> >>>>> - -- - -- > Dr Tim Gruene Institut fuer anorganische Chemie Tammannstr. 4 > D-37077 Goettingen > > GPG Key ID = A46BEE1A > >>> >> - -- - -- Dr Tim Gruene Institut fuer anorganische Chemie Tammannstr. 4 D-37077 Goettingen GPG Key ID = A46BEE1A -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.12 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iD8DBQFQWuRdUxlJ7aRr7hoRAix4AJ9u/dT5RQuuL9wY0+2BTQre9TdsywCeJ5Jg uWoFQGip/L4ZTbGQvaMAIHQ= =Gdhk -----END PGP SIGNATURE-----