[ccp4bb] happy/sad maps

James Holton Fri, 28 Apr 2023 08:49:09 -0700

Its still April, but this one isn't a joke.

The smiley-face electron density in the left panel of the attached imagehas the remarkable property that any attempt to sharpen or blur the mapturns it into the frowny-face on the right. If you'd like to try thisyourself, the hidden_frown.map file is available in this tarball:

https://bl831.als.lbl.gov/~jamesh/bugreports/fft_042423.tgz

In fact, any use of an FFT, even with the sharpening B set to zero,turns the smiley into a frowny face. There is no way to get the smileyface back (except opening the file again). Yes, that's right, even justa simple back-and-forth FFT: turning this hidden_frown.map intostructure factors and then back into a map again, gives you a frownyface. This happens using coot, ccp4 and phenix.

Wait, what!? Isn't a Fourier transform supposed to preserveinformation? As in: you can jump back and forth between real andreciprocal space with impunity? Without introducing error? Well, yes,it is SUPPOSED to work like that, but the 3D FFT algorithms ofstructural biology have a ... quirk. If you start with structure factorsand make a map out of them, you can convert it back-and-forth as oftenas you want with 100% preservation of information. However, if youstart with a real-space map (such as from cryoEM), a back-and-forthconversion gives you a different map. This new map can then betransformed back-and-forth all you want and be 100% preserved. It hasbeen "christened" by the FFT, but it is not the same as the startingmap, which is impossible to recover from the FFT-transformed data.Information has been lost. It is fine for crystallography (which startswith structure factors), but for techniques such as cryoEM that startwith maps, using an FFT changes the data.

What information is being lost? Sharp edges. These turn into ripplescovering all of real and reciprocal space. Do real-world data have sharpedges? Well, the all-or-nothing masks we use to model bulk solvent areone example. Also, if you "mask off" otherwise smooth density with anall-or-nothing mask, you will get similar ripples. Another example of asharp edge might be the large changes between adjacent pixels you seewhen a single electron hits a detector. For example, if you make a mapwith just one non-zero voxel and run it back-and-forth through FFT youwill find that voxel loses from 50% to 99% of its value (depending onthe size of the map). How much does this actually impact cryo-EM data? That is my question.

What evil magic did I wield to make this map? Well, I drew a smiley andfrowny face by hand, converted them to maps, and then I generated randomnoise within the boundaries of the smiley face. I ran this noisy mapback-and-forth through FFT, and then subtracted the map that survivesthe FFT from the pre-FFT map. This cheshire_smile.map has theinteresting property that all of the structure factors calculated fromit are zero. It has an RMSD of 1.4, but after a back-and-forth FFT thisRMSD drops to 1e-7. I generated the hidden_frown.map by simply summingthe frowny.map with cheshire_smile.map.

But isn't this map getting less noisy? Yes it is, but the interpretationclearly changes as well.

Why does this happen? It is because of a finite resolution cutoff. Oh!What a relief! You don't have super-high resolution, do you? Well, no,almost nobody has signal out beyond 1.0 A, but we do have noise. Indiffraction data this noise is removed by simply not measuring it. Formap data, however, the problem is that noise at very high frequencies(small-number resolutions) is hard to avoid. This is because of anotherphenomenon NMR spectroscopists are very familiar with: aliasing, or"folding". If any high-spatial frequency noise exists above half thesampling rate (or "Nyquist frequency") it still gets recorded, but showsup in a lower-frequency Fourier coefficient. It is not possible toremove such aliasing noise after digitization. Upon discretization ofthe signal (FFT or no) all these high frequencies join with lowerfrequency terms, and so survive any low-pass filtering. Darn.

Why am I bringing this up? Because if there is noise out beyond the FFTresolution limit it implies there is also noise out beyond theNyquist-Shannon limit as well. If that is the case, direct-space imagingdata may be a lot noisier than it needs to be. In general, in digitalsignal processing of things like sound an analog low-pass filter isalways installed at the input of any digitizer. Perhaps this is whyde-focusing works better than being at focus?

What is the solution? Well, for things like the bulk solvent mask I'dsay some real-space "feathering" is called for before performing FFTs. Same goes for masked density like that used to compute CCmask. It mayalso be worth looking into the digitization process of "image first"structural biology methods?

My question for the BB: can someone explain how Nyquist folding ishandled in cryoEM data processing?


-James Holton
MAD Scientist

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

[ccp4bb] happy/sad maps

Reply via email to