Its still April, but this one isn't a joke.
The smiley-face electron density in the left panel of the attached image
has the remarkable property that any attempt to sharpen or blur the map
turns it into the frowny-face on the right. If you'd like to try this
yourself, the hidden_frown.map file is available in this tarball:
https://bl831.als.lbl.gov/~jamesh/bugreports/fft_042423.tgz
In fact, any use of an FFT, even with the sharpening B set to zero,
turns the smiley into a frowny face. There is no way to get the smiley
face back (except opening the file again). Yes, that's right, even just
a simple back-and-forth FFT: turning this hidden_frown.map into
structure factors and then back into a map again, gives you a frowny
face. This happens using coot, ccp4 and phenix.
Wait, what!? Isn't a Fourier transform supposed to preserve
information? As in: you can jump back and forth between real and
reciprocal space with impunity? Without introducing error? Well, yes,
it is SUPPOSED to work like that, but the 3D FFT algorithms of
structural biology have a ... quirk. If you start with structure factors
and make a map out of them, you can convert it back-and-forth as often
as you want with 100% preservation of information. However, if you
start with a real-space map (such as from cryoEM), a back-and-forth
conversion gives you a different map. This new map can then be
transformed back-and-forth all you want and be 100% preserved. It has
been "christened" by the FFT, but it is not the same as the starting
map, which is impossible to recover from the FFT-transformed data.
Information has been lost. It is fine for crystallography (which starts
with structure factors), but for techniques such as cryoEM that start
with maps, using an FFT changes the data.
What information is being lost? Sharp edges. These turn into ripples
covering all of real and reciprocal space. Do real-world data have sharp
edges? Well, the all-or-nothing masks we use to model bulk solvent are
one example. Also, if you "mask off" otherwise smooth density with an
all-or-nothing mask, you will get similar ripples. Another example of a
sharp edge might be the large changes between adjacent pixels you see
when a single electron hits a detector. For example, if you make a map
with just one non-zero voxel and run it back-and-forth through FFT you
will find that voxel loses from 50% to 99% of its value (depending on
the size of the map). How much does this actually impact cryo-EM data?
That is my question.
What evil magic did I wield to make this map? Well, I drew a smiley and
frowny face by hand, converted them to maps, and then I generated random
noise within the boundaries of the smiley face. I ran this noisy map
back-and-forth through FFT, and then subtracted the map that survives
the FFT from the pre-FFT map. This cheshire_smile.map has the
interesting property that all of the structure factors calculated from
it are zero. It has an RMSD of 1.4, but after a back-and-forth FFT this
RMSD drops to 1e-7. I generated the hidden_frown.map by simply summing
the frowny.map with cheshire_smile.map.
But isn't this map getting less noisy? Yes it is, but the interpretation
clearly changes as well.
Why does this happen? It is because of a finite resolution cutoff. Oh!
What a relief! You don't have super-high resolution, do you? Well, no,
almost nobody has signal out beyond 1.0 A, but we do have noise. In
diffraction data this noise is removed by simply not measuring it. For
map data, however, the problem is that noise at very high frequencies
(small-number resolutions) is hard to avoid. This is because of another
phenomenon NMR spectroscopists are very familiar with: aliasing, or
"folding". If any high-spatial frequency noise exists above half the
sampling rate (or "Nyquist frequency") it still gets recorded, but shows
up in a lower-frequency Fourier coefficient. It is not possible to
remove such aliasing noise after digitization. Upon discretization of
the signal (FFT or no) all these high frequencies join with lower
frequency terms, and so survive any low-pass filtering. Darn.
Why am I bringing this up? Because if there is noise out beyond the FFT
resolution limit it implies there is also noise out beyond the
Nyquist-Shannon limit as well. If that is the case, direct-space imaging
data may be a lot noisier than it needs to be. In general, in digital
signal processing of things like sound an analog low-pass filter is
always installed at the input of any digitizer. Perhaps this is why
de-focusing works better than being at focus?
What is the solution? Well, for things like the bulk solvent mask I'd
say some real-space "feathering" is called for before performing FFTs.
Same goes for masked density like that used to compute CCmask. It may
also be worth looking into the digitization process of "image first"
structural biology methods?
My question for the BB: can someone explain how Nyquist folding is
handled in cryoEM data processing?
-James Holton
MAD Scientist
########################################################################
To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list
hosted by www.jiscmail.ac.uk, terms & conditions are available at
https://www.jiscmail.ac.uk/policyandsecurity/