Sorry for the confusion, I was going for brevity.

Any time you do a thought experiment you make a fake-data data set, the "true" phases and "true" amplitudes become the ones you put into the simulation process.  This is by definition.  Is there potential for circular reasoning?  Of course!  But you can do controls:

  If you start with an ordinary single-conformer coordinate model and flat bulk solvent from refmac to make your Ftrue, then what you will find is that even after adding all plausible experimental errors to the data the final Rwork/Rfree invariably drop to small-molecule levels of 3-4%.  This is true even if you prune the structure back, shake it, and rebuild it in various ways.  The difference features always guide you back to Rwork/Rfree = 3/4%. However, if you refine with phenix.refine, you will find Rwork/Rfree stall at around 10-11%.  This is because Ftrue came from refmac and refmac and phenix.refine have somewhat different bulk solvent models.  If Ftrue comes from phenix and you refine with refmac you get similar "high" R values.  High for a small molecule anyway. And, of course, if you get Ftrue from phenix and refine with phenix you also get final Rwork/Rfree = 3/4%. If you do more things that automated building doesn't do, like multi-headed side chains, or get the bulk solvent from an MD simulation, then you can get "realistic" Rwork/Rfree in the 20%s.  All of this is the main conclusion from this paper: https://dx.doi.org/10.1111/febs.12922

But, in all these situations with various types of "systematic error" thrown in, because you know Ftrue and PHItrue you can compare different kinds of maps to this ground "truth" and see which is closest when you compare electron density. In my experience, this is the 2mFo-DFc map, phased with PHIcalc from the model. You might think that replacing PHIcalc with PHItrue would make the map even better because PHItrue is a "better" phase than PHIcalc, but it turns out this actually make things worse!  That's what is counter-intuitive: 2mFo-DFc amplitudes are "designed" to be used with the slightly-wrong phase of PHIcalc, not PHItrue.

That's what I was trying to say.

-James Holton
MAD Scientist


On 12/5/2018 7:36 PM, Keller, Jacob wrote:
That said, model phases are not so bad.  In fact, in all my experiments with fake data the 
model-phased 2mFo-DFc map always has the best correlation to the "true" map.  If you 
substitute the "true" phases and use the 2mFo-DFc coefficients you actually make things 
worse. Counter-intuitive, but true.
I don't understand what you mean by true and fake here--can you clarify? How 
are the true map and phases generated (from an original true model, I assume?), 
and how are the fake data generated? (Also from the true model?) I am wondering 
whether there is some circular reasoning?

JPK

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCP4BB&A=1

Reply via email to