Dear Ian and contributors to this interesting thread,
(please, scroll down a little bit) Am 15.09.10 23:34, schrieb Ian Tickle:
I should just like to point out that the main source of the disagreement here seems to be that people have very different ideas about what a 'model' is or should be. Strictly a model is a purely mathematical construct, in this case it consists of the appropriate equation for the calculated structure factor and the best-fit values of the various parameters (scattering factors, atomic positions, occupancies, B factors, TLS parameters etc.) that appear in it. A mathematical model is inevitably going to be an imperfect representation of reality, but hopefully it's the best one we can come up with, in the sense of best explaining the data without significant overfitting. The problem arises because many users of the PDB, and I suspect many contributors to this BB, particularly non-crystallographers, don't see it like that, because they view a PDB file as a physical model, i.e. not as the best fit to the data (assuming that the non-crystallographers even know what the data are!), but the closest representation of reality. The difference between the N-H bond lengths that Ed referred to illustrates the distinction between the mathematical and the physical model. The mathematical model requires that the bond length is 0.86 Ang because that value gives the best fit of the assumed spherical scattering factor of H to the deformation density of the X-H covalent bond. The physical model requires that it be 1.00 Ang because that is the internuclear distance found by spectroscopic methods& predicted by QM calculations. The same goes for B factors and TLS: to a large extent they are a mathematical construct whose purpose is to provide an optimal fit to the data. The connection of Bs& TLS with reality is tenuous at best, nevertheless people obviously would like to have a physical interpretation such as rigid-body correlated motion. The fact that Bragg scattering provides no information about correlated motion (you need to measure the diffuse scattering for that) doesn't seem to deter them! I have no doubt in my mind that it is the mathematical model that should be published, because hopefully it's the best available interpretation of the data. Whether that involves publishing the riding H atoms explicitly, or alternatively the formulae and parameters that were used to calculate their positions I don't mind, as long as I can faithfully reproduce the Fcalcs to check the validity of the model. Then users of the PDB are free to *interpret* the mathematical models as physical models in a appropriate manner (e.g. by adjusting the bond lengths to H), and crystallographers have the untainted mathematical models needed to reproduce the Fcalcs.
so, wouldn't be the deposition of the final model's Fcalc, Phic (and their weights) along with the final coordinates be the best solution? The final Fcalc are our best model and can be used to reproduce the final statistics (which would remove the sfcheck annoyance) and to reproduce the final electron density maps, and the coordinates can be used for what ever purpose they are needed, irrespective of adding riding hydrogens or not.
Best regards, Dirk. -- ******************************************************* Dirk Kostrewa Gene Center Munich, A5.07 Department of Biochemistry Ludwig-Maximilians-Universität München Feodor-Lynen-Str. 25 D-81377 Munich Germany Phone: +49-89-2180-76845 Fax: +49-89-2180-76999 E-mail: kostr...@genzentrum.lmu.de WWW: www.genzentrum.lmu.de *******************************************************