Dear sunyeping

The critical step in calculating a 3D superposition is the decision of which 
atom in one structure to fit on which atom in the second structure.
Actually fitting the transformation matrix is trivial in comparison. 

In PyMOL you have different ways to determine what you fit:
If you use “pairfit” or “super”, you explicitly tell the program which atoms to 
fit to which, and you will get an error message if the two selections do not 
contain exactly the same number of atoms, e.g. because if you have alternative 
locations for some of the atoms.

Super will generate an initial 3D alignment and then iteratively improve the 
alignment by eliminating the atom pairs that show the highest divergence. This 
generates a 3D alignment that concentrates on the structural core and ignores 
highly flexible loops: This is ideal to generate an image that illustrates that 
two proteins are structurally related, but will not give you a highly 
reproducible rmsd value that you can compare, because you do not exactly know 
what has been fitted. 
With pairfit, you know exactly which atoms are fitted to which, but it can be 
tedious to enumerate  exactly all the atoms you want to fit. Its advantag is 
that in comparing a whole series of structures, you can force it to compare 
exactly the same part of the structures and thereby get rmsd values you can 
compare even if the labels are not identical in the two structures, because you 
explicitly tell the program which atoms to pair for the fit.
Fit relies on corresponding atoms having the same label in the two structures. 
Align uses a sequence alignment to identify corresponding atoms (look carefully 
at the sequence alignment and the alignment object to check whether this makes 
sense), it works well for comparing snapshots of the same protein along a 
dynamics run or for proteins with identical or at least very similar sequence, 
while cealign iteratively optimizes the atom pairing to achieve a 
sequence-independent structural alignment, and thereby is able to give you a 
fairly robust alignment of two structures with divergent sequences but similar 
structures, e.g. GPCRs or antibodies.

Which method you use depends on the purpose of your alignment (illustration of 
similarity versus quantification of the goodness of the fit),
the sequence similarity and the structural similarity of the two or more 
structures you are comparing. You also have to decide if you do an all-atom 
fit, if you want to fit only the backbone atoms, or only the “guide atoms” 
(Calpha in the case of proteins, C4’ for nucleic acid). In any case, if are 
citing rmsd values, you have to indicate clearly what you have been doing, 
otherwise a comparison of rmsd values is meaningless.

best regards

Annemarie

____________________________________________________________

Dr. Annemarie Honegger
Dept. of Biochemistry
Zürich University
Winterthurerstrasse 190
8057 Zürich
Switzerland

e-mail:   honeg...@bioc.uzh.ch
phone:   +41 44 635 55 62
fax:        +41 44 635 57 12     



> On 11 Jul 2019, at 14:01, pymol-users-requ...@lists.sourceforge.net wrote:
> 
> Send PyMOL-users mailing list submissions to
>       pymol-users@lists.sourceforge.net
> 
> To subscribe or unsubscribe via the World Wide Web, visit
>       https://lists.sourceforge.net/lists/listinfo/pymol-users
> or, via email, send a message with subject or body 'help' to
>       pymol-users-requ...@lists.sourceforge.net
> 
> You can reach the person managing the list at
>       pymol-users-ow...@lists.sourceforge.net
> 
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of PyMOL-users digest..."
> 
> 
> Today's Topics:
> 
>   1. about cealign and align (sunyeping)
> 
> 
> ----------------------------------------------------------------------
> 
> Message: 1
> Date: Thu, 11 Jul 2019 10:12:31 +0800
> From: "sunyeping" <sunyep...@aliyun.com>
> To: "pymol-users" <pymol-users@lists.sourceforge.net>
> Subject: [PyMOL] about cealign and align
> Message-ID:
>       <4c149e92-9843-4c82-b441-edad40e0ce5d.sunyep...@aliyun.com>
> Content-Type: text/plain; charset="utf-8"
> 
> Dear all,
> 
> I am now trying to compare two structures by alignment in pymol. The two 
> proteins are similar structures from the same family with about 500 residues. 
> I find that the "cealign" command gives much better result that the "align" 
> command. With the "cealign" command, the two structures aligned well, but 
> with "align" command two structure don't aligned well. The rmsd value given 
> with the "cealign" is much smaller than that given with the "align". 
> 
> However, when I try the align a small region of the two structures, I get the 
> reverse result. I create new objects corresponding to the small region I am 
> interested in from the two whole proteins, respectively and align the two new 
> objects with "align" and "cealign". I find the rmsd value given with "align" 
> is much smaller that than given with "cealign". 
> 
> I wish know the work principle behind "align" and "celign", the calculation 
> formula of them and what cause the difference between aligning the whole 
> proteins and small regions. Could you help me with these?
> 
> Best regards.
> -------------- next part --------------
> An HTML attachment was scrubbed...
> 
> ------------------------------
> 
> 
> 
> ------------------------------
> 
> Subject: Digest Footer
> 
> _______________________________________________
> PyMOL-users mailing list
> PyMOL-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/pymol-users
> Unsubscribe: 
> https://sourceforge.net/projects/pymol/lists/pymol-users/unsubscribe
> 
> ------------------------------
> 
> End of PyMOL-users Digest, Vol 158, Issue 13
> ********************************************

_______________________________________________
PyMOL-users mailing list
Archives: http://www.mail-archive.com/pymol-users@lists.sourceforge.net
Unsubscribe: 
https://sourceforge.net/projects/pymol/lists/pymol-users/unsubscribe

Reply via email to