Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: more thinking and less pipetting (?)

Ian Tickle Tue, 08 Dec 2020 06:02:56 -0800

Hi Marko

I hope he hasn't confused resolution with RMSD error:


"Just keep in mind that (1) a lower RMSD represents a better predicted
structure, and that (2) most experimental structures have a resolution
around 2.5 Å. Taking this into consideration, about a third (36%) of Group
427’s submitted targets were predicted with a root-mean-square deviation
(RMSD) under 2 Å, and 86% were under 5 Å, with a total mean of 3.8 Å."

Cheers

-- Ian



On Tue, 8 Dec 2020 at 13:51, Marko Hyvonen <mh...@cam.ac.uk> wrote:

> Here is another take on this topic, by Carlos Quteiral (@c_outeiral), from
> a non-crystallographer's point of view, covering many of the points discussed
> in this thread  (incl. an example of the model guiding correction of the
> experimental structure).
>
>
> https://www.blopig.com/blog/2020/12/casp14-what-google-deepminds-alphafold-2-really-achieved-and-what-it-means-for-protein-folding-biology-and-bioinformatics/
>
> Marko
>
> On 08/12/2020 13:25, Tristan Croll wrote:
>
> This is a number that needs to be interpreted with some care. 2 Å crystal
> structures in general achieve an RMSD of 0.2 Å on the portion of the
> crystal that's resolved, including loops that are often only in
> well-resolved conformations due to physiologically-irrelevant crystal
> packing interactions. The predicted models, on the other hand, are in
> isolation. Once you get to the level achieved by this last round of
> predictions, that starts making fair comparison somewhat more difficult*.
> Two obvious options that I see: (1) limit the comparison only to the stable
> core of the protein (in which case many of the predictions have RMSDs in
> the very low fractions of an Angstrom), or (2) compare ensembles derived
> from MD simulations starting from the experimental and predicted structure,
> and see how well they overlap.
>
> -- Tristan
>
> * There's one more thorny issue when you get to this level: it becomes
> more and more possible (even likely) that the prediction gets some things
> right that are wrong in the experimental structure.
> ------------------------------
> *From:* CCP4 bulletin board <CCP4BB@JISCMAIL.AC.UK>
> <CCP4BB@JISCMAIL.AC.UK> on behalf of Ian Tickle <ianj...@gmail.com>
> <ianj...@gmail.com>
> *Sent:* 08 December 2020 13:04
> *To:* CCP4BB@JISCMAIL.AC.UK <CCP4BB@JISCMAIL.AC.UK>
> <CCP4BB@JISCMAIL.AC.UK>
> *Subject:* Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: more thinking
> and less pipetting (?)
>
>
> There was a little bit of press-release hype: the release stated "a score
> of around 90 GDT is informally considered to be competitive with results
> obtained from experimental methods" and "our latest AlphaFold system
> achieves a median score of 92.4 GDT overall across all targets. This means
> that our predictions have an average error (RMSD
> <https://en.wikipedia.org/wiki/Root-mean-square_deviation_of_atomic_positions>)
> of approximately 1.6 Angstroms <https://en.wikipedia.org/wiki/Angstrom>,".
>
> Experimental methods achieve an average error of around 0.2 Ang. or better
> at 2 Ang. resolution, and of course much better at atomic resolution (1
> Ang. or better), or around 0.5 Ang. at 3 Ang. resolution.  For
> ligand-binding studies I would say you need 3 Ang. resolution or better.
> 1.6 Ang. error is probably equivalent to around 4 Ang. resolution.  No
> doubt that will improve with time and experience, though I think it will be
> an uphill struggle to get better than 1 Ang. error, simply because the
> method can't be better than the data that go into it and 1-1.5 Ang.
> represents a typical spread of homologous models in the PDB.  So yes very
> competitive if you're desperate for a MR starting model, but not quite yet
> there for a refined high-resolution structure.
>
> Cheers
>
> -- Ian
>
>
> On Tue, 8 Dec 2020 at 12:11, Harry Powell - CCP4BB <
> 0000193323b1e616-dmarc-requ...@jiscmail.ac.uk> wrote:
>
> Hi
>
> It’s a bit more than science by press release - they took part in CASP14
> where they were given sequences but no other experimental data, and did
> significantly better than the other homology modellers (who had access to
> the same data) when judge by independent analysis. There were things wrong
> with their structures, sure, but in almost every case they were less wrong
> than the other modellers (many of whom have been working on this problem
> for decades).
>
> It _will_ be more impressive once the methods they used (or equivalents)
> are implemented by other groups and are available to the “public” (I
> haven’t found an AlphaFold webserver to submit a sequence to, whereas the
> other groups in the field do make their methods readily available), but
> it’s still a step-change in protein structure prediction - it shows it can
> be done pretty well.
>
> Michel is right, of course; you can’t have homology modelling without
> homologous models, which are drawn from the PDB - but the other modellers
> had the same access to the PDB (just as we all do…).
>
> Just my two ha’porth.
>
> Harry
>
> > On 8 Dec 2020, at 11:33, Goldman, Adrian <adrian.gold...@helsinki.fi>
> wrote:
> >
> > My impression is that they haven’t published the code, and it is science
> by press-release.  If one of us tried it, we would - rightly - get hounded
> out of time.
> >
> > Adrian
> >
> >
> >
> >> On 4 Dec 2020, at 15:57, Michel Fodje <michel.fo...@lightsource.ca>
> wrote:
> >>
> >> I think the results from AlphaFold2, although exciting and a
> breakthrough are being exaggerated just a bit.  We know that all the
> information required for the 3D structure is in the sequence. The protein
> folding problem is simply how to go from a sequence to the 3D structure.
> This is not a complex problem in the sense that cells solve it
> deterministically.  Thus the problem is due to lack of understanding and
> not due to complexity.  AlphaFold and all the others trying to solve this
> problem are “cheating” in that they are not just using the sequence, they
> are using other sequences like it (multiple-sequence alignments), and they
> are using all the structural information contained in the PDB.  All of this
> information is not used by the cells.   In short, unless AlphaFold2 now
> allows us to understand how exactly a single protein sequence produces a
> particular 3D structure, the protein folding problem is hardly solved in a
> theoretical sense. The only reason we know how well AlphaFold2 did is
> because the structures were solved and we could compare with the
> predictions, which means verification is lacking.
> >>
> >> The protein folding problem will be solved when we understand how to go
> from a sequence to a structure, and can verify a given structure to be
> correct without experimental data. Even if AlphaFold2 got 99% of structures
> right, your next interesting target protein might be the 1%. How would you
> know?   Until then, what AlphaFold2 is telling us right now is that all
> (most) of the information present in the sequence that determines the 3D
> structure can be gleaned in bits and pieces scattered between homologous
> sequences, multiple-sequence alignments, and other protein 3D structures in
> the PDB.  Deep Learning allows a huge amount of data to be thrown at a
> problem and the back-propagation of the networks then allows careful
> fine-tuning of weights which determine how relevant different pieces of
> information are to the prediction.  The networks used here are humongous
> and a detailed look at the weights (if at all feasible) may point us in the
> right direction.
> >>
> >>
> >> From: CCP4 bulletin board <CCP4BB@JISCMAIL.AC.UK> On Behalf Of Nave,
> Colin (DLSLtd,RAL,LSCI)
> >> Sent: December 4, 2020 9:14 AM
> >> To: CCP4BB@JISCMAIL.AC.UK
> >> Subject: External: Re: [ccp4bb] AlphaFold: more thinking and less
> pipetting (?)
> >>
> >> The subject line for Isabel’s email is very good.
> >>
> >> I do have a question (more a request) for the more computer scientist
> oriented people. I think it is relevant for where this technology will be
> going. It comes from trying to understand whether problems addressed by
> Alpha are NP, NP hard, NP complete etc. My understanding is that the
> previous successes of Alpha were for complete information games such as
> Chess and Go. Both the rules and the present position were available to
> both sides. The folding problem might be in a different category. It would
> be nice if someone could explain the difference (if any) between Go and the
> protein folding problem perhaps using the NP type categories.
> >>
> >> Colin
> >>
> >>
> >>
> >> From: CCP4 bulletin board <CCP4BB@JISCMAIL.AC.UK> On Behalf Of Isabel
> Garcia-Saez
> >> Sent: 03 December 2020 11:18
> >> To: CCP4BB@JISCMAIL.AC.UK
> >> Subject: [ccp4bb] AlphaFold: more thinking and less pipetting (?)
> >>
> >> Dear all,
> >>
> >> Just commenting that after the stunning performance of AlphaFold that
> uses AI from Google maybe some of us we could dedicate ourselves to the
> noble art of gardening, baking, doing Chinese Calligraphy, enjoying the
> clouds pass or everything together (just in case I have already prepared my
> subscription to Netflix).
> >>
> >> https://www.nature.com/articles/d41586-020-03348-4
> >>
> >> Well, I suppose that we still have the structures of complexes (at the
> moment). I am wondering how the labs will have access to this technology in
> the future (would it be for free coming from the company DeepMind -
> Google?). It seems that they have already published some code. Well,
> exciting times.
> >>
> >> Cheers,
> >>
> >> Isabel
> >>
> >>
> >> Isabel Garcia-Saez              PhD
> >> Institut de Biologie Structurale
> >> Viral Infection and Cancer Group (VIC)-Cell Division Team
> >> 71, Avenue des Martyrs
> >> CS 10090
> >> 38044 Grenoble Cedex 9
> >> France
> >> Tel.: 00 33 (0) 457 42 86 15
> >> e-mail: isabel.gar...@ibs.fr
> >> FAX: 00 33 (0) 476 50 18 90
> >> http://www.ibs.fr/
> >>
> >>
> >> To unsubscribe from the CCP4BB list, click the following link:
> >> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
> >>
> >>
> >>
> >> --
> >>
> >> This e-mail and any attachments may contain confidential, copyright and
> or privileged material, and are for the use of the intended addressee only.
> If you are not the intended addressee or an authorised recipient of the
> addressee please notify us of receipt by returning the e-mail and do not
> use, copy, retain, distribute or disclose the information in or attached to
> the e-mail.
> >> Any opinions expressed within this e-mail are those of the individual
> and not necessarily of Diamond Light Source Ltd.
> >> Diamond Light Source Ltd. cannot guarantee that this e-mail or any
> attachments are free from viruses and we cannot accept liability for any
> damage which you may sustain as a result of software viruses which may be
> transmitted in or with the message.
> >> Diamond Light Source Limited (company no. 4375679). Registered in
> England and Wales with its registered office at Diamond House, Harwell
> Science and Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom
> >>
> >>
> >>
> >> To unsubscribe from the CCP4BB list, click the following link:
> >> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
> >>
> >>
> >> To unsubscribe from the CCP4BB list, click the following link:
> >> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
> >>
> >
> >
> > To unsubscribe from the CCP4BB list, click the following link:
> > https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
> >
>
> ########################################################################
>
> To unsubscribe from the CCP4BB list, click the following link:
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>
> This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a
> mailing list hosted by www.jiscmail.ac.uk, terms & conditions are
> available at https://www.jiscmail.ac.uk/policyandsecurity/
>
>
> ------------------------------
>
> To unsubscribe from the CCP4BB list, click the following link:
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>
> ------------------------------
>
> To unsubscribe from the CCP4BB list, click the following link:
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>
>
> --
>
> Marko Hyvonen
> Department of Biochemistry, University of cambridgemh...@cam.ac.uk
> +44 (0)1223 766 044
> @HyvonenGrouphttp://hyvonen.bioc.cam.ac.uk
>
>
> ------------------------------
>
> To unsubscribe from the CCP4BB list, click the following link:
> https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
>

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: more thinking and less pipetting (?)

Reply via email to