Skip to main content

Get the Reddit app

Scan this QR code to download the app now
Or check it out in the app stores

AlphaFold


[R] AlphaFold 2
r/MachineLearning

ml. Beginners please see learnmachinelearning


Members Online
[R] AlphaFold 2

Seems like DeepMind just caused the ImageNet moment for protein folding.

Blog post isn't that deeply informative yet (paper is promised to appear soonish). Seems like the improvement over the first version of AlphaFold is mostly usage of transformer/attention mechanisms applied to residue space and combining it with the working ideas from the first version. Compute budget is surprisingly moderate given how crazy the results are. Exciting times for people working in the intersection of molecular sciences and ML :)

Tweet by Mohammed AlQuraishi (well-known domain expert)
https://twitter.com/MoAlQuraishi/status/1333383634649313280

DeepMind BlogPost
https://deepmind.com/blog/article/alphafold-a-solution-to-a-50-year-old-grand-challenge-in-biology

UPDATE:
Nature published a comment on it as well
https://www.nature.com/articles/d41586-020-03348-4




Exploring AlphaFold
r/bioinformatics

## A subreddit to discuss the intersection of computers and biology. ------ A subreddit dedicated to bioinformatics, computational genomics and systems biology.


Members Online
Exploring AlphaFold

Hi r/bioinformatics**,**

I'm curious about AlphaFold and its applications, especially in wet lab work. Has anyone in the community worked with AlphaFold, and what has your experience been? I'm particularly interested in understanding the delimitation of AlphaFold and exploring potential features that could be added for broader use. Any insights or suggestions would be greatly appreciated!




[R] DeepMind Open Sources AlphaFold Code
r/MachineLearning

ml. Beginners please see learnmachinelearning


Members Online
[R] DeepMind Open Sources AlphaFold Code

"Last year we presented #AlphaFold v2 which predicts 3D structures of proteins down to atomic accuracy. Today we’re proud to share the methods in @Nature w/open source code. Excited to see the research this enables. More very soon!"

https://twitter.com/demishassabis/status/1415736975395631111

I did not see this one coming, I got to admit it.


Biggest science news of 2021: DeepMind solves 98.5 per cent of human protein structures using its AlphaFold model
r/science

This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research.


Members Online


AlphaFold found thousands of possible psychedelics. Will its predictions help drug discovery?
r/EverythingScience

/r/EverythingScience is the sister subreddit to /r/science. With a broader rule set than /r/science, it is the place for high quality scientific content that doesn't necessarily reference a peer-reviewed paper from the last 6 months.


Members Online

Demis Hassabis-led company Isomorphic Labs teams up with two major pharmaceutical companies to use next generation of AlphaFold in drug discovery

[D] In the AlphaFold 2 paper, can someone explain figure 2 for me?
r/MachineLearning

ml. Beginners please see learnmachinelearning


Members Online
[D] In the AlphaFold 2 paper, can someone explain figure 2 for me?

Paper here: https://www.nature.com/articles/s41586-021-03819-2

Figure 2 here: https://www.nature.com/articles/s41586-021-03819-2/figures/2

In Figure 2, they show a series of charts but I don't understand what is the main point the authors want to make. There's a brief paragraph that mentions its:

we observe high side-chain accuracy when the backbone prediction is accurate (Fig. 2b) and we show that our confidence measure, the predicted local-distance difference test (pLDDT), reliably predicts the Cα local-distance difference test (lDDT-Cα) accuracy of the corresponding prediction (Fig. 2c). We also find that the global superposition metric template modelling score (TM-score)27 can be accurately estimated (Fig. 2d). Overall, these analyses validate that the high accuracy and reliability of AlphaFold on CASP14 proteins also transfers to an uncurated collection of recent PDB submissions, as would be expected

My interpretation of Figure 2 A is that it shows a histogram plot the fraction of protein chains that are a certain angstrom. This is straightforward.

Fig 2.b. is pretty straightforward. It's a plot moving to the top right showing the correlation between backbone accuracy and side-chain accuracy.

lDDT-Cα measures backbone accuracy using only Cα atoms, whereas, lDDT (local difference distance test) is a measure of structural agreement.

Fig 2.c. is where it gets confusing for me. The text mentions that:

lDDT-Cα = 0.997 × pLDDT − 1.17 (Pearson’s r = 0.76).

n = 10,795 protein chains

This should be the blue line in the plot right? And the dots are individual protein chains, ie. the n = 19,795

So, does the blue line represents the prediction and the blue points are the "ground truth" of protein chains? And is that why the blue line isn't drawn through the cloud of blue points but is slightly off to the right?

Also, is the shaded region of the linear fit in the figure caption referring to the blue points in the inset or something else?

The shaded region of the linear fit represents a 95% confidence interval estimated from 10,000 bootstrap samples.

Fig 2.d. I'm still confused about this. It should be similar to figure 2.c.



Deepmind’s AlphaFold Revealed the Structures of all the Proteins Known to Science, Expanding the AlphaFold DB by Over 200x
r/bioinformatics

## A subreddit to discuss the intersection of computers and biology. ------ A subreddit dedicated to bioinformatics, computational genomics and systems biology.


Members Online
Deepmind’s AlphaFold Revealed the Structures of all the Proteins Known to Science, Expanding the AlphaFold DB by Over 200x
r/cbirt
Deepmind’s AlphaFold Revealed the Structures of all the Proteins Known to Science, Expanding the AlphaFold DB by Over 200x
upvotes

[D] AlphaFold just released a database of 200 million protein structures. How would you use this data as an ML engineer?
r/MachineLearning

ml. Beginners please see learnmachinelearning


Members Online
[D] AlphaFold just released a database of 200 million protein structures. How would you use this data as an ML engineer?

The structure of a protein determines its functionality. Researchers have used this data in the past to design new drugs, vaccines, and enzymes. You can access the database for free here - https://www.deepmind.com/blog/alphafold-reveals-the-structure-of-the-protein-universe

This new database will allow researchers to gain a deeper understanding of protein families, how they interact and evolve, etc. Deepmind has written some use cases here - https://www.deepmind.com/blog/alphafold-reveals-the-structure-of-the-protein-universe

How would you use it? What would you like to explore or predict with it?




Deepmind’s AlphaFold Revealed the Structures of all the Proteins Known to Science, Expanding the AlphaFold DB by Over 200x
r/biology

A place to discuss all things biology! We welcome people and content from all related fields.


Members Online
Deepmind’s AlphaFold Revealed the Structures of all the Proteins Known to Science, Expanding the AlphaFold DB by Over 200x
r/cbirt
Deepmind’s AlphaFold Revealed the Structures of all the Proteins Known to Science, Expanding the AlphaFold DB by Over 200x
upvotes


Role of Bioinformaticians in AlphaFold
r/bioinformatics

## A subreddit to discuss the intersection of computers and biology. ------ A subreddit dedicated to bioinformatics, computational genomics and systems biology.


Members Online
Role of Bioinformaticians in AlphaFold

Were Computational Biologists and Bioinformaticians heavily involved in the creation of Alpha Fold? Or is most of its success due to the Deep Learning and Reinforcement Learning researchers working on it?

Also, do you think future "big leaps" in Biological research will come out of domain related research from within Computational Biology and Bioinformatics or rather non domain specific research (like NLP models that are able to analyze multiple scientific papers at once or further deep learning research along the lines of Alpha Fold)?


[R] AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination
r/MachineLearning

ml. Beginners please see learnmachinelearning


Members Online
[R] AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination

This feedback from scientific practice should definitely be taken into account by the community: https://www.nature.com/articles/s41592-023-02087-4

Quote:

our results show that AlphaFold predictions are not better representations of the contents of a crystal than the models deposited in the PDB, as the deposited models agree much more closely with experimental data where the predicted and deposited models differ





  • ## A subreddit to discuss the intersection of computers and biology. ------ A subreddit dedicated to bioinformatics, computational genomics and systems biology. members
  • Everything pertaining to the technological singularity and related topics, e.g. AI, human enhancement, etc. members
  • ml. Beginners please see learnmachinelearning members
  • Reddit’s home for Artificial Intelligence (AI) members
  • A place to post news and discuss the frontiers of biochemistry and biotechnology. Please refrain from posting home videos with songs and raps. members
  • Welcome to /r/deepmind, a subreddit focused on the artificial intelligence company known for AlphaGo and AlphaZero. members
  • A subreddit devoted to the field of Future(s) Studies and evidence-based speculation about the development of humanity, technology, and civilization. -------- You can also find us in the fediverse at - https://futurology.today members
  • This community is a place to share and discuss new scientific research. Read about the latest advances in astronomy, biology, medicine, physics, social science, and more. Find and submit new publications and popular science coverage of current research. members
  • Subreddit dedicated to the news and discussions about the creation and use of technology and its surrounding issues. members
  • News about any technological application that uses biological systems, living organisms, or derivatives thereof, to make or modify products or processes for specific use. | Or we can talk about career advice. Whatever. members
  • Slate Star Codex was a blog by Scott Alexander about human cognition, politics, and medicine. He now blogs at Astral Codex Ten: https://astralcodexten.substack.com/ members
  • A place to discuss all things biology! We welcome people and content from all related fields. members
  • Ask a science question, get a science answer. members
  • members
  • members
  • A subreddit dedicated to all things BOINC, a platform enabling the public to volunteer their computer's processing capability towards research projects distributed across the globe. Users can decide which projects they participate in, using the free and open-source BOINC client software. members