In the COVID era, computational biology is having a heyday – and machine learning is playing a massive role. With billions upon billions of compounds to search through for any given therapeutic application, strictly brute-force simulations are wildly unfeasible, necessitating more artificially intelligent methods of whittling down the options. Now, researchers from IRB Barcelona’s Structural Bioinformatics and Network Biology lab have developed a deep learning method that predicts the biological activity of any given molecule – even in the absence of experimental data.
The researchers, led by Patrick Aloy, are applying deep machine learning to a massive dataset: the Chemical Checker, which provides processed, harmonized, and integrated bioactivity data on 800,000 small molecules and is also produced by the Structural Bioinformatics and Network Biology lab. In total, any given molecule has 25 bioactivity “spaces,” but for most molecules, data on only a few are known – if that.
Using the new deep learning tool, that’s changing. The Chemical Checker database contains data on all 25 bioactivity spaces from each of those 800,000 molecules, and the tool, having been trained on that data, can predict all the bioactivity spaces of any molecules with incomplete bioactivity data. “The new tool … allows us to forecast the bioactivity spaces of new molecules, and this is crucial in the drug discovery process as we can select the most suitable candidates and discard those that, for one reason or another, would not work,” explained Aloy.
Of course, the prediction isn’t perfect, and assessing molecules with more available data will allow the tool to produce higher-confidence predictions. Some molecules, as well, prove simply more or less difficult for the tool to assess. “All models are wrong, but some are useful,” said Martino Bertoni, first author on the paper describing the research. “A measure of confidence allows us to better interpret the results and highlight which spaces of bioactivity of a molecule are accurate and in which ones an error rate can be contemplated.”
The researchers chose a challenging case for validation: a cancer-related transcription factor that was broadly considered an “undruggable” target. The tool identified 131 compounds that fit the target by predicting their bioactivity spaces, and their ability to degrade the target was experimentally confirmed.
The research described in this article was published as “Bioactivity descriptors for uncharacterized chemical compounds” in the June 2021 issue of Nature Communications. The article was written by Martino Bertoni, Miquel Duran-Frigola, Pau Badia-i-Mompel, Eduardo Pauls, Modesto Orozco-Ruiz, Oriol Guitart-Pla, Víctor Alcalde, Víctor M. Diaz, Antoni Berenguer-Llergo, Isabelle Brun-Heath, Núria Villegas, Antonio García de Herreros and Patrick Aloy. To read it, click here.
April 24, 2024
April 23, 2024
- G42 Selects Qualcomm to Boost AI Inference Performance
- Veritas Strengthens Cyber Resilience with New AI-Powered Solutions
- CERN’s Edge AI Data Analysis Techniques Used to Detect Marine Plastic Pollution
- Alteryx and DataCamp Partner to Bring Analytics Upskilling to All
- SymphonyAI Announces IRIS Foundry, an AI-powered Industrial Data Ops Platform
April 22, 2024
- Jülich’s New AI Foundation Models Aim to Advance Scientific Applications
- Cognizant and Microsoft Expand Partnership to Deploy Generative AI Across Multiple Industries
- Gulp Data and Datarade Partner to Empower Enterprises to Monetize Data
- Fullstory Launches Data Direct to Enhance Corporate Understanding of Behavioral Data
April 19, 2024
- Carahsoft to Showcase Cutting-Edge Solutions with 70+ Partners at GEOINT 2024
- BrainChip Highlights the 2nd Generation Akida at tinyML Summit 2024
- MathCo Named Microsoft Solutions Partner for Data and AI
- Salesforce Survey: Data Will Make or Break Workers’ Trust in AI
- Weights & Biases Announces Expanded Integration with NVIDIA NIM
- Dataminr Introduces ReGenAI to Enhance Real-Time Event Monitoring
- Cisco Reimagines Security for Data Centers and Clouds in Era of AI
- Gurucul Enhances Federated Search Capabilities Across Multiple Data Sources
- SAS-Sponsored Study Highlights Talent Shortages and Strategic Gaps in GenAI Adoption
- Redgate Launches Enterprise Edition of Redgate Monitor for Large-Scale Databases
Most Read Features
Sorry. No data so far.
Most Read News In Brief
Sorry. No data so far.
Most Read This Just In
Sorry. No data so far.
Sponsored Partner Content
-
Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!
-
Supercharge Your Data Lake with Spark 3.3
-
Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]
-
Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]
-
Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023
-
The Art of Mastering Data Quality for AI and Analytics
Sponsored Whitepapers
Contributors
Featured Events
-
AI & Big Data Expo North America 2024
June 5 - June 6Santa Clara CA United States -
AI Hardware & Edge AI Summit Europe
June 18 - June 19London United Kingdom -
AI Hardware & Edge AI Summit 2024
September 10 - September 12San Jose CA United States -
CDAO Government 2024
September 18 - September 19Washington DC United States