In the COVID era, computational biology is having a heyday – and machine learning is playing a massive role. With billions upon billions of compounds to search through for any given therapeutic application, strictly brute-force simulations are wildly unfeasible, necessitating more artificially intelligent methods of whittling down the options. Now, researchers from IRB Barcelona’s Structural Bioinformatics and Network Biology lab have developed a deep learning method that predicts the biological activity of any given molecule – even in the absence of experimental data.
The researchers, led by Patrick Aloy, are applying deep machine learning to a massive dataset: the Chemical Checker, which provides processed, harmonized, and integrated bioactivity data on 800,000 small molecules and is also produced by the Structural Bioinformatics and Network Biology lab. In total, any given molecule has 25 bioactivity “spaces,” but for most molecules, data on only a few are known – if that.
Using the new deep learning tool, that’s changing. The Chemical Checker database contains data on all 25 bioactivity spaces from each of those 800,000 molecules, and the tool, having been trained on that data, can predict all the bioactivity spaces of any molecules with incomplete bioactivity data. “The new tool … allows us to forecast the bioactivity spaces of new molecules, and this is crucial in the drug discovery process as we can select the most suitable candidates and discard those that, for one reason or another, would not work,” explained Aloy.
Of course, the prediction isn’t perfect, and assessing molecules with more available data will allow the tool to produce higher-confidence predictions. Some molecules, as well, prove simply more or less difficult for the tool to assess. “All models are wrong, but some are useful,” said Martino Bertoni, first author on the paper describing the research. “A measure of confidence allows us to better interpret the results and highlight which spaces of bioactivity of a molecule are accurate and in which ones an error rate can be contemplated.”
The researchers chose a challenging case for validation: a cancer-related transcription factor that was broadly considered an “undruggable” target. The tool identified 131 compounds that fit the target by predicting their bioactivity spaces, and their ability to degrade the target was experimentally confirmed.
The research described in this article was published as “Bioactivity descriptors for uncharacterized chemical compounds” in the June 2021 issue of Nature Communications. The article was written by Martino Bertoni, Miquel Duran-Frigola, Pau Badia-i-Mompel, Eduardo Pauls, Modesto Orozco-Ruiz, Oriol Guitart-Pla, Víctor Alcalde, Víctor M. Diaz, Antoni Berenguer-Llergo, Isabelle Brun-Heath, Núria Villegas, Antonio García de Herreros and Patrick Aloy. To read it, click here.
April 25, 2024
- Cleanlab Launches New Solution to Detect AI Hallucinations in Language Models
- University of Maryland’s Smith School Launches New Center for AI in Business
- SAS Advances Public Health Research with New Analytics Tools on NIH Researcher Workbench
- NVIDIA to Acquire GPU Orchestration Software Provider Run:ai
April 24, 2024
- AtScale Introduces Developer Community Edition for Semantic Modeling
- Domopalooza 2024 Sets a High Bar for AI in Business Intelligence and Analytics
- BigID Highlights Crucial Security Measures for Generative AI in Latest Industry Report
- Moveworks Showcases the Power of Its Next-Gen Copilot at Moveworks.global 2024
- AtScale Announces Next-Gen Product Innovations to Foster Data-Driven Industry-Wide Collaboration
- New Snorkel Flow Release Empowers Enterprises to Harness Their Data for Custom AI Solutions
- Snowflake Launches Arctic: The Most Open, Enterprise-Grade Large Language Model
- Lenovo Advances Hybrid AI Innovation to Meet the Demands of the Most Compute Intensive Workloads
- NEC Expands AI Offerings with Advanced LLMs for Faster Response Times
- Cribl Wins Fair Use Case in Splunk Lawsuit, Ensuring Continued Interoperability
- Rambus Advances AI 2.0 with GDDR7 Memory Controller IP
April 23, 2024
- G42 Selects Qualcomm to Boost AI Inference Performance
- Veritas Strengthens Cyber Resilience with New AI-Powered Solutions
- CERN’s Edge AI Data Analysis Techniques Used to Detect Marine Plastic Pollution
- Alteryx and DataCamp Partner to Bring Analytics Upskilling to All
- SymphonyAI Announces IRIS Foundry, an AI-powered Industrial Data Ops Platform
Most Read Features
Sorry. No data so far.
Most Read News In Brief
Sorry. No data so far.
Most Read This Just In
Sorry. No data so far.
Sponsored Partner Content
-
Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!
-
Supercharge Your Data Lake with Spark 3.3
-
Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]
-
Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]
-
Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023
-
The Art of Mastering Data Quality for AI and Analytics
Sponsored Whitepapers
Contributors
Featured Events
-
AI & Big Data Expo North America 2024
June 5 - June 6Santa Clara CA United States -
AI Hardware & Edge AI Summit Europe
June 18 - June 19London United Kingdom -
AI Hardware & Edge AI Summit 2024
September 10 - September 12San Jose CA United States -
CDAO Government 2024
September 18 - September 19Washington DC United States