In the COVID era, computational biology is having a heyday – and machine learning is playing a massive role. With billions upon billions of compounds to search through for any given therapeutic application, strictly brute-force simulations are wildly unfeasible, necessitating more artificially intelligent methods of whittling down the options. Now, researchers from IRB Barcelona’s Structural Bioinformatics and Network Biology lab have developed a deep learning method that predicts the biological activity of any given molecule – even in the absence of experimental data.
The researchers, led by Patrick Aloy, are applying deep machine learning to a massive dataset: the Chemical Checker, which provides processed, harmonized, and integrated bioactivity data on 800,000 small molecules and is also produced by the Structural Bioinformatics and Network Biology lab. In total, any given molecule has 25 bioactivity “spaces,” but for most molecules, data on only a few are known – if that.
Using the new deep learning tool, that’s changing. The Chemical Checker database contains data on all 25 bioactivity spaces from each of those 800,000 molecules, and the tool, having been trained on that data, can predict all the bioactivity spaces of any molecules with incomplete bioactivity data. “The new tool … allows us to forecast the bioactivity spaces of new molecules, and this is crucial in the drug discovery process as we can select the most suitable candidates and discard those that, for one reason or another, would not work,” explained Aloy.
Of course, the prediction isn’t perfect, and assessing molecules with more available data will allow the tool to produce higher-confidence predictions. Some molecules, as well, prove simply more or less difficult for the tool to assess. “All models are wrong, but some are useful,” said Martino Bertoni, first author on the paper describing the research. “A measure of confidence allows us to better interpret the results and highlight which spaces of bioactivity of a molecule are accurate and in which ones an error rate can be contemplated.”
The researchers chose a challenging case for validation: a cancer-related transcription factor that was broadly considered an “undruggable” target. The tool identified 131 compounds that fit the target by predicting their bioactivity spaces, and their ability to degrade the target was experimentally confirmed.
The research described in this article was published as “Bioactivity descriptors for uncharacterized chemical compounds” in the June 2021 issue of Nature Communications. The article was written by Martino Bertoni, Miquel Duran-Frigola, Pau Badia-i-Mompel, Eduardo Pauls, Modesto Orozco-Ruiz, Oriol Guitart-Pla, Víctor Alcalde, Víctor M. Diaz, Antoni Berenguer-Llergo, Isabelle Brun-Heath, Núria Villegas, Antonio García de Herreros and Patrick Aloy. To read it, click here.
March 28, 2024
- MineOS Unveils AI Asset Discovery
- Cloudera Survey Reveals 90% of IT Leaders Believe that Unifying the Data Lifecycle on a Single Platform is Critical for Analytics and AI
- Snowflake Enhances Secure, Cross-Cloud Collaboration for High Value Business Outcomes with Snowflake Data Clean Rooms
- Domo Announces Winners of the 2024 Community Ovation Awards
March 27, 2024
- New MLPerf Inference Benchmark Results Highlight the Rapid Growth of Generative AI Models
- Qlik Advances Real-time Data Analytics with Solace PubSub+ Platform Integration
- Samsung Unveils Expanded CXL Memory Module Portfolio at Memcon 2024, Enhancing AI and HPC
- Celestial AI Closes $175M Series C Funding Round Led by US Innovative Technology Fund
- Databricks Launches DBRX: A New Standard for Efficient Open Source Models
- Astronomer Unveils New Capabilities in Astro to Streamline Enterprise Data Orchestration
- DataVisor Introduces Enhanced Anti-Money Laundering Solution to Support Financial Institutions
March 26, 2024
- Dremio Announces General Availability on Microsoft Azure
- Elastic Study Highlights Soaring Optimism in Generative AI Investment Despite Data and Security Challenges
- Dataiku and PwC Bring Practical AI Solutions to Regulated Industries
- HPE Leverages GenAI to Enhance AIOps Capabilities of HPE Aruba Networking Central Platform
- DataStax and Microsoft Collaborate to Make it Easier to Build Enterprise Generative AI and RAG Applications with Legacy Data
- SQream and Qantm AI Partner to Solve Enterprise Data and AI Challenges
- Neo4j Announces Collaboration with Microsoft to Advance GenAI and Data Solutions
- InterSystems Expands IRIS Data Platform with Vector Search to Support Next-Gen AI Applications
March 25, 2024
Most Read Features
Sorry. No data so far.
Most Read News In Brief
Sorry. No data so far.
Most Read This Just In
Sorry. No data so far.
Sponsored Partner Content
-
Supercharge Your Data Lake with Spark 3.3
-
Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]
-
Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]
-
Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023
-
The Art of Mastering Data Quality for AI and Analytics
Sponsored Whitepapers
Contributors
Featured Events
-
Data Universe
April 10 - April 11New York United States -
Call & Contact Center Expo
April 24 - April 25Las Vegas NV United States -
AI & Big Data Expo North America 2024
June 5 - June 6Santa Clara CA United States -
AI Hardware & Edge AI Summit 2024
September 10 - September 12San Jose CA United States