January 28, 2020

New Library Adds Causality to ML Models

George Leopold

A new open source library is designed to help data scientists and domain experts jointly develop machine learning models based on causal relationships rather than just data correlations. The developers of the new CausalNex library argue that running machine learning projects without considering causality can lead to faulty conclusions.

QuantumBlack, a data analytics unit of McKinsey & Co., said CausalNex is its second open source release after Kedro, a library aimed at production ML code. Its new machine learning project is designed to help data scientists infer causal relationships in data. Dependencies in data can be expressed in a network graph that can then be inspected by domain experts. The collaborative approach is designed to eliminate spurious correlations in ML models, QuantumBlack said.

The library for causal reasoning applies “what-if” analysis to Bayesian networks on the assumption that the probabilistic model is more intuitive in describing causality than traditional ML frameworks based on correlation analysis and pattern recognition.

“CausalNex is built on our collective experience to leverage Bayesian networks to identify causal relationships in data so that we can develop the right interventions from analytics,” the developer said.

Bayesian networks have been used previously to build causal models, but the process often requires multiple libraries used to test how different data points relate to or influence each other. The requirement for multiple libraries often hindered subject objects who could otherwise spot relationships among variables with relative ease.

A single CausalNex library allows data scientists to develop network graphs that can be amended by subject experts. That collaboration builds trust in the resulting models. “Causal relationships are more accurate if we can easily encode or augment domain expertise in the graph model,” developers noted.

The company began work on CausalNex in 2018 as clients’ machine learning projects ran into causality issues. In February 2019, researchers combined data gleaned from internal projects into the software that underlies the new library.

Prior to its release to the open source community, QuantumBlack developers refined the library for implementation of Bayesian networks.

CausalNex is available now on GitHub.

Recent items:

Can Markov Logic Take Machine Learning to the New Level?

Why Knowledge Graphs are Foundational to Artificial Intelligence

Applications: Enterprise Analytics

Technologies: Frameworks

Sectors: Financial Services, Other

Vendors: QuantumBlack

Tags: Bayesian networks, casualty, CausalNex, Kedro, machine learning, open source software

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

New Library Adds Causality to ML Models

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 19, 2024

April 18, 2024

April 17, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

New Library Adds Causality to ML Models

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 19, 2024

April 18, 2024

April 17, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link