DataRobot unveiled several enhancements in its automated machine learning platform today, including the introduction of features like composable ML and continuous AI. The company, which is holding a virtual conference today and tomorrow, also bought Zepl, the data science notebook startup founded by the backers of Apache Zepplin.
During a session at DataRobot’s AI Experience Worldwide virtual conference, Nenshad Bardoliwalla, the company’s senior vice president of product, led attendees through a roundup of the company’s 2020 accomplishments, and laid out a roadmap for what DataRobot hopes to do in 2021.
“2020 was a busy year at DataRobot,” said Bardoliwalla, who joined DataRobot in late 2019 when it acquired Paxata, the data preparation company he co-founded. “Perhaps we had a little more time on keyboards, not having to commute during the pandemic.”
For starters, DataRobot completed the integration of the Paxata data prep tools, he said. It also rounded out its end-to-end data pipelines to support model training, inference, and continouous improvement, Bardoliwalla said.
“We enhanced AutoML significantly with the addition of visual AI to automate and democratize computer vision models, location AI to add geospatial awareness, and automated deep learning to help you try new approaches on your tabular text and your image data,” he told the audience. “We also added automatic feature discovery to automate the laborious and repetitive feature engineering activities from multiple related datasets into a single operation.”
2020 also saw DataRobot bolster its time-series capabilities with automated anomaly detection; add support for Keras deep learning libraries; roll out new MLOps agents to support remote models; add a champion-challenger framework to stress-test ML models; introduce an application gallery; and roll out a value tracker for AI.
“I believe we set the bar high for innovation,” Bardoliwalla said. “And now we plan to surpass it.”
The company already rolled out its first new release in March with the launch of DataRobot version 7.0. That release brought enhancements in MLOps, model baseline comparison, computer vision, and data prep.
With today’s announcement, the company is once again bolstering capabilities across its expansive AI platform, including the introduction of what the company is calling “composable ML,” which Bardoliwalla called “another first in the field of data science and machine learning.”
Composable ML is a feature for the most advanced “code-first” data scientists that gives them more capabilities to tweak and tune their ML models, Bardoliwalla said.
“You’ll be able to copy, edit, and even reconfigure DataRobot’s blueprints to fit the specific needs of your use case and the best practices unique to your business,” he told the audience in today’s session. “Within any DataRobot blueprint, you’ll be able to integrate your own Python and R code to add new custom tabs into our model. You can even bring your own training models built from scratch by your best data scientists and run them in our head-to-head model competition. Once you see how they place on our leaderboard, you can decide which model” to use.
Meanwhile, the new “Continuous AI” feature is designed to bolster the DataRobot MLOps product to fine-tune the retraining policies for machine learning models.
“You will now be able to set automatic retraining policies for your production models based on a schedule of your choosing, or on the occurrence of a significant event, such as the detection of data drift or a dip in model accuracy,” Bardoliwalla said.
Continuous AI will leverage DataRobot’s champion-challenger framework to analyze the performance of production models and recommend when the model should be replaced. It’s all about automating the tasks that are mostly done manually by data science practitioners today, Bardoliwalla said.
“Just imagine how much time and energy you could have saved this time last year when Covid made everyone’s models inaccurate, forcing lengthy re-training and replacement projects,” he said. “Now your modes will stay accurate for as a long as they live in your production environment, no matter what crazy thing is going on across the world.”
This release also brings a new bias and fairness tool that monitors for disparate impacts by ML models. DataRobot has been at the forefront of tackling the bias concerns in AI, having produced the “State of AI Bias” report in 2019, and hiring a new Global AI Ethicist, Haniyeh Mahmoudian, three weeks ago. It also released a model grader, which is a new tool to evaluate existing AI models and generate an automatic scorecard grading them across data auality, robustness, accuracy, and fairness.
The company also debuted its no code AI app builder for practitioners who don’t want to write code in Python or R. The software presents a GUI that allows users to build ML models in a drag-and-drop fashion using widgets and pre-built templates.
Finally, DataRobot acquired Zepl, the Silicon Valley firm behind the open source Apache Zeppelin data science notebook. Zepl, which had raised $13.1 million, owns a cloud product that allows users to analyze data and develop ML models in the interactive Zepplin environment. The offering, which emerged from beta in 2018, works with multiple languages and production environments, including Apache Spark environments, such as Databricks’.
DataRobot says it will integrate Zepl as “a cloud-native, self-service notebook” in its AI environment. In addition to enabling users to build models in a notebook environment (similar to Jupyter), they’ll be able to collaborate with other data scientists using Zepl’s integration with DataRobot, the company says.
“We have always known that to lead the AI market, we must embrace all creators of AI systems, from analysts and citizen data scientists who prefer using a GUI to advanced data scientists who love to code,” said Dan Wright, who took over the CEO seat at DataRobot in March following the departure of co-founder Jeremy Achin.
“Through the addition of Zepl, we now give advanced data scientists more flexibility to use our enterprise AI platform within their existing workflows, including the ability to use their own code. By incorporating Zepl into the DataRobot platform, we plan to further democratize data science across every enterprise and significantly accelerate our code-centric roadmap,” he says.
The company’s AI Experience Worldwide continues tomorrow at https://aiworldwide.datarobot.com.
Related Items:
DataRobot Introduces AI-Powered Anti-Pandemic Initiative ‘ContagionNET’
DataRobot Eyes IPO After Another VC Haul
Apache Zeppelin Launches Latest Data Science Notebook
March 18, 2024
- Snowflake Partners with NVIDIA to Deliver Full-Stack AI Platform for Customers
- HPE Debuts End-to-End AI-Native Portfolio for Generative AI
- NetApp Partners with NVIDIA to Transform Generative AI with Advanced Data Retrieval Tech
- Pure Storage Accelerates Enterprise AI Adoption to Meet Growing Demands with NVIDIA AI
- OpenText Announces OpenText World Europe 2024: A Gathering of Global Minds in London, Munich, and Paris
- Machine Learning Summit 2024 Kicks Off Shanghai Leg
- Algolia Ecommerce Report: AI-Driven Search Experiences Go Mainstream, Supercharge Revenue Generation
- SQream Demonstrates Next-Level AI and ML Data Processing Capabilities at NVIDIA GTC
- Voltron Data Advances Theseus, Making It the First Petabyte Scale Query Engine for Large Scale Data Processing
- Cisco Completes Acquisition of Splunk
March 14, 2024
- InfluxData Collaborating with AWS to Bring InfluxDB and Time Series Analytics to Developers Around the World
- Starfish Storage Celebrates a Decade of Leadership in Metadata-Driven Unstructured Data Management
- StreamNative Simplifies Data Streaming with New Apache Kafka Offering
- Airbyte Announces Industry-Leading 5,000 Data Connectors Built with No-Code Builder
- Microsoft and Oracle Expand Partnership to Satisfy Global Demand for Oracle Database@Azure
- Three-Quarters of Organizations Have Reached High Levels of AI Maturity, LXT Survey Finds
- Yugabyte Achieves PCI DSS Level 1 Compliance, Validating Secure and Scalable Distributed PostgreSQL for Financial Institutes
- Data product management pioneer Mindfuel closes €3.75m seed funding to launch across Europe
March 13, 2024
Most Read Features
Sorry. No data so far.
Most Read News In Brief
Sorry. No data so far.
Most Read This Just In
Sorry. No data so far.
Sponsored Partner Content
-
Supercharge Your Data Lake with Spark 3.3
-
Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]
-
Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]
-
Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023
-
The Art of Mastering Data Quality for AI and Analytics
Sponsored Whitepapers
Contributors
Featured Events
-
Memory Con 2024
March 26 - March 27Mountain View CA United States -
Data Universe
April 10 - April 11New York United States -
Call & Contact Center Expo
April 24 - April 25Las Vegas NV United States -
AI & Big Data Expo North America 2024
June 5 - June 6Santa Clara CA United States -
AI Hardware & Edge AI Summit 2024
September 10 - September 12San Jose CA United States