
Release of Dataiku 6 Features Elasticity Support, Sustainable AI, and Cross-Team Collaboration
NEW YORK, Dec. 4, 2019 –Today Dataiku released the latest version of its leading Enterprise AI and machine learning platform, Dataiku 6, which includes the ability for users to easily spin up and manage Kubernetes clusters from inside the Dataiku platform. In addition to elasticity, Dataiku 6 offers a suite of new features to empower organizations to build sustainable AI systems.
Dataiku 6 underscores the company’s continued commitment to helping businesses build AI that stands the test of time, ensuring that they are not hampered by inevitable and unforeseen developments (technological, regulatory, or otherwise) in the space. Other highlights include a new plugin store to more easily expand the power of Dataiku and improved subpopulation analysis for better model performance and avoiding model bias.
“The world of AI is moving incredibly fast, but companies can’t wait for it to slow down in order to get started. That means organizations need to ensure they are future-proofing their approach to AI,” said Florian Douetteau, Dataiku CEO. “Dataiku 6 enables enterprises to do just that by offering more features for white box AI, collaboration, efficiency, and elastic resource management to allow businesses’ AI to evolve along with the technology.”
Dataiku 6 future-proofs the path to Enterprise AI via:
Elasticity: Dataiku 6 enables users to easily spin up and manage Kubernetes clusters (on AWS, Azure, or GCP) from inside the Dataiku platform. This means that non-admin users can now quickly spin up Kubernetes clusters for optimized execution of Spark or in-memory jobs. Admins can also isolate and manage compute power so that every team gets exactly what they need to run their analysis and deploy Enterprise AI at scale.
In addition, Snowflake users will experience faster runtimes in Dataiku with the new optimized sync with WASB and native execution of Spark jobs in Snowflake. Dataiku 6 also makes it lightning fast to execute long, multi-step SQL data pipelines, allowing for an optimized compute and storage environment when working with SQL data.
White Box AI: Dataiku 6 has two new visual capabilities (partial dependence plots and subpopulation analysis) that enable users to deep-dive into key aspects of model behavior that can help teams avoid undesirable model biases. With subpopulation analysis, users can easily weed out these unintended model biases and create a more transparent and fair deployment of AI. Meanwhile, partial dependence plots help people understand complex models visually by surfacing the relationship between a feature and the target.
Cross-Team Collaboration and Efficiency: Improved IDE integrations (RStudio, VS Code, SublimeText, PyCharm) enable coders to work in their environment of choice while fueling collaboration on the Dataiku platform. Better visualization is critical for communicating data-driven systems and decisions to business stakeholders as well as for data scientists to understand and track the progress of AI projects. Dataiku 6 makes it seamless to work with external data visualization tools like Qlik and Tableau.
With the collaboration features added in Dataiku 6, data analysts can easily leverage the new plugins store and reuse code created by data engineers and data scientists in their everyday workflows. With features like custom model plugins for visual machine learning and custom charts, coders can now create and share beautiful visuals and custom machine learning models with non-coders.
For an in-depth look at Dataiku 6, read the release notes.
To receive a demo of Dataiku, including a look at the new features in action, sign up for the webinar on December 11th.
About Dataiku
Dataiku is the centralized data platform that democratizes the use of data science, machine learning, and AI in the enterprise. With Dataiku, businesses are uniquely empowered to move along their data journey from data preparation to analytics at scale to Enterprise AI. By providing a common ground for data experts and explorers, a repository of best practices, shortcuts to machine learning and AI deployment/management, and a centralized, controlled environment, Dataiku is the catalyst for data-powered companies.
Source: Dataiku
August 18, 2025
- Oracle Deploys OpenAI GPT-5 Across Database and Cloud Applications Portfolio
- Penn and Ai2 Release 400K Synthetic Images to Advance AI’s Visual Reasoning
- MSU: Decades of Data Point to Widespread Butterfly Loss in Midwest
August 15, 2025
- SETI Institute Awards Davie Postdoctoral Fellowship for AI/ML-Driven Exoplanet Discovery
- Los Alamos Sensor Data Sheds Light on Powerful Lightning Within Clouds
- Anaconda Report Reveals Need for Stronger Governance is Slowing AI Adoption
August 14, 2025
- EDB Accelerates Enterprise AI Adoption with NVIDIA
- Oracle to Offer Google Cloud’s Gemini AI Models Through OCI Generative AI
- Grafana Labs Launches Public Preview of AI Assistant for Observability and Monitoring
- Striim Launches 5.2 with New AI Agents for Real-Time Predictive Analytics and Vector Embedding
- G42 Launches OpenAI GPT-OSS Globally on Core42’s AI Cloud
- SuperOps Launches Agentic AI Marketplace Partnering with AWS
- MinIO Launches MinIO Academy as AI Adoption Drives Demand for Object Storage Expertise
- DataBank Reports 60% of Enterprises Already Seeing AI ROI or Expect to Within 12 Months
- SnapLogic Surpasses $100M ARR as Founder Gaurav Dhillon Retires
August 13, 2025
- KIOXIA Advances AI Server Infrastructure Scalability, Accelerating Storage Performance and Density
- Qubrid AI Debuts 2-Step No-Code Platform to Chat Directly with Proprietary Data
- Redpanda Provides Kafka-Compatible Streaming for NYSE Cloud Services
- Treasure Data Introduces ‘No Compute’ Pricing, Delivering Predictable Economics with Hybrid CDP Architecture
- Couchbase: New Enterprise Analytics Brings Next-Gen JSON Analytics to Self-Managed Deployments
- Top 10 Big Data Technologies to Watch in the Second Half of 2025
- Rethinking Risk: The Role of Selective Retrieval in Data Lake Strategies
- LinkedIn Introduces Northguard, Its Replacement for Kafka
- Apache Sedona: Putting the ‘Where’ In Big Data
- What Are Reasoning Models and Why You Should Care
- Scaling the Knowledge Graph Behind Wikipedia
- Why Metadata Is the New Interface Between IT and AI
- Why OpenAI’s New Open Weight Models Are a Big Deal
- LakeFS Nabs $20M to Build ‘Git for Big Data’
- Doing More With Your Existing Kafka
- More Features…
- Mathematica Helps Crack Zodiac Killer’s Code
- Promethium Wants to Make Self Service Data Work at AI Scale
- BigDATAwire Exclusive Interview: DataPelago CEO on Launching the Spark Accelerator
- The Top Five Data Labeling Firms According to Everest Group
- McKinsey Dishes the Goods on Latest Tech Trends
- Supabase’s $200M Raise Signals Big Ambitions
- Solidigm Celebrates World’s Largest SSD with ‘122 Day’
- AI Skills Are in High Demand, But AI Education Is Not Keeping Up
- Google Pushes AI Agents Into Everyday Data Tasks
- Collate Focuses on Metadata Readiness with $10M Series A Funding
- More News In Brief…
- Seagate Unveils IronWolf Pro 24TB Hard Drive for SMBs and Enterprises
- OpenText Launches Cloud Editions 25.3 with AI, Cloud, and Cybersecurity Enhancements
- Gartner Predicts 40% of Generative AI Solutions Will Be Multimodal By 2027
- StarTree Adds Real-Time Iceberg Support for AI and Customer Apps
- Gathr.ai Unveils Data Warehouse Intelligence
- Deloitte Survey Finds AI Use and Tech Investments Top Priorities for Private Companies in 2024
- LF AI & Data Foundation Hosts Vortex Project to Power High Performance Data Access for AI and Analytics
- Dell Unveils Updates to Dell AI Data Platform
- Zscaler Unveils Business Insights with Advanced Analytics for Smarter SaaS Spend and Resource Allocation
- Collibra Acquires Deasy Labs to Extend Unified Governance Platform to Unstructured Data
- More This Just In…