
ChatGPT Gives Kinetica a Natural Language Interface for Speedy Analytics Database

(SuPatMaN/Shutterstock)
It would normally take quite a bit of complex SQL to tease a multi-pronged answer out of Kinetica’s high-speed analytics database, which is powered by GPUs but wire-compataible with Postgres. But with the new natural language interface to ChatGPT unveiled today, non-technical users can get answers to complex questions written in plain English.
Kinetica was incubated by the U.S. Army over a decade ago to pour through huge mounds of fast-moving geospatial and temporal data in search of terrorist activity. By leveraging the processing capability of GPUs, the vector database could run full table scans on the data, whereas other databases were forced to winnow down the data with indexes and other techniques (it has since embraced CPUs with Intel’s AVX-512).
With today’s launch of its new Conversational Query feature, Kinetica’s massive processing capability is now within the reach of workers who lack the ability to write complex SQL queries. That democratization of access means executives and others with ad-hoc data questions are now able to leverage the power of Kinetica’s database to get answers.
The vast majority of database queries are planned, which enables organizations to write indexes, de-normalize the data, or pre-compute aggregates to get those queries to run in a performant way, says Kinetica co-founder and CEO Nima Negahban.

A user can submit a natural langauge query directly on the Kinetica dashboard, which ChatGPT converts to SQL for execution
“With the advent of generative large language models, we think that that mix is going to change to where a lot bigger portion of it’s going be ad hoc queries,” Negahban tells Datanami. “That’s really what we do best, is do that ad hoc, complex query against large datasets, because we have that ability to do large scans and leverage many-core compute devices better than other databases.”
Conversational Query works by converting a user’s natural language query into SQL. That SQL conversion is handled by OpenAI’s ChatGPT large language model (LLM), which proven itself to be a quick learner of language–spoken, computer, and otherwise. OpenAI API then returns the finalized SQL, and users can then choose to execute it against the database directly from the Kinetica dashboard.
Kinetica is leaning on the ChatGPT model to understand the intent of language, which is something that it’s very good at. For example, to answer the question “Where do people hang out the most?” from a massive database of geospatial data of human movement, ChatGPT is smart enough to know that “hang out” is a synonym for “dwell time,” which is how the data is officially identified in the database. (The answer, by the way, is 7-Eleven.)
Kinetica is also doing some work ahead of time to prepare ChatGPT to generate good SQL through its “hydration” process, says Chad Meley, Kinetica’s chief marketing officer.
“We have native analytic functions that are callable through SQL and ChatGPT, through part of the hydration process, becomes aware of that,” Meley says. “So it can use a specific time-series join or spatial join that we make ChatGPT aware of. In that way, we go beyond your typical ANSI SQL functions.”
The SQL generated by ChatGPT isn’t perfect. As many are aware, the LLM is prone to seeing things in the data, the so-called “hallucination” problem. But even though it’s SQL isn’t completely free of defect, ChatGPT is still quite useful at this state, says Negahban, who was a 2018 Datanami Person to Watch.
“I’ve seen that it’s kind of good enough,” he says. “It hasn’t been [wildly] wrong in any queries it generates…I think it will be better with GPT-4.”
In the end analysis, by the time it takes a SQL pro to write the perfect seven-way join and get it over to the database, the opportunity to act on the data may be gone. That’s why the pairing of a “good enough” query generator with a database as powerful as Kinetica can make a different for decision-makers, Negahban says.
“Having an engine like Kinetica that can actually do something with that query without having to do planning beforehand” is the big get, he says. “If you try to do some of these queries with the Snowflake, or insert your database du jour, they really struggle because that’s just not what they’re built for. They’re good at other things. What we’re really good at, as an engine, is to do ad hoc queries no matter the complexity, no matter how many tables are involved. So that really pairs well with this ability for anyone to generate SQL across all their data asking questions about all the data in their enterprise.”
Conversational Query is available now in the cloud and on-prem versions of Kinetica.
Related Items:
ChatGPT Dominates as Top In-Demand Workplace Skill: Udemy Report
Bank Replaces Hundreds of Spark Streaming Nodes with Kinetica
Preventing the Next 9/11 Goal of NORAD’s New Streaming Data Warehouse
May 26, 2023
- Alteryx Launches Maveryx Community to Celebrate the Power of Analytics for All
- The 2023 Innovative Data Infrastructure Forum: Envisioning the Yottabyte Era
- Airbyte’s Pioneering State of Data Survey Sheds Light on Global Data Engineering Practices
- ZEE5 Moves to ScyllaDB NoSQL to Enhance User Experience at Scale with Predictable Costs
- IBM and Microsoft to Sponsor Carruthers and Jackson’s Annual Summer School for Data Leaders
May 25, 2023
- New Software by Integral Enables Companies to Maximize Dataset Customization and Privacy
- MongoDB and Alibaba Cloud Extend Global Partnership
- TomTom and Alteryx Deepen Partnership with New Location Intelligence Offering
- Habu Announces Collaboration With Microsoft Azure to Deliver Zero-Trust Data Clean Room
- Opaque Systems and Confidential Computing Consortium Unveil Speaker Lineup for First-Ever Confidential Computing Summit
- Snorkel AI Announces 3rd Annual Future of Data-Centric AI Conference
- SEALSQ Unveils AI-Powered Data Pipelines for Real-Time IoT Satellite and Semiconductor Data Processing
- Ocient’s Latest Release Elevates Hyperscale Data Analytics with Real-time Features and BI Tools Support
May 24, 2023
- DataStax and ThirdAI Announce Partnership to Democratize Access to Advanced AI Tech
- Alteryx Unveils Unified Platform Experience to Accelerate Analytics Automation
- Census Launches dbt_census_utils, the First dbt Macros for Data Activation
- Alteryx Announces New Generative AI Capabilities to Supercharge Analytics Democratization
- Dataiku Survey Reveals 84% of Companies Are Using Low- and No-Code Platforms to Achieve AI Goals
- Domino Data Lab Gears Up for Rev 4, a Crucial Convergence of AI Luminaries and Fortune 100 Leaders
- Alteryx’s New Research Highlights the Role of Data and Tech in Enhancing Decision Intelligence Across Organizations
Most Read Features
- Tableau Jumps Into Generative AI with Tableau GPT
- Data Mesh Vs. Data Fabric: Understanding the Differences
- Vector Databases Emerge to Fill Critical Role in AI
- Which BI and Analytics Vendors Are Incorporating ChatGPT, and How
- Google Claims Its TPU v4 Outperforms Nvidia A100
- LLMs Are the Dinosaur-Killing Meteor for Old BI, ThoughtSpot CEO Says
- Data Quality Is Getting Worse, Monte Carlo Says
- The Semantic Layer Architecture: Where Business Intelligence is Truly Heading
- Open Source Provides Path to Real-Time Stream Processing
- Big Data File Formats Demystified
- More Features…
Most Read News In Brief
- Microsoft Unifies Data Management, Analytics, and ML Into ‘Fabric’
- Mathematica Helps Crack Zodiac Killer’s Code
- Nine Things I Learned at Tableau Conference 2023
- Informatica Claims 80% Speedup for Data Management Tasks with LLMs
- Big Data Career Notes: May 2023 Edition
- AI Chatbots: A Hedge Against Inflation?
- IBM Embraces Iceberg, Presto in New Watsonx Data Lakehouse
- We’re Still in the ‘Wild West’ When it Comes to Data Governance, StreamSets Says
- Venture Capital Funding Plummets, But AI Investment Growing Strong
- ChatGPT Gives Kinetica a Natural Language Interface for Speedy Analytics Database
- More News In Brief…
Most Read This Just In
- ServiceNow and Hugging Face Release StarCoder LLM for Code Generation
- Sumo Logic Names Joe Kim as President and CEO
- New Relic Debuts Grok Generative AI Observability Assistant
- Google Cloud’s Generative AI Revolutionizing Workplace Applications: Major Enterprise Partnerships Announced
- Pega Announces Pega GenAI to Infuse Generative AI Capabilities in Pega Infinity ’23
- Red Hat OpenShift AI Accelerates Generative AI Adoption Across the Hybrid Cloud
- Pinecone Raises $100M Series B to Provide Long-Term Memory for AI
- Francisco Partners Completes Acquisition of Sumo Logic
- MariaDB Unveils Distributed SQL Vision at OpenWorks 2023, Boosting Scalability for MySQL and PostgreSQL Communities
- Google Cloud Unveils A3 GPU Supercomputer: Next-Gen Power for Advanced AI Models
- More This Just In…
Sponsored Partner Content
Sponsored Whitepapers
Sponsored Multimedia
Contributors
Featured Events
-
IEEE Conference on Artificial Intelligence 2023
June 5 @ 8:00 am - June 6 @ 5:00 pmSanta Clara CA United States -
Enterprise Data Summit
June 7 -
CDAO Insurance 2023
June 13 - June 14 -
ODSC Europe 2023
June 14 - June 15London United Kingdom