DataStax just announced the general availability of its vector search capability in Astra DB, its DBaaS built on Apache Cassandra.
Vector search is a must-have capability for building generative AI applications. In machine learning, vector embeddings are the distilled representations of raw training data and act as a filter for running new data through during inference. Training a large language model results in potentially billions of vector embeddings.
Vector databases store these embeddings and perform a similarity search to find the best match between a user’s prompt and the vectorized training data. Instead of searching with keywords, embeddings allow users to conduct a search based on context and meaning to extract the most relevant data.
There are native databases specifically built to manage vector embeddings, but many relational and NoSQL databases (like Astra DB) have been modified to include vector capabilities due to the demand surrounding generative AI.
This demand is palpable: McKinsey estimates that generative AI could potentially add between $2.6 and $4.4 trillion in value to the global economy. DataStax CPO Ed Anuff noted in a release that databases capable of supporting vectors are crucial to tapping into the potential of generative AI as a sustainable business initiative.
“An enterprise will need trillions of vectors for generative AI so vector databases must deliver limitless horizontal scale. Astra DB is the only vector database on the market today that can support massive-scale AI projects, with enterprise-grade security, and on any cloud platform. And, it’s built on the open source technology that’s already been proven by AI leaders like Netflix and Uber,” he said.
DataStax says one advantage of vector search within Astra DB is that it can help reduce AI hallucinations. LLMs are prone to fabricating information, called hallucinating, which can be damaging to business. This vector search release includes Retrieval Augmented Generation (RAG), a capability that grounds search results within specific enterprise data so that the source of information can be easily pinpointed.
Data security is another factor to consider with generative AI deployment, as many AI use cases involve sensitive data. DataStax says Astra DB is PCI, SOC2, and HIPAA enabled so that companies like Skypoint Cloud Inc., which offers a data management platform for the senior living healthcare industry, can use Astra DB as a vector database for resident health data.
“Envision it as a ChatGPT equivalent for senior living enterprise data, maintaining full HIPAA compliance, and significantly improving healthcare for the elderly,” said Skypoint CEO Tisson Mathew in a statement.
To support this release, DataStax also created a Python library called CassIO aimed at accelerating vector search integration. The company says this software framework easily integrates with popular LLM software like LangChain and can maintain chat history, create prompt templates, and cache LLM responses.
The new vector search capability is available on Astra DB for Microsoft Azure, AWS, and Google Cloud. The company also says vector search will be available for customers running DataStax Enterprise, the on-premises, self-managed offering, within the month.
Matt Aslett of Ventana Research expects generative AI adoption to grow rapidly and says that through 2025, one-quarter of organizations will deploy generative AI embedded in one or more software applications.
“The ability to trust the output of generative AI models will be critical to adoption by enterprises. The addition of vector embeddings and vector search to existing data platforms enables organizations to augment generic models with enterprise information and data, reducing concerns about accuracy and trust,” he said.
Related Items:
Vector Databases Emerge to Fill Critical Role in AI
DataStax Bolsters Real-Time Machine Learning with Kaskada Buy
DataStax Nabs $115 Million to Help Build Real-Time Applications
May 16, 2024
- Melissa Launches Data Marketplace, One-Stop Shop for Comprehensive Customer Data
- Qlik Connect 2024 to Explore Ethical AI and Integrated Data Strategies
- MetaRouter Launches ACCELERATE to Simplify Server-Side Data Collection
May 15, 2024
- HPE Expands HPE GreenLake Cloud Solutions to Simplify Hybrid Cloud Management
- WEKA Nets $140M in Series E Funding at $1.6B Valuation
- Alteryx AiDIN Copilot Leverages Google Cloud’s Gemini AI for Enhanced Data Interaction
- DataStax to Launch New Hyper-Converged Data Platform with Vector Search and GenAI Capabilities
- Hazelcast Showcases Real-Time Data Platform at 2024 Gartner Summit
- StreamNative Launches Ursa, a Kafka-Compatible Data Streaming Engine
- Anomalo Brings Its AI-Powered Data Quality Platform to Google Cloud Marketplace
May 14, 2024
- NetApp Unveils Unified Data Storage Built for the AI Era
- SensiML Launches First Complete Open-Source AutoML Solution for Edge AI/ML Development
- ScaleOut Software Expands Digital Twins Potential with Version 3 Release
- SolarWinds Unveils Enhanced Database Performance Analyzer with Advanced PostgreSQL Support
- New dbt Cloud Enhancements Empower Organizations with Trustworthy Data At Scale
- Spectra Logic and Geyser Data Partner to Launch Industry-First Tape-as-a-Service Offering
- Gartner Says Data & Analytics Leaders Must Use Collective Intelligence to Drive Business Value
May 13, 2024
Most Read Features
Sorry. No data so far.
Most Read News In Brief
Sorry. No data so far.
Most Read This Just In
Sorry. No data so far.
Sponsored Partner Content
-
Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!
-
Supercharge Your Data Lake with Spark 3.3
-
Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]
-
Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]
-
Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023
-
The Art of Mastering Data Quality for AI and Analytics
Sponsored Whitepapers
Contributors
Featured Events
-
AI & Big Data Expo North America 2024
June 5 - June 6Santa Clara CA United States -
CDAO Canada Public Sector 2024
June 18 - June 19 -
AI Hardware & Edge AI Summit Europe
June 18 - June 19London United Kingdom -
AI Hardware & Edge AI Summit 2024
September 10 - September 12San Jose CA United States -
CDAO Government 2024
September 18 - September 19Washington DC United States