

Enterprise metadata management (EMM) tools are fast becoming a necessity in this new age when companies are drowning in a flood of metadata. NoSQL databases are a popular alternative to their relational database counterparts for many use cases, including EMM.
The most basic type of NoSQL database is the key-value store, a database composed of a key and an associated data value (which can be a number, a string, or even another set of key-value pairs). A key-value store’s simple structure has speed and performance advantages for certain use cases where horizontal scaling is needed, such as serving queries on read-only data on large websites with heavy traffic.
A popular embedded key-value store is RocksDB, an open source project with a large community of contributors created in 2012. Now there’s a new key-value player in town: Speedb. The company bills its data storage engine as a drop-in replacement for RocksDB.
According to Speedbd, not all key-value stores are created equal in terms of performance and scale, especially for managing today’s growing volumes of metadata that are increasingly important for business operations. Speedb lists some of the problems RocksDB users have faced, including I/O hangs and stalls, code instability, database size limitations, excessive tuning, sharding, and write amplification.

A still from an introductory video shows the capabilities of Speedb. Source: Speedb
Adi Gelvan, Speedb co-founder and CEO, was working for Infinidat where he needed an EMM solution that did not involve beefing up system hardware or restructuring his entire existing data stack. He landed on RocksDB but quickly realized that it would not be scalable past 100GB. It was suggested he would need to shard RocksDB by breaking down his datasets into more manageable chunks and assigning each one to its own node with its own storage engine, but Gelvan was not thrilled with the idea of the extra work and complexity that would entail.
It was then, in Nov. 2020, that Gelvan and his co-founders decided to build their own data engine that was hyper-scalable, high-performance and used less computing power, and Speedb was born.
RocksDB is organized as a log-structured merge (LSM) tree, a structure that maintains the key-value pairs needed in a key-value store. For their engine, Gelvan and his colleagues re-implemented and improved upon the LSM tree structure to increase its efficiency while making sure it was compatible with the RocksDB API in order to appeal to current RocksDB users. The co-founders claim their engine can process 100x more data, at 10x the speed, using 80% fewer resources than RocksDB.
Speedb’s website explains how it works: “The Speedb Data Engine is based on a revolutionary compaction method that reduces the write amplification factor (WAF) from ~30 to ~5. As a result, Speedb eliminates processing latency issues and throughput drops, which are frequently encountered when using traditional LSM-trees, while significantly reducing CPU utilization and memory consumption.
“By redesigning the RocksDB I/O and job schedulers we were able to further improve performance stability and reduce stalls. We have also redesigned RocksDB’s flow control mechanism to eliminate spikes in user latency. Based on these technological breakthroughs, Speedb supports unprecedently fast writes even on large datasets while keeping a B-Tree like read performance.”
The engine also features a probabilistic index that uses a hierarchical data map and consumes less than 3 bytes per object, regardless of its size. Indexes for large datasets can be stored in DRAM which allows for mapping hundreds of billions of objects with only one media access per read. For enterprise users, Speedb is optimized with real-time monitoring, adaptive auto-tuning of system parameters, advanced reporting capabilities, and enterprise-grade support and customization for specific use-cases.
Among Speedb’s first business partners is Redis, an in-memory data structure store with offices in the same Tel Aviv building where Gelvan developed Speedb. In December 2021, about a month after the Speedb engine’s official launch, the company announced a $4 million funding round and the Redis partnership. As the storage engine for the Redis on Flash database, Speedb says Redis on Flash users can double their throughput and capacity while reducing latency, driving up performance, and reducing costs.
This week, Speedb revealed that hybrid cloud security firm XM Cyber has adopted Speedb to improve its Attack Path Management platform, a product that “lets customers continuously see their on-premises and cloud networks through the eyes of an attacker, and spot attacks before they happen.” The platform provides simulations of attack paths and uses Apache Flink (which uses RocksDB) for data operations. As the company’s metadata grew, RocksDB slowed it all down with memory bottlenecks and performance degradation, but now, Speedb has provided between 8.7 and 10.2x greater performance for the platform.
“I’ve rarely seen such a fast, elegant and simple solution to a deep-tech challenge as we experienced with Speedb,” said Yaron Shani, senior researcher and technology lead at XM Cyber. “Speedb’s impact was instantaneous, after simply replacing a few lines in the Docker files. Its dramatic improvement in memory utilization and performance allows us to give our customers better products and services than ever before. During the process of working together, we even discovered unique problems we were unaware of. Speedb is now deployed in our main build that goes to all customers, large and small.”
To learn more about the Speedb data engine, visit this link.
Related Items:
Five Emerging Trends in Enterprise Data Management
There’s a NoSQL Database for That
The Future of Data Management: It’s Already Here
August 8, 2022
August 5, 2022
- Datajoin Raises $3.5M Seed to Fix Marketers’ Broken Tech Stack
- Talend Announces Support for Amazon Redshift Serverless
August 4, 2022
- Ahana Awarded Industry Recognition for Big Data Analytics and Presto Innovations
- NeuroBlade Wins Innovation Award at Flash Memory Summit 2022
- Forrester Recognizes Zenoss in New AIOps Report
- Next Pathway Partners with Microsoft to Migrate Customers to Azure
- Census Achieves Premier Partner Status with Snowflake
- SingleStore Selects AppDirect to Power Real-Time Marketplace
- Zilliz Announces Key Contributions to Milvus 2.1
August 3, 2022
- VisualCortex and i-PRO Collaborate on Enterprise-Wide Computer Vision Tech Deployments
- ManageEngine Releases SaaS Version of Analytics Plus
- Pliops Collaborates with Partners to Break Through Data Scalability Barriers at FMS
- mParticle Acquires AI Startup Vidora
- Tricentis Survey Reveals Majority of Organizations Recognize Value in AI-Augmented DevOps
- Gigamon: Deep Observability Forecasted to Grow From $278M to $2B by 2026
- Pavilion Announces Flash Array 7X Database Performance Gains
- Micro Focus’ CyberRes Partners with Google Cloud
- SentinelOne and Cribl Partner to Deliver Data Flexibility Across Cybersecurity and Observability
- Seagate Addresses Hyperscale Workloads with New Enterprise-Class Nytro SSDs
Most Read Features
- How Intuit Is Building AI, Analytics, and Streaming on One Lakehouse
- A Dozen Questions for Databricks CTO Matei Zaharia
- The Race to Ensure Post Quantum Data Security
- Databricks Claims 30x Advantage in the Lakehouse, But Does It Hold Water?
- Altair Gives Legacy SAS Code a New Place to Run
- Esri Puts Federal Climate Change Action on the Map
- Esri Melds GIS with AI, Graph, and Analytics
- Big Data File Formats Demystified
- Data Mesh Vs. Data Fabric: Understanding the Differences
- StarRocks Brings Speedy OLAP Database to the Cloud
- More Features…
Most Read News In Brief
- Meta Releases AI Model That Translates Over 200 Languages
- Mathematica Helps Crack Zodiac Killer’s Code
- IBM Research Open-Sources Deep Search Tools
- FeatureByte Raises $5.7M to Fix the Weakest Link in AI
- SingleStore is the Newest Data Unicorn with $116M Funding Round
- Google Debuts LaMDA 2 Conversational AI System and AI Test Kitchen
- TIBCO’s ModelOps Takes AI Models Out of the Lab and Onto on the Road
- PowerSchool Launches K-12 Education Cloud Platform with Snowflake
- Altoros Report Compares Two NoSQL Databases
- EMR Serverless Now Available from AWS
- More News In Brief…
Most Read This Just In
- Samsung Launches 24Gbps GDDR6 DRAM
- Fiddler Announces Updates to AI Model Management Platform
- Intel Releases Open Source AI Reference Kits
- SingleStore and Intel Collaborate to Deliver Real-Time Data Technology
- Grafana Labs Introduces K8s Monitoring in Grafana Cloud
- Palantir Named a Leader in AI/ML Platforms by Independent Research Firm
- Dataiku Joins Deloitte US Data and AI Alliance Ecosystem
- Accenture Acquires Latin American Cloud Data Firm Tenbu
- dbt Labs Announces Formal Launch of its Technology Partner Program
- Western Digital Extends HDD Technology Across Smart Video, NAS and Datacenter Channel Segments
- More This Just In…
Sponsored Partner Content
-
Everyday AI, Extraordinary People
-
Dataiku Makes the Use of Data and AI an Everyday Behavior
-
Data Fabrics as the best path for Enterprise Data Integration
-
Dataiku connects data and doers through Everyday AI
-
Leaving Legacy ETL Behind
-
Streamline Lakehouse Analytics with Matillion and Databricks SQL
-
Close the Information Gap: How to Succeed at Analytics in the Cloud
-
Who wins the hybrid cloud?
Sponsored Whitepapers
Contributors
Featured Events
-
CDAO Government
September 13 @ 1:00 pm - September 14 @ 5:00 pmWashington DC United States -
CDAO APEX West
September 19 @ 8:00 am - September 20 @ 5:00 pm -
DMWF North America
October 5 - October 6Santa Clara CA United States -
CDAO Fall
October 10 - October 12Boston MA United States -
Operational Excellence Week Canada 2022
October 17 - October 20Toronto ON Canada