Cassandra Gets Monitoring, Performance Upgrades
The latest beta release of the Apache Cassandra is designed to hit the ground running as the NoSQL database moves steadily to the cloud to provide managed services in production deployments.
Cassandra 4.0 released on Monday (July 20) is the first major update of the database since 2017, incorporating more than 1,000 bug fixes and extensive “battle” testing to improve performance in production, making it “the most stable release ever,” maintainers asserted. Performance testing included running Cassandra on clusters as large as 1,000 nodes using an array of enterprise use cases.
Cassandra promoters note that hyper-scalers such as Apple (NASDAQ: AAPL) have deployed the database in production with more than 75,000 nodes, illustrating its ability to scale.
Among the new features incorporated into version 4.0 is the ability to stream data between nodes during scaling operations such as adding a new node or datacenter during peak traffic times.
It also includes the new data access controls operating on a “per datacenter basis.” In one scenario, operators of datacenters located in the Europe and the United States could configure Cassandra to allow access to a single datacenter using a “network authorizer” feature. Data governance features are gaining traction as European authorities crack down on the cross-border movement of personal user data.
Monitoring tools are also emphasized in the Cassandra latest release. Previously, open source tools from key code contributors such as DataStax and Instaclustr were the primary tools for observing Cassandra clusters.
“Constant monitoring of key performance indicators such as latency, disk usage, and throughput is critical to maintaining an optimal deployment,” Justin Cameron, a senior software engineer at Instaclustr, wrote last year in Datanami.
Around-the-clock “monitoring is necessary because both internal and external changes to Cassandra usage patterns are very common,” Cameron added.
The latest version allows users to selectively monitor system metrics and configuration settings via a feature called Virtual Tables. Other tools allow users to record and replay production workloads to analyze performance.
Along with DataStax, the data platform developer behind Cassandra, key code contributors to the 4.0 version include Amazon Web Services (NASDAQ: AMZN) and Instaclustr.
The Cassandra 4.0 better release is here.
April 9, 2021
- Alteryx Global Inspire 2021 Conference to Showcase New Products in Analytics and Data Science
- Trifacta Announces New Community and Certification Programs for Data Workers
- Gary Hagmueller Joins Dgraph Labs as CEO
- Elastic and Confluent Partner to Develop Enhanced Experience for Kafka and Elasticsearch Users
- New UK Fellowship Programme Will Fund 6 Fellows in AI and Data Science to Support Life Science Research
April 8, 2021
- Full Agenda Released for TigerGraph’s Graph + AI Summit 2021
- AWS Announces General Availability of Amazon Lookout for Equipment
- Collibra and EVO Banco Boost Data-Driven Digital Banking
- Swarm64 and EDB Partner to Extend EDB Postgres Advanced with Faster Query Performance
- Alation Delivers Cloud-Based Platform for Data Intelligence
- Comet Raises $13M Series A for Model Development and Monitoring
- Graphistry and Pavilion Partner to Accelerate Graph Analytics Using RAPIDS and NVIDIA GPUs
- New Services in SAP HANA Cloud Lower TCO in Data-Intensive, Highly Regulated Industries
- Snorkel AI Launches Application Studio, Raises $35M Led by Lightspeed Venture Partners
- Streamlit Transforms How Data Scientists Share Data, Raises $35M Led by Sequoia
April 7, 2021
- OmniSci Announces Opportunities to Accelerate Geospatial, GPU Database Knowledge At GTC 2021
- MinIO Enables IT to Manage Kubernetes-native Object Storage
- Ataccama Report: Nearly 8 in 10 Businesses Struggle with Data Quality
- MANTA Partners with Neo4j to Provide Enhanced Graph Technology for Data Pipeline Analysis
- Qlik Collaborates with AWS to Accelerate Cloud Analytics with SAP Data
Most Read Features
- Big Data File Formats Demystified
- Synthetic Data: Sometimes Better Than the Real Thing
- Experts Disagree on the Utility of Large Language Models
- A ‘Glut’ of Innovation Spotted in Data Science and ML Platforms
- He Couldn’t Beat Teradata. Now He’s Its CEO
- Is Python Strangling R to Death?
- Why Data Science Is Still a Top Job
- Who’s Winning In the $17B AIOps and Observability Market
- Big Data Predictions: What 2020 Will Bring
- What’s the Difference Between AI, ML, Deep Learning, and Active Learning?
- More Features…
Most Read News In Brief
- Can PHP Script a Big Data Comeback?
- Data Prep Still Dominates Data Scientists’ Time, Survey Finds
- AWS Adds Explainability to SageMaker
- Global DataSphere to Hit 175 Zettabytes by 2025, IDC Says
- AWS Tackles Real-Time Data Transformation with S3 Object Lambda
- The AI Inside NASA’s Latest Mars Rover, Perseverance
- The Union of Salesforce, Tableau Yields Hybrid ‘Business Science’
- Databricks Edges Closer to IPO with $1B Round
- Data Salaries Get a COVID Bump
- Informatica Accelerates DataOps with Spark, GPUs
- More News In Brief…
Most Read This Just In
- Moody’s Analytics Wins Award for Best Use of AI in Banking or FinTech
- Aiven Raises $100M Series C to Expand Global Open Source Innovation
- Alluxio Advances Analytics and AI with NVIDIA Accelerated Computing
- GrafanaCONline Returns June 7-17, CFP Is Open Now
- AWS Announced Strategic Partnership with Hugging Face NLP Startup
- y42 Raises $2.9M to Provide a Scalable and Affordable Data Stack to Companies of All Sizes
- ThoughtSpot Acquires SeekWell to Operationalize Analytics, Push Cloud Data Insights to Business Apps
- Trifacta Announces Industry’s First Data Engineering Cloud
- PrivaceraCloud’s New Integrations Automate Identity Management Across Multi and Hybrid-Cloud Environments
- New Tool Increases Transparency and Understanding into Machine Behaviors
- More This Just In…