Follow Datanami:

Tag: cluster

Microsoft Extends Cassandra Rings with NoSQL Database Preview

Mar 2, 2021 |

Microsoft today announced the public preview of a new Cassandra database service for Azure Cosmos DB. The service, called Azure Managed Instance for Apache Cassandra, extends previous work that Microsoft did with Cassandra, and will enable customers to build new Cassandra databases and to build hybrid setups that extend existing Cassandra database rings to Microsoft’s service running in the Azure cloud. Read more…

Rethinking Log Analytics at Cloud Scale

Aug 19, 2020 |

Log analytics is soaring in popularity, and Elasticsearch has captured a lot of that growth. But running a performant Elasticsearch cluster at scale is notoriously difficult. Now a company called ChaosSearch is touting a unique approach to the scalability problem, which uses indexing and query optimization to effectively turn S3 into database that can feed huge amounts of data to upstream systems at a fraction of the cost. Read more…

Google Brings Kubernetes Operator for Spark to GCP

Jan 30, 2019 |

Those looking to run Apache Spark on clusters managed with Kubernetes will be interested in the new Spark operator for Kubernetes unveiled by Google today. The software, which is in beta, will be supported on the Google Cloud Platform. Read more…

MapR Makes Platform More Cloud-Like

Jun 26, 2018 |

MapR Technologies today unveiled a major enhancement to its big data cluster that introduces features commonly found on public cloud platforms, including support for an S3-compatible API, erasure coding, and a data-tiering function that supports external “cheap and deep” Read more…

Snowflake Taps Qubole for Deep Machine Learning in the Cloud

Feb 13, 2018 |

Organizations storing big data in Snowflake’s cloud data warehouse can now run machine learning and deep learning algorithms against that data thanks to a new partnership with Qubole.

The two companies today announced a partnership that will allow Qubole’s big data processing engines, including Apache Spark and TensorFlow, to read and write data to Snowflake’s data warehouse. Read more…

Dr. Elephant Leads the Performance Parade

Jan 12, 2018 |

I started working on big data infrastructure in 2009 when I joined Cloudera, which at the time was a small startup with about 10 engineers. It was a fun place to work. Read more…

Unraveling Hadoop and Spark Performance Mysteries

Sep 13, 2016 |

What do you do when your Spark or Hive job runs like molasses? If you’re like most end-users who lack in-depth technical skills, the answer is “not much.” Now a startup named Unravel Data is working to show you what’s actually going on in the cluster, and provide some configuration recommendations and automatic fixes as well. Read more…

ScaleOut Pushes the Bottleneck in Latest IMDG Update

Dec 2, 2015 |

Each computer architecture, by definition, has a bottleneck that prevents it from performing faster. With the latest release of its in-memory data grid (IMDG) for performing data-parallel analytics, ScaleOut Software has continued to push the bottleneck out of the CPU and put it firmly into the network’s lap. Read more…

Intel Investments Seek to Speed Big Data Rollouts

Aug 27, 2015 |

Chipmaker Intel Corp. is expanding its push into big data with an equity investment and a technology partnership with BlueData, the startup that specializes in generating virtualized big data clusters. Read more…

Sharing Infrastructure: Can Hadoop Play Well With Others?

Mar 28, 2013 |A lot of big data/Hadoop implementations are swimming against the currents of what recent history has taught about large scale computing and the result is a significant amount of waste, says Univa CEO, Gary Tyreman, who believes that Hadoop shared-infrastructure environments are on the rise. Read more…
Datanami