Follow Datanami:

Tag: cluster

LinkedIn Open Sources Tech Behind 10,000-Node Hadoop Cluster

LinkedIn last week open sourced DynoYARN, a key piece of technology that allows it to predict how appliacation performance will be impacted as it scales Hadoop to gargantuan proportions, including one 10,000-node cluster Read more…

Microsoft Extends Cassandra Rings with NoSQL Database Preview

Microsoft today announced the public preview of a new Cassandra database service for Azure Cosmos DB. The service, called Azure Managed Instance for Apache Cassandra, extends previous work that Microsoft did with Cassand Read more…

Rethinking Log Analytics at Cloud Scale

Log analytics is soaring in popularity, and Elasticsearch has captured a lot of that growth. But running a performant Elasticsearch cluster at scale is notoriously difficult. Now a company called ChaosSearch is touting a Read more…

Google Brings Kubernetes Operator for Spark to GCP

Those looking to run Apache Spark on clusters managed with Kubernetes will be interested in the new Spark operator for Kubernetes unveiled by Google today. The software, which is in beta, will be supported on the Google Read more…

MapR Makes Platform More Cloud-Like

MapR Technologies today unveiled a major enhancement to its big data cluster that introduces features commonly found on public cloud platforms, including support for an S3-compatible API, erasure coding, and a data-tieri Read more…

Snowflake Taps Qubole for Deep Machine Learning in the Cloud

Organizations storing big data in Snowflake's cloud data warehouse can now run machine learning and deep learning algorithms against that data thanks to a new partnership with Qubole. The two companies today announced Read more…

Dr. Elephant Leads the Performance Parade

I started working on big data infrastructure in 2009 when I joined Cloudera, which at the time was a small startup with about 10 engineers. It was a fun place to work. My colleagues and I got paid to work on open source Read more…

Unraveling Hadoop and Spark Performance Mysteries

What do you do when your Spark or Hive job runs like molasses? If you're like most end-users who lack in-depth technical skills, the answer is "not much." Now a startup named Unravel Data is working to show you what's ac Read more…

ScaleOut Pushes the Bottleneck in Latest IMDG Update

Each computer architecture, by definition, has a bottleneck that prevents it from performing faster. With the latest release of its in-memory data grid (IMDG) for performing data-parallel analytics, ScaleOut Software has Read more…

Intel Investments Seek to Speed Big Data Rollouts

Chipmaker Intel Corp. is expanding its push into big data with an equity investment and a technology partnership with BlueData, the startup that specializes in generating virtualized big data clusters. Along with the Read more…

Sharing Infrastructure: Can Hadoop Play Well With Others?

A lot of big data/Hadoop implementations are swimming against the currents of what recent history has taught about large scale computing and the result is a significant amount of waste, says Univa CEO, Gary Tyreman, who believes that Hadoop shared-infrastructure environments are on the rise. Read more…

How Ford is Putting Hadoop Pedal to the Metal

Ford Motor, like any other company at its scale, has been contending with a slew of big data problems--a matter that is being complicated by ever-growing additions to the data feed, from its own internal feeds to the terabytes of vehicle-fed machine data. We talked in depth with Ford's data science lead about the company's tough vendor and technology decisions around Hadoop and where the real value of such.... Read more…

Self-Service Data Mining, Hold the Bottlenecks

Business line analysts are too often stuck in gyrations between the database admins and the database itself, says Platfora CEO, Ben Werther.  The legacy database model will have to be replaced with more efficient processes, argues Werther, who believe his company's scale-out, in-memory solution might be what the data doctor ordered. Read more…

Expedia Adds Notes to Big Data Symphony

This week at the SAS Premier Business Leadership Summit, we spoke to Joe Megibow, VP and General Manager of Expedia.com, a travel giant that spun out of Microsoft that has had to contend with the process of evolution of 16 years of hardware and software investments have led to... Read more…

MapR Update Extends Hadoop Accessibility

This week MapR Technologies, announced a new version of its Hadoop distribution, including several bug fixes as well as new accessibility options. Among these are the ability to try a virtual MapR cluster as well as expanded application opportunities. Read more…

Elastic MapReduce Lead Traces Big Data Clouds

At the AWS Gov 2011 summit, general manager for Amazon Web Services' Elastic Map Reduce offering, Peter Sirota provided some prime use cases for its hosted Hadoop service and highlighted innovative ways MapReduce is changing commerce. Read more…

Datanami