Tag: Hadoop

Enforcing Hadoop SLAs in a Big YARN World

Jul 23, 2014 |

The Apache Hadoop community has done a truly amazing job developing a scalable and versatile platform for big data analytic workloads. And with the recent introduction of YARN in Hadoop 2, we’re now able to run multiple analytic engines on our clusters simultaneously. Unfortunately, the prospect for resource contention has also gone up, and that will likely increase demand for service level agreement (SLA) enforcement. YARN made its big introduction just as companies started to move their Hadoop deployments out Read more…

Teradata Acquires Revelytix, Hadapt

Jul 22, 2014 |

Teradata Corp., the analytic data platform vendor, said it has expanded its big data portfolio with a pair of recent acquisitions. Teradata, based in Dayton, Ohio, said July 22 it has acquired the assets of Revelytix, an information management specialist, along with big data technologists and intellectual property from Hadapt. The Revelytix deal was completed on July 16; the Hadapt acquisition on July 17, Teradata said. Terms of the two acquisitions were not disclosed, the company said, because they are Read more…

HP Throws Trafodion Hat into OLTP Hadoop Ring

Jul 14, 2014 |

Hewlett-Packard last month quietly unveiled Trafodion, an ANSI-compliant relational SQL database that’s now available as an open source productd. With two decades of development at HP and the new capability to run on top of HBase, Trafodion could provide a big boost to efforts to run transactional workloads on Hadoop. The database technology behind Trafodion (which is Welsh for “transaction”) has been around for a long time at Hewlett-Packard, but it was dancing perilously close to the waste bin of Read more…

Where Does Spark Go From Here?

Jul 11, 2014 |

The excitement behind Apache Spark reached an apex last week during the 2014 Spark Summit put on by Databricks, the company behind the in-memory analytics phenomenon. With a large community of users and growing support from software vendors, the future for Spark certainly appears bright. But there’s a large amount of work ahead to fulfill the promise of Spark, including hardening various components. Providing an easier-to-use alternative to MapReduce is the first use case for Spark, which is said to Read more…

How Hadoop is Remaking Travel and Expense Reporting at Concur

Jul 7, 2014 |

If you’re like most people, filling out an expense report ranks right up there with getting a haircut or visiting the dentist. But thanks to the advanced analytics work that Concur is doing with Cloudera and Hadoop, the expense report process is becoming not only more enjoyable for business travelers, but more helpful too. Concur started working with Hadoop about three years ago as a place to combine and “munge” all the disparate data sources that the company deals with. Read more…

Data Virtualization and Big Data Stacks—Three Common Use Cases

Jul 2, 2014 |

There is a critical need among businesses today for a platform that enables them to access any kind of data from anywhere it lives without necessarily moving it to a central location like a data warehouse. In fact, every new innovation in technology or business is driven by better use of increasingly complex and disparate data. Data virtualization represents a straightforward way to deal with the complexity, heterogeneity, and volume of information coming at us, while meeting the needs of Read more…

See Spark Run on NoSQL, DataStax Says

Jun 30, 2014 |

DataStax today announced that Apache Spark is included in the latest release of its NoSQL databases, including open source Cassandra and DataStax Enterprise version 4.5. Those databases now include the in-memory Spark tools, thereby giving customers a new option for running analytic workloads on transactional data. Apache Spark has garnered loads of attention as a potent analytic toolset for Hadoop. Developers are eager for Spark, which offers a single API for accessing a treasure trove of machine learning, graph, SQL, Read more…

MapR Announces $110M Investment Led by Google

Jun 30, 2014 |

Hadoop distributor MapR Technologies said today it has completed a $110 million financing round led by heavy hitters Google Capital and the venture capital arm of mobile chip specialist Qualcomm. San Jose-based MapR said Google Capital led the equity-financing round with $80 million. Also participating was Qualcomm Ventures, along with existing investors Lightspeed Venture Partners, Mayfield Fund, New Enterprise Associates and Redpoint Ventures. MapR also announced completion of a $30 million line of credit led by Silicon Valley Bank. The financing Read more…

Apache Spark Gets YARN Approval from Hortonworks

Jun 26, 2014 |

Hortonworks today announced that Apache Spark is certified to work with YARN, the quarterback calling plays in next-gen Hadoop v2 clusters. The YARN stamp of approval clears the way for Hortonworks to fully support Spark for machine learning analytic workloads in a new release of HDP later this year. The prospect of running the in-memory Spark framework on Hadoop has garnered a lot of interest over the past year and a half, in particular as a faster and easier-to-program replacement Read more…

Google Re-Imagines MapReduce, Launches DataFlow

Jun 25, 2014 |

It’s well known in the industry that more than 10 years ago Google invented MapReduce, the technology at the heart of first-generation Hadoop. It’s less well known that Google moved away from MapReduce several years ago. Today at its Google I/O 2014 conference, the Web giant unveiled a possible successor to MapReduce called Dataflow, which it’s selling through its hosted cloud service. Google Cloud Dataflow is a managed service for creating data pipelines that ingest, transform, and analyze massive amounts Read more…