Tag: Hive

Hortonworks Touts Hive Speedup, ACID to Prevent ‘Dirty Reads’

Apr 4, 2017 |

If you’re considering using Hadoop for SQL-based analytics and BI, you’ll be interested in the latest news out of Hortonworks, which today unveiled a new release of its flagship data platform that boasts a fast new release of Apache Hive, as well as a new ACID merge function that can prevent “dirty reads.” Read more…

Hadoop Has Failed Us, Tech Experts Say

Mar 13, 2017 |

The Hadoop dream of unifying data and compute in a distributed manner has all but failed in a smoking heap of cost and complexity, according to technology experts and executives who spoke to Datanami. Read more…

How PRGX Is Making Its AS/400-to-Hadoop Migration Work

Oct 28, 2016 |

Many companies are using big data technologies to build new applications that can take advantage of emerging data streams, like sensor data or social media. It’s not often you see established back-office applications being migrated to Hadoop, but that’s just what PRGX is doing with a trusty old AS/400 application. Read more…

Big Performance Gains Seen Across SQL-on-Hadoop Engines

Oct 18, 2016 |

You can’t really go wrong these days when it comes to picking a SQL-on-Hadoop engine. As long as you stick to the mainstream open source products like Hive, Impala, Spark SQL, and Presto, your SQL queries are likely running 2-4x faster than they did earlier this year, without changing your queries or buying more hardware. Read more…

ODPi Tackles Hive with Latest Hadoop Runtime Spec

Sep 27, 2016 |

ODPi today unveiled the second major release of its Runtime Specification that’s geared at setting a standard for Hadoop components to ensure greater interoperability among distributions and third-party products. New additions to the spec include Apache Hive and the Hadoop Compatible File System (HCFS). Read more…

From Hadoop to Zeta: Inside MapR’s Convergence Conversion

Mar 8, 2016 |

If you’re a regular Datanami reader, you likely know MapR Technologies as a Hadoop distributor, one of the three “pure play” providers alongside Hortonworks and Cloudera. But with its integrated NoSQL database, a modified distributed file system, and an integrated stream processing engine that shipped today—key elements of its so-called Zeta Architecture–it’s become increasingly difficult to put MapR in the Hadoop bucket. Read more…

SQL-on-Hadoop Test: Each Engine Has ‘Sweet Spots’

Feb 25, 2016 |

Business intelligence has emerged as the top workload for Hadoop, ahead of data science and ETL. That has prompted bench markers to zero in on the performance of leading SQL-on-Hadoop engines for BI use cases. Read more…

Google Releases Cloud Processor For Hadoop, Spark

Feb 24, 2016 |

Google took the wraps off of its managed Apache Hadoop and Spark service this week, saying its cloud data processing platform is intended to reduce the cost and ease management of processing big datasets. Read more…

Distributed Computing Tops List of Hottest Job Skills

Jan 27, 2016 |

If you have cloud and distributed computing skills, your job prospects for 2016 are golden. That’s because those particular job skills—which parallel the rise of Hadoop and other distributed computing frameworks–topped a LinkedIn analysis of the top 25 skills to help you find a new job this year. Read more…

Picking the Right SQL-on-Hadoop Tool for the Job

Jan 13, 2016 |

SQL is, arguably, the biggest workload many organizations run on their Hadoop clusters. And there’s good reason why: The combination of a familiar interface (SQL) along with a modern computing architecture (Hadoop) enables people to manipulate and query data in new and powerful ways. Read more…