Follow Datanami:

Tag: impala

Cloudera Bringing Impala to AWS Cloud

Apache Impala, the SQL-based analytical database that originated at Cloudera, will soon be available as a managed service on the Amazon Web Services cloud, the Hadoop software distributor announced today at AWS Re:Invent Read more…

Application Management Gets Unraveled

It's all about enterprise applications, we are told, with big data apps among the most critical. Hence, a growing focus on managing application performance has fueled new monitoring approaches such as operational data sc Read more…

Hadoop Has Failed Us, Tech Experts Say

The Hadoop dream of unifying data and compute in a distributed manner has all but failed in a smoking heap of cost and complexity, according to technology experts and executives who spoke to Datanami. "I can't find a Read more…

Big Performance Gains Seen Across SQL-on-Hadoop Engines

You can't really go wrong these days when it comes to picking a SQL-on-Hadoop engine. As long as you stick to the mainstream open source products like Hive, Impala, Spark SQL, and Presto, your SQL queries are likely runn Read more…

Arrow Aims to Defrag Big In-Memory Analytics

You probably haven't heard of Apache Arrow yet. But judging by the people behind this in-memory columnar technology and the speed at which it just became a top-level project at the Apache Software Foundation, you're goni Read more…

Picking the Right SQL-on-Hadoop Tool for the Job

SQL is, arguably, the biggest workload many organizations run on their Hadoop clusters. And there's good reason why: The combination of a familiar interface (SQL) along with a modern computing architecture (Hadoop) enabl Read more…

Inside Yellow Pages’ SQL-on-Hadoop Journey

Like many national companies, scale is important for Yellow Pages, or YP as the company is now known. So when the Georgia-based local marketing firm set out to find a suitable SQL engine to deliver real-time analytics a Read more…

Three Ways Zoomdata Makes Big Data Pop

When it comes to big data visualization tools, there's no shortage of players. Tableau, Qlik, Spotfire, and Microstrategy are established incumbents with big followings. But there's a fresh crop of visualization tools ma Read more…

Actian Claims ‘Permanent Performance Advantage’ with SQL-on-Hadoop Tool

The SQL-on-Hadoop sweepstakes are by no means over. What's been dubbed the "gateway drug" for Hadoop is just starting to gain traction. But according to Actian, its SQL-on-Hadoop offering, dubbed Vortex, is out to an ear Read more…

How Advances in SQL on Hadoop Are Democratizing Big Data–Part 2

In a previous article, we discussed several key advances in SQL on Hadoop that are making Big Data capabilities increasingly accessible to analytics organizations. While SQL democratizes Big Data by leveraging convention Read more…

How Advances in SQL on Hadoop Are Democratizing Big Data–Part 1

September 2014 marked the anniversary of Edgar F. Codd’s 1969 introduction of “A Relational Model of Data for Large Shared Data Banks”, which is a compellation of research and theories that ultimately provided the Read more…

Cloudera Shuffles Its Product Deck in Pursuit of ‘Data Hub’ Strategy

Cloudera today unveiled a new three-tiered product packaging strategy for its Hadoop software, including a new high-end "Data Hub" Edition designed to help it compete against the likes of IBM and Pivotal. The company also announced the availability of the Spark stream processing and machine learning engine. Read more…

White-Glove Hadoop Cloud Service Launched by Altiscale

Hadoop users that need more handholding than Amazon can provide may want to check out the new Hadoop as a Service (HaaS) offering launched today by Altiscale. Founded by veteran technologists from Yahoo and AltaVista, the company intends to provide a high-touch experience for running and--more importantly--optimizing production Hadoop workloads in a private cloud. Read more…

Zooming Through Historical Data with Streaming Micro Queries

Stream processing engines, such as Storm and S4, are commonly used to analyze real-time data as it flows into an organization. But did you know you can use this technology to analyze historical data too? A company called ZoomData recently showed how. Read more…

Finding Big Data Treasure in the Cloud

Heading into 2014, one of the big data trends that will intensify is the transition toward end-to-end data analytic services hosted in the cloud. One of the promising big data cloud services is Treasure Data, a Silicon Valley company that offers an interesting mix of MapReduce, columnar databases, and intelligent agent technology that's aimed at helping clients get a quick return on their big data investments. Read more…

Cloudera Releases Impala Into the Wild

The week, Cloudera announced that Impala, their SQL on Hadoop initiative, is moving out of public beta and into general availability. Datanami spoke with Cloudera to get an overview of the state of Impala, how they see it stacking up in the competitive landscape, and where the space will move in the near future. Read more…

Self-Service Data Mining, Hold the Bottlenecks

Business line analysts are too often stuck in gyrations between the database admins and the database itself, says Platfora CEO, Ben Werther.  The legacy database model will have to be replaced with more efficient processes, argues Werther, who believe his company's scale-out, in-memory solution might be what the data doctor ordered. Read more…

Visualizing Big Data’s Key Partner

Visualization is vital to managing big data. The proper charts, graphs, and other representations of large datasets can let business users see trends they would not know existed otherwise. And after having a busy week at Hadoop World, Tableau appears to be on the forefront of the big data visualization market. Read more…

Cloudera Runs Real-Time with Impala

This week at Strata Hadoop World we sat down Cloudera CEO, Mike Olson, to talk about the company's recent open sourced effort to add a real-time aspect to Hadoop, thus taking it beyond its batch-only roots. We also hit on emerging issues in the ecosystem, including how their competitive advantages stack up against others in the... Read more…

Datanami