Vendors » Cloudera

Features

Apache Beam’s Ambitious Goal: Unify Big Data Development

Apr 22, 2016 |

If you’re tired of using multiple technologies to accomplish various big data tasks, you may want to consider Apache Beam, a new distributed processing tool from Google that’s now incubating at the ASF. One of the challenges of big data development is the need to use lots of different technologies, frameworks, APIs, languages, and software development kits. Depending on what you’re trying to do–and where you’re trying to do it–you may choose MapReduce for batch processing, Apache Spark SQL for Read more…

Can Big Data Deliver on the Huge Expectations of Precision Medicine?

Apr 18, 2016 |

Big data can do a lot of things. But can it allow doctors to create drugs that are tailor-made to your genes, reduce the cost of healthcare for a billion people, and let us all live to 150? Only time will tell, but there are certainly those in the industry, like Cloudera co-founder Mike Olson, who are bullish on big data’s potential to create a healthcare renaissance through precision medicine. There are many ways big data technologies are creeping into Read more…

Reporter’s Notebook: 6 Key Takeaways from Strata + Hadoop World

Apr 5, 2016 |

The big data ecosystem was on full display at last week’s Strata + Hadoop World conference in San Jose. At the ripe old age of 10, Hadoop is still the driving force, but newer frameworks like Spark and Kafka are gaining steam. Here are some of the top trends your Datanami editor pulled from the show based on observations and discussions with attendees and vendors. Let’s start with the biggest news from Strata, which was the rise of Kafka and Read more…

Cutting On Random Digital Mutations and Peak Hadoop

Apr 1, 2016 |

In a wide-ranging Strata + Hadoop World talk on Wednesday that reminds us why we like Doug Cutting so much, the father of Hadoop riffed on the evolution of big data tech, the power of open source, the promise of Flink, and the possibility of “peak Hadoop” at the ripe old age of 10. “It’s easy to say it’s all hype,” the Cloudera chief architect and Apache Hadoop co-founder said during a presentation on the next 10 years of Hadoop Read more…

ODPi Defines Hadoop Runtime Spec; Operations Up Next

Mar 28, 2016 |

Today the ODPi issued the first set of documents that describes a standard distribution of basic runtime components for Hadoop, including YARN, HDFS, and MapReduce. Going forward, the organization is preparing a management specification for Hadoop as it considers which Hadoop problem area it will tackle next. The ODPi was founded a year ago  on the eve of the Spring Strata + Hadoop World conference as the Open Data Platform initiative to help reign in some of the complexity that’s impacting Read more…

News In Brief

Resolving Hadoop’s Storage Gap

Mar 28, 2016 |

Over the past several years, the Hadoop ecosystem has made great strides in its real-time access capabilities, narrowing the gap compared to traditional database technologies. With systems such as Impala and Spark, analysts can now run complex queries or jobs over large datasets within a matter of seconds. With systems such as Apache HBase and Apache Phoenix, applications can achieve millisecond-scale random access to arbitrarily-sized datasets. These improvements have allowed Hadoop to expand the set of applications for which it Read more…

Data Warehouse Market Ripe for Disruption, Gartner Says

Mar 14, 2016 |

While mega-vendors with names like IBM (NYSE: IBM) and Oracle (NYSE: ORCL) continue to lead the data warehousing space, shifts in the market are creating opportunities for smaller vendors to innovate in areas like cloud deployments and streaming data, Gartner says in its latest Magic Quadrant report. Disruption is accelerating in the market for data warehousing solutions, Gartner says in its February report. New requirements–such as the need to store and analyze an increasingly diverse array of data types–are leading Read more…

Cloudera Targets Hadoop SQL Workloads with CDH 5.5

Nov 19, 2015 |

Cloudera is aiming to improve how SQL workloads run on Hadoop with today’s release of Cloudera Enterprise 5.5, which brings support for Spark SQL, support for JSON data types in Impala, better security on Impala and Hive, and the beta of a new SQL workload optimization tool. SQL has been lingua franca for accessing and manipulating data within databases for decades, and so it should come as surprise that SQL is big on Hadoop, even though it’s not a database, Read more…

BlueTalon Delivers Fine-Grain Protection of HDFS Data

Sep 29, 2015 |

BlueTalon today announced that its big data security software now supports the enforcement of fine-grained authorization policies and masking of HDFS data. This will enable customers to control access to Hadoop data using the same tool they use to control access to enterprise data warehouses and relational databases, the company says. One of the problems with implementing data-centric security in a big data world is the fact that data lives in multiple places. While many organizations are building big data Read more…

Inside Platfora’s Transition to Apache Spark

Sep 22, 2015 |

Platfora has embraced Apache Spark as the underlying data processing engine in version 5 of Big Data Discovery, which it announced today. But the company hasn’t completely gotten rid of MapReduce in its Hadoop application, and the reason may surprise you. Platfora ‘s Big Data Discovery combines many of the steps involved in analyzing big data–from data cleansing and transformation to data visualization–into one solution that runs natively on  Hadoop. Platfora was one of the first analytic tool vendors to target Read more…

This Just In

Cloudera and MongoDB Join Forces

Apr 29, 2014 |

Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, and MongoDB, the database for modern applications, today announced a strategic partnership that will transform how organizations approach big data.

Saint Peter’s University Becomes a Member of Cloudera’s Academic Partnership

Apr 25, 2014 |

Saint Peter’s University today announced its official membership in Cloudera’s Academic Partnership. Cloudera is the leader in analytic big data management powered by Apache Hadoop, an essential tool for working with big data. The agreement will provide the master’s program in data science with industry-leading resources to help streamline and accelerate the adoption of Hadoop. Cloudera has developed the Academic Partnership program to help bridge the big data talent gap and to aid in the development of the next generation of data scientists.

Diyotta DI Suite Now Cloudera Enterprise 5 Certified

Apr 18, 2014 |

Diyotta, a leading big data integration company, today announced that their Big Data Integration platform, Diyotta DI Suite is now certified on Cloudera Enterprise 5, Cloudera’s next generation enterprise analytic platform powered by Apache Hadoop. Diyotta’s engineering team had been working with Cloudera’s partner engineering team to show the true commitment to the company’s technology innovation by certifying on Cloudera Enterprise 5 before it was GA.

Cloudera Introduces Big Data Training Course

Apr 16, 2014 |

Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, today announced the industry’s first hands-on Big Data training course that teaches developers to build end-to-end applications for a Hadoop-based enterprise data hub (EDH). The new four-day course, called Designing and Building Big Data Applications, prepares data professionals to use an EDH’s full capabilities to build custom, converged applications that enable their organizations to achieve greater value from data and solve real-world problems.

Jaspersoft Teams with Cloudera for Hadoop Reporting and Analytics

Apr 15, 2014 |

Jaspersoft, the Intelligence Inside applications and business processes, today announced product integration with Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, to extend the company’s leadership in reporting and analytics for Big Data. With this integration, Jaspersoft becomes one of the first business intelligence (BI) providers to support both Cloudera 5 Enterprise Data Hub and Cloudera Impala for Hadoop analytics.