Too many big data initiatives are science projects that take months of effort, risk failure and require highly trained data scientists with scarce skills. According to a CSC survey, 55 percent of big data projects aren’t completed and many others fall short of their objectives.Read more...
Vendors » Cloudera
The excitement behind Apache Spark reached an apex last week during the 2014 Spark Summit put on by Databricks, the company behind the in-memory analytics phenomenon. With a large community of users and growing support from software vendors, the future for Spark certainly appears bright. But there’s a large amount of work ahead to fulfill the promise of Spark, including hardening various components. Providing an easier-to-use alternative to MapReduce is the first use case for Spark, which is said to Read more…
If you’re like most people, filling out an expense report ranks right up there with getting a haircut or visiting the dentist. But thanks to the advanced analytics work that Concur is doing with Cloudera and Hadoop, the expense report process is becoming not only more enjoyable for business travelers, but more helpful too. Concur started working with Hadoop about three years ago as a place to combine and “munge” all the disparate data sources that the company deals with. Read more…
Hortonworks today announced that Apache Spark is certified to work with YARN, the quarterback calling plays in next-gen Hadoop v2 clusters. The YARN stamp of approval clears the way for Hortonworks to fully support Spark for machine learning analytic workloads in a new release of HDP later this year. The prospect of running the in-memory Spark framework on Hadoop has garnered a lot of interest over the past year and a half, in particular as a faster and easier-to-program replacement Read more…
Hadoop is quickly gaining momentum as the foundation for a new class of massively parallel applications that works against petabytes of data. But as customers grow their test clusters beyond a handful of nodes, and with the bad taste of the Target breach still hanging around, the lack of security controls threatens to derail large production Hadoop deployments. Hadoop comes from open source, and there’s no shortage of open source Apache projects aimed at building security functionality into the core Read more…
The phrase “traditional Hadoop” was heard early and often at this week’s 2014 Hadoop Summit. While first-generation Hadoop technologies unlocked previously unseen potential in big data sets, it pales in comparison to what we’ll see with second-gen Hadoop, we’re told. Hope springs eternal in Silicon Valley, where tech’s brightest minds make huge fortunes changing the world and “having the courage to re-imagine the future” is a personal trait as common as liking children and small animals. So when you bring Read more…
News In Brief
At first glance, the partnership that Cloudera and MongoDB unveiled today is a bit of a head scratcher. While the two companies are arguably the biggest software vendors in the nascent space, they swim in opposite ends of the big data pool. It turns out, that’s exactly why the companies felt they needed to work together.
Cloudera’s motto is “Ask bigger questions.” In light of today’s news regarding a mind-boggling $900 million funding round and a $4.1 billion valuation, one big question rises to the surface: What in the world is Cloudera going to do with all that money?
Intel today announced it will cease development of its own Hadoop distribution and instead will invest in and promote Cloudera’s distribution to its customers. The surprise move confirms a rumor that surfaced last week that had Intel taking a significant stake in Cloudera during its latest round of venture funding.
Cloudera today announced an additional $160 million in funding, bringing the Hadoop distributor’s total venture haul to $300 million. The software company now values itself at more than $2 billion, according to Bloomberg, which also reported that Hadoop rival Intel has invested in the firm.
Not long ago, the rules for what constituted a data warehouse were fairly well defined. The schema was fixed, you could say, and was based primarily on relational database technology designed to process structured data. My, how times have changed. Last week, Gartner for the first time accepted non-relational technologies–including those based on Hadoop and NoSQL–in its annual Magic Quadrant for Data Warehouses report.
This Just In
Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, and MongoDB, the database for modern applications, today announced a strategic partnership that will transform how organizations approach big data.
Saint Peter’s University today announced its official membership in Cloudera’s Academic Partnership. Cloudera is the leader in analytic big data management powered by Apache Hadoop, an essential tool for working with big data. The agreement will provide the master’s program in data science with industry-leading resources to help streamline and accelerate the adoption of Hadoop. Cloudera has developed the Academic Partnership program to help bridge the big data talent gap and to aid in the development of the next generation of data scientists.
Diyotta, a leading big data integration company, today announced that their Big Data Integration platform, Diyotta DI Suite is now certified on Cloudera Enterprise 5, Cloudera’s next generation enterprise analytic platform powered by Apache Hadoop. Diyotta’s engineering team had been working with Cloudera’s partner engineering team to show the true commitment to the company’s technology innovation by certifying on Cloudera Enterprise 5 before it was GA.
Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, today announced the industry’s first hands-on Big Data training course that teaches developers to build end-to-end applications for a Hadoop-based enterprise data hub (EDH). The new four-day course, called Designing and Building Big Data Applications, prepares data professionals to use an EDH’s full capabilities to build custom, converged applications that enable their organizations to achieve greater value from data and solve real-world problems.
Jaspersoft, the Intelligence Inside applications and business processes, today announced product integration with Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, to extend the company’s leadership in reporting and analytics for Big Data. With this integration, Jaspersoft becomes one of the first business intelligence (BI) providers to support both Cloudera 5 Enterprise Data Hub and Cloudera Impala for Hadoop analytics.