Follow Datanami:

Tag: Hadoop

Building an Open Cloud Data Lake Future

The explosion of data and the need for business agility to leverage that data for competitive advantage are driving a massive surge of data lake innovation. We’ve moved past first-generation on-premises Hadoop-based da Read more…

To Centralize or Not to Centralize Your Data–That Is the Question

Should you strive to centralize your data, or leave it scattered about? It seems like it should be a simple question, but it’s actually a tough one to answer, particularly because it has so many ramifications for how d Read more…

Cloudera Delivers Private Cloud Amid Public Speculation of Sale

Cloudera today announced that it has begun a limited tech preview for the on-premise version of its enterprise data lake, called Cloudera Data Platform (CDP) Private Cloud, which is its first product to support Kubernete Read more…

LinkedIn Open Sources Kube2Hadoop

Hadoop and Kubernetes have fundamentally different ways of authenticating users, exposing a security gap for organizations that want to access HDFS data from Kubernetes-based applications. Thanks to the new Kube2Hadoop t Read more…

Cloudera Prefers Red Hat for Kubernetes, But YARN Not Going Away

When Cloudera ships the on-premise version of its latest Hadoop distribution later this year, it will work with a Kubernetes container orchestration system from Red Hat, the company announced today. But the introduction Read more…

Yahoo’s Vespa Takes a Whack at CORD-19 Data

Verizon Media (formerly Yahoo) is giving its new Vespa search engine a chance to show what it can do against CORD-19, the collection of scholarly articles about COVID-19. The company is inviting the public to try using V Read more…

Rob Bearden Returns to Lead Cloudera’s Second Act

When Cloudera ran into trouble last June following poor financial results, the board jettisoned senior leadership, including CEO Tom Reilly and Mike Olson, its chief strategy officer. Those moves would open up a path for Read more…

Top 12 Datanami Stories of 2019

2019 was an eventful year in the big data space, with enough intersecting story lines to keep a big data watcher enmeshed for hours – if not days -- on end. We did our best to trace the story lines out for you, dear re Read more…

Big Data Predictions: What 2020 Will Bring

With just over a week left on the 2019 calendar, it’s now time for predictions. We’ll run several stories featuring the 2020 predictions of industry experts and observers in the field. It all starts today with what i Read more…

2019: A Big Data Year in Review – Part One

At the beginning of the year, we set out 10 big data trends to watch in 2019. We correctly called some of what unfolded, including a renewed focus on data management and continued rise of Kubernetes (that wasn’t hard t Read more…

Druid Developer Ramps Cloud-Native Approach

A real-time analytics startup founded by the authors of the Apache Druid database have raised additional funding as they ramp up product development based on the open-source data store. Imply said it has raised $30 mi Read more…

Simplifying the Big Data Lake Experiences in the Cloud

The cloud is a hot spot for big data lakes these days, thanks largely to the greater technological simplicity and lower upfront costs of getting started in the public cloud. But as organizations grow their cloud data lak Read more…

Architecting Your Data Lake for Flexibility

The terms 'Big Data' and 'Hadoop' have come to be almost synonymous in today's world of business intelligence and analytics. The promise of easy access to large volumes of heterogeneous data, at low cost compared to trad Read more…

Cloudera Begins New Cloud Era with CDP Launch

Eleven years after its founding, Cloudera fulfilled its name in a big way today with the launch of Cloudera Data Platform (CDP), its new flagship data platform that allows customers to securely manage and govern their da Read more…

On the Origin of Business Insight in a Data-Rich World

Where does business insight come from? In what circumstances does it arise? Does it erupt spontaneously when a critical mass of data has been reached? Or does it only present itself after a methodical analysis has been c Read more…

How Sumo Logic Turns the Event Data Tsunami into Continuous Intelligence

The ongoing explosion of event data can easily overwhelm your ability to make sense of it. Modern applications -- many containerized and running on the cloud -- generate huge streams of log data describing everything tha Read more…

Google Charts Data Stack Overhaul with Kubernetes

Google today announced that its hosted Spark and Hadoop distribution, Cloud Dataproc, can now run on Kubernetes, at least as an alpha. It may not sound particularly groundbreaking. But when you consider the work that wen Read more…

Cloudera Rebounds in Q2, Beats Expectations

After a tough first quarter that led to the ouster of its CEO, Cloudera delivered better financial results in the second quarter, according to figures released this week. While the company is still running at a loss and Read more…

Seeing the Big Picture on Big Data Market Shift

Hidden from view in the "I want to be data-driven" conversation are the nitty-gritty details of how actually to become a data-driven organization. The grand hope is that artificial intelligence, in the guise of machine l Read more…

MinIO Enjoying Role in Emerging Cloud Architecture

In the post-Hadoop world, object storage systems have become the new favored place to park petabytes of data, with Amazon S3 leading the way among cloud providers. But when organizations are looking for on-premise object Read more…

Datanami