Follow Datanami:

Tag: HDFS

How Facebook Accelerates SQL at Extreme Scale

Serving SQL queries on a petabyte of data is one thing, but delivering it at Facebook’s scale is something else entirely. Earlier this year, the social media giant implemented the Alluxio distributed file system into i Read more…

LinkedIn Open Sources Kube2Hadoop

Hadoop and Kubernetes have fundamentally different ways of authenticating users, exposing a security gap for organizations that want to access HDFS data from Kubernetes-based applications. Thanks to the new Kube2Hadoop t Read more…

Rob Bearden Returns to Lead Cloudera’s Second Act

When Cloudera ran into trouble last June following poor financial results, the board jettisoned senior leadership, including CEO Tom Reilly and Mike Olson, its chief strategy officer. Those moves would open up a path for Read more…

Big Data Predictions: What 2020 Will Bring

With just over a week left on the 2019 calendar, it’s now time for predictions. We’ll run several stories featuring the 2020 predictions of industry experts and observers in the field. It all starts today with what i Read more…

2019: A Big Data Year in Review – Part One

At the beginning of the year, we set out 10 big data trends to watch in 2019. We correctly called some of what unfolded, including a renewed focus on data management and continued rise of Kubernetes (that wasn’t hard t Read more…

Cloudera Begins New Cloud Era with CDP Launch

Eleven years after its founding, Cloudera fulfilled its name in a big way today with the launch of Cloudera Data Platform (CDP), its new flagship data platform that allows customers to securely manage and govern their da Read more…

Seeing the Big Picture on Big Data Market Shift

Hidden from view in the "I want to be data-driven" conversation are the nitty-gritty details of how actually to become a data-driven organization. The grand hope is that artificial intelligence, in the guise of machine l Read more…

Re-Imagining Big Data in a Post-Hadoop World

In the big data battle for architectural supremacy, the cloud is clearly winning and Hadoop is clearly losing. Customers are shying away from investing in monolithic Hadoop clusters in favor of more nimble (if not less e Read more…

Databricks Donates Delta Code to Open Source

Databricks today announced that it's open sourcing the code behind Databricks Delta, the Apache Spark-based product it designed to help keep data neat and clean as it flows from sources into its cloud-based analytics env Read more…

Intel Builds Analytics, Database Use Cases for Optane

Intel offered a list of use cases for its Optane DC persistent memory technology during a company event this week, including Twitter’s effort to scale its Hadoop clusters using Optane and SAP HANA’s database improvem Read more…

Can On-Prem S3 Compete with HDFS for Analytic Workloads?

In the battle for big data storage supremacy, Hadoop is still in the running. It may no longer be the 800-lb gorilla, but the demonstrated scalability of the Hadoop Distributed File System (HDFS) makes it a potent conten Read more…

Hadoop Gets Improved Hooks to Cloud, Deep Learning

Organizations that adopt the latest version 3.2 release of Apache Hadoop will get new integration hooks into the AWS and Azure clouds, as well as access to a new deep learning project called Hadoop Submarine. Hadoop m Read more…

Hadoop 3.0 Likely to Arrive Before Christmas

It's looking like big data developers will get an early holiday present as work on Hadoop version 3.0 nears completion. And while Hadoop 3.0 brings compelling new features, including a 50% increase in capacity and upward Read more…

Committers Talk Hadoop 3 at Apache Big Data

The upcoming delivery of Apache Hadoop 3 later this year will bring big changes to how customers store and process data on clusters. Here at the annual Apache Big Data show in Miami, Florida, a pair of Hadoop project com Read more…

ODPi Tackles Hive with Latest Hadoop Runtime Spec

ODPi today unveiled the second major release of its Runtime Specification that's geared at setting a standard for Hadoop components to ensure greater interoperability among distributions and third-party products. New add Read more…

Investments in Fast Data Analytics Surge

Companies are quickly ramping up their investments fast data analytics and real-time stream processing frameworks and lowering spending on batch technologies in an attempt to get on top of growing data volumes and veloci Read more…

Hadoop Past, Present, and Future

Every few years the technology industry seems to be consumed with a shiny new object that gets hyped far beyond reality. At worst, the inevitable bursting of the hype bubble leads to the disappearance of the technology f Read more…

Resolving Hadoop’s Storage Gap

Over the past several years, the Hadoop ecosystem has made great strides in its real-time access capabilities, narrowing the gap compared to traditional database technologies. With systems such as Impala and Spark, analy Read more…

BlueTalon Delivers Fine-Grain Protection of HDFS Data

BlueTalon today announced that its big data security software now supports the enforcement of fine-grained authorization policies and masking of HDFS data. This will enable customers to control access to Hadoop data usin Read more…

Does InfiniBand Have a Future on Hadoop?

Hadoop was created to run on cheap commodity computers connected by slow Ethernet networks. But as Hadoop clusters get bigger and organizations press the upper limits of performance, they're finding that specialized gear Read more…

Datanami