Follow Datanami:

Tag: Spark

How Uber Uses Spark and Hadoop to Optimize Customer Experience

If you've ever used Uber, you're aware of how ridiculously simple the process is. You press a button, a car shows up, you go for a ride, and you press another button to pay the driver. But there's a lot more going on beh Read more…

Cloudera Unveils Kudu, a Fast New Storage Option for Hadoop

Hadoop has been evolving from its batch-oriented roots into a more real-time system for some time. That evolution gained momentum today with Cloudera's announcement of Kudu, a new  in-memory data store for Hadoop design Read more…

Cutting: Spark an ‘All-Around Win’ for Hadoop

Hadoop co-creator Doug Cutting said today that Apache Spark is "very clever" and is "pretty much an all-around win" for Hadoop, adding that it will enable developers to build better and faster data-oriented applications Read more…

Inside Platfora’s Transition to Apache Spark

Platfora has embraced Apache Spark as the underlying data processing engine in version 5 of Big Data Discovery, which it announced today. But the company hasn't completely gotten rid of MapReduce in its Hadoop applicatio Read more…

Apache Spark Gets IBM Mainframe Connection

IBM's recent embrace of Apache Spark is beginning to generate dividends in the form of open source contributions for a mainframe big data link to Spark. Big data software vendor Syncsort, Woodcliff Lake, N.J., said Tu Read more…

Intel Investments Seek to Speed Big Data Rollouts

Chipmaker Intel Corp. is expanding its push into big data with an equity investment and a technology partnership with BlueData, the startup that specializes in generating virtualized big data clusters. Along with the Read more…

Medical Insight Set to Flow from Semantic Data Lakes

The potential for data analytics to disrupt healthcare delivery is large, and getting larger by the day. But in many cases, the need to hammer data into a structured format creates a barrier to productivity. Now a hospit Read more…

How Spark Democratizes Analytic Value from Hadoop Lakes

So you've installed Hadoop and built a data lake to house all the bits and bytes that your organization previously discarded. So now what? If you follow the advice from industry experts, the next step on your analytics j Read more…

Does InfiniBand Have a Future on Hadoop?

Hadoop was created to run on cheap commodity computers connected by slow Ethernet networks. But as Hadoop clusters get bigger and organizations press the upper limits of performance, they're finding that specialized gear Read more…

Inside WebTrends’ Big Data Analytics Pipeline

WebTrends has been collecting and analyzing Web data on behalf of its customers since it was founded way back in 1993. Considering the exponenetial growth of the Net since then, it's not a stretch to say WebTrends was do Read more…

How Apache Spark Is Helping IT Asset Management

There's been a lot of energy focused on how big data technology can improve the sales, marketing, service, and support departments of corporations. Tools like Hadoop, Spark, and NoSQL databases are changing the rules for Read more…

IBM, Databricks Join Forces to Advance Spark

IBM has jumped on the Apache Spark bandwagon, revealing it would throw its considerable weight behind the open source in-memory processing framework that has been gaining momentum over the last year. Separately, Datab Read more…

Basho Goes Vertical with Big Data Stack

Basho Technologies made a name for itself in the NoSQL database world by developing a scalable key-value store called Riak that's used by the likes of Time Warner, The Weather Company, and Comcast. Today the company disc Read more…

How Machine Learning Is Eating the Software World

Marc Andreessen famously said in 2011 that software was eating the world. Four years later, that trend has accelerated, only now it appears that machine learning technology is on the cusp of eating software, and that alg Read more…

Wanted: A Plug-In Architecture for Hadoop Development

Hadoop is hard. There's just no way around that. Setting up and running a cluster is hard, and so is developing applications that make sense of, and create value from, big data. What Hadoop really needs now, says former Read more…

Google Cloud Dataflow Now Open for Business

Google today formally took the wraps off Cloud Dataflow, the hosted offering designed to allow developers with average Java and Python skills to build sophisticated analytic "pipelines" that process huge amounts of data. Read more…

From Spiders to Elephants: The History of Hadoop

Have you ever wonder where this thing called Hadoop came from, or even why it's here? Marko Bonaci has wondered such things, too. In fact, he wondered about them so much that he decided to write a History of Hadoop chapt Read more…

Deep Dive Into Oracle’s Emerging Big Data Stack

Oracle has a lot of turf to protect in the multi-billion-dollar relational database market, where it owns a dominant share of the market. That creates a natural tension when it comes to big data technologies like Hadoop Read more…

Tachyon Nexus Gets $7.5M to Productize Big Data File System

Tachyon Nexus, the company founded to productize the Tachyon in-memory file system developed at the AMPlab, has received $7.5 million in venture capital funding from Andreessen Horowitz, the Wall Street Journal reported Read more…

Pinterest Shoots ‘Pinball’ Into Open Source

Pinterest announced yesterday that it's making the workflow management software it developed to manage big data pipelines, called Pinball, available as open source. Now anybody can use the same technology that Pinterest Read more…

Datanami