Tag: Spark

Hadoop Engines Compete in Comcast Query ‘Smackdown’

Jun 22, 2017 |

Who rules the ring when it comes to Hadoop SQL query engine performance? Can flashy newcomers like Presto and Spark take an established giant like MapReduce to the matt? Comcast recently held a competition to crown the best Hadoop engine, and the answer may surprise you. Read more…

Yahoo’s Massive Hadoop Scale on Display at Dataworks Summit

Jun 16, 2017 |

Yahoo put its massive Hadoop investment on display this week at Dataworks Summit, the semi-annual big data conference that it co-hosts with Hortonworks.

While Hadoop is no longer the conference headliner that it once was, the platform is still critical for the daily operations of Yahoo, which officially became part of Verizon Communications this week when the $4.5 billion acquisition finally closed. Read more…

Hortonworks Shifts Focus to Streaming Analytics

Jun 14, 2017 |

Hortonworks started life providing a Hadoop distribution that allowed customers to process big data at rest. But these days, the company has shifted its much of its attention and resources to streaming analytics, or processing big data in motion. Read more…

Spark’s New Deep Learning Tricks

Jun 7, 2017 |

Imagine being able to use your Apache Spark skills to build and execute deep learning workflows to analyze images or otherwise crunch vast reams of unstructured data. That’s the gist behind Deep Learning Pipelines, a new open source package unveiled yesterday by Databricks. Read more…

Pepperdata Takes On Spark Performance Challenges

May 24, 2017 |

Apache Spark has revolutionized how big data applications are developed and executed since it emerged several years ago. But troubleshooting slow Spark jobs on Hadoop clusters is not an easy task. Read more…

Cloudera Unveils Altus to Simplify Hadoop in the Cloud

May 24, 2017 |

Running Hadoop, whether on-premise or in the cloud, is neither simple nor easy. Administrators with specialized skills are needed to configure, manage, and maintain the clusters for their clients, who are data scientists, engineers, and analysts. Read more…

Google/ASF Tackle Big Computing Trade-Offs with Apache Beam 2.0

May 19, 2017 |

Trade-offs are a part of life, in personal matters as well as in computers. You typically cannot have something built quickly, built inexpensively, and built well. Pick two, as your grandfather would tell you. Read more…

Masking Technical Complexity in the Security Data Lake

May 2, 2017 |

Today’s growing cybersecurity threat demands a sophisticated response, one that increasingly involves the utilization of big data technologies like parallel file systems and machine learning. However, some security experts warn that growing number and complexity of big data security tools could be hindering the cause. Read more…

Iguazio Re-Architects the Stack for Continuous Analytics

Apr 18, 2017 |

When it comes to modern big data architectures, you will typically find lots of different components, engines, and moving parts, each of which tackles part of the problem. One vendor with bold vision of re-architecting the stack with a more streamlined approach is Iguazio, which is building a singular product based on Flash that delivers continues analytics on big and fast data. Read more…

Learning from Your Data: Essential Considerations

Apr 13, 2017 |

For any organization undergoing digital transformation, a primary consideration is how to find, capture, manage and analyze big data. They are looking to big data and data science to facilitate the discovery of analytics that will enable informed decision-making. Read more…