Follow Datanami:

Tag: Spark

Microsoft Expands Hadoop on Azure

Apr 16, 2019 |

Microsoft has upgraded its open source analytics services running on Azure with a new version of Hadoop incorporating enhancements of Apache Hive and other open source analytics frameworks.

The software giant (NASDAQ: Read more…

How Databricks Keeps Data Quality High with Delta

Apr 8, 2019 |

Data lakes have sprung up everywhere as organizations look for ways to store all their data. But the quality of data in those lakes has posed a major barrier to getting a return on data lake investments. Read more…

MapR to Autoscale Spark and Drill Via Prebuilt Kubernetes Containers

Apr 2, 2019 |

MapR Technologies today announced a technology preview of pre-built containers for Kubernetes that will give customers new capabilities for dynamically scaling their containerized Spark and Drill applications based on demand. Read more…

A Decade Later, Apache Spark Still Going Strong

Mar 8, 2019 |

Don’t look now but Apache Spark is about to turn 10 years old. The open source project began quietly at UC Berkeley in 2009 before emerging as an open source project in 2010. Read more…

Data Engineering Continues to Move the Employment Needle

Mar 5, 2019 |

Interested in a career in big data? You could do well by investing your time and effort in acquiring data science skills. But you may do even better by turning yourself into a data engineer, which is a title that continues to see substantial demand in the job market. Read more…

Microsoft Invests in Databricks

Feb 5, 2019 |

Databricks, the high-flying analytics startup founded by the creators of Apache Spark, announced yet another venture funding haul this week as it hustles to meet what it says is growing demand for its analytics platform. Read more…

Presto Backers Bolster Its Open Source Origins

Jan 31, 2019 |

A new industry group will promote Presto, the popular open source distributed SQL query engine launched by Facebook engineers in 2012 as a follow-on to Apache Hive.

The Presto Software Foundation launched on Thursday (Jan. Read more…

Build on the AWS Cloud with Your Eyes Wide Open

Jan 9, 2019 |

Building data applications on public clouds like Amazon Web Services is a no brainer for many organizations these days. The tools for ingesting, storing, and processing data in the cloud are rapidly maturing, and best of all, they’re largely pre-integrated, which saves data scientists and engineers time and money. Read more…

Movie Recommendations with Spark Collaborative Filtering

Nov 2, 2018 |

Collaborative filtering (CF)[1] based on the alternating least squares (ALS) technique[2] is another algorithm used to generate recommendations. It produces automatic predictions (filtering) about the interests of a user by collecting preferences from many other users (collaborating). Read more…

Nvidia Platform Pushes GPUs into Machine Learning, High Performance Data Analytics

Oct 10, 2018 |

GPU leader Nvidia, generally associated with deep learning, autonomous vehicles and other higher-end AI-related workloads (and gaming, of course), is mounting an open source end-to-end GPU acceleration platform and ecosystem directed at machine learning and data analytics, domains heretofore within the CPU realm. Read more…

Do NOT follow this link or you will be banned from the site!