Follow Datanami:

Tag: Spark

LinkedIn’s Translation Engine Linked to Presto

Dec 11, 2020 |

An SQL translation engine unveiled this week by LinkedIn is integrated with other open-source SQL query engines like Presto in a combination aimed at bulging data lakes.

The Microsoft unit’s Coral engine handles analysis and rewrite along with translation duties. Read more…

Data Exchange Maker Harbr Closes Series A

Nov 17, 2020 |

Harbr, a London startup that helps organizations like Moody’s Analytics to create their own custom data exchanges, yesterday announced that it has completed a Series A round of financing, netting $38.5 million for the growing concern. Read more…

The Past and Future of In-Memory Computing

Oct 26, 2020 |

When Nikita Ivanov co-founded GridGain Systems back in 2005, he envisioned in-memory computing going mainstream and becoming a massive category unto itself within a few years. That obviously didn’t pan out, but on the eve of the In-Memory Computing Summit 2020 taking place later this week, the GridGain CTO is still bullish on the future in-memory computing, particularly for powering stream processing. Read more…

Aerospike Gives Legacy Infrastructure a Real-Time Boost

Sep 15, 2020 |

A database connector upgrade released this week by Aerospike Inc. links open source frameworks like Apache Spark data streaming to existing enterprise data infrastructure.

Among the goals is providing backward compatibility with mainframes and other relational data stores that are currently unable to handle data analytics and machine learning workloads. Read more…

Microsoft Now Developing Its Own Hadoop

Aug 26, 2020 |

Hadoop might be dead, but that’s not stopping public cloud providers from using it. The latest to make a move is Microsoft Azure, which in July announced that it would begin developing its own distribution under its HDInsight brand. Read more…

To Centralize or Not to Centralize Your Data–That Is the Question

Jul 14, 2020 |

Should you strive to centralize your data, or leave it scattered about? It seems like it should be a simple question, but it’s actually a tough one to answer, particularly because it has so many ramifications for how data systems are architected, particularly with the rise of cloud data lakes. Read more…

Intel Updates Optane, Expands NAND SSD Offerings

Jun 19, 2020 |

Intel Corp. remains persistent in upgrading its Optane persistent memory series. The chip maker (NASDAQ: INTC) said this week its second generation Optane series is tuned to the latest version of its Xeon Scalable processors (codenamed Cooper Lake), and boosts memory bandwidth by an average of 25 percent. Read more…

Staying On Top of ML Model and Data Drift

Jun 16, 2020 |

A lot of things can go wrong when developing machine learning models. You can use poor quality data, mistake correlation for causation, or overfit your model to the training data, just to name a few. Read more…

Will Databricks Build the First Enterprise AI Platform?

Dec 2, 2019 |

Ali Ghodsi might have one of the best jobs in technology right now. As the CEO of Databricks, Ghodsi just completed an oversubscribed $400 million round of funding that gave the company a $6.2 billion valuation. Read more…

Simplifying the Big Data Lake Experiences in the Cloud

Oct 16, 2019 |

The cloud is a hot spot for big data lakes these days, thanks largely to the greater technological simplicity and lower upfront costs of getting started in the public cloud. Read more…

Do NOT follow this link or you will be banned from the site!