
Tag: Pandas
Starburst Brings Dataframes Into Trino Platform
Starburst customers who prefer to manipulate data using dataframes as opposed to regular SQL will be happy with a pair of announcements made today. That includes the introduction of PyStarburst, which provides a PySpark- Read more…
Inside Pandata, the New Open-Source Analytics Stack Backed by Anaconda
Anaconda made some news yesterday when it announced support for Pandata, a new open-source stack. But just what is Pandata, and should it be on your big data radar? According to the Pandata GitHub page, Pandata is a c Read more…
Deephaven Streamlines Access to Real-Time Analytics Platform
Getting Deephaven’s real-time analytics system up and running will be easier thanks to a new installation technique using a standard Python library. The open source software also sports a new integration with Jupyter a Read more…
Spark Gets Closer Hooks to Pandas, SQL with Version 3.2
The Apache Spark community last week announced Spark 3.2, a significant new release of the distributed computing framework. Among the more exciting features are deeper support for the Python data ecosystem, including the Read more…
Scaling to Great Heights at the Ray Summit
If you haven’t yet heard about Ray, the open source Python framework for building distributed applications, then next week’s Ray Summit will provide a compelling introduction to what might be one of the cornerstone t Read more…
Program Synthesis Moves a Step Closer to Reality
As data scientists and software developers sort through the plethora of tools and APIs ranging from Python to Apache Spark, automation schemes are emerging to help programmers navigate those tools and the accompanying in Read more…
Dremio Donates Fast Analytics Compiler to Apache Foundation
Dremio has donated the Gandiva Initiative -- a LLVM-based execution kernel designed to speed up analytical workloads – to the Apache Software Foundation, where it will become available to anybody who wants it as part o Read more…