Follow Datanami:

IBM Acquires Observability Platform Databand.ai

IBM has announced the acquisition of data observability software vendor Databand.ai. Today’s announcement marks IBM’s fifth acquisition of 2022. The company says the acquisition “further strengthens IBM’s softwar Read more…

Cloudera Picks Iceberg, Touts 10x Boost in Impala

Cloudera is now supporting the open source Apache Iceberg table format in its cloud data platform, or lakehouse, the vendor announced yesterday. The move will help to ensure transactional integrity in the big data enviro Read more…

FBI Warns of Deepfakes in Remote Hiring

Have you ever been talking to someone face-to-face on a video chat and heard them sneeze without actually seeing them sneeze? No, this is not a new physics-bending symptom of Covid. It could be a warning that the person Read more…

A DECADE OF DATANAMI: 2011-2021

Check out our special Decade of Datanami series, which chronicled the major advances in the fields of big data from 2011 to 2021. From the rise of Apache Spark to the rapid expansion of the cloud, Datanami has been there Read more…

Opaque Systems Raises $22M Series A to Boost Confidential Computing

Opaque Systems announced it has raised $22 million in Series A funding bringing its total financing to $31.6 million. The San Francisco-based firm is focused on enabling collaborative analytics and AI in confidential com Read more…

It’s Not ‘Mobile Spark,’ But It’s Close

On April 1, 2015, Apache Spark PMC member Reynold Xin wrote a compelling blog detailing plans to deliver a mobile version of Spark. It was all a joke, of course: Spark was a heavy bit of code designed for distributed sys Read more…

Amazon re:MARS Highlights Intelligence from Shops to Space

Amazon hosted its inaugural re:MARS event back in 2019, with “MARS” here standing for machine learning, automation, robotics, and space. For the past two years, the event has been on hold due to the pandemic—but th Read more…

Databricks Scores ACM SIGMOD Awards for Spark and Photon

Databricks announced it has won two awards at the ACM SIGMOD (Association of Computing Machinery’s Special Interest Group in the Management of Data) Conference in Philadelphia. Apache Spark was awarded the SIGMOD Sy Read more…

Why the Open Sourcing of Databricks Delta Lake Table Format Is a Big Deal

Databricks introduced Delta back in 2019 as a way to gain transactional integrity with the Parquet data table format for Spark cloud workloads. Over time, Delta evolved to become its own table format and also to become m Read more…

John Snow Labs Releases Spark NLP 4.0

Today, John Snow Labs announced the release of Spark NLP 4.0, the latest version of its NLP library built on Apache Spark ML. Spark NLP 4.0 features new question answering annotators, major performance improvements, opti Read more…

HPE’s GreenLake Cloud Service Goes Private

The last few years have seen a mass exodus to the public cloud, but many companies are bringing back computing resources internally for reasons that include data security and government regulations. HPE at the Discove Read more…

HPE Relaunches MapR Tech as Data Fabric Offering

HPE today unveiled GreenLake for Data Fabric, a new hybrid cloud data management solution designed to reduce challenges associated with data silos. The new offering, which piggybacks with a new private cloud solution HPE Read more…

Databricks Opens Up Its Delta Lakehouse at Data + AI Summit

Databricks, which had faced criticism of running a closed lakehouse, is open sourcing most of the technology behind Delta Lake, including its APIs, with the launch of Delta Lake 2.0. That was one of a number of announcem Read more…

Coding for the Edge: Six Lessons for Success

Edge computing is expanding dramatically as organizations rush to realize the benefits in latency, flexibility, cost, and performance that the edge can deliver. IDC estimates that global spending on edge hardware, softwa Read more…

Starburst Acquires Fellow Trino Supplier, Varada

Starburst solidified its position in the market for next-gen data analytics engines yesterday with the acquisition of Varada, a former competitor that developed and sold an analytics engine based on Presto. Terms of the Read more…

Video Analytics Platform VisualCortex is the New CV Kid in Town

A new video analytics platform, VisualCortex, has been launched globally. The company says its enterprise-grade platform is capable of facilitating any real-time or historical video analytics use cases, unlike single-use Read more…

Meet Datanami 2022 Person to Watch Adam Selipsky

When former Amazon Web Services CEO Andy Jassy left to take the top job at the parent company, it opened up a spot for Adam Selipsky to take over the world's biggest cloud platform. The timing was fortuitious for Seli Read more…

As Cloud Coverage Expands, Salaries Also Rise

The march to the cloud continued in 2021, as hyperscalers opened more data centers and customers ran more workloads in the cloud. As the cloud becomes the defacto standard platform for transactional and analytical worklo Read more…

A/B Test Like You’re Airbnb

A/B testing is a critical and oft-overlooked element of data science at high-flying tech firms like Google, Netflix, and Uber, but it can be difficult to set up a testing infrastructure. Now you can implement A/B testing Read more…

Datanami