Follow Datanami:

Technologies » Frameworks


EraDB Says Elasticsearch Clone Is More Scalable, Easier to Run

Feb 10, 2021 |

EraDB today took the covers off EraSearch, a distributed log management tool built atop the startup’s S3-based database service. The company claims the Kubernetes-based EraSearch offering is API-compatible with Elasticsearch, but is more scalable and easier to manage than the popular open source log management tool it seeks to replace. Read more…

Empowering the Data Consumer: Living, and Breathing Data Governance, Security, and Regulations

Feb 9, 2021 |

Every organization wants to leverage their data as their strategic asset, scale data usage while effectively delivering speed, self-service, and ensuring security. To meet the growing demand for analytics in large enterprises, an automated approach to data privacy and governance is critical. Read more…

Apache Iceberg: The Hub of an Emerging Data Service Ecosystem?

Feb 8, 2021 |

Engineers at Netflix and Apple created Apache Iceberg several years ago to address the performance and usability challenges of using Apache Hive tables in large and demanding data lake environments. Read more…

Kyligence Doubles Down on Cubes in the Cloud

Jan 22, 2021 |

Multidimensional analysis, or the use of pre-computed OLAP cubes, used to be a primary method for quickly obtaining useful information from reams of data, but has largely fallen out of favor in the age of big data and big compute. Read more…

Governance, Privacy, and Ethics at the Forefront of Data in 2021

Jan 21, 2021 |

As we continue to gear up 2021, it’s hard to envision the topics of data governance, privacy, and ethics not becoming more pressing topics for companies. While the rewards of using data are great, so too are the risks connected with abusing, losing, and misusing data, and those risks are becoming ever more clear. Read more…

News In Brief

Penguin’s DeepData Integrates Red Hat Ceph Storage

Apr 2, 2021 |

Penguin Computing, the HPC, AI and enterprise data center vendor, is collaborating with IBM’s Red Hat unit and Seagate Technology to offer scalable storage options for data-driven applications. The software-defined approach is positioned as a complement to existing monolithic appliance storage that is proving unable to keep pace with big data. Read more…

AWS Adds Explainability to SageMaker

Mar 31, 2021 |

Amazon Web Services is adding an AI explainability reporting feature to its SageMaker machine learning model builder aimed at improving model accuracy.

SageMaker Autopilot now generates a model explainability report via SageMaker Clarify, the Amazon tool used to detect algorithmic bias while increasing the transparency of machine learning models. Read more…

Restoring Supply Chains, Reducing Waste Via Explainable AI

Mar 30, 2021 |

The problem is wasteful supply chains clogged with excess inventory and defective parts. The goal is creating “perfect flow” for manufacturers and retailers to reduce waste running into the hundreds-of-billions dollars annually. Read more…

With Integration Complete, Cloudera Re-Launches SQL Stream Builder

Mar 29, 2021 |

In October, Cloudera acquired Eventador, which developed a SQL-based streaming data analysis solution based on Apache Flink. With the work to integrate that product with the Cloudera Data Platform (CDP) complete, the company today re-launched it as Cloudera SQL Stream Builder. Read more…

Fiverr Adds Data Science Recruiting Category

Mar 25, 2021 |

The latest attempt to address the data science skills gaps comes from Fiverr International, a recruiter of freelance technical talent, which unveiled a new vertical segment this week dedicated to data-related skills and services. Read more…

This Just In

O’Reilly Announces 2021 Superstream Series Lineup and Dates

Jan 14, 2021 |

BOSTON, Jan. 14, 2021 — O’Reilly, a premier source for insight-driven learning on technology and business, today announced its 2021 Superstream Series lineup. Beginning this month, O’Reilly will present its most popular conference franchises— Software Architecture, Infrastructure & Read more…

DL Framework ‘SmallTrain 0.2.0’ Released for Professional and Commercial Use

Dec 30, 2020 |

KYOTO, Japan, Dec. 30, 2020 — Geek Guild Co., Ltd. announced the Open-Source Software (OSS) project, “SmallTrain,” which generates user-friendly deep learning models for high accuracy and high functionality as a standalone deep learning library and as a wrapper for TensorFlow and PyTorch. Read more… Unveils Flex-Code Data Connectors for Rapid Data Ingestion

Dec 17, 2020 |

PALO ALTO, Calif., Dec. 17, 2020 —, a data engineering company, today announced significant advancements to the Ascend Unified Data Engineering Platform with the addition of flex-code data connectors, a first-of-its-kind connector framework for data ingestion in the Apache Spark ecosystem that bridges the data connectivity worlds of databases, lakes, warehouses, APIs, and more. Read more…

DataStax Delivers New Open-Source API Stack for Modern Data Apps

Dec 10, 2020 |

SANTA CLARA, Calif., Dec. 10, 2020 — DataStax has announced a new API stack for modern data apps. Stargate, an open-source API framework for data, first unveiled this summer, is now generally available in DataStax’s Astra cloud database and for free download. Read more…

MLCommons Launches and Unites 50+ Tech and Academic Leaders in AI, ML

Dec 3, 2020 |

SAN FRANCISCO, Dec. 3, 2020 — Today, MLCommons, an open engineering consortium, launches its industry-academic partnership to accelerate machine learning innovation and broaden access to this critical technology for the public good. Read more…