Follow Datanami:

Vendor » Elasticsearch

Features

MinIO Enjoying Role in Emerging Cloud Architecture

In the post-Hadoop world, object storage systems have become the new favored place to park petabytes of data, with Amazon S3 leading the way among cloud providers. But when organizations are looking for on-premise object Read more…

Cloudera Commits to 100% Open Source

The old Cloudera developed and distributed its Hadoop stack using a mix of open source and proprietary methods and licenses. But the new Cloudera will be 100% open source, just like Hortonworks, its one-time Hadoop rival Read more…

Crawling the Web for Data and Profit

The Web serves as a vast, renewable resource for the most valuable thing in existence: data. However, getting useful data from the Web isn't always an easy task. Luckily, there are a handful of open source and commercial Read more…

Hidden Anomalies No Match for LivePerson’s Machine Learning Engine

LivePerson knows a thing or two about customer service. After all, the company runs the global chat and messaging infrastructure that connects 18,000 businesses like Citibank and Home Depot with millions of their own cus Read more…

The Serverless ETL Nextdoor

When Nextdoor set out to rebuild its data ingestion pipeline so it could better handle billions of files per day, it brought in all the usual suspects in the real-time racket: Spark, Kafka, Flume, etc. In the end, howeve Read more…

Dremio Emerges from Stealth with Multi-Threat Middleware

If your business analysts are struggling to connect, prepare, and query data from multiple sources in a timely and cost effective manner, you might be interested in learning about Dremio, a new open source software compa Read more…

How These Banking, Energy, and Pharma Firms Use Spark

Few frameworks have gained so much popularity as quickly as Apache Spark.  The open source technology may not be ubiquitous yet in the analytics world, but it's fast approaching that point. Spark has certainly caught Read more…

JanusGraph Picks Up Where TitanDB Left Off

Yesterday marked the formal launch of JanusGraph, a new Linux Foundation project formed to continue development of the TitanDB graph database. JanusGraph is a fork TitanDB, the distributed graph database that was orig Read more…

Solr or Elasticsearch–That Is the Question

That is the common question I hear: Which one is better, Solr or Elasticsearch? Which one is faster?  Which one scales better?  Which one can do X, and Y, and Z?  Which one is easier to manage?  Which one should we u Read more…

Datanami