Big Data • Big Analytics • Big Insight

Applications » Complex Event Processing


Big Data So Easy a Caveman Could Do It?

Feb 26, 2015 |

Let’s face it: big data isn’t easy. If you’re building a big data application today, you’re up to your eyeballs in things like R and Java, MapReduce and Pig, and Storm and Kafka. There’s a reason data scientists are so hard to find that they’re compared to unicorns. But in the future, the big data application assembly process may be dumbed down to the point where, as the insurance commercial says, even a caveman could do it. That’s the approach Read more…

Apache Spark Continues to Spread Beyond Hadoop

Feb 13, 2015 |

Apache Spark is most often thought of as a faster replacement for MapReduce, the batch-oriented programming framework that enabled first-gen Hadoop to catch traction 10 years ago. Indeed, Spark was initially created with Hadoop in mind. But Spark’s promise of speedy and accessible analytics is catching on with other data stores beyond Hadoop, including relational databases. Spark’s relationship to Hadoop is strong. All of the major Hadoop distributors include the in-memory processing framework with their particular Hadoop distributions. Core Spark Read more…

DataStax Dips Into Graph Waters, Pulls Out a Titan

Feb 3, 2015 |

Growing interest in graph database technology led DataStax to acquire Aurelius, the company behind the open source TitanDB graph database. DataStax tells Datanami that it plans to make the TitanDB technology available as an optional feature running on its commercial NoSQL database. TitanDB is one of a handful of open source graph databases aimed at enabling people to group and query large sets of connected data. The open source project, which began 2.5 years ago, has garnered a strong following Read more…

Interana Nabs $20M to Continue Work on Time-Series Analytics

Jan 21, 2015 |

Hadoop may be the top headliner when it comes to big data analytics, but it’s not the only game in town. A case in point is Interana, a Silicon Valley startup that today unveiled $20 million in funding to ramp up development of its tool for exploring massive amounts of time-series data. Interana‘s software is based on a distributed column-based database that enables fast access to vast amounts of time-series data. Running on X86 servers, the database itself can process Read more…

Can Big Data Give Us Bionic Brains?

Jan 20, 2015 |

The rise of big data and analytics promises to transform many aspects of our lives, chief among them how we synthesize data and make decisions. Some theorize that big data will augment our own cognitive capabilities to the point where it’s like we have bionic brains. According to Deloitte Analytics, the field of cognitive analytics is an extension of cognitive computing, and is made up of three main components: machine learning, natural language processing, and advanced analytics. These technologies, combined Read more…

News In Brief

Cloudera Brings Kafka Under Its ‘Data Hub’ Wing

Feb 18, 2015 |

Cloudera is making Apache Kafka a supported part of its Hadoop distribution, the company announced today. While Kafka still doesn’t run on Hadoop, Cloudera says the changes it is instituting will help CDH customers build real-time analytics applications that span Hadoop and Kafka. Kafka is an open source message broker that’s designed to handle massive flows of streaming, real-time data, such as log data. The software was originally developed at LinkedIn, which uses it to process hundreds of millions of Read more…

MapR Delivers Bi-Directional Replication with Distro Refresh

Feb 18, 2015 |

A new release of the MapR Distribution including Hadoop unveiled today will enable companies to perform real-time, bi-directional data replication between Hadoop clusters that are thousands of miles apart. The new table replication feature was added to MapR-DB, the NoSQL database included with the high-end edition of MapR’s commercial Hadoop offering. As Hadoop adoption grows, companies are finding it increasingly difficult to ensure that they’re acting on the latest, freshest data. This fast-data problem is particularly evident in organizations that Read more…

Project Myriad Brings Hadoop Closer to Mesos

Feb 12, 2015 |

One of the challenges of running Hadoop is resource management. The process of spinning up and managing hundreds, if not tens of thousands, of server nodes in a Hadoop cluster—and spinning them down and moving them, etc.–is way too hard to do manually. Automation must come to the table to help Hadoop take the next step forward in its evolution. The big question is how it will unfold. One answer to that question came to the forefront yesterday when a Read more…

MarkLogic Goes All In with JavaScript

Feb 9, 2015 |

One of the knocks on MarkLogic’s has been the need for specialized training to develop applications that run on its NoSQL database. With today’s unveiling of MarkLogic 8, the company has addressed that criticism by enabling programmers to access all of the database’s functionality using the lingua franca of the Web today: JavaScript. JavaScript rose to prominence over the past decade as the language to write browser-side code that powered Web 2.0 applications. As JavaScript code flourished on the Web, Read more…

Tachyon Support Coming to Big Data Hypervisor

Feb 3, 2015 |

Organizations that are deploying Apache Spark to do data science on big data may be inclined to invest in Tachyon, the in-memory file system that was developed next to Spark at the AMPlab. Getting Spark and Tachyon spun up and deployed on bare metal can be a hassle, but it’s a business opportunity to BlueData, which is aiming to be the VMware of big data. Tachyon is a distributed, in-memory file system designed to enable reliable file sharing at memory-speed Read more…

This Just In