Technologies » Frameworks


Real-World Hadoop Takes Center Stage at Strata

Sep 30, 2015 |

There’s no faster path to budgetary oblivion than implementing technology for technology’s sake. In today’s super-heated big data environment, it’s easy to get all worked up over technologies like Hadoop without carefully considering the business justifications at the same time. At the Strata + Hadoop World conference today, Cloudera co-founder Mike Olson did his best to steer the conversation to real-world Hadoop solutions. Cloudera’s chief strategy officer “Iron” Mike Olson kicked off today’s Strata keynote extravaganza with a reminder about Read more…

MapR Gives Hadoop Distro More NoSQL Smarts

Sep 29, 2015 |

Hadoop and NoSQL databases share some similarities, but the platforms typically live within different levels of the big data spectrum. Now MapR Technologies is working to break those barriers down by adding a major piece of NoSQL functionality to its Hadoop distribution. At the Strata + Hadoop World conference today, MapR Technologies announced that it’s now supporting the storage of JSON documents in MapR-DB, the enterprise NoSQL data store that ships with its Hadoop distribution. As a spruced up version Read more…

Cutting: Spark an ‘All-Around Win’ for Hadoop

Sep 24, 2015 |

Hadoop co-creator Doug Cutting said today that Apache Spark is “very clever” and is “pretty much an all-around win” for Hadoop, adding that it will enable developers to build better and faster data-oriented applications than MapReduce ever could. Cutting talked at length about Spark during today’s Cloudera webinar, titled “Uniting Spark and Hadoop: The One Platform Initiative.” The Hadoop distributor, which employs Cutting as its chief architect, has ramped up the pro-Spark messaging since the launch of the One Platform Read more…

One Deceptively Simple Secret for Data Lake Success

Sep 21, 2015 |

Gartner turned heads last year when it declared that the majority of data lake projects would end in failure. As a self-avowed “old dog” of data warehousing, EMC’s Bill Schmarzo vehemently agrees with that assessment, but says there is one simple secret to having success with a data lake project. Before we disclose Schmarzo’s secret formula (which really is unnervingly simple), let’s talk about how not to go about building a data lake. For starters, you can’t begin with the Read more…

News In Brief

Pivotal Opens Up HAWQ, MADlib

Oct 6, 2015 |

Pivotal Software has turned over its SQL-on-Hadoop engine along with its MADlib machine-learning tool to the open source community as it seeks to extend the reach of its interactive SQL engine deeper into the Hadoop ecosystem. Pivotal, which announced in April it was teaming with Hortonworks by combining its big data suite with its partner’s Hadoop platform, said recently its contributing HAWQ engine and MADlib framework to the Apache Software Foundation, giving each “incubation” status within the open source group. Read more…

Big Data Plays Arresting Role in White Collar Crime

Oct 6, 2015 |

User-friendly analytics tools with graphical presentation tools plus big data solutions and the ability to more easily search and track fiscal anomalies give financial crime fighters a much stronger arsenal in their ongoing battle against white collar crime. These offenses costs taxpayers up to $600 billion annually, but the complexities of these cases mean many financial scammers elude justice for years, if not forever. That could – and should – be changing, however, as investigators leverage powerful databases, big data, Read more…

Data ‘Assimilation’ Seen as Weak Link in U.S. Weather Forecasts

Oct 5, 2015 |

Second-guessing over the relative accuracy of the U.S. weather forecasting model compared to its European counterparts resumed last week as early predictions that Hurricane Joaquin would strike the U.S. east coast proved inaccurate. With the exception of the sodden residents of some southeastern coastal states, others were mostly spared after the hurricane drifted out into the Atlantic Ocean after pounding the Bahamas. Early models from the U.S. National Weather Service had the Category 4 storm hitting the east coast anywhere Read more…

CMU and Boeing Establish Aerospace Data Analytics Lab

Oct 2, 2015 |

Carnegie Mellon University and The Boeing Company (NYSE: BA) announced plans this week to establish the Boeing/Carnegie Mellon Aerospace Data Analytics Lab, a new academic research initiative that will leverage the university’s leadership in machine learning, language technologies and data analytics. This is more evidence of the collision between big data and HPC spurring academic-industry collaboration. The goal, say CMU and Boeing, is to find ways to use artificial intelligence and big data to capitalize on the enormous amount of data Read more…

Cisco, Paxata Join Forces on Data Prep

Oct 1, 2015 |

Data preparation specialist Paxata announced a partnership this week with Cisco Systems designed to advance the networking giant’s data preparation capabilities on its emerging big data platform. The partnership in which Paxata’s technology will underpin the Cisco (CSCO) data prep offering was unveiled this week at Stata + Hadoop World. The deal underscores the growing momentum of the data prep market as data sets grow in size and complexity. The Cisco platform will use Paxata’s machine intelligence algorithms along with Read more…

This Just In

Clarabridge Acquires Engagor

May 21, 2015 |

RESTON, VA., May 21 — Clarabridge, Inc., the leading provider of Customer Experience Management (CEM) solutions for the world’s top brands, today announced the acquisition of Engagor, the most comprehensive platform for real-time social customer service and engagement. The combined offering provides a complete, end-to-end technology solution for marketers, customer care organizations and operations teams to create more profitable customer relationships. Founded by Folke Lemaitre in 2011, Belgium-based Engagor offers a robust social listening and engagement platform for marketers and customer Read more…

MemSQL Launches Community Edition: World’s Fastest In-Memory Database Now Available to All

May 20, 2015 |

SAN FRANCISCO, CA – May 20  – MemSQL, the leader in real-time databases for transactions and analytics, today announced the most significant release of MemSQL to date. MemSQL 4 – which includes a new Community Edition – empowers interconnected enterprises to aggregate and report on real-time data, accelerating the growth trajectories of their digital businesses. This release brings to market groundbreaking capabilities such as the industry’s first real-time, distributed geospatial intelligence and the MemSQL Spark Connector to operationalize Apache Spark. Read more…

Apache Unveils Hadoop 2

Oct 17, 2013 |

Apache Software Foundation, which oversees the 150 or so open source projects under the famous Apache umbrella, this week announced Hadoop 2 – the latest version of the popular software framework for distributed computing.