Big Data • Big Analytics • Big Insight

Technologies » Middleware

Features

Taming the Wild Side of Hadoop Data

Jan 28, 2015 |

Organizations are attracted to Hadoop because it lets them store huge amounts and different types of data, and worry about structuring it later. But that “anything goes” philosophy has a downside, and can threaten to turn a big data lake into bottomless pit. Today Hortonworks unveiled a plan to give data management on Hadoop just a bit more structure. Dubbed the Data Governance Initiative (DGI), Hortonworks aims to lead the development of new open source software that help organizations track Read more…

Three Ways Big Data and HPC Are Converging

Jan 27, 2015 |

Big data is becoming much more than just widespread distribution of cheap storage and cheap computation on commodity hardware.  Big data analytics may soon become the new “killer app” for high performance computing (HPC). There is more to big data than large amounts of information.  It also pertains to massive distributed activities such as complex queries and computations (a.k.a analytics).  In other words, deriving value through computation is just as “big” as the size of the data sets themselves.  In Read more…

Who’s Buying Into Big Data? It’s Not Who You Might Think

Jan 23, 2015 |

In any fast-moving technological field, there’s going to be a fairly wide distribution between those who believe the hype and push its adoption, and those who move forward cautiously. But according to a new report from Teradata, the “perception gap” for big data is quite large and is hurting organization’s ability to take advantage of data analytics. In a new report called “The Virtuous Cycle of Big Data,” Teradata says it’s none other than the chief executives who are wearing Read more…

Interana Nabs $20M to Continue Work on Time-Series Analytics

Jan 21, 2015 |

Hadoop may be the top headliner when it comes to big data analytics, but it’s not the only game in town. A case in point is Interana, a Silicon Valley startup that today unveiled $20 million in funding to ramp up development of its tool for exploring massive amounts of time-series data. Interana‘s software is based on a distributed column-based database that enables fast access to vast amounts of time-series data. Running on X86 servers, the database itself can process Read more…

Pulling Insights from Unstructured Data – Nine Key Steps

Jan 16, 2015 |

Data, data everywhere, but not a drop to use. Companies are increasingly confronted with floods of data, including “unstructured data” which is information from within email messages, social posts, phone calls, and other sources that isn’t easily put into a traditional column. Making sense and actionable recommendations from structured data is difficult, and doing so from unstructured data is even harder. Despite the challenge, the benefits can be substantial. Companies that commit to examining unstructured data that comes from devices Read more…

News In Brief

Watson’s Take on the State of the Union

Jan 29, 2015 |

Every year, as the State of the Union address comes to a close, analysts, political correspondents and politicians alike come forward to offer unique take on the president’s speech. This year, however, one new voice is chiming in: that of IBM’s Watson. Using a recording of President Obama’s address, MSNBC tapped the IBM Watson User Modeling service (one of IBM’s Watson Developer Cloud services) to take a deeper look at the language, tone, and topics covered to compare its outlook Read more…

VoltDB Reaches Out to Hadoop for Fast Data Analytics

Jan 28, 2015 |

VoltDB has made a name for itself by building a fast in-memory distributed relational database that processes transactions and does some real-time analytics too. With today’s launch of VoltDB version 5, the company is smoothing its integration with Hadoop in hopes of helping customers connect the dots between big data and fast data analytics. Hadoop is increasingly the repository for organization’s big unstructured data, such as call logs, click streams, and location information. All that stuff goes into the Hadoop Read more…

MapR Ponies Up with Free Hadoop Training

Jan 27, 2015 |

Hadoop is hard. There’s just no way around it. Without a set of specific technical skills, the chances of somebody developing or running a Hadoop cluster are not good. A new training program unveiled today by MapR Technologies may help to put a dent in the pent-up demand for Hadoop skills. MapR Technologies‘ Hadoop On-Demand Training is an online program designed to give developers, analysts, and administrators real-world Apache Hadoop skills. The program, which you can read more about at Read more…

Why Machine Learning Is A Priority for Andreessen Horowitz in 2015

Jan 26, 2015 |

Marc Andreessen and Ben Horowitz are legendary in the Silicon Valley, having invested early in firms like Twitter, Facebook, Box, Skype, and Groupon. So when the Sand Hill Road experts with an eye for picking winners put machine learning near the top of its list of trends to watch out for in 2015, it was a cue that the technology is on the cusp of having a breakout year. The way that Andreessen Horowitz sees things, machine learning and big data Read more…

Cloudera Teams with Google to Run Dataflow on Spark

Jan 20, 2015 |

Cloudera and Google today announced that they’re working together to get Dataflow–the big data pipeline model Google publicly launched last June–to run on Apache Spark, thereby giving customers more freedom to run their big data applications wherever they see fit. Google Cloud Dataflow is a managed service for creating data pipelines that ingest, transform, and analyze massive amounts of data, in either batch or streaming modes, using the same SDK and API. The service is based internal Google technologies like Read more…

This Just In