Big Data • Big Analytics • Big Insight

Taming Apache Storm for Real-Time Analytics

Taming Apache Storm for Real-Time Analytics

Apache Storm is gaining a foothold among organizations looking to do real-time analytics on streaming data. However,...

Training Day: CrowdFlower Sets Human-Generated Data Free

Training Day: CrowdFlower Sets Human-Generated Data Free

Data scientists who are looking for high quality sets of curated data on which to train their...

How to Get a ‘Network Effect’ from Your Big Data Lake

How to Get a ‘Network Effect’ from Your Big Data Lake

One of the hidden benefits of being a data-driven organization is a so-called “network effect” that occurs...

The 3 Key Steps to Building a Predictive App with Machine Learning

The 3 Key Steps to Building a Predictive App with Machine Learning

Machine learning is the technology that allows businesses to make sense of vast quantities of data, make...

In The Spotlight

Accelerating Hadoop® Workflows to Yield Greater Application Efficiency

As enterprise-critical decision support fully embraces big data, confusion has grown on how to best satisfy increasing demand for ever larger data analytics. Some have questioned whether Hadoop will continue to reliably scale and serve as the primary workhorse for enterprise production level data analytics. Rising to satisfy the need for more scale, truly break-through technologies have recently removed any question mark on how to extend the useful life and scale of enterprise-critical Hadoop applications.

Read more...

News In Brief

Where Does InfiniDB Go From Here?

mariaDB_seal

Last September, the company behind InfiniDB, Calpont, went out of business. Up stepped MariaDB, the company behind the open source relational database, to serve as a steward for the product and provide support to customers. The big question on Read more…


Fujitsu Adding Column-Oriented Processing Engine to PostgreSQL

database.png

Fujitsu Laboratories last week announced that it's developed a column-oriented data storage and processing engine that can quickly analyze large amounts of data stored on a PostgreSQL database. The technology, which utilizes vector processing, Read more…


Actian Claims ‘Permanent Performance Advantage’ with SQL-on-Hadoop Tool

actian_logo_200.gif

The SQL-on-Hadoop sweepstakes are by no means over. What's been dubbed the "gateway drug" for Hadoop is just starting to gain traction. But according to Actian, its SQL-on-Hadoop offering, dubbed Vortex, is out to an early--and permanent--lead Read more…


‘Data and Goliath’ A Portrait of Big Data Abuses

data and goliath book

A new book by security expert Bruce Schneier is raising serious questions about the state of privacy in the big data age, and whether giving corporations and government access to the most intimate details of our lives in exchange for convenience Read more…


Apache Spark Ecosystem Continues To Build

spark flame

Apache Spark was everywhere at the recent Strata + Hadoop World conference. From Tableau's new Spark interface to the new Spark as a service (SaaS) offerings and Intel's new Spark initiative, the big data framework was very hard to Read more…


U.S. Names First Chief Data Scientist

Screen Shot 2015-02-24 at 11.33.47 AM

An industry veteran and college math professor who is partially credited with coining the title "data scientist" has been named the nation's first chief data scientist. The White House announced the appointment of DJ Patil to the new post Read more…


Snowflake Differentiates Itself in Strata Startup Showcase

snowflake

Snowflake Computing, a big data warehousing as a service provider, took home top honors at the Startup Showcase event held during last week's Strata + Hadoop World conference. The award is a boost to the Silicon Valley company, which aims to be Read more…


IBM Embraces Hadoop in ‘BigInsight’ Push

Screen Shot 2015-02-19 at 6.25.11 PM

IBM jumped onto the Hadoop bandwagon this week with the introduction of its BigInsights for Apache Hadoop offering along with machine learning with R statistical computing and other features designed to handle data analysis at massive Read more…


Will Poor Data Security Handicap Hadoop?

security lock and key

Companies around the world are looking to Hadoop as a platform on which to perform big data analytics. Every day, petabytes of data are flowing into Hadoop clusters with the aim of giving them a competitive edge. However, the overall lack of Read more…


Cloudera Brings Kafka Under Its ‘Data Hub’ Wing

kafka logo

Cloudera is making Apache Kafka a supported part of its Hadoop distribution, the company announced today. While Kafka still doesn't run on Hadoop, Cloudera says the changes it is instituting will help CDH customers build real-time analytics Read more…