Big Data • Big Analytics • Big Insight

News in Brief

Fujitsu Adding Column-Oriented Processing Engine to PostgreSQL

Mar 4, 2015 |

Fujitsu Laboratories last week announced that it’s developed a column-oriented data storage and processing engine that can quickly analyze large amounts of data stored on a PostgreSQL database. The technology, which utilizes vector processing, is being showcased this week at a conference in Japan. Fujitsu has a long history developing big systems designed to handle heavy transactional loads. It was a close development partner of Sun Microsystems for 64-bit Sparc servers until Sun was acquired by Oracle. Today the $4.5 Read more…

Actian Claims ‘Permanent Performance Advantage’ with SQL-on-Hadoop Tool

Mar 2, 2015 |

The SQL-on-Hadoop sweepstakes are by no means over. What’s been dubbed the “gateway drug” for Hadoop is just starting to gain traction. But according to Actian, its SQL-on-Hadoop offering, dubbed Vortex, is out to an early–and permanent–lead in the performance department. At the recent Strata + Hadoop World show, Actian pitted Vortex against Cloudera’s Impala right in the booth, where it largely re-created the results of a 2014 TPC Decision Support (TPC-DS) benchmark test that showed Vortex completing a job Read more…

‘Data and Goliath’ A Portrait of Big Data Abuses

Mar 2, 2015 |

A new book by security expert Bruce Schneier is raising serious questions about the state of privacy in the big data age, and whether giving corporations and government access to the most intimate details of our lives in exchange for convenience and security is a tradeoff we should be making. Since 9/11, Schneier has been an outspoken critic of the government’s sometimes ham-handed approach to security. Take the airport security checkpoints, for example. Is the economic loss from asking everybody Read more…

Apache Spark Ecosystem Continues To Build

Feb 25, 2015 |

Apache Spark was everywhere at the recent Strata + Hadoop World conference. From Tableau’s new Spark interface to the new Spark as a service (SaaS) offerings and Intel’s new Spark initiative, the big data framework was very hard to miss. Intel jumped on Spark’s bandwagon last week when it announced it was forming a new initiative around the in-memory framework. “We have engaged with Databricks, one of the pioneers of Apache Spark, to advance analytics capability for the Spark on Read more…

U.S. Names First Chief Data Scientist

Feb 24, 2015 |

An industry veteran and college math professor who is partially credited with coining the title “data scientist” has been named the nation’s first chief data scientist. The White House announced the appointment of DJ Patil to the new post last week. Patil also will serve as the Obama administration’s deputy chief technology officer for data policy, the White House said. Patil most recently served as a vice president at RelateIQ, a customer relationship management specialist acquired by Salesforce in July Read more…

Snowflake Differentiates Itself in Strata Startup Showcase

Feb 23, 2015 |

Snowflake Computing, a big data warehousing as a service provider, took home top honors at the Startup Showcase event held during last week’s Strata + Hadoop World conference. The award is a boost to the Silicon Valley company, which aims to be a one-stop shop for analyzing data generated on the cloud. Snowflake emerged from stealth mode in October with $26 million in cash and a vision to create an “elastic data warehouse” that lives in the cloud. The company, Read more…

IBM Embraces Hadoop in ‘BigInsight’ Push

Feb 19, 2015 |

IBM jumped onto the Hadoop bandwagon this week with the introduction of its BigInsights for Apache Hadoop offering along with machine learning with R statistical computing and other features designed to handle data analysis at massive scale. The introduction coincides with the launch of an industry initiative by IBM and others to promote Apache Hadoop and big data technologies in enterprises. IBM BigInsights for Apache Hadoop comes with a broad data science toolset to query data, visualize and carry out Read more…

Will Poor Data Security Handicap Hadoop?

Feb 19, 2015 |

Companies around the world are looking to Hadoop as a platform on which to perform big data analytics. Every day, petabytes of data are flowing into Hadoop clusters with the aim of giving them a competitive edge. However, the overall lack of built-in security threatens to hamper the open source platform’s spread before it’s really gotten off the ground. If you set out to build a big data platform today, chances are good that data security would be one of Read more…

Cloudera Brings Kafka Under Its ‘Data Hub’ Wing

Feb 18, 2015 |

Cloudera is making Apache Kafka a supported part of its Hadoop distribution, the company announced today. While Kafka still doesn’t run on Hadoop, Cloudera says the changes it is instituting will help CDH customers build real-time analytics applications that span Hadoop and Kafka. Kafka is an open source message broker that’s designed to handle massive flows of streaming, real-time data, such as log data. The software was originally developed at LinkedIn, which uses it to process hundreds of millions of Read more…

MapR Delivers Bi-Directional Replication with Distro Refresh

Feb 18, 2015 |

A new release of the MapR Distribution including Hadoop unveiled today will enable companies to perform real-time, bi-directional data replication between Hadoop clusters that are thousands of miles apart. The new table replication feature was added to MapR-DB, the NoSQL database included with the high-end edition of MapR’s commercial Hadoop offering. As Hadoop adoption grows, companies are finding it increasingly difficult to ensure that they’re acting on the latest, freshest data. This fast-data problem is particularly evident in organizations that Read more…