Big Data • Big Analytics • Big Insight

Sectors » Other


When Big Data Becomes Too Much Data

Oct 27, 2014 |

About 2.5 exabytes of data will be generated today, or roughly the amount of data that was generated from the dawn of time until 2004. What’s in there, and will any of it be useful? The reality is the amount of data is so vast, its quality so dubious, and our abilities so relatively weak that most of it will have no impact whatsoever. In a perfect world, each additional byte of data we generate and absorb would shave a Read more…

Today’s Baseball Analytics Make Moneyball Look Like Child’s Play

Oct 24, 2014 |

Baseball has always been a game of numbers and statistics. But thanks to an explosion of data over the past seven years and the advent of new analytic software running on supercomputers, the game is on the cusp of changes that will make Moneyball look like it belongs in the minor leagues. When the San Francisco Giants take the field against the Kansas City Royals in game three of the World Series tonight, you can bet that the choices made Read more…

Spark Smashes MapReduce in Big Data Benchmark

Oct 10, 2014 |

Databricks today released benchmark results for Apache Spark running the Sort Benchmark, a competition for measuring the sorting performance of large clusters. Spark running on Hadoop sorted 100 TB of data in 23 minutes, three times faster than the previous record held by Yahoo using MapReduce on Hadoop. The result, Databricks says, are due to targeted improvements the Spark community made to improve performance, and should lay to rest any concerns about Spark’s scalability. Databricks, which is the commercial outfit Read more…

Top Three Things Not To Do in Excel

Oct 2, 2014 |

Let’s face it: We all have seen a crazy Microsoft Excel spreadsheet or encountered one of its dreaded “Not Responding” messages. Unfortunately, the flexibility and ease of Excel makes it the ideal candidate for inappropriate use and widespread abuse. As the most widely used analytical tool in the world, Excel has indeed come a long way since the days of Visicalc and MultiPlan. Modern Excel 2013 and the latest Power BI add-ins do sizzle in demonstrations, but there are analyses Read more…

Five Steps to Running ETL on Hadoop for Web Companies

Sep 1, 2014 |

Mention ETL (Extract, Transform and Load) and eyes glaze over. The thought goes: “That stuff is old and meant for clunky enterprise data warehouses. What does it have to do with my Internet/Web/ecommerce application?” Quite a lot, actually. ETL did originate in enterprise IT where data from online databases is Extracted, then Transformed to normalize it and finally Loaded into enterprise data warehouses for analysis. Although Internet companies feel they have no use for expensive, proprietary data warehouses, the fact Read more…

News In Brief

Dairy Industry Asks: Got Big Data?

Oct 31, 2014 |

The bucolic days of the family dairy farm are long gone. Even in places like “America’s Dairyland” (aka, Wisconsin), huge dairy operations that milk thousands of Holstein cows twice a day are an increasingly common site. These days, the dairy industry is all about production. Enter big data technology as “big dairy” becomes the primary supplier of milk, cheese and, at least in the author’s home state, fried cheese curds. Among the big data applications being embraced by corporate farmers Read more…

S. Korea Eyes Big Data to Reduce Car Accidents

Oct 8, 2014 |

Among the many striking features of the bustling city of Seoul, South Korea, are its connectedness, a function of its extensive deployment of broadband networks, and its roaring, non-stop traffic. An electronic sign in the city’s shopping district actually keeps track of the number of Korean auto fatalities. The total seemingly increases by the minute. Hence, the Korean government wants to use spatial, weather and other big data sources to provide drivers with what officials call an accident forecast service Read more…

Twitter Funds MIT ‘Social Machines’ Effort

Oct 3, 2014 |

Machines could become more social thanks to a new Twitter-funded initiative at the Massachusetts Institute of Technology’s vaunted Media Lab that will seek to develop new technologies to make sense of social chatter ranging from tweets to data streams to digital content. MIT Media Lab announced Oct. 1 the creation of a Laboratory for Social Machines using $10 million in funding from Twitter over the next five years. Twitter said it also would provide the new MIT lab with full Read more…

MongoDB Teams with Weather Channel on Digital Alerts

Oct 1, 2014 |

Somebody is finally trying to do something about the weather. The Weather Channel has begun transitioning its digital platforms, including its mobile apps running on iOS and Android, to MongoDB’s database. The new platform will allow the Weather Channel to serve weather alerts and other real-time information to an estimated 40 million users, the database specialist announced Oct. 1. MongoDB said it would serve as the “data store” for all Weather Channel feeds and user information delivered by its digital Read more…

Stinger Initiative Prepares for .next Phase

Sep 4, 2014 |

The Hadoop developer community that recently delivered the final tweaks to the Stinger Initiative, an effort to bring SQL capabilities to Apache Hive, said its effort would focus on further enhancements to SQL for supporting real-time access in Hive along with support for transactional capabilities. In a blog post, Hortonworks developers Alan Gates and Raj Bains reported that 145 developers from 44 companies have contributed 390,000 lines of code over the last 13 months to the Stinger Initiative. They Read more…

This Just In

MapR Announces November Speaking Lineup

Oct 31, 2014 |

SAN JOSE, Calif., Oct. 31 —  MapR Technologies, Inc., provider of the top-ranked distribution for Apache Hadoop, today announced its November speaking lineup, including TDWI Solution Summit, ad:tech, Lucene/Solr Revolution, and the Internet of Things (IoT) Expo. Highlights Include: TDWI Solution Summit  The TDWI Solution Summit is an exclusive, hosted gathering of experienced professionals, industry thought leaders, and top solution providers for real-world tips and best practices on the evolving data ecosystem. This event’s focus is Big Data Analytics for Read more…

Cloudera Announces Formation of Cloudera Labs

Oct 30, 2014 |

PALO ALTO, Calif., Oct. 30 — Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop, announced the formation of Cloudera Labs, a virtual center for fostering innovations in incubation within Cloudera’s engineering R&D, and fast-tracking promising open source initiatives on the leading edge of adoption. The goal of Cloudera Labs is to bring more use cases, productivity, and value to developers by seeking and exploring new solutions to their problems through the development of future standard technologies that will power Read more…

Alteryx Announces Release of New Book

Oct 30, 2014 |

IRVINE, Calif., Oct. 30 – Alteryx, Inc., the leader in data blending and advanced analytics, today announced the release of Data Blending For Dummies, Alteryx Special Edition, a new book that explains how data analysts can use data blending to support business decision making. This Special Edition release, now available in eBook format, is a new addition to the ‘For Dummies’ line up— the world’s best-selling reference series and a branded imprint of Wiley. “As the amount of data available to analysts continues Read more…

TIBCO Introduces Open Source Version of Jaspersoft 5.6

Oct 29, 2014 |

PALO ALTO, Calif., Oct. 29 — TIBCO Software Inc., a global leader in infrastructure and business intelligence software, today announced the open source release of version 5.6 of the company’s award-winning TIBCO Jaspersoft business intelligence platform, available for free download on the Jaspersoft Community site. This new version includes several product updates, including new Big Data connectors, interactive reporting upgrades, an updated OLAP engine, and performance enhancements to TIBCO Jaspersoft Studio. In addition, the company announced that more than 450,000 users have registered with the Read more…

Simba Technologies Brings SQL to MongoDB

Oct 28, 2014 |

VANCOUVER, B.C., Oct. 28 — Today Simba Technologies Inc., the worldwide leader in Big Data connectivity, announced the availability of version 1.7 of its innovative MongoDB ODBC Driver, providing best-in-class connectivity for MongoDB data sources. The release features several ground-breaking advances, notably a breakthrough SQL interface for MongoDB, industry-first support for ODBC 3.8, and better performance. “With this release, Simba Technologies takes MongoDB ODBC connectivity to a new level,” says Simba Technologies CTO George Chow. “By design, the MongoDB database architecture employs Read more…