Applications » Enterprise Analytics

Features

Big Data Outliers: Friend or Foe?

Sep 2, 2014 |

The bigger your dataset, the greater your chance of stumbling into an outlier. It’s practically a certainty you’ll find isolated, unexpected, and possibly bizarre data you never expected to see in your data. But how you respond to these outliers could mean the difference between big data success and failure. How should you deal with data outliers? The answer is simple: It depends. On the one hand, the presence of outliers may be a sign of serious data quality issues, Read more…

Five Steps to Running ETL on Hadoop for Web Companies

Sep 1, 2014 |

Mention ETL (Extract, Transform and Load) and eyes glaze over. The thought goes: “That stuff is old and meant for clunky enterprise data warehouses. What does it have to do with my Internet/Web/ecommerce application?” Quite a lot, actually. ETL did originate in enterprise IT where data from online databases is Extracted, then Transformed to normalize it and finally Loaded into enterprise data warehouses for analysis. Although Internet companies feel they have no use for expensive, proprietary data warehouses, the fact Read more…

Hadoop Labor Update: Cloudera Talks Impala 2.0 as Hortonworks Previews Kafka

Aug 29, 2014 |

Say what you will about Hadoop (and we do), the big data platform is evolving at an incredible rate. This week, two of the biggest Hadoop distributors, Hortonworks and Cloudera, shared how they’re working to improve two key aspects of the platform: real-time data pipelining via Apache Kafka and SQL-based data warehousing via Impala. Let’s start with Cloudera. This week, the Hadoop distributor announced that the upcoming release of Impala 2.0 will add much more complete SQL functionality to CDH, Read more…

Who IBM’s Server Group Turns To for Machine Data Analytics

Aug 28, 2014 |

IBM’s engineering prowess is second to none, and its Systems and Technology Group builds the computers that run the world’s biggest companies. But when IBM’s STG unit went looking for a way to predict failures by analyzing log data returned by its customers’ servers and storage arrays, it looked externally to a little-known machine data analytics startup from Santa Clara. Glassbeam got its start five years ago, before the Internet of Things (IOT) became the industry’s hottest buzzword and out-hyped Read more…

Why Hadoop Isn’t the Big Data Solution You Think It Is

Aug 26, 2014 |

Hadoop carries a lot of promise in the IT world for the way it has democratized access to massively parallel storage and computational power. But the level of hype that surrounds Hadoop is disproportionate to its present capabilities, raising the possibility of a big data letdown of elephantine proportions. The emergence of Hadoop as a next-generation platform for parallel computing has piqued the interest of customers and investors alike. What mid-sized company looking for a big data edge wouldn’t want Read more…

News In Brief

EnterpriseDB Throws Down PostgreSQL Gauntlet

Aug 29, 2014 |

A new database development environment released earlier this month has been tweaked to leverage the NoSQL capabilities in PostgreSQL, the open source object-relational database tool, to build next-generation web applications. EnterpriseDB, the Postgres specialist based in Bedford, Mass., said its Postgres Extended Datatype Development Kit would allow application developers to use Postgres for the types of applications that previously required a specialized NoSQL-only tool. To enable this, the kit is said to expand Postgres capabilities for handling document databases with Read more…

From Data Wrangling to Data Harmony

Aug 18, 2014 |

More and better automation tools such as machine-learning technologies are needed to free data scientists from mundane “data-wrangling” chores. Those tools would allow scientists to focus on gleaning insights from prepared data, a range of experts told the New York Times in a recent survey of the state of big data. The newspaper reported that data scientists spend from 50 percent to 80 percent of their time organizing data, or “data janitor work,” before they could begin sifting through it Read more…

Analytics Drives Tesla Customer Loyalty

Aug 8, 2014 |

As the auto industry is transformed by upstarts like Tesla Motors, analytics is being applied to bring customers and car makers closer together. The Tesla S twin-engine electric sedan is a case in point, having been variously described by one observer of auto technology as an “iPad on wheels” and “highly modular,” a reference to the amount of computing power in the mid-range electric car. As the technology pundit Rob Enderle notes, “Tesla often knows about a problem before the Read more…

HP Taps Hortonworks to Supply Hadoop for HAVEn

Jul 24, 2014 |

Hewlett-Packard and Hortonworks today announced a strategic partnership that will see the vendors work together around Hadoop. In exchange for HP’s $50 million investment in Hortonworks, HP gets the right to distribute the Hortonworks Data Platform (HDP) as the Hadoop component of its HAVEn big data suite. While HP offers a range of IT solutions for big data problems, the IT giant has not had a very aggressive Hadoop strategy. It strengthened its Hadoop story in June 2013, when it Read more…

Teradata Acquires Revelytix, Hadapt

Jul 22, 2014 |

Teradata Corp., the analytic data platform vendor, said it has expanded its big data portfolio with a pair of recent acquisitions. Teradata, based in Dayton, Ohio, said July 22 it has acquired the assets of Revelytix, an information management specialist, along with big data technologists and intellectual property from Hadapt. The Revelytix deal was completed on July 16; the Hadapt acquisition on July 17, Teradata said. Terms of the two acquisitions were not disclosed, the company said, because they are Read more…

This Just In

ProKarma Joins Hortonworks System Integrator Partner Program

Sep 2, 2014 |

BEAVERTON, Ore., Sept. 2 – ProKarma, a global IT solutions company, today announced a partnership with Hortonworks, a leading contributor to and provider of enterprise Apache Hadoop, that will allow its clients to benefit from enterprise-level big data solutions, including enhanced integration capabilities around Hadoop to solve their unique business challenges. By joining the Hortonworks Systems Integrator Partner Program, ProKarma can rely on Hadoop to help strengthen its ability to drive and lead efforts related to advanced analytics and emerging technologies, including big Read more…

Trifacta Teams with Tableau to Simplify the Analysis of Big Data on Hadoop

Aug 28, 2014 |

SAN FRANCISCO, Calif., Aug. 28 — Trifacta, a leading Data Transformation platform provider, today announced deep integration with Tableau, a global leader in rapid-fire, easy-to-use business analytics software. The integration builds on Trifacta’s strategic partnership with Tableau. With the release of Trifacta Data Transformation Platform 1.5, Tableau customers now have the option of writing the output of Trifacta data transformations directly to a Tableau Data Extract format or registering the output with Hadoop’s HCatalog for more scalable, interactive data discovery Read more…

Alteryx to Sponsor Tableau Conference

Aug 28, 2014 |

Aug. 28 – Alteryx, Inc., the leader in data blending and advanced analytics, announced that it will once again be the top sponsor for the upcoming 2014 Tableau Conference (TC14). At the conference, which takes place in the Seattle area September 8-12, Alteryx will highlight how business analysts can dramatically reduce the time to create the best analytical data set, so they can spend more time creating opportunities with data, and less time preparing it. Alteryx also will distribute a new version Read more…

HP Business Intelligence Modernization Services Unveiled

Aug 26, 2014 |

PALO ALTO, Calif., Aug. 26 – HP today announced HP Business Intelligence Modernization Services designed to help enterprises understand, manage and leverage their data to improve customer engagement, create new business opportunities and reduce costs. Most existing business intelligence environments can provide analytics and reporting, but are not designed to deliver insights from new formats and higher volumes of data. As demand for access to information increases, enterprises are being challenged to process unstructured data and vast amounts of sensor data Read more…

IBM and Inspur to Advance Big Data and Analytics Innovation in China

Aug 26, 2014 |

ARMONK, N.Y. and JINAN, China, Aug. 26 – IBM and Inspur today announced the companies are committing to make Big Data and Analytics and transaction processing solutions available to customers and ISVs in China, giving them the ability to access and make sense of huge volumes of information in real-time. The two companies have collaborated and enabled IBM DB2 and IBM WebSphere Application Server software to operate on Inspur TS K1 Systems. In addition, Inspur will leverage an OpenPOWER Foundation reference design and capabilities to create innovative system solutions. Read more…