Sectors » Other

Features

Five Steps to Running ETL on Hadoop for Web Companies

Sep 1, 2014 |

Mention ETL (Extract, Transform and Load) and eyes glaze over. The thought goes: “That stuff is old and meant for clunky enterprise data warehouses. What does it have to do with my Internet/Web/ecommerce application?” Quite a lot, actually. ETL did originate in enterprise IT where data from online databases is Extracted, then Transformed to normalize it and finally Loaded into enterprise data warehouses for analysis. Although Internet companies feel they have no use for expensive, proprietary data warehouses, the fact Read more…

Hadoop Labor Update: Cloudera Talks Impala 2.0 as Hortonworks Previews Kafka

Aug 29, 2014 |

Say what you will about Hadoop (and we do), the big data platform is evolving at an incredible rate. This week, two of the biggest Hadoop distributors, Hortonworks and Cloudera, shared how they’re working to improve two key aspects of the platform: real-time data pipelining via Apache Kafka and SQL-based data warehousing via Impala. Let’s start with Cloudera. This week, the Hadoop distributor announced that the upcoming release of Impala 2.0 will add much more complete SQL functionality to CDH, Read more…

How a Top Ad Firm Stopped Fearing Big Data and Learned to Love Analytics

Aug 18, 2014 |

Competition in the $500-billion global advertising industry is brutal, and ad firms that fail to deliver are quickly shown the door by their clients. But in this age of big data analytics, delivering results is not enough, as clients increasingly demand to know exactly how their media buys boosted demand. As the sixth largest media buying and marketing firm in the world, Havas Media has created and executed print, online, display, and TV advertising strategies for some of the world’s Read more…

TPC Crafts More Rigorous Hadoop Benchmark From TeraSort Test

Aug 18, 2014 |

While Moore’s Law has made computing and storage capacity less expensive with each passing year, the amount of data that companies are storing and the number and sophistication of the algorithms that they want to employ on that data to perform analytics is growing faster than the prices are dropping. And that means the bang for the buck of the underlying hardware and the analytics software that runs atop it matter. The trouble is that benchmarking systems takes far too Read more…

Here’s Another Option for Hadoop Enterprise Search

Aug 8, 2014 |

The software stacks of many Hadoop distributions feature Apache Lucene and Solr as the enterprise search component. But the folks at the French firm Sinequa say Hadoop customers will get more actual work done–and quickly analyze massive amounts of poly-structured data from dozens of other sources in multiple languages–by using its enterprise search solution. Hadoop, machine learning algorithms, and graph databases may get most of the headlines in our big data world, but good old search engines continue to be Read more…

News In Brief

Stinger Initiative Prepares for .next Phase

Sep 4, 2014 |

The Hadoop developer community that recently delivered the final tweaks to the Stinger Initiative, an effort to bring SQL capabilities to Apache Hive, said its Stinger.next effort would focus on further enhancements to SQL for supporting real-time access in Hive along with support for transactional capabilities. In a blog post, Hortonworks developers Alan Gates and Raj Bains reported that 145 developers from 44 companies have contributed 390,000 lines of code over the last 13 months to the Stinger Initiative. They Read more…

Big Data Salaries, Job Offers on the Rise

Sep 3, 2014 |

As demand for predictive analytics professionals grows, median base salaries and retention bonuses continue to rise as more companies in more industries jump on the big data bandwagon. According to a salary survey of predictive analytics professionals released Sept. 3 by Burtch Works, the executive recruiting consultant, median base salaries over the last 12 months ranged from $95,000 for data analysts to $145,000 for managers. Seventy-one percent of data workers surveyed said they were eligible for bonuses while 91 percent Read more…

Poll: SAS Use Surges for Data Mining

Aug 26, 2014 |

A recent poll querying data scientists on which programming and statistics languages they used in 2014 for analytics, data mining and data science found that four main languages dominated. The data mining community web site KDnuggets reported earlier this month that respondents identified R, Python, SAS and SQL (in that order) as a preferred programming language. Fully 91 percent of respondents used one of the four languages. The R programming language led the way, cited by 49 percent of respondents Read more…

Performance Analytics Tackles Sports Injuries

Aug 25, 2014 |

In the human demolition derby known as the National Football League, season-ending, often career-threatening, injuries are already piling up like lineman on a loose football. And it’s only preseason! An injury to a star player—the St. Louis Rams lost their starting quarterback Sam Bradford to his second torn ACL in as many years over the weekend—can ruin a team’s season before it begins. That translates into empty seats in stadiums, lost revenues from missing the playoffs and potentially years of Read more…

Gauging Human Emotions at the Stroke of a Key

Aug 22, 2014 |

Indian researchers using text pattern analysis and “keystroke dynamics” claim they have designed a computer program that can accurately recognize a computer user’s emotions. “Depending on the emotion,” the researchers claimed, they accurately recognized and “emotional states” most of the time, indeed with great precision: 87 percent. Writing in the journal Behavior & Information Technology, a team of Indian researchers asked volunteer typists to note their emotional state after typing “fixed” text along with prescribed intervals of regular computer use. Read more…

This Just In

Splunk Appoints Mark Carges to Board of Directors

Sep 15, 2014 |

SAN FRANCISCO, Calif., Sept. 15 — Splunk Inc., provider of the leading software platform for real-time Operational Intelligence, today announced that Mark Carges has been appointed to the company’s Board of Directors effective as of September 9, 2014. “We are pleased to welcome Mark to the Splunk board,” said Godfrey Sullivan, Chairman and CEO, Splunk. “Mark’s deep technical expertise together with his understanding of business model migration to the cloud provides us with invaluable insight and perspective.” “I am thrilled to be Read more…

IDC Publishes New Big Data Study

Sep 15, 2014 |

FRAMINGHAM, Mass., Sept. 15 — In a newly published study, International Data Corporation (IDC) forecasts that the Big Data technology and services market will grow at a 26.4% compound annual growth rate (CAGR) to $41.5 billion through 2018, or about six times the growth rate of the overall information technology market. Although IDC believes that Big Data will continue to represent a fast-growing multibillion-dollar worldwide opportunity for the next five years, overall growth of the individual segments that make up this Read more…

MapR Appoints Patrik Svanström Vice President, EMEA

Sep 11, 2014 |

SAN JOSE, Calif., Sept. 11 — MapR Technologies, Inc., provider of the top-ranked distribution for Apache Hadoop, today announced the appointment of Patrik Svanström as vice president, EMEA. Svanström will be responsible for MapR business activities across the region, including sales and partner development, strategic planning, customer engagement and revenue growth. With over 20 years of experience in enterprise software and infrastructure markets, Svanström has held executive management positions at start-ups and multinational companies, including Informatica, HP, EMC and Oracle. “Patrik Read more…

Talend Appoints New CTO and Head of Engineering

Sep 11, 2014 |

Sept. 11 — Talend, the global big data integration software leader, today announced that 20-year enterprise veteran Laurent Bride has joined the company as Chief Technology Officer and Head of Engineering. Reporting into CEO Mike Tuchen, Bride will play a key role in shaping Talend’s technology strategy as the company embarks on its next phase of growth in big data, applications and cloud. Most recently, Bride served as CTO at Axway, a market leader in governing the flow of data. Prior to joining Read more…

Vormetric Partners with DataStax

Sep 11, 2014 |

SAN FRANCISCO, Calif., Sept. 11 — Vormetric, a leader in enterprise data security for physical, virtual, big data, public, private and hybrid cloud environments, today, at the fifth annual Cassandra Summit, announced a partnership with DataStax, the company that delivers Apache Cassandra to the enterprise. Vormetric enhances the existing enterprise-class security features already available from DataStax’s massively scalable distributed database technology with protection for data-at-rest that includes encryption, enhanced access controls and security intelligence. With Vormetric’s Transparent Encryption and Vormetric Application Encryption solutions, organizations can safely use sensitive data within Read more…