Tag: Hadoop

Trifacta Brings Partners Into Data Prep Fold

May 24, 2016 |

The market for self-service data preparation tools is having a golden moment in the sun, with analyst firms like Gartner deciding that it does, in fact, have legs to stand on its own. The health of that market is also why Trifacta today launched a formal business partner program. With the new Wrangler Partner Program, Trifacta aims to bring a variety of types of firm into the self-service data prep fold, including system integrators, consulting firms, software vendors, and Hadoop Read more…

Kafka Creators Tackle Consistency Problem in Data Pipelines

May 24, 2016 |

One of the big questions surrounding the rise of real-time stream processing applications is consistency. When you have a distributed application involving thousands of data sources and data consumers, how can you be sure that the data going in one side comes out the other unchanged? That’s the challenge that Confluent is addressing with today’s launch of new software for Apache Kafka. If you’re moving big data today, you’re probably using Apache Kafka, or at least looking at it. The Read more…

How Spark and Hadoop Are Advancing Cancer Research

May 23, 2016 |

The combination of Spark and Hadoop has supercharged big data analysis across many industries and use cases by lowering the barrier of entry to advanced analytics and thereby enabling data scientists to create data-driven products that weren’t previously possible. But one area where Spark and Hadoop are having an especially strong impact revolves around cancer research. Cancer killed about 590,000 Americans last year, according to the Centers for Disease Control. That makes it the second leading causes of death in Read more…

Skills Gap Also Includes ‘Failure to Communicate’

May 17, 2016 |

The data science skills gap continues to widen, with emerging automation tools like machine learning only just now starting to take up some of the slack. PayScale, the online salary database, released a report Tuesday (May 17) on the state of the “skills economy” that ranks data analytics, programming and cloud computing skills among the most sought-after by U.S. employers. Nevertheless, the skills survey also highlights a continuing lack of writing and other communications skills among recent college graduate along Read more…

Hadoop Past, Present, and Future

May 17, 2016 |

Every few years the technology industry seems to be consumed with a shiny new object that gets hyped far beyond reality. At worst, the inevitable bursting of the hype bubble leads to the disappearance of the technology from relevance (remember Internet browsing on your TV?), but more often the hype subsides until a real but narrower focus for the technology is found. It’s been a decade since Hadoop was first created as an Apache top-level project, and during that decade Read more…

Data Gravity Pulls to the Cloud

May 16, 2016 |

Last month, Spotify grabbed headlines by announcing plans to get rid of its data centers and move onto Google’s Cloud Platform (GCP), claiming that the storage, compute and network services in the cloud are as high quality as on-premise alternatives. While few people take a second look at a digital-native company choosing to store data in the cloud, it seems to be generally accepted that for certain companies and industries, the cloud just isn’t a fit. Hadoop distribution vendors like Cloudera and Read more…

ODPi Offers Olive Branch to Apache Software Foundation

May 11, 2016 |

The rift between the Open Data Platform Initiative (ODPi) and the Apache Software Foundation (ASF) is on the mend, thanks in part to a peace offering by ODPi, an admission of being indelicate, and a $40,000 check. It may not pacify everybody in the Apache Hadoop community who feel threatened by ODPi’s presence, but at least it’s a start. With its financial commitment, ODPi becomes a gold sponsor in the ASF, which manages 350 open source projects, about 10 percent Read more…

DataRobot Looks to Cut Data Science Backlog

May 9, 2016 |

The data science automation specialist DataRobot Inc. is gaining traction in the big data market for its machine-learning application as new investors like Intel Capital fund its expanding operations. Boston-based DataRobot has so far raised more than $57 million in four equity investment rounds, including a $33 million funding round completed in February. Along with Intel Capital, Recruit Strategic Partners joined the startup’s fourth funding round as new investors. The company’s machine-learning platform runs either on top of Hadoop as Read more…

Hortonworks Grows Revenue 85 Percent

May 4, 2016 |

Hortonworks (NASDAQ: HDP), a much-analyzed stock as the only pure-play Hadoop distributor that’s public, reported $41.3 million in revenue for the first quarter of 2016, an 85 percent increase over the same quarter last year. Support subscription revenue during the quarter was up sharply from $13.1 million to $27.6 million, an increase of 110 percent compared to the first quarter of 2015, which was Hortonworks’ first quarter as a public company following an IPO in late 2014. Professional services revenue Read more…

How Credit Card Companies Are Evolving with Big Data

May 3, 2016 |

As overseers of our digitized marketplaces, credit card companies have a bird’s eye view of what we buy. If you want to know what American consumers are interested in, there’s perhaps no better way that to examine their purchase histories, so it’s no wonder that credit card companies, such as American Express, Capital One, JP Chase Morgan, and Citibank are at the forefront of big data mining. One of the credit card companies doing a lot of work in this Read more…