Too many big data initiatives are science projects that take months of effort, risk failure and require highly trained data scientists with scarce skills. According to a CSC survey, 55 percent of big data projects aren’t completed and many others fall short of their objectives.Read more...
Vendors » EMC
There is a lot of debate in the big data space about tools and technology, and which ones are best. Is SQL better than NoSQL? Hadoop or Spark? What about R or Python? Of course no single tool or technology is the best for all situations, and you would do well to pick the right tool or technology for the job at hand.
While still in its infancy, the big data technology trend has made a lot of substantial progress since it gained traction at the beginning of this decade. The year 2013 was a big year with advances being made in virtually every quarter of the space. In this feature, we take a look at some of the significant trends that have crossed our desks in the past year — wrapped up and presented to you with a pretty bow. Out with the old, in with the new — it’s the Datanami 2013 Year in Review!
Startup company, Pivotal, announced today that they will be receiving a strategic investment from GE along the lines of $105 million dollars, representing a 10% equity stake in the company that was spun out of EMC to challenge the existing cloud software establishment.
According to the McKinsey Global Institute, businesses in the United States alone will be short 140,000 to 190,000 data scientists by the year 2018. Greenplum intends to help solve this problem with a complete open sourcing of their Chorus platform and the resulting partnership with Kaggle, a website which fosters growth in the data science community by hosting data mining competitions among its 57,000 participants.
ABBA is the source of more than just upbeat disco grooves. These days, the band’s acronym is being revived to back an EMC project that hopes to use Flash technology to spin big data into fast data. Although we too agree that there’s just not enough gold iamé in data-intensive computing R&D, there….
News In Brief
Not long ago, the rules for what constituted a data warehouse were fairly well defined. The schema was fixed, you could say, and was based primarily on relational database technology designed to process structured data. My, how times have changed. Last week, Gartner for the first time accepted non-relational technologies–including those based on Hadoop and NoSQL–in its annual Magic Quadrant for Data Warehouses report.
Among Hadoop distributors, MapR Technologies’ product offering is the strongest, Amazon Web Services has the biggest Hadoop market presence, and IBM and Pivotal Software have the best market strategy, according to a new Forrester Wave report released yesterday that slices and dices the Hadoop market and analyzes the various Hadoop product offerings and business strategies.
With Oracle’s OpenWorld going on this week, there is a lot of talk about what the future holds for IT. Two of the most powerful CEO’s in IT took the stage this week to discuss the trends that are shaping the future.
Today, April 15, 2013, is tax day in the United States. Millions of Americans are assuredly scrambling to file their returns today and most will be doing so electronically. Specifically, the IRS expects 80 percent of tax returns, or 250 million, to be filed online. That represents a significant big data problem in identifying the veracity of said filings.
Greenplum and EMC announced a 1000-node test cluster that will be used by select institutions to validate code running on the Hadoop framework. The companies say the the new service will further big data development by providing an environment to….
This Just In
RainStor, provider of the most efficient enterprise database for Big Data, has successfully completed product testing, resulting in validation of its database on EMC Corporation’s Isilon Scale-Out network-attached storage (NAS) running on the Hadoop Distributed File System (HDFS).