Too many big data initiatives are science projects that take months of effort, risk failure and require highly trained data scientists with scarce skills. According to a CSC survey, 55 percent of big data projects aren’t completed and many others fall short of their objectives.Read more...
Vendors » Oracle
While Moore’s Law has made computing and storage capacity less expensive with each passing year, the amount of data that companies are storing and the number and sophistication of the algorithms that they want to employ on that data to perform analytics is growing faster than the prices are dropping. And that means the bang for the buck of the underlying hardware and the analytics software that runs atop it matter. The trouble is that benchmarking systems takes far too Read more…
This fall, thousands of budding data scientists around the country will begin their march toward advanced degrees–and the near certainty of multiple job offers upon graduation. At least seven new graduate-level programs are set to fire up at colleges and universities around the country, while a handful of new undergraduate programs are also popping up to give data scientists a head start on their schooling. The demand for data scientists remains extremely high as companies across all industries race to Read more…
Analytic platforms that generate insights from data in real time are mature enough for enterprises to begin adopting them, Forrester says in its latest report. While open source streaming analytic products like Apache Storm are proving popular, Forrester says they lack key functionality found in the offerings of proprietary vendors, such as top-rated Software AG. You don’t need a Forrester analyst to know that streaming analytics is red hot at the moment. If Hadoop has opened our eyes to what Read more…
Once dismissed as exotic toys, in-memory databases are gaining new ground in the data center. The latest in-memory believer is Oracle, which in July will ship the new in-memory option for its 12c database, which it claims will boost analytic workloads by 1,000x. That likely won’t be enough to match dedicated in-memory setups, such as EXASOL‘s. Oracle is taking a page out of rival SAP’s HANA playbook, which says enterprises want a single hybrid database that can simultaneously process transaction Read more…
Splice Machine today announced the commercial availability of its relational database management system (RDBMS) for Hadoop. By building a SQL-compliant RDBMS atop HBase, Splice Machine is giving customers another place to run workloads that are reaching the limits of standard databases running on commodity hardware. When you think about Hadoop, chances are good that you think about analytic workloads running against large amounts of semi-structured data. It’s for doing data mining against Web logs or running pattern matching algorithms against Read more…
News In Brief
Not long ago, the rules for what constituted a data warehouse were fairly well defined. The schema was fixed, you could say, and was based primarily on relational database technology designed to process structured data. My, how times have changed. Last week, Gartner for the first time accepted non-relational technologies–including those based on Hadoop and NoSQL–in its annual Magic Quadrant for Data Warehouses report.
According to the big data number crunchers at Wikibon, spending on big data hardware, software, and services added up to $18.6 billion in 2013. That’s about $500 million, or roughly 3 percent, bigger than it projected the market to be a year ago when it did its last forecast. On a year-over-year basis, actual big data spending grew 58 percent.
The best way to boost your salary in 2014 may be to learn a new data analytics tool. According to the 2013 Data Science Salary Survey from O’Reilly, there was a strong correlation between data scientists and analysts who used the highest number of tools and those who had the highest salaries. What’s more, those who used open source tools, such as R and Hadoop, tended to bring home more bacon than those who used commercial products, such as SAS and Teradata.
Cloudera today unveiled results of internal tests that show its Impala SQL engine is twice as fast as an unnamed commercial data warehouse systems and 24 times faster than Apache Hive. The results also show that Impala scales in nearly a linear fashion on Hadoop, which Cloudera says will give customers the confidence they need to not only absorb legacy business intelligence workloads, but also build the next generation of SQL-powered data analytic applications.
With Oracle’s OpenWorld going on this week, there is a lot of talk about what the future holds for IT. Two of the most powerful CEO’s in IT took the stage this week to discuss the trends that are shaping the future.
This Just In
Oracle today announced that it has signed an agreement to acquire BlueKai, the industry’s leading cloud-based big data platform that enables companies to personalize online, offline and mobile marketing campaigns with richer and more actionable information about targeted audiences.
Oracle today announced Oracle Big Data Appliance X4-2, which includes the entire Cloudera Enterprise technology stack and 33 percent more storage capacity for a total of 864 terabytes.
NGDATA, a customer intelligence management company, today announced that it has achieved Gold partner status in Oracle PartnerNetwork (OPN).
Univa, the Data Center Automation company, today announced that the company will assume support of Oracle Grid Engine customers. As a part of the transaction, Univa acquired Oracle Grid Engine assets and intellectual property making it the sole commercial provider of Grid Engine software.
Kapow Software, a Kofax company and leading big data integration software provider, announced it is working with Oracle on data integration for Oracle Endeca Information Discovery. In combination with Kapow Software”s data integration technology, which will be marketed and sold by Oracle as Oracle Endeca Information Discovery Web Data Acquisition Toolkit, organizations can now use the enterprise data discovery platform for rapid, intuitive exploration and analysis of information from any unstructured external data source.