Cloudera Unveils Altus to Simplify Hadoop in the Cloud

Cloudera Unveils Altus to Simplify Hadoop in the Cloud

Running Hadoop, whether on-premise or in the cloud, is neither simple nor easy. Administrators with specialized skills are needed to configure, manage, and maintain the clusters for their clients, who are data scientists, engineers, and analysts. Read more…

How to Build a Big Data Culture

How to Build a Big Data Culture

Data has become a critical corporate asset for keeping a competitive edge in the market and making smarter decisions company-wide. Read more…
Universities Roll Out New Data Science Programs

Universities Roll Out New Data Science Programs

College grads looking to take the next step in their data science careers will have several new graduate-level data science programs to enroll in this fall, including a new Master’s of Applied Data Science at Syracuse University, a new Doctorate of Data Science at the University of Tennessee, and a new

Demand from students for graduate-level data science degrees, as well as demand from companies for more skilled data scientists to crunch big data, led Syracuse University leaders to create the new 18-month, multi-disciplinary program, says Associate Professor Jeff Saltz. Read more…

Predicting Hurricane Damage with Machine Learning

Predicting Hurricane Damage with Machine Learning

Next week marks the start of a hurricane season that weather experts predict will generate an above-average number of tropical storms. Read more…

Datanami Headlines

Under Contract: Crowd-Sourced Zillow Algorithm

Zillow, the home sales tracking web site, has transformed the real estate business, but its proprietary algorithm for estimating home values hasn’t always measured up.

In tacit acknowledgment of that shortfall, the Seattle-based company launched a machine learning competition this week that will award $1 million to the individual or team that comes up with the best improvement to its “Zestimate” Read more…

Pepperdata Takes On Spark Performance Challenges

Apache Spark has revolutionized how big data applications are developed and executed since it emerged several years ago. But troubleshooting slow Spark jobs on Hadoop clusters is not an easy task. In fact, it may even be harder than MapReduce. Read more…

Nvidia’s Huang Sees AI ‘Cambrian Explosion’

The processing power and cloud access to developer tools used to train machine-learning models are making artificial intelligence ubiquitous across computing platforms and data framework, insists Nvidia CEO Jensen Huang.

One consequence of this AI revolution will be “a Cambrian explosion of autonomous machines” Read more…

Digital Psychometrics and its Discontents: Online Behavior for Psychological Analytics

Occasionally we hear about predictive data modeling aimed at gaining psychological insight that at first blush seems – even to tech sophisticates who pooh-pooh AI alarmism – like a primitive version of something unsettling to come, something too penetrating for comfort, something with the potential to compromise our privacy, even our sense of independence. Read more…

Largest Tech Fund Targets AI

A huge private equity fund launched over the weekend by Japan’s Softbank Group and a Saudi sovereign wealth fund targets artificial intelligence and other technologies while giving the new $93 billion fund the option of acquiring a larger stake in processor maker ARM Ltd. Read more…

Google/ASF Tackle Big Computing Trade-Offs with Apache Beam 2.0

Trade-offs are a part of life, in personal matters as well as in computers. You typically cannot have something built quickly, built inexpensively, and built well. Pick two, as your grandfather would tell you. But apparently when you’re Google and you, in concert with the Apache Software Foundation, just delivered a grand unifying theory of programming and processing in the form of Apache Beam 2.0, those old rules may not apply anymore. Read more…

GE Invests on Data Prepper Tamr

The investment arm of GE, a steady customer of startup Tamr Inc., has made a strategic investment in the data preparation platform vendor and placed a director on its board of directors.

Tamr was spun out from the Massachusetts Institute of Technologies’ Computer Science and Artificial Intelligence Laboratory in 2014. Read more…

Committers Talk Hadoop 3 at Apache Big Data

The upcoming delivery of Apache Hadoop 3 later this year will bring big changes to how customers store and process data on clusters. Here at the annual Apache Big Data show in Miami, Florida, a pair of Hadoop project committers from Cloudera shared details on how the changes will impact YARN and HDFS. Read more…

‘Cloud TPU’ Bolsters Google’s ‘AI-First’ Strategy

Google fleshed out its artificial intelligence efforts during its annual developers conference this week with the roll out of an initiative called Google.ai that serves as a clearinghouse for machine learning research, tools and applications designed to help automate AI development. Read more…

The Motivation for Native Graph Databases

Building a dependable database management system is a difficult task. We must be aware of the design trade-offs in the construction of a database management system and understand how those trade-offs impact end-user problems that we want to help solve. Read more…

Data Deals Heat Up

It’s looking more like buyers’ market as the number of data analytics deals grows.

Seeking to break out of the pack of business intelligence and analytical tool vendors, Tibco Software said it would acquire data science platform vendor Statistica. Read more…

How Big Data Can Save Lives at the Hospital

As a trauma nurse at one of the largest hospitals in the country, Charles Boicey saw his share of sickness and injuries, amazing recoveries and unfortunate deaths. But for all the good he and his fellow clinicians were able to do, Boicey was frustrated that he didn’t have the information to do more. Read more…

Massive Autonomous Vehicle Sensor Data: What Does It Mean?

It was reported by Ford that connected car sensors generate 25 gigabytes of data per hour, and then by the WSJ that a typical autonomous vehicle generates 4 terabytes of data in 90 minutes, and then by Intel of 45 terabits per hour.  Read more…