Fall Strata 2016 Coverage
Feature Articles from Fall Strata 2016
This week’s Strata + Hadoop World conference in New York City was expected to draw more than 7,000 attendees, making it biggest big data conference on the planet. It’s also a showcase for Cloudera, which is the main sponsor of the show along with O’Reilly.
“Bring the compute to the data.” That was Hadoop’s calling card and solution for the problem of moving big data. However, the rise of cloud repositories and streaming technologies is causing Hadoop distributors to question whether that architecture is the best one going forward. Datanami seeks answers this week at Strata + Hadoop World. Read more…
Streaming data technology that’s been simmering on the backburner for the past few years will be the main entrée at this week’s Strata + Hadoop World conference in New York City.
There’s a profound shift currently underway in the big data community, as companies look for better ways to manage the huge flows of data occurring across their networks, and find faster ways to make business decisions. Read more…
The big data community has been dealing with the data scientist shortage ever since big data became a thing. Now we’re learning that there’s possibly an even bigger shortage of another type of data professional: the data engineer.
Data engineer is a relatively new position that’s a hybrid of sorts between a data analyst and a data scientist. Read more…
Tonight we’ll hear from the two main candidates applying to be the next president of the United States. The debate should give voters a clearer picture of how the next president will govern. But what we’re not likely to hear is how data science will power good decision-making once they’re in the White House. Read more…
Here’s some good news for young data science professionals looking for that first job: your skills are in high demand and will help you land a job with an average starting salary close to $120,000. But there’s also some bad news: the field is evolving so quickly that you will continually need to refresh your skills. Read more…
News in Brief from Fall Strata 2016
Yahoo is releasing the deep learning algorithm that it uses to detect “not safe for work” (NSFW) images to the open source community, the Web giant announced last week.
Anywhere from 4% to 30% of the Internet is composed of pornographic content, according to a 2011 article in Forbes. Read more…
A data lake startup whose platform utilizes Docker application containers to run an open source analytics engine is the winner of the startup showcase during this week’s Strata + Hadoop World in New York.
SAP completed it acquisition of big data analytics startup Altiscale Inc. this week, saying it would fold Altiscale’s data cloud and Hadoop services into its existing SAP HANA cloud and emerging analytics efforts.
Project DataWorks, a new initiative launched this week by IBM to advance its analytics push, seeks to forge a cloud-based analytics platform that combines different data types with its Watson cognitive computing technology.
The DataWorks initiative also reflects IBM’s (NYSE: IBM) embrace last year of the Apache Spark in-memory computing framework. Read more…
The rise of microservices in recent years is one of the general IT trends that’s paralleled the emergence of big data technology. This week at the Strata Hadoop World conference, MapR will be talking about how it plans to embrace the development and management of microservices in its converged data platform. Read more…
The application of machine learning to predictive analytics continues apace as a way to improve IT operations, data security and business intelligence. Among those offering frequent platform upgrades is real-time “operational intelligence” specialist Splunk Inc., which this week rolled out the latest versions of its IT, security and analytics packages that seek to “operationalize” Read more…
Companies operating multiple Apache Kafka clusters in on-premise and cloud data centers will benefit from a handful of new enterprise-level features unveiled at the Strata + Hadoop World conference today by Confluent, the commercial open source company behind the popular big data message bus.
Confluent today announced that its enterprise-strength Kafka offering, called Confluent Enterprise, is getting three key new capabilities in the version 3.1 release that will ship next month, including multi datacenter replication, automated cross-cluster data balancing, and a cloud-migration facility. Read more…
ODPi today unveiled the second major release of its Runtime Specification that’s geared at setting a standard for Hadoop components to ensure greater interoperability among distributions and third-party products. New additions to the spec include Apache Hive and the Hadoop Compatible File System (HCFS). ODPi also announced more ISVs have completed interoperability testing. Read more…
That rumbling sound you hear is Apache Spark entering production deployments in public clouds along with surging use of the cluster-computing framework’s streaming and machine learning capabilities, according to a new vendor survey that also found more diverse users and use cases.
Databricks Inc., the San Francisco-based startup behind Apache Spark, released survey results on Tuesday (Sept. Read more…
A transaction manager for the NoSQL database HBase has been approved as an open-source incubator project, according to project sponsor Yahoo.
The HBase transaction manager dubbed “Omid” (“Hope” in Persian) is the latest in a string of Hadoop ecosystem projects backed by Yahoo that also includes Pig, Storm and YARN. Read more…
MemSQL today unveiled a new release of its in-memory relational database that can process a real-time flow of messages from Apache Kafka using “exactly once” semantics. The NewSQL database accomplished the feat by creating a new “Create Pipeline” SQL command, and in part by bypassing Apache Spark.
You would be hard pressed to find a business executive who does not believe data initiatives are critical to company growth. Still, a large number of companies say their initial data initiatives have failed due to issues like “data inflexibility.”
That’s the key finding of a data analytics study compiled Monday (Sept. Read more…
This Just In from Fall Strata 2016
- InfoTrellis Unveils AllSight Version 4.6 – (9/30/2016)
- Hortonworks Announces Updates to Azure HDInsight Hadoop Cloud Offering – (9/29/2016)
- New Features Introduced for Confluent Enterprise – (9/28/2016)
- Cask Partners With Tableau to Deliver Rapid Insights From Big Data – (9/28/2016)
- Cloudera Announces New Technology Enhancements to Core Platform – (9/28/2016)
- Cloudera Reveals 2016 Data Impact Award Winners – (9/28/2016)
- Hortonworks to Showcase Latest Offerings at Strata + Hadoop World – (9/28/2016)
- BlueData Announces Fall Release for Enterprise Edition of EPIC Software – (9/27/2016)
- BlueTalon Test Drive Unveiled – (9/27/2016)
- Splice Machine Announces Native PL/SQL Support to Accelerate Migrations From Oracle to Hadoop – (9/27/2016)
- MapR Unveils Support for Event-Driven Microservices on Converged Data Platform – (9/27/2016)
- ClearStory Data Makes Advancements to Spark-Based BI Platform – (9/27/2016)
- Paxata Connect Now Available – (9/27/2016)
- Attunity Enterprise Manager Now Available – (9/27/2016)
- Dataguise Announces Availability of Sensitive Data Governance Dashboard – (9/27/2016)
- Cask Announces Further Integration With Cloudera Enterprise – (9/27/2016)
- Big Data Applications Now ODPi Interoperable – (9/27/2016)
- Continuum Analytics to Speak at Strata + Hadoop World – (9/27/2016)
- MemSQL 5.5 Released – (9/26/2016)
- Pentaho Announces Five New Data Integration Enhancements – (9/26/2016)
- Zoomdata and Cloudera Partner to Deliver Customer Insights Solution – (9/26/2016)
- MemSQL Unveils Roadmap for Ongoing Integration With Apache Spark – (9/26/2016)
- Zoomdata Announces Partnership to Provide Unified Visual Analytics Front End for Teradata Data Platforms – (9/26/2016)
- Maana Unveils Winter ’17 Knowledge Platform – (9/26/2016)
- Zoomdata Offering Free 30-Day Trial in AWS Marketplace – (9/26/2016)