Top Big Data Startups Honored at Strata
Hadoop virtualization startup BlueData took home the blue ribbon at the Startup Showcase competition during the Strata + Hadoop World conference last week. Also gaining recognition were Metanautix, an SQL data compute engine; graph analytics startup GraphLab; and SlamData, which develops an SQL add-on for MongoDB.
As we previously reported, BlueData was founded in 2012 by two former VMware engineers, Kumar Sreekanti and Tom Phelan, who wanted to dramatically reduce the complexity in Hadoop deployments. The company’s offering, dubbed EPIC, provides a layer of virtualization and automation that enables non-technical users to spin up Hadoop clusters with as few as five clicks.
Being named the top startup at Strata gives BlueData an additional level of credibility–not only among the Hadoop distributors, who have to be concerned about the creeping level of complexity in Hadoop, but also with anybody looking to use Apache Spark. Last week Databricks, the company behind Spark, announced that it has certified EPIC. That means BlueData customers can now spin up multi-tenant Spark clusters in a matter of minutes, without involving any Hadoop.
Nabbing second place was Metanautix, a big data startup based in Silicon Valley. Founded by the former Google engineer who headed the development of Dremel, Theo Vassilakis, and the Facebook engineer who developed the social media giant’s photo repository, Apostolos Lerios, Metanautix aims to create a SQL-powered super tool that can access practically any database or data store, and addresses a range of needs, including ad hoc queries, ETL, and transactional server jobs.
“Our goal is basically, wherever you have your data, we want to be able to go read it,” Vassilakis told Datanami in an interview this summer. “If it’s NoSQL, we’ll go read it from NoSQL. If it’s a server or if the Web or HDFS, we’ll go get it.”
Graph database startup GraphLab nabbed third place in the Strata Startup Showcase. The Seattle, Washington-based company is looking to commercialize an open source graph database created by CEO Carlos Guestrin at Carnegie Mellon University in 2009. The first release of its software, which was officially unveiled this summer, is said to lower the barrier of entry to using machine learning algorithms to analyze large sets of data without the services of a data scientist.
Taking the prize for audience favorite is SlamData, a Boulder, Colorado company that was founded earlier this year to simplify analytics on NoSQL databases. The company’s co-founders, Jeff Carr and John De Goes, developed an ANSI SQL compatible interface that makes NoSQL data available to anyone, and eliminates the need to learn new technologies or write ETL code to analyze NoSQL data. The open source software currently just supports MongoDB, but will soon support the NoSQL data stores from Apache Spark, Elasticsearch, and Cassandra.
The judges also looked at several other big data firms, who were finalists in this year’s competition, including:
- X15 Software, a provider of machine data platforms;
- Waterline Data Science, a developer of Hadoop data inventory management software;
- Pepperdata, a developer of Hadoop performance tools;
- VisualDNA, which blends psychology with big data;
- SequenceIQ, a developer of tools for making predictive applications;
- AtScale, which helps Tableau and Impala work better together;
- Plotly, an online analytics and data visualization tool;
- and E8 Security, a provider of machine learning-based security tools.
July 9, 2020
- Spectra Logic Publishes ‘Digital Data Storage Outlook 2020’
- MariaDB Announces $25M Funding Round to Scale SkySQL Operations
- Domo Updates its COVID-19 Global Tracker with National Paycheck Protection Program Data from the SBA
- Cloudian Launches Operations in Australia and New Zealand
- NHS Trusts Advance Use of Analytics to Manage Patient Infection Status, Staff Exposure During Pandemic
- cnvrg.io and NetApp Partner to Deliver MLOps Dataset Caching
- Columbia Professor Confronts Healthcare Inequality in Time of COVID-19
- Oracle Autonomous Database Now Available in Customer Data Centers
- Researchers Receive NIH Funding to Develop Data-Driven Strategies in COVID-19 Fight
- FingerMotion Launches Big Data Insurance Solution
July 8, 2020
- Circonus Announces Free 45-Day Trial of its Kubernetes Monitoring Solution
- Talend Donates Nearly $3M in Data Skills Courses, Technologies to Higher Education
- HNI Corporation Taps Ascend.io to Fuel Operational Analytics
- GridGain Announces Nebula Managed Service For Apache Ignite and GridGain In-Memory Computing Platforms
- Data Mining System Unearths US Counties Most at Risk for COVID Deaths
- Researchers Create Online Resources for Data Exploration, Visualization, and Discovery for the Pan-Cancer Project
- Looker Now Available in the Microsoft Azure Marketplace
- Exasol Launches Exasol V7
- AIMART Enhances Investigative Intelligence Portfolio with Siren Partnership
- StataCorp Contributes Data Science Software to COVID-19 Research Database for Academic and Medical Research
Most Read Features
- Big Data File Formats Demystified
- Nvidia Destroys TPCx-BB Benchmark with GPUs
- How to Build a Better Machine Learning Pipeline
- BI Tools — Are They Enough to Build a Data-Driven Culture?
- How COVID-19 Is Impacting the Market for Data Jobs
- Databricks Brings Data Science, Engineering Together with New Workspace
- Understanding Your Options for Stream Processing Frameworks
- SAS Provides Big Data Solutions for… Bees?
- Is Python Strangling R to Death?
- Databricks Cranks Delta Lake Performance, Nabs Redash for SQL Viz
- More Features…
Most Read News In Brief
- New Report Ranks Countries by COVID-19 Safety
- Spark 3.0 Brings Big SQL Speed-Up, Better Python Hooks
- IBM Brings Back a Netezza, Attacks Yellowbrick
- Blurred Lines: SAS and Microsoft To Go Deep in Analytics Partnership
- New Map Shows Hundreds of Counties in the COVID-19 Endgame — and Thousands on the Uptick
- NIH Launches Massive Initiative for COVID-19 Patient Data Analytics
- Researchers Explore Link Between American Individualism and Poor COVID-19 Response
- U.S. Special Ops Launches $600M Analytics Effort
- Data Prep Still Dominates Data Scientists’ Time, Survey Finds
- Bitnine Looks to Scale PostgreSQL
- More News In Brief…
Most Read This Just In
- HSBC Joins Data Privacy Firm Privitar’s Series C Financing Round with $7M Investment
- D2iQ Unveils KUDO for Kubeflow to Accelerate Enterprise-Grade Machine Learning on Kubernetes
- SAS Debuts Tools to Gauge Risks and Impacts of Reopening
- Databricks Introduces Delta Engine, Acquires Redash
- Cloudera Debuts its Cloudera Data Platform Private Cloud
- The Linux Foundation Cloud Engineer Bootcamp Announced
- Technology Aims to Provide Cloud Efficiency for Databases During Data-Intensive COVID-19 Pandemic
- BP Invests $5M in Geospatial Analytics Software Company Satelytics
- Alation Launches Data Governance Initiatives
- New Actian Vector for Hadoop Enables Real-time and Operational Analytics
- More This Just In…