Datameer’s Tool Seeks to Boost Data Lake Adoption
As big data set keeps getting bigger, vendors are pitching different approaches for exploring it faster and at scale with the goal of delivering real-time analytics.
Among the frameworks being unveiled at this week’s Strata conference in New York City is an “interactive visual exploration” tool from analytics software vendor Datameer. The company, which downplays the need for hiring big data science teams, claims its platform can help unify data ingestion and preparation by adding visual exploration of data lakes while leveraging Hadoop.
San Francisco-based Datameer said Tuesday (Sept. 26) its visualization tool based on its proprietary dynamic indexing technology extends its Visual Explorer platform beyond integration and data preparation to include what it claims in “unrestrained interactive visual exploration of extremely large data sets.”
Datameer CEO Christian Rodatus said the company’s approach extends beyond earlier attempts to remake the enterprise data warehouse stack via SQL or online analytical processing. Those approaches “required many hops between different tools, duplicating data and creating latency, which limited data exploration,” Rodatus argued.
Datameer’s interactive visual approach addresses a “common pain point,” he added: the current need to switch between tools used to prepare and curate data. The lack of an interactive exploration tool has created a gap between business analysts and data lakes. The addition of its Hadoop-native visual application is designed to eliminate duplication and other processing steps to accelerate the process of generating data sets that are ready to analyze, the company argues.
Stefan Groschupf, Datameer’s founder, has also argued that the rush to hire data scientist teams is often unnecessary. “More and more companies are understanding that there are very specific use case where data science teams are incredibly valuable, but not everything needs a data scientist,” he told Datanami in July.
Hence, the company said its new tool is designed to bridge the “last mile” functionality gap between data preparation and analysis.
In addition, the company said data engineers and business analysts could use a familiar spread sheet-like interface to analyze data, generating charts and graphs that visualize an entire data set. Those features, the company asserts, provide an “ad-hoc” analytics capability that would “ultimately foster greater adoption of the data lake.”
Emphasizing speed, the Datameer framework is fundamentally designed to overcome the inherent latency associated with the availability of raw data and its preparation for analysis. While data engineers focus on ingestion and preparation, analysts can beef up the resulting data pipelines, the company claims. The platform is designed to then convert data analysis into “reusable” data pipelines.
Datameer said its Visual Explorer is available now in a private beta version for select customers. The company will provide live demonstrations during the Strata Data Conference.
July 2, 2020
- Anaconda Releases 2020 State of Data Science Survey Results
- Big Data Analytics Among Top Three Deployment Priorities for Enterprises, Says Frost & Sullivan
- Informatica Acquires Compact Solutions
- Confluent Announces Infinite Retention for Apache Kafka in Confluent Cloud
- BP Invests $5M in Geospatial Analytics Software Company Satelytics
- Data Visualization Gets Artificial Intelligence Boost with $5M NSF Grant
- LSU CS Professor Studies COVID-19 Disparities on Social Media
July 1, 2020
- OmniSci Powers New Website Enabling Public to View House-by-House Information On Flint Water Crisis
- Aerospike Adds New Partners to Meet Growing Demand in APAC Region
- Informatica, The ADAPT Research Centre Collaborate to Accelerate AI Research, Development
- Huawei’s Data Virtualization Engine openLooKeng Goes Open Source
- Zoic Labs Creates Interactive Data Visualization Tool, Connecting Scientists with COVID-19 Research Data
- Noodle Partners and Stevens Institute of Technology Address Shifting Demand with Online Programs
- Decisions, NLP Logix Partner to Deliver Machine Learning Capabilities to Business Process Management
- UMass Amherst Awarded Federal Grants to Support Research to Improve Pandemic Forecasting
- Yellowbrick Makes Cloud Disaster Recovery Service, New Features Generally Available
June 30, 2020
- Hitachi Vantara Names Gajen Kandiah as New CEO
- Ahana Announces Linux Foundation’s PrestoDB Now Available on AWS Marketplace and DockerHub
- Fivetran Raises $100M in Series C Financing Round
- American Family Insurance Data Science Institute Awards ‘Mini Grants’ to Advance Data Science
Most Read Features
- Big Data File Formats Demystified
- How to Build a Better Machine Learning Pipeline
- Nvidia Destroys TPCx-BB Benchmark with GPUs
- What Is a Data Cloud? And 11 Other Snowflake Enhancements
- BI Tools — Are They Enough to Build a Data-Driven Culture?
- How COVID-19 Is Impacting the Market for Data Jobs
- Databricks Brings Data Science, Engineering Together with New Workspace
- Understanding Your Options for Stream Processing Frameworks
- MongoDB Steps Up Game with MongoDB Cloud
- SAS Provides Big Data Solutions for… Bees?
- More Features…
Most Read News In Brief
- New Report Ranks Countries by COVID-19 Safety
- New Map Shows Hundreds of Counties in the COVID-19 Endgame — and Thousands on the Uptick
- IBM Brings Back a Netezza, Attacks Yellowbrick
- Blurred Lines: SAS and Microsoft To Go Deep in Analytics Partnership
- U.S. Special Ops Launches $600M Analytics Effort
- Spark 3.0 Brings Big SQL Speed-Up, Better Python Hooks
- NIH Launches Massive Initiative for COVID-19 Patient Data Analytics
- PrestoDB Hits Fork in the Road as Startup Gains Venture Funding
- War Unfolding for Control of Elasticsearch
- Storage and Networking Bytes: Elasticsearch Logging Goes Kubernetes
- More News In Brief…
Most Read This Just In
- HSBC Joins Data Privacy Firm Privitar’s Series C Financing Round with $7M Investment
- D2iQ Unveils KUDO for Kubeflow to Accelerate Enterprise-Grade Machine Learning on Kubernetes
- SAS Debuts Tools to Gauge Risks and Impacts of Reopening
- TIBCO Spotfire, Data Science Solutions Now Support Microsoft Azure Cognitive Services
- The Linux Foundation Cloud Engineer Bootcamp Announced
- Databricks Introduces Delta Engine, Acquires Redash
- Technology Aims to Provide Cloud Efficiency for Databases During Data-Intensive COVID-19 Pandemic
- Cloudera Debuts its Cloudera Data Platform Private Cloud
- Alation Launches Data Governance Initiatives
- New Actian Vector for Hadoop Enables Real-time and Operational Analytics
- More This Just In…