Bigstep Adds Spark Service to Bare-Metal Cloud
A Spark-based analytics engine released last week runs on a low-latency, software-defined bare metal fabric and can scale Spark clusters and the IT infrastructure that supports them.
Bigstep, which has dual headquarters in London and Chicago, said its real-time Spark service is designed to speed deployment of real-time data streaming applications as more Spark implementations shift to the cloud. Among the emerging uses for such implementations are the Internet of Things and algorithm decision-making, Bigstep Founder and CEO Lucas Roh noted in a statement.
Roh added that Bigstep’s bare metal cloud platform would help reduce development requirements for its real-time Spark service. The company also said it would offer a “pay-per-use” container-based Spark cluster optimized for real-time streaming applications that use multiple concurrent Spark contexts.
A rapid prototyping feature includes a built-in Jupyter interface for Scala, Python or R programming languages. Jupyter Notebook is designed to allow data scientists to combine code, graphs, dashboards and descriptive texts within the same document while performing operations interactively.
Meanwhile, real-time data streaming applications run in parallel with other container application clusters.
The company added that its bare-metal fabric could run multiple Spark versions, including Spark 2.0, using the same pool of resources.
The introduction of the Spark platform follows the company’s release in early October of a real-time application container service designed specifically for streaming applications. The company said it container service also targets emerging infrastructure based on micro-services and memory-intensive workloads requiring low latency and higher performance.
The container service is based on Docker and can run distributed streaming applications on the company’s bare metal cloud. Those applications can be built on Spark Streaming, Apache Flink or the Heron streaming replacement for Apache Storm, the company said.
Persistent storage requirements have recently overtaken security as the top barrier to adoption of application containers in production. Hence, Bigstep stressed its real-time container service offers high-end persistent support, including storage volumes that follow containers as they move across clusters.
Meanwhile, the company said customers could deploy its new Spark service alongside either Zoomdata and Bigstep data lakes, or use it with applications deployed on-premises or on its managed container service.
July 7, 2020
- Dynatrace Expands AI-Powered Observability for Kubernetes Environments
- AtScale Expands COVID-19 Insights Model to Help Global Organizations Understand the New Normal
- Qubole Expands Integration with Informatica’s AI-driven Data Engineering Integration
- Virtual Esri User Conference to Explore How GIS Interconnects the World
- Dremio to Host Cloud Data Lake Conference
July 6, 2020
- Data Science Team at Columbia to Enhance Probabilistic Programming
- Exasol Announces Partnership with TEKsystems Global Services
- Syniti Addresses Growing Demand with Geographic Expansion of Europe, Africa to include Middle East Team
July 2, 2020
- Anaconda Releases 2020 State of Data Science Survey Results
- Big Data Analytics Among Top Three Deployment Priorities for Enterprises, Says Frost & Sullivan
- Informatica Acquires Compact Solutions
- Confluent Announces Infinite Retention for Apache Kafka in Confluent Cloud
- BP Invests $5M in Geospatial Analytics Software Company Satelytics
- Data Visualization Gets Artificial Intelligence Boost with $5M NSF Grant
- LSU CS Professor Studies COVID-19 Disparities on Social Media
July 1, 2020
- OmniSci Powers New Website Enabling Public to View House-by-House Information On Flint Water Crisis
- Aerospike Adds New Partners to Meet Growing Demand in APAC Region
- Informatica, The ADAPT Research Centre Collaborate to Accelerate AI Research, Development
- Huawei’s Data Virtualization Engine openLooKeng Goes Open Source
- Zoic Labs Creates Interactive Data Visualization Tool, Connecting Scientists with COVID-19 Research Data
Most Read Features
- Big Data File Formats Demystified
- Nvidia Destroys TPCx-BB Benchmark with GPUs
- How to Build a Better Machine Learning Pipeline
- BI Tools — Are They Enough to Build a Data-Driven Culture?
- How COVID-19 Is Impacting the Market for Data Jobs
- Databricks Brings Data Science, Engineering Together with New Workspace
- Understanding Your Options for Stream Processing Frameworks
- SAS Provides Big Data Solutions for… Bees?
- MongoDB Steps Up Game with MongoDB Cloud
- Couchbase Nabs $105M as it Readies Cloud Offering
- More Features…
Most Read News In Brief
- New Report Ranks Countries by COVID-19 Safety
- Spark 3.0 Brings Big SQL Speed-Up, Better Python Hooks
- IBM Brings Back a Netezza, Attacks Yellowbrick
- New Map Shows Hundreds of Counties in the COVID-19 Endgame — and Thousands on the Uptick
- Blurred Lines: SAS and Microsoft To Go Deep in Analytics Partnership
- U.S. Special Ops Launches $600M Analytics Effort
- NIH Launches Massive Initiative for COVID-19 Patient Data Analytics
- War Unfolding for Control of Elasticsearch
- AWS Upgrades SageMaker Labeling Tool
- Nebula Graph Joins Database Race
- More News In Brief…
Most Read This Just In
- HSBC Joins Data Privacy Firm Privitar’s Series C Financing Round with $7M Investment
- D2iQ Unveils KUDO for Kubeflow to Accelerate Enterprise-Grade Machine Learning on Kubernetes
- SAS Debuts Tools to Gauge Risks and Impacts of Reopening
- The Linux Foundation Cloud Engineer Bootcamp Announced
- Databricks Introduces Delta Engine, Acquires Redash
- Cloudera Debuts its Cloudera Data Platform Private Cloud
- Technology Aims to Provide Cloud Efficiency for Databases During Data-Intensive COVID-19 Pandemic
- Alation Launches Data Governance Initiatives
- New Actian Vector for Hadoop Enables Real-time and Operational Analytics
- MariaDB Announces the General Availability of MariaDB Community Server 10.5
- More This Just In…