July 6, 2017

NEC Claims Vector CPU Outperforms Spark

George Leopold

An arms race is shaping up in the machine-learning sector with the claim by NEC Corp. that its approach based on its vector processor accelerates data processing by more than a factor of 50 compared to the Apache Spark cluster-computing framework.

The Japanese computer and IT vendor (NEC; TSE: 6701) said this week its vector processor leverages “sparse matrix” data structures to accelerate processor performance in executing machine learning algorithms. NEC said it also developed middleware incorporating sparse matrix structures to simplify machine-learning tasks. The middleware can be launched from Python or Spark platforms in the same format as the machine-learning library, it added.

The vector processing approach would extend “low-cost analysis” to more users since it reduces the number of servers required for “large-scale data analysis that was formerly only available to large companies,” Yuichi Nakamura, general manager of NEC’s System Platform Research Laboratories, noted in a statement.

NEC vector processing card

NEC launched development of its vector processor in 2014. Among the applications targeted at the time was allowing individual researchers to access datacenters processing large amounts of data. Along with large data sets, the vector technology also targets image analysis, the company said.

The vector processing approach is designed to accelerate sparse matrix computing in which most of the elements in a matrix have a value of zero. Sparse matrices use different data analysis and storage protocols depending on the application.

NEC said it developed a hybrid format for using sparse matrices in which processing is executed by column or row depending on number of non-zero elements. “This enables the high-speed processing of machine learning without decreasing the processing efficiency of vector computers,” it claimed.

The data processing framework also reduced “communication volume” require for machine learning through a data compression technique. For machine learning, the results of data processing are updated depending on the values in various columns. Using the sparse matrix approach, columns with no values are not updated.

“By removing these non-updated portions from communication, the reduction of communication volume is achieved,” NEC said.

The accompanying middleware that runs on Python or Spark was implemented in C++ and MPI (Message Passing Interface). As with Spark, NEC’s middleware was designed to run across multiple processors.

As for its performance claims, NEC said it compared a use case in which Spark was executed in a cluster of servers while its middleware ran on the company’s SX-ACE vector computer. The 50-fold increase in data processing performance was based on both platforms running 64 cores.

NEC unveiled its vector-processing framework for machine learning during this week’s International Symposium on Parallel and Distributed Computing in Innsbruck, Austria.

In June, the Japanese company announced it was investing $10 million in an Indian data analytics center that will focus on growing demand in the region for Hadoop processing and storage.

Recent items:

Kinetica Gets Fuzzy With In-GPU Algorithms

Fujitsu Adding Column-Oriented Processing Engine to PostgreSQL

Applications: Artificial Intelligence, Enterprise Analytics

Technologies: Frameworks, Processors, Storage

Sectors: Academia, Biosciences, Financial Services, Healthcare, Manufacturing, Other, Retail

Vendors: NEC

Tags: apache spark, machine learning, python, sparse matrix, vector processor

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

NEC Claims Vector CPU Outperforms Spark

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 25, 2024

April 24, 2024

April 23, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

NEC Claims Vector CPU Outperforms Spark

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 25, 2024

April 24, 2024

April 23, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link