April 16, 2013

Finding SGI’s Needle with UV Big Brain

Ian Armas Foster

Two intuitively simple processes, searching and comparing, require two quite different skill sets when applied to large datasets. By literally moving a haystack around, SGI CTO Eng Lim Goh demonstrated the capabilities of the company’s UV Big Brain to accomplish both tasks. “The UV Brain gives you a massive coherent shared memory in order to carry a huge haystack and it has many processes around that big memory to allow you to do parallel processing of that comparison,” Goh said.

Searching for a needle in a haystack is a common idiom denoting the search for a significant thing amidst a swath of insignificance. As such, it is not surprising to see companies like SGI use said idiom when discussing the UV Big Brain computer.

It is slightly more surprising to see a literal presentation of hay being moved around to represent Hadoop nodes. As Goh noted to begin his demonstration, “In a Hadoop cluster, you basically start with a huge haystack. Then you divide and conquer, as follows.”

This approach, as Goh explained, is preferable when dealing with a situation where the nodes do not have to interact with each other. A search for a unique term, for example, fits the description, as the information related to that query is not dependent on knowing all of the information.

“You split the haystack up into multiple smaller haystacks. Each of these smaller haystacks is on a node in a Hadoop cluster…At each node level, you don’t need to talk to your neighbor.” Therefore, the workload can be split among the nodes and run in parallel. “Concurrently, each of these nodes are doing the same thing,” Goh said. This splitting and parallelization was represented by the simple splitting of the actual hay. It should be noted that while Goh did not appear to find the needle, his processing time was somewhat limited.

Either way, such an approach would not work as well for comparison queries.

As Goh held up one piece of hay against another and noted the height difference, he noted that such a process could be split up among different nodes in the cluster and the same amount of operations would happen. However, instead of those operations happening in an isolated environment, they interact across the network, putting strain on the connections and potentially creating bottlenecks.

“Every time you reach out, you are stressing that Hadoop network,” Goh explained. This does not stop a machine like UV Big Brain from making such queries, of course. Instead, the system reportedly adjusts so all information can be contained in one node, lessening or in some cases eliminating the network strain.

Related Articles

What Can Enterprises Learn From Genome Sequencing?

SGI Spreads Strategic Wings with DataRaptor

SGI Plants Big Data Seeds in HPC

Applications: Data Mining, Research Analytics

Technologies: Network, Systems

Vendors: SGI

Tags: SGI, UV Big Brain

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Finding SGI’s Needle with UV Big Brain

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 18, 2024

April 17, 2024

April 16, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Finding SGI’s Needle with UV Big Brain

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 18, 2024

April 17, 2024

April 16, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link