January 10, 2013

Testing Cisco’s Unified Fabric

Ian Armas Foster

Networking big data can be a hassle, especially when that data is spread out over several data centers separated by some distance.

Yong-Hee Jeon of the Catholic University of Daegu in South Korea examined the efficiency of the Cisco Unified Fabric and its attempt to ease networking in managing big data applications. Specifically, he ran business intelligence and ETL tasks using Hadoop MapReduce over the Cisco Fabric to determine its runtime performance.

To assess the versatility of the Cisco network, it was important to test both BI and ETL functions. Both would take in about a Terabyte of data, but the BI would be expected to analyze it and output only a Megabyte while ETL is expected to convert the one TB into workable formats such that it can be used later by BI applications.

According to the Internet Research Group, network considerations need to be taken into account before Hadoop clusters are installed. Several companies are finding themselves buying those clusters without much of a thought as to how set up the network that will govern it, leading to undesirable results. “The scalability and usability of a Hadoop cluster may be damaged without understanding the role of WAN in the application of enterprise Hadoop,” Jeon said.

Cisco introduced their fabric as a means to enhance companies’ ability to manage their big data. The idea is to minimize I/O and computing bottlenecks by moving the computing itself to the data, a principle that has taken hold in the industry in the last couple of years. In this way, large files can be split up and spread across the fabric. A diagram of the fabric and how it relates to I/O is shown below.

“To efficiently process massive amounts of data, it was noted that it is important to move computing to where the data is using a distributed file system, rather than a central system for data,” Jeon said about Cisco’s purpose for the fabric. “Cisco proposes that a single large file is split into blocks, and the blocks are distributed among the nodes of the Hadoop cluster.”

When Jeon ran his BI and ETL tests on the fabric, he noted that the traffic—the aforementioned bottlenecks that slow networking down and delay runtime—was minimal over BI operations where a lot of data had to be analyzed and output. With ETL functions, spikes occurred before the Hadoop Reducers took effect, lending credence to a slight I/O bottleneck.

The main issue, however, took place during an ETL-related “reduce-shuffle phase.” As Jeon puts it, “it is shown that there is a significant amount of traffic because the entire data set needs to be shuffled across the network. The spikes are made up of many short-lived flows from all the nodes in the job and can potentially create temporary burst trigger short-lived buffer and I/O congestion.”

The good news is that, according to Jeon, these bursts do not last long. However, Jeon left out discrete numbers in his report, meaning the true effectiveness of the fabric is perhaps difficult to quantify.

Related Articles

GE Plugs Second Internet

IEEE Tags Top Tech Trends of 2013

Cisco Works with Oracle, Hadoop ahead of Hadoop World

Applications: Research Analytics

Technologies: Network

Tags: Cisco, Hadoop, Networking

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Testing Cisco’s Unified Fabric

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 26, 2024

April 25, 2024

April 24, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Top 6 Strategies for Reducing Data Warehouse Costs

Building an Operational Data Warehouse for Real-time Analytics

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

AI & Big Data Expo North America 2024

CDAO Canada Public Sector 2024

AI Hardware & Edge AI Summit Europe

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Testing Cisco’s Unified Fabric

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 26, 2024

April 25, 2024

April 24, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link