March 4, 2013

Big IO and the Real Time Future

Isaac Lopez

No stranger to massive data in the HPC context, Cray points to one area that it’s had its paws in for a number of years. Climate data has always presented data processing and movement challenges, and according to the supercomputing company, the data sets used for international climate projects are “almost 70 times larger today than they were less than 20 years ago. “This constant expansion is creating storage, archival and sharing challenges for the data collectors.

Cray refers to this as “Big IO,” and believes that the work that’s being done now at the petascale level will have lasting impacts in science and business, especially where real time data collection and processing is a requirement. Cray indicates that they are particularly proud of the work they have been doing with the Korean Meteorological Administration (KMA).

“Big data” is nothing new to the KMA, who has been using supercomputing tools and sensor data for decades to do everything from weather modeling, to climate prediction, air pollution monitoring, and earthquake and marine meteorology for decades. In 2005, Cray and the KMA joined forces to launch the Earth Systems Research Center (ESRC) aimed at maximizing the utility of advanced high performance computing facilities. Since its inception, the partnership boasts it’s involvement in funding with 23 projects with the Korean academic community in such areas as severe weather and high resolution global modeling.

“Each operating [weather prediction and climate modeling] center usually has large sets of data (terabytes and petabytes) that need to move fast, both in terms of IO and throughput – and reliability,” says Cray in discussing their ideas on “Big IO.” Cray says that Big IO solutions are needed where real time analysis is needed.

“These efforts are increasingly necessary as the global climate changes and many government bodies begin to search for ways to adapt to the changing weather conditions,” says Cray. Real time weather modeling is both data and processor intensive with variable levels of terabytes to petabytes having to move in and out of the processor quickly in order to create the model.

But many enterprises jumping into big data today don’t really need the intense and persistent processing power of a weather modeler, right? That may be short sighted, suggest some data scientists. We may soon see a day when enterprises are using the power of automated data driven decision-making to help companies model their own business climates with machines making automated decisions based on the data.

In the meantime, Cray has jumped on the Hadoop bandwagon, announcing that its Xtreme line of supercomputers will include the Intel distribution for Apache Hadoop software, joining the company’s other big data solutions, including the Cray Sonexion storage system and YarcData’s uRiKa appliance for graph analytics.

Related Articles

Intel Hitches Xeon to Hadoop Wagon

Cray Grabs Intel Hadoop Distribution for Xtreme Supers

Cray Big Data Arm Reaches Out to W3C to Push SPARQL, RDF Standards

Applications: Complex Event Processing

Technologies: Processors, Storage, Systems

Sectors: Other

Vendors: Cray

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Big IO and the Real Time Future

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 16, 2024

April 15, 2024

April 12, 2024

April 11, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Big IO and the Real Time Future

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 16, 2024

April 15, 2024

April 12, 2024

April 11, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link