October 8, 2015

Health Care Emerges as Hadoop Use Case

George Leopold

Unstructured data is estimated to account for nearly 80 percent of health care information, a hodge-podge of physicians’ notes, sensor data from medical devices, lab results, X-ray and MRI images along with clinical and financial data. Add to the mix several layers of patient privacy and medical IT regulations and you get a use case ripe for the application of big data analytics.

MapR Technologies is among the Hadoop specialists eying the big data market for health care estimated by market researchers like McKinsey & Co. to be worth as much as $450 million in health care cost savings. Using tools like Hadoop to gain access to unstructured medical data that is growing exponentially is seen as one way to increase efficiency and cut costs while improving patient care.

MapR’s and other distributions of Apache Hadoop are being promoted as among the tools needed to capture patient information as medicine moves to an “evidence-based” approach that collects clinical data and feeds it into clinical and other analytics platforms. Among the advertised outcomes are earlier detection and diagnoses of diseases along with more effective treatment based on information like a patient’s genetic makeup. Clinical analytics also can be used to adjust drug dosages to limit side effects, improve effectiveness and reduce costs for expensive drugs.

For its part, MapR is citing a list of health care and life sciences use cases as ideal for big data technology in general and, specifically, for its distribution of Hadoop. A prime example is genome processing and DNA sequencing in which current big data architectures leverage high-performance computing and either storage area networks or network attached storage.

MapR, San Jose, Calif., argues that these architectures are facing networking bottlenecks, slowing the distributed sorting of medical data. Hence, MapR and others have begun converging storage and computing on a single platform as a way to reduce the cost of storing large volume of sequencing data. Meanwhile, its Hadoop distribution aims to provide real-time access to clinical data stores.

In another use case, MapR said an unidentified health care organization that collects petabytes of treatment and claims data is using its Hadoop distribution to create a new data repository service for it enterprise customers. Customers could use the repository to run an expanded set of analytics on their own data.

MapR promotes its architecture as meeting multi-tenancy requirements through a volume-based isolated environment that also provides secure access for end users.

Combining unstructured data from multiple sources is also being used to improve diagnostic accuracy. So-called “assisted diagnosis” combines medical expert systems with individual data sets and big data algorithms. MapR’s Hadoop distribution allows predictive modeling and machine learning to crunch large sample sizes to help pin down a diagnosis, the company said.

Recent items:

Health Alliance Looks to Leverage, Secure Patient Data

NIH Effort Looks to Compress Big Genomics Data

Applications: Predictive Analytics, Research Analytics

Technologies: Storage

Sectors: Biosciences, Healthcare

Tags: apache hadoop, Big Data Storage, health analytics, mapr

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

Health Care Emerges as Hadoop Use Case

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 17, 2024

April 16, 2024

April 15, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

Health Care Emerges as Hadoop Use Case

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 17, 2024

April 16, 2024

April 15, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link