Big Data • Big Analytics • Big Insight

Technologies » Storage

Features

Selecting the Right Database for the Right Job

Oct 6, 2014 |

The Internet and mobile devices have created tremendous opportunities to engage with consumers in new ways. At the same time, they have brought unprecedented challenges for managing the vast volumes of structured and unstructured data being pumped in from these channels. At Adform, we face those opportunities and challenges first-hand. We enable media agencies, trading desks, advertisers and publishers to easily scale and deliver programmatic display advertising, rich media, and video, across desktop and mobile devices. Offers and other content Read more…

How Baidu Uses Deep Learning to Drive Success on the Web

Sep 22, 2014 |

The Chinese Web giant Baidu is investing heavily in deep learning technologies as it seeks to drive intelligence from big data using high performance computing (HPC). From speech and facial recognition to language transaction and Web search, Baidu relies on deep learning and artificial intelligence technologies to improve a range of customer-facing applications. Dr. Ren Wu, a distinguished scientist at Baidu’s Deep Learning Institute (IDL), discussed Baidu’s use of deep learning technologies in a keynote presentation at Tabor Communication’s recent Read more…

Tape Gets Second Wind as Big Data Mounts

Sep 16, 2014 |

Think tape is dead in our big data world? Think again. This week, the National Center for Supercomputing Applications (NCSA) announced that it bought 20PB of tape capacity to expand the world’s biggest data archive. Meanwhile, the LTO Program has plotted out a roadmap for the next decade that will eventually see a single LTO tape cartridge storing 120 TB of data. While tape is looked down upon in our speed-obsessed culture, the old standby has proven its relevance in Read more…

What’s Driving the Explosion of Government Data?

Sep 11, 2014 |

Most people already know that the world produces massive amounts of data every day, and government organizations are a huge source of that data. For instance, it is estimated that next year the Department of Energy will create 300 terabytes of data per day from analyzing light sources, and by 2020, the department will create 15 petabytes of data per year analyzing high-energy physics. That’s a lot of data. But where does all this data come from? The amount of Read more…

How a Web Analytics Firm Turbo-Charged Its Hadoop ETL

Sep 10, 2014 |

The Web analytics firm comScore knows a thing or two about managing big data. With tens of billions of data points added to its 400-node Hadoop cluster every day, the company is no stranger to scalability challenges. But there’s one ETL optimization trick in particular that helped comScore save petabytes of disk and improve data processing times in the process. ComScore is one of the biggest providers of Web analytics used by publishers, advertising firms, and their clients. If you Read more…

News In Brief

EMC, Pivotal Add Compute to Hadoop Data Lake

Oct 15, 2014 |

Storage vendor EMC Corp. and cloud specialist Pivotal have partnered to roll out a new version of a Data Lake Hadoop Bundle that adds a compute option to the big data product along with accelerated analytics that come with scaled out storage and computing along with analytics software. As data lakes gain momentum as scalable repositories for data generated from current and advanced workloads, EMC and Pivotal are positioning Data Lake Hadoop Bundle 2.0 as a tool for plumbing the Read more…

Zoomdata Raises $17M

Oct 7, 2014 |

Venture capitalists remain bullish on data analytics startups. Stream processing engine specialist Zoomdata Inc. said Oct. 6 it has raised $17 million in a Series B funding round led by Accel Partners. Zoomdata, Reston, Va., said it would use the funds to accelerate product development while expanding marketing and sales efforts. The startup has come up with a way to use a stream processing engine to analyze historical as well as real-time data. It combines open source stream processing engines Read more…

Peaxy Launches Scalable Data Manager

Sep 30, 2014 |

Data management specialist Peaxy Inc. (pronounced “peak-see”) is targeting the enormous growth of unstructured data in industrial sectors with its Hyperfiler system designed to harness disparate data through controlled access and a “consistent data path.” San Jose-based Peaxy said this week its data management tool targets engineers and analysts who must search for datasets across geographic locations, platforms and storage devices. The unstructured data is then pieced together to aid, for example, the product lifecycle spanning design and simulation. By Read more…

Data Tools Help Researchers Tackle Pediatric Cancer

Sep 10, 2014 |

A genomic data analysis platform being installed at the National Cancer Institute will deliver rapid analysis of billions of data points required when sequencing human DNA and other genomic data. Translational Genomics Research (TGen) said Sept. 10 it is providing the National Cancer Institute with high performance computing and bioinformatics support along with specialized tools designed to support pediatric cancer research programs. Those efforts include personalized medicine trial for pediatric cancer patients being conducted by the Neuroblastoma and Medulloblastoma Translational Read more…

MapR Reports Accelerated OpenTSDB Performance

Sep 9, 2014 |

Eyeing new Internet of Things (IoT) applications, MapR Technologies said its open-source distribution of Apache Hadoop “ingested” more than 100 million data points per second. The performance benchmark for the MapR distribution with its in-Hadoop NoSQL database, MapR-DB, was achieved using only four nodes of a ten-node cluster. By accelerating its OpenTSDB software by a factor of 1,000 on a small cluster, MapR claimed the performance clears the way for managing huge amounts of data along with IoT and other Read more…

This Just In

Attunity Replicate 4.0 Introduced

Oct 20, 2014 |

Oct. 20 — Attunity Ltd., a leading provider of information availability software solutions, introduced today its new certified data integration solution for the Teradata Appliance for Hadoop from Teradata, the big data analytics and marketing applications company. With this rollout, Attunity Replicate has extended its capability to automate data loading and replication to and from Hadoop to the Teradata Database and Teradata Aster Database within the Teradata Unified Data Architecture (UDA), easily and efficiently. The solution, tested and certified with the Teradata Appliance for Hadoop, will Read more…

Advancements Made to Teradata Database

Oct 20, 2014 |

NASHVILLE, Tenn., Oct. 20 – Teradata Corp., the big data analytics and marketing applications company, today announced engineering advancements to the Teradata Database that deliver analytic performance and system efficiency through new memory and CPU optimizations. These enhancements strengthen Teradata’s approach to in-memory computing and enable customers to seamlessly and automatically realize the greatest benefit from their investment in memory. “Teradata is relentlessly dedicated to engineering a smarter, simpler way to leverage memory and CPU to drive performance,” said Scott Gnau, president, Teradata Labs. Read more…

Tresata Analytics Platform 4.0 Released

Oct 17, 2014 |

NEW YORK, N.Y., Oct. 17 — Tresata Inc., a provider of Hadoop-powered predictive analytics software, announced the release of Tresata Analytics Platform 4.0, the latest upgrade to its class-leading customer intelligence management software. This release delivers several category-defining features – real-time execution of analytical processes using Spark, intuitive data scientist driven user interface, rapid integration API and enhanced intelligence discovery and delivery capabilities – while retaining its core advantage of having been architected to run entirely in Hadoop. “We are really Read more…

Actian Express – Hadoop SQL Edition Released

Oct 16, 2014 |

NEW YORK, N.Y., Oct. 16 — Actian Corporation (“Actian”), the Hadoop analytics company, has launched the Actian Analytics Platform – Express Hadoop SQL Edition, a free community version of the industry’s first end-to-end analytics platform running 100 percent inside of Hadoop. With no limits on the number of Hadoop nodes, and data up to 500GB, Actian Express – Hadoop SQL Edition supercharges Hadoop adoption and accelerates time-to-value for organizations that have been struggling to get value from their Hadoop investments. Hadoop has Read more…

Waterline Data Science Joins MapR Advantage Partner Program

Oct 16, 2014 |

NEW YORK, N.Y., Oct. 16 — Waterline Data Science today announced at Strata + Hadoop World New York that it has joined the MapR Advantage Partner Program. Waterline Data Science will integrate the MapR Distribution including Apache Hadoop with Waterline Data Inventory to enable data self-service on Hadoop, allowing users to find, understand, and help govern Hadoop data. Oliver Claude, Waterline Data Science CMO, states, “We’re pleased to have Anoop Dawar, senior director, product management, MapR, join our Advisory Board to help steer the partnership Read more…