Technologies » Systems

Features

AMPLab’s Tachyon Promises to Solidify In-Memory Analytics

Aug 14, 2014 |

U.C Berkeley’s AMPLab first landed on the radar screens of data scientists with Apache Spark, which promises to provide an in-memory data processing framework to replace or augment MapReduce. More recently, the tech wizzes at AMPLab have whipped up Tachyon, a new distributed file system that sits atop HDFS and aims to allow multiple Hadoop or Spark applications and jobs to access the same data at memory speeds without fears of corrupting it. The rapid rise of Apache Spark demonstrates Read more…

3D Visualizer Goes from Cybersecurity to Open Source

Aug 4, 2014 |

A 3D visualization tool developed by the network security firm OpenDNS to identify malicious websites and domains will soon be available to anybody via open source license. While OpenGraphiti was developed to solve problems in the security realm, the GPU-powered software can be used to visualize large datasets for any use case. OpenDNS security researcher Thibault Reuille started developing OpenGraphiti about a year ago to help the firm ferret out malicious websites and domains, and identify the cybercriminals controlling them. Read more…

Now and Then: The Evolution of In-Memory Computing

Jul 29, 2014 |

The history of data warehousing, big data, and analytics can be described as a constant challenge to process and analyze ever-increasing volumes of data in shorter amounts of time. Fundamentally, the single biggest factor that affects our ability to process data is the speed at which we can access it, and then do something with it. Data warehousing architectures have addressed this over the years by adopting massively parallel processing (MPP) designs, but the lower latency of in-memory computing combined Read more…

Apache Spark Gets YARN Approval from Hortonworks

Jun 26, 2014 |

Hortonworks today announced that Apache Spark is certified to work with YARN, the quarterback calling plays in next-gen Hadoop v2 clusters. The YARN stamp of approval clears the way for Hortonworks to fully support Spark for machine learning analytic workloads in a new release of HDP later this year. The prospect of running the in-memory Spark framework on Hadoop has garnered a lot of interest over the past year and a half, in particular as a faster and easier-to-program replacement Read more…

How T-Mobile Got More from Hadoop

Jun 24, 2014 |

When T-Mobile’s engineering department filled up a pair of 1 petabyte Netezza appliances from IBM, the cell phone carrier naturally turned to Hadoop. But when Hive and Pig turned out to be less than ideal query mechanisms on a cluster with half a trillion rows, the company searched for a suitable replacement, which it found from RainStor. T-Mobile had what you would call a big problem with its data warehouse; namely it was growing too fast. The cell phone company Read more…

News In Brief

Selfies Spawn Photo Analytics

Aug 21, 2014 |

An estimated 1.8 billion photos are shared on social media each day. Current “social listening tools” can’t analyze all this data. Enter Ditto Labs Inc., which touts its proprietary “visual recognition engine” that locates brand names in social media posts, then turns the information over to marketing geniuses. It’s the brave new world of “photo analytics,” according to Ditto, that uses its recognition engine to scan photos for corporate logos that might hint at how consumers “interact” with products and Read more…

From Data Wrangling to Data Harmony

Aug 18, 2014 |

More and better automation tools such as machine-learning technologies are needed to free data scientists from mundane “data-wrangling” chores. Those tools would allow scientists to focus on gleaning insights from prepared data, a range of experts told the New York Times in a recent survey of the state of big data. The newspaper reported that data scientists spend from 50 percent to 80 percent of their time organizing data, or “data janitor work,” before they could begin sifting through it Read more…

Startup Adatao Raises $13M for Big Data 2.0 Platform

Aug 13, 2014 |

Big data startup Adatao Inc. recently announced a $13 million funding round along with additions to its board of directors as it seeks to expand the customer base for big data analytics. Adatao [ah-DAY-tao], Sunnyvale, Calif., was founded in 2012 by former Google and Yahoo engineers. It is betting on Apache Spark as the best platform for deploying a simplified user interface. The interface is said to resemble Google Docs. The Series A funding round was led by Andreessen Horowitz, Read more…

Analytics Drives Tesla Customer Loyalty

Aug 8, 2014 |

As the auto industry is transformed by upstarts like Tesla Motors, analytics is being applied to bring customers and car makers closer together. The Tesla S twin-engine electric sedan is a case in point, having been variously described by one observer of auto technology as an “iPad on wheels” and “highly modular,” a reference to the amount of computing power in the mid-range electric car. As the technology pundit Rob Enderle notes, “Tesla often knows about a problem before the Read more…

This Just In

Nexenta Launches NexentaEdge 1.0

Aug 19, 2014 |

SANTA CLARA, Calif., Aug. 19 – Nexenta, the global leader of Software-Defined Storage (SDS) solutions, today announced the launch of NexentaEdge 1.0 – the industry’s first software-only scale out block and object storage solution designed to deliver high performance global inline deduplication on petabyte scale clusters. NexentaEdge is ideally suited for next-generation open source cloud infrastructures and Big Data repositories requiring performance, scale and low total cost of ownership. The solution will be demonstrated for the first time at VMworld US 2014, August 24-28 in San Read more…

Speakers Announced for EnterpriseHPC’14 Summit

Aug 18, 2014 |

CARLSBAD, Calif., Aug. 18 – The EnterpriseHPC’14 summit, the world’s first live event dedicated to exploring the emergence of advanced scale enterprise computing, today announced the line-up of speakers for the program, including keynotes Vince Gennaro, President of the Society for American Baseball Research (SABR), Ren Wu, Distinguished Scientist, Baidu IDL, Pete Ungaro, CEO of Cray, Inc., and Paul Bloch, President & Co-founder of DataDirect Networks, Inc. The summit takes place September 7-9, 2014 in Carlsbad, CA, and is dedicated to Read more…

TPC Launches TPCx-HS

Aug 18, 2014 |

SAN FRANCISCO, Calif., Aug. 18 – The Transaction Processing Performance Council (TPC) today announced the immediate availability of TPCx-HS, developed to provide verifiable performance, price / performance, availability, and optional energy consumption metrics of big data systems. TPCx-HS is the industry’s first objective benchmark enabling measurement of both hardware and software including Hadoop Runtime, Hadoop Filesystem API compatible systems and MapReduce layers. And – as the TPC’s first Express Benchmark – TPCx-HS is available via the TPC Web site in the Read more…

Zettaset Introduces Fast-PATH

Aug 14, 2014 |

MOUNTAIN VIEW, Calif., Aug. 14 – Zettaset, the leader in Big Data security, today announced Fast-PATH, an advanced software configuration management solution that automates and accelerates Hadoop deployment, significantly reducing the over-reliance on costly and time-consuming professional services that burdens today’s Big Data environment. Hadoop is rapidly evolving, but has not yet reached the level of maturity and sophistication that traditional relational databases offer. As a result, users expecting lower operational costs by using Hadoop software and infrastructure are surprised to find Read more…

PSSC Labs’ CloudOOP 12000 Certified with Cloudera Enterprise 5

Jul 14, 2014 |

LOS ANGELES, Calif., July 14 – PSSC Labs is pleased to announce their purpose built CloudOOP 12000 Big Data server is certified compatible to run Cloudera Enterprise 5.  This announcement comes just days after PSSC Labs deployed a 3+ PByte Cluster designated to run Cloudera  Enterprise 5 to a prominent New York based Ad Tech company. The CloudOOP 12000 product line includes the world’s most energy efficient, highest density enterprise storage server platforms.   In just 1U of rack space, the Read more…