Follow Datanami:

Tag: HBase

Microsoft Expands Hadoop on Azure

Microsoft has upgraded its open source analytics services running on Azure with a new version of Hadoop incorporating enhancements of Apache Hive and other open source analytics frameworks. The software giant (NASDAQ: Read more…

Masking Technical Complexity in the Security Data Lake

Today's growing cybersecurity threat demands a sophisticated response, one that increasingly involves the utilization of big data technologies like parallel file systems and machine learning. However, some security exper Read more…

Learning from Your Data: Essential Considerations

For any organization undergoing digital transformation, a primary consideration is how to find, capture, manage and analyze big data. They are looking to big data and data science to facilitate the discovery of analytics Read more…

Yahoo Unleashes HBase Transaction Manager

A transaction manager for the NoSQL database HBase has been approved as an open-source incubator project, according to project sponsor Yahoo. The HBase transaction manager dubbed "Omid" ("Hope" in Persian) is the late Read more…

Resolving Hadoop’s Storage Gap

Over the past several years, the Hadoop ecosystem has made great strides in its real-time access capabilities, narrowing the gap compared to traditional database technologies. With systems such as Impala and Spark, analy Read more…

Distributed Computing Tops List of Hottest Job Skills

If you have cloud and distributed computing skills, your job prospects for 2016 are golden. That's because those particular job skills—which parallel the rise of Hadoop and other distributed computing frameworks--topp Read more…

Medical Insight Set to Flow from Semantic Data Lakes

The potential for data analytics to disrupt healthcare delivery is large, and getting larger by the day. But in many cases, the need to hammer data into a structured format creates a barrier to productivity. Now a hospit Read more…

Yahoo Casts Real-Time OLAP Queries with Druid

Yahoo is in the process of implementing a big data tool called Druid to power high-speed real-time queries against its massive Hadoop-based data lake. Engineers at the Web giant say the open source database's combination Read more…

Google Launches Bigtable Hosting Service

Google yesterday announced the beta launch of Google Cloud Bigtable, a hosted version of its wide-column NoSQL database. The service is accessible via the standard HBase API--making it instantly integrated with the Hadoo Read more…

Wanted: A Plug-In Architecture for Hadoop Development

Hadoop is hard. There's just no way around that. Setting up and running a cluster is hard, and so is developing applications that make sense of, and create value from, big data. What Hadoop really needs now, says former Read more…

MapR Ponies Up with Free Hadoop Training

Hadoop is hard. There's just no way around it. Without a set of specific technical skills, the chances of somebody developing or running a Hadoop cluster are not good. A new training program unveiled today by MapR Techno Read more…

Microsoft Blends NoSQL with Relational DB in the Cloud

Microsoft yesterday unveiled Azure DocumentDB, a new cloud-hosted databases that adds elements of a relational database, such as SQL-like queries and transactional processing, into a document-oriented NoSQL database. The Read more…

HP Throws Trafodion Hat into OLTP Hadoop Ring

Hewlett-Packard last month quietly unveiled Trafodion, an ANSI-compliant relational SQL database that's now available as an open source product. With two decades of development at HP and the new capability to run on top Read more…

Facebook Adds Another 9 to HBase Availability

As one of the largest users of Hadoop in the world, Facebook knows a thing or two about running the big data platform for a high degree of availability. Last week, the social media giant's engineering team explained how Read more…

Hortonworks Drives Stinger Home with HDP 2.1

Hortonworks today unveiled a major new release of its Hadoop distribution that puts significant new capabilities into the hands of its customers. The speed and scale of SQL processing in Apache Hive were improved with the final phase of the Stinger initiative, while the additions of Apache Storm and Apache Solr in HDP 2.1 open up new ways for customers to manipulate their data. Security and data governance were bolstered with Apache Knox and Apache Falcon, respectively, while Apache Spark is now available as a tech preview. Read more…

A Peek Inside Cisco’s Hadoop Security Machine

The Internet is the ultimate invention of man, a creation that will forever change how humans work, live, and play. But for all the good it's capable of, the Internet has also created a comfortable home for cybercriminals, who use increasingly sophisticated techniques to siphon hundreds billions of dollars from the global economy. One company that's upping the ante in the battle against cybercriminals is Cisco, which is using a 60-node Hadoop cluster to separate criminal signals from the Internet's noise. Read more…

Cloudera Shuffles Its Product Deck in Pursuit of ‘Data Hub’ Strategy

Cloudera today unveiled a new three-tiered product packaging strategy for its Hadoop software, including a new high-end "Data Hub" Edition designed to help it compete against the likes of IBM and Pivotal. The company also announced the availability of the Spark stream processing and machine learning engine. Read more…

Keeping Tabs on Amazon EMR Performance

The Hadoop architecture may appear simple at first, but it's deceptively complex and may lull you into a false sense of security as you try to scale your cluster. If you run Hadoop on Amazon Elastic MapReduce, there are a number of potentially wallet-draining performance issues to keep an eye on. This week, Compuware launched an Amazon-resident solution to help prevent EMR jobs from running amok. Read more…

Intel Goes Graph with Hadoop Distro

Intel will be targeting big retail operations with a new graph database that it unveiled today as part of its Intel Distribution for Apache Hadoop version 3 announcement. The graph engine will enable customers to make product or customer recommendations in real time, a la Netflix or Amazon, based on existing data. The chip giant also fleshed out its Hadoop distro with a 20x speedup in encryption functions, a data tokenization option, and a handful of new machine learning algorithms aimed at solving common problems. Read more…

OLTP Clearly in Hadoop’s Future, Cutting Says

Think Hadoop is just for analytics? Think again, says Hadoop creator Doug Cutting, who last week predicted that, in the future, organizations will run all sorts of workloads on their Hadoop clusters, even online transaction processing (OLTP) workloads, the last bastion of the relational legacy. Read more…

Datanami