Follow Datanami:

Tag: apache

Cloudera Commits to 100% Open Source

Jul 11, 2019 |

The old Cloudera developed and distributed its Hadoop stack using a mix of open source and proprietary methods and licenses. But the new Cloudera will be 100% open source, just like Hortonworks, its one-time Hadoop rival that it acquired in January. Read more…

Open Source is Now a Big Data Service

Aug 16, 2018 |

Open source technologies continue to make headway across a range of industries undergoing digital conversions. The big data sector has of course led the way with a growing list of Apache Foundation projects ranging from Hadoop to Spark that have made their way into data-centric enterprises coping with huge data volumes. Read more…

ODPi Tackles Hive with Latest Hadoop Runtime Spec

Sep 27, 2016 |

ODPi today unveiled the second major release of its Runtime Specification that’s geared at setting a standard for Hadoop components to ensure greater interoperability among distributions and third-party products. New additions to the spec include Apache Hive and the Hadoop Compatible File System (HCFS). Read more…

Apache Takes Storm Into Incubation

Sep 19, 2013 |On Wednesday night, Doug Cutting, Director for the Apache Software Foundation (ASF), announced that the organization will be adding the distributed real time computation system known as Storm as the foundations newest Incubator podling. Read more…

Putting Some Real Time Sting into Hive

Mar 19, 2013 |A coalition of Hive community enthusiasts report that they have achieved a 45x performance increase for Apache Hive through an effort they have branded “The Stinger Initiative.” The group says they are aiming at 100x improvement. Read more…

Apache Hadoop 2.0.3-Alpha Released With Future Outlook

Feb 19, 2013 |The next generation of the Apache Hadoop open-source software framework has been given an alpha release and set free in the wild, delivering the next major milestone for the Apache Hadoop community. Read more…

BioInformatics: A Data Deluge with Hadoop to the Rescue

Nov 19, 2012 |Apache Hadoop-based massively parallel processing is well suited to address many challenges in the growing field of BioInformatics. BioInformatics is not a “spectator sport”; this article explains how to get started via hands-on experience with the FDA Adverse Event Reporting System (FAERS). Read more…

Searching Big Data’s Open Source Roots

Oct 22, 2012 |The face of search has changed dramatically since the first days of Google and other search engines due to many widely-used open source technologies that enable complex queries across vast sets of multi-structured data. This week we talk with Apache Mahout, Lucene and Solr guru Grant Ingersoll, now Chief Scientist at LucidWorks, about what has... Read more…

Pentaho Stirs Open Source Kettle

Jan 31, 2012 |This week open source business intelligence vendor, Pentaho, pushed the code that powers the latest release of their Kettle offering into an Apache 2.0 license, strengthening ties to Hadoop and related projects under the same license.... Read more…

Hadoop Hits Primetime with Production Release

Jan 6, 2012 |After six years of endless refinements among a set of big name users including Facebook, Yahoo and others, the Apache Software Foundation announced this week that the first enterprise-ready version, dubbed 1.0, of Hadoop was ready to roll forward in production environments. Read more…
Do NOT follow this link or you will be banned from the site!