Follow BigDATAwire:

March 9, 2015

The ASF Announces Apache Tajo v0.10.0

FOREST HILL, Md., March 9 — The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the availability of Apache Tajo v0.10.0, the latest version of the advanced Open Source data warehousing system in Apache Hadoop.

Apache Tajo is used for low-latency and scalable ad-hoc queries, online aggregation, and ETL (extract-transform-load process) on large data sets stored on HDFS (Hadoop Distributed File System) and other data sources. By supporting SQL standards and leveraging advanced database techniques, Tajo allows direct control of distributed execution and data flow across a variety of query evaluation strategies and optimization opportunities.

“Tajo has evolved over the last couple of years into a mature ‘SQL-on-Hadoop’ engine,” said Hyunsik Choi, Vice President of Apache Tajo. “The improved JDBC driver in this release allows users to easily access Tajo as if users use traditional RDBMSs. We have verified new JDBC driver on many commercial BI solutions and various SQL tools. It was easy and works successfully.”

Tajo v0.10.0 reflects dozens of new features and improvements, including:

  • Oracle and PostgreSQL catalog store support
  • Direct JSON file support
  • HBase storage integration (allowing users to directly access HBase tables through Tajo)
  • Improved JDBC driver for easier use of JDBC application
  • Improved Amazon S3 support
BigDATAwire