Tag: sql

A Bottom-Up Approach to Data Quality

Sep 6, 2017 |

Despite the amazing progress we’ve made in novel data processing techniques, poor data quality remains the bane of analytics. It’s why data scientists spend upwards of 80% of their time preparing and cleansing data instead of exploring the data and building models that leverage it. Read more…

RDBMS Remains Popular As Data Sources Grow

Sep 6, 2017 |

As the number and variety of data sources continues to explode along with proliferation of third party APIs used to connect it, data repositories such as relational databases continue to thrive while emerging software services and tools are accelerating the shift to “open analytics,” Read more…

Kafka Gets Streaming SQL Engine, KSQL

Aug 28, 2017 |

Confluent today unveiled KSQL, a SQL engine for Apache Kafka designed to enable users to run continuous interactive SQL queries on streaming data. The new software, which is currently in developer preview, will lower the barrier of entry for stream processing, the vendor says. Read more…

Databricks, Flush With Cash, Steers Spark at AI

Aug 22, 2017 |

Momentum around the Apache Spark cluster computing framework continues to build with the announcement of hefty late-stage funding round that will help push the analytics platform and related artificial intelligence applications deeper into enterprises. Read more…

The New Math Driving NoSQL Analytics

Aug 3, 2017 |

NoSQL databases are extremely popular among developers thanks to their flexible schemas and rich data types like JSON. But those same attributes make getting data out of them using traditional SQL queries a real pain. Read more…

Kinetica Gets $50M for Converged GPU Analytics

Jun 29, 2017 |

Kinetica’s bold plan to build a converged real-time analytics platform that uses GPUs and in-memory techniques to power existing SQL queries alongside deep learning algorithms got a big boost today when it disclosed a $50 million Series A investment from venture capitalists. Read more…

Hadoop Engines Compete in Comcast Query ‘Smackdown’

Jun 22, 2017 |

Who rules the ring when it comes to Hadoop SQL query engine performance? Can flashy newcomers like Presto and Spark take an established giant like MapReduce to the matt? Comcast recently held a competition to crown the best Hadoop engine, and the answer may surprise you. Read more…

Hortonworks Touts Hive Speedup, ACID to Prevent ‘Dirty Reads’

Apr 4, 2017 |

If you’re considering using Hadoop for SQL-based analytics and BI, you’ll be interested in the latest news out of Hortonworks, which today unveiled a new release of its flagship data platform that boasts a fast new release of Apache Hive, as well as a new ACID merge function that can prevent “dirty reads.” Read more…

SAP Vora Gets Analytics, Cloud Upgrades

Mar 15, 2017 |

Building on its acquisition of Hadoop specialist Altiscale Inc., SAP is combining the latest release of its Vora in-memory distributed computing platform with its big data cloud as it extends the Apache Spark framework to deliver interactive analytics on Hadoop. Read more…

What’s In the Pipeline for Apache Spark?

Mar 6, 2017 |

According to Apache Spark creator Matei Zaharia, Spark will see a number of new features and enhancements to existing features in 2017, including the introduction of a standard binary data format, better integration with Kafka, and even the capability to run Spark on a laptop. Read more…