Technologies » Frameworks

Features

Under New CEO, Lucidworks Aims to Redefine Search and Itself

Sep 18, 2014 |

Lucidworks today unveiled an ambitious new enterprise search application called Fusion that uses advanced signal processing and analytics to drive a new level of personalization and push-delivery of information to users. The product is the culmination of a year-long development initiative under new CEO Will Hayes, who aims to carve a new big data niche for the Solr backer. Search is a ubiquitous component of the Internet, the original “killer app.” Type in your query, hit enter, and voila: a Read more…

MapR Puts Apache Drill into Hadoop Distro

Sep 16, 2014 |

Organizations today demand tools that provide familiar SQL-based access to data stored on HDFS. Today, MapR Technologies gave its customers yet another SQL interface when it announced support for Apache Drill 0.5 in the new release of its commercial Hadoop distribution, MapR 4.0.1. Apache Drill is an open source framework that delivers a full SQL-compliant interface that allows users to query unstructured data stored on HDFS for data discovery purposes. The project, which is backed by MapR and is an Read more…

What’s Driving the Explosion of Government Data?

Sep 11, 2014 |

Most people already know that the world produces massive amounts of data every day, and government organizations are a huge source of that data. For instance, it is estimated that next year the Department of Energy will create 300 terabytes of data per day from analyzing light sources, and by 2020, the department will create 15 petabytes of data per year analyzing high-energy physics. That’s a lot of data. But where does all this data come from? The amount of Read more…

How a Web Analytics Firm Turbo-Charged Its Hadoop ETL

Sep 10, 2014 |

The Web analytics firm comScore knows a thing or two about managing big data. With tens of billions of data points added to its 400-node Hadoop cluster every day, the company is no stranger to scalability challenges. But there’s one ETL optimization trick in particular that helped comScore save petabytes of disk and improve data processing times in the process. ComScore is one of the biggest providers of Web analytics used by publishers, advertising firms, and their clients. If you Read more…

Big Data Challenges in Social Sciences & Humanities Research

Sep 8, 2014 |

One commonly mentioned benefit of the study of history is that we may gain insight into the thoughts and behavior of our ancestors. Learning where we came from can help us better prepare for where we are going. At the launch of this age of Big Data, we are only now well positioned to accomplish that ambition. Armed with Big Data Analytics, we can improve the well-being of ourselves and our offspring, particularly in Social Sciences & Humanities Research. In Read more…

News In Brief

U.S. Cracking Down on Data Brokers

Sep 22, 2014 |

The U.S. is stepping scrutiny of big data companies that regulators increasing view as “stewards of information detailing nearly every facet of consumers’ lives.” The U.S. Federal Trade Commission (FTC) has been leading the charge with tougher enforcement of consumer protection laws. Earlier this year, it reached settlements with two data brokers for violations of the Fair Credit Reporting Act. The web site Instant Checkmate and InfoTrack Information Services both agreed to pay civil fines and permanent injunctions against continuing Read more…

Concerns About Big Data Abuses Grow

Sep 18, 2014 |

The tension between the rise of big data and concerns over privacy and fairness continues to mount as federal regulators convened this week to ponder whether big data is a “tool for inclusion or exclusion.” That was the title of a Sept. 15 Federal Trade Commission workshop examining the impact of big data on U.S. consumers, particularly the poor and underserved. “A growing number of companies are increasingly using big data analytics techniques to categorize consumers and make predictions about Read more…

IBM Moves to Make Watson Accessible to the Masses

Sep 17, 2014 |

IBM is promising data crunching for the masses with its Watson Analytics natural-language cognitive service. The big data leader said Sept. 16 the extended release of the cloud-based analytics service promises to broaden access to predictive and visual analytic tools. The free version 1 release will run on desktops as well as mobile device, IBM said. The self-service analytics package includes data refinement and warehousing services that would allow users to move beyond simple spreadsheets to analyze and visualize data. Read more…

Food For Thought: Startup Targets Nutritional Data

Sep 16, 2014 |

A U.K. data startup that has developed a semantic platform targeting nutrition and health applications said it has added more than 500,000 U.S. products to its dataset in hopes of promoting new applications for everything from nutritional information to exercise devices to restaurant menus. London-based Klappo said earlier this month that the expansion of its dataset includes 525,000 U.S. product barcodes containing ingredient information about the nutritional value of U.S. food products. It said about 100,000 cover new food ingredients Read more…

Data Execs Thinking Big, Worried about Securty, Skills

Sep 15, 2014 |

Another industry survey has concluded that big data is helping global companies identify new revenue sources while developing new products and services. But a majority of executives responding to a survey by consulting firm Accenture Analytics also said they remain concerned about data security and a growing lack of data analytics talent. New York-based Accenture said its study was based on a survey of C-level information, data, analytics, operations and financial officers from 19 countries and seven industries. “They’re recognizing Read more…

This Just In

Apache Unveils Hadoop 2

Oct 17, 2013 |

Apache Software Foundation, which oversees the 150 or so open source projects under the famous Apache umbrella, this week announced Hadoop 2 – the latest version of the popular software framework for distributed computing.