Applications » Data Mining

Features

Big Data-Powered Authentication Gives Security a Much-Needed Boost

Oct 1, 2014 |

Trillions of dollars are at stake in the ongoing battle against cybercriminals and fraudsters. Whether it’s Russian cybercriminals hacking into retail chains or a kid in New Jersey filing fraudulent claims, stolen credentials and user IDs often play a major role. But now, big data technologies like Hadoop are helping to prevent unauthorized access, both behind the corporate firewall and in public coffers. Technological advances are a two-way street when it comes to crime on the Internet. A gang of Read more…

Big Data Outlier Detection, for Fun and Profit

Sep 30, 2014 |

As we discussed in the first part of this series, how you handle data outliers can determine whether your big data project ends with a bang or flames out in failure. But before you even decide what to do with outliers, you need to be able to detect them. That is easier said than done. Because they can mean different things at different times, outliers can be extremely challenging to deal with in a big data context. On the one Read more…

Hadoop Data Virtualization from Cask Now Open Source

Sep 25, 2014 |

Continuuity, a big data startup that seeks to drive complexity out of Hadoop by virtualizating data and applications, today announced that it’s changing its name to Cask and making its software open source. The company also open sourced a streaming engine now named Tigon, and announced the hiring of former Intel executive as COO. Former Facebook engineer Jonathan Gray co-founded Continuuity with former Yahoo engineer Nitin Motgi about three years ago to address the challenges they saw enveloping the Hadoop Read more…

Hortonworks Hatches a Roadmap to Improve Apache Spark

Sep 24, 2014 |

Hortonworks today issued a broad and detailed roadmap outlining the investment it would like to see made to Apache Spark, the in-memory processing framework that has become one of Hadoop’s most popular subprojects. The plan focuses on improving how Spark runs with YARN, enabling monitoring and management of Spark, and ensuring that Spark plays nicely with Hive and other Hadoop engines. In the blog piece, titled “An investment in Apache Spark for the Enterprise,” Hortonworks director of product management Vinay Read more…

How Baidu Uses Deep Learning to Drive Success on the Web

Sep 22, 2014 |

The Chinese Web giant Baidu is investing heavily in deep learning technologies as it seeks to drive intelligence from big data using high performance computing (HPC). From speech and facial recognition to language transaction and Web search, Baidu relies on deep learning and artificial intelligence technologies to improve a range of customer-facing applications. Dr. Ren Wu, a distinguished scientist at Baidu’s Deep Learning Institute (IDL), discussed Baidu’s use of deep learning technologies in a keynote presentation at Tabor Communication’s recent Read more…

News In Brief

States Seek to Backstop U.S. Data Privacy Laws

Sep 25, 2014 |

California is poised to become the first state to restrict the use of student data by third-party technology vendors. Two student data privacy bills were sent to California Gov. Jerry Brown earlier this month, including one that prescribes privacy guidelines for contracts between school districts and technology vendors. Along with contracts, California State Assembly Bill 1584 covers the privacy of student records and digital storage services along with educational software that could be used with data analysis tools for marketing Read more…

Concerns About Big Data Abuses Grow

Sep 18, 2014 |

The tension between the rise of big data and concerns over privacy and fairness continues to mount as federal regulators convened this week to ponder whether big data is a “tool for inclusion or exclusion.” That was the title of a Sept. 15 Federal Trade Commission workshop examining the impact of big data on U.S. consumers, particularly the poor and underserved. “A growing number of companies are increasingly using big data analytics techniques to categorize consumers and make predictions about Read more…

Poll: SAS Use Surges for Data Mining

Aug 26, 2014 |

A recent poll querying data scientists on which programming and statistics languages they used in 2014 for analytics, data mining and data science found that four main languages dominated. The data mining community web site KDnuggets reported earlier this month that respondents identified R, Python, SAS and SQL (in that order) as a preferred programming language. Fully 91 percent of respondents used one of the four languages. The R programming language led the way, cited by 49 percent of respondents Read more…

Selfies Spawn Photo Analytics

Aug 21, 2014 |

An estimated 1.8 billion photos are shared on social media each day. Current “social listening tools” can’t analyze all this data. Enter Ditto Labs Inc., which touts its proprietary “visual recognition engine” that locates brand names in social media posts, then turns the information over to marketing geniuses. It’s the brave new world of “photo analytics,” according to Ditto, that uses its recognition engine to scan photos for corporate logos that might hint at how consumers “interact” with products and Read more…

U.S. Enlists Big Data to Fight Securities Fraud

Aug 19, 2014 |

The U.S. Securities and Exchange Commission (SEC) moved to reform its enforcement practices after missing the Bernard Madoff Ponzi scheme and other financial fraud by creating an Office of Market Intelligence in 2010. The office was intended to make use of new intelligence tools like big data analysis to improve securities enforcement. So far, the results of the SEC efforts have been mixed, according to the author of a recent law review article focusing on how the office has leveraged Read more…

This Just In

Intel and Mitsubishi Collaborate

Sep 30, 2014 |

TOKYO, Japan, Sept. 30 — Intel Corporation and Mitsubishi Electric Corporation today announced a new collaboration to develop next-generation factory automation (FA) systems with Internet of Things (IoT) technologies and a pilot program at Intel’s backend manufacturing facility in Malaysia. The pilot demonstrates the benefits of IoT in a factory setting with a focus on delivering productivity enhancement through innovative functions, such as predictive failure, by combining Intel’s expertise developing solutions for IoT and Mitsubishi Electric’s “e-F@ctory” automation capabilities. Intel realized Read more…

SAS to Host Two Big Data Events in October

Sep 24, 2014 |

CARY, N.C., Sept. 24 — From data visualization to cybersecurity, marketing analytics to the Internet of Things, the sources and uses of data are rapidly evolving. This October in Las Vegas, decision makers and data scientists from around the world will gather at two events to explore and share how critical technologies — data mining, data visualization, Hadoop, forecasting and more — create value from big data. And they’ll examine big data threats too, including cyber-attacks and fraud. The Premier Read more…

RapidMiner World Conference Concludes

Aug 25, 2014 |

BOSTON, Mass., Aug. 25 — Pioneering predictive analytics leader RapidMiner last week concluded its RapidMiner World conference, which brought together over 100 RapidMiner users and other data analytics industry experts from around the globe. The four-day conference, which took place from August 18 – 21 in Boston, explored the latest in predictive analytics, data mining, and the future of RapidMiner. With practical use cases and industry discussions, RapidMiner World attracted a wide-range of participants, ranging in skill-level and company role. In this inaugural U.S. Read more…

ProfitBricks Accelerating Big Data Projects with SDN and Infiniband

Apr 2, 2014 |

ProfitBricks, the price/performance leader in Cloud Computing IaaS, is now helping big data systems integrators like Altoros bring the power of the cloud to the high demand of its customers. Thanks to second generation technologies like software-defined networking (SDN) and InfiniBand that are at the heart of the ProfitBricks cloud, Altoros customers can easily deploy Hadoop clusters for both temporary projects and continuous big data analysis programs.

Dell Acquires StatSoft

Mar 24, 2014 |

Dell today announced the acquisition of StatSoft, a leading provider of advanced analytics solutions that deliver a wide range of data mining, predictive analytics and data visualization capabilities. StatSoft combines comprehensive statistical analysis with advanced analytics to help organizations better understand their businesses, predict change, increase agility and control critical systems.