Follow Datanami:

Tag: data cleansing

ICIJ Turns to Big Data Tech to Unravel FinCEN Files

Sep 25, 2020 |

Unraveling financial crimes like money laundering is a notoriously difficult task, especially when criminals purposely cover their tracks. It gets a little easier when you have advanced tools, such as text analytics, machine learning, and a graph database, which is what the International Consortium of Investigative Journalists (ICIJ) used with its latest investigation, dubbed the FinCEN Files. Read more…

Data Prep Still Dominates Data Scientists’ Time, Survey Finds

Jul 6, 2020 |

Data scientists spend about 45% of their time on data preparation tasks, including loading and cleaning data, according to a survey of data scientists conducted by Anaconda. The company also analyzed the gap between what data scientists learn as students, and what the enterprises demand. Read more…

Syncsort Doubles Down on Data Quality with Pitney Bowes Buy

Dec 6, 2019 |

Here’s a stat to ponder: With its $700-million acquisition of Pitney Bowes’ software and data business now complete, Syncsort becomes the second biggest vendor in the data quality space, per 2018 figures from IDC. Read more…

The Anatomy of AI: Understanding Data Processing Tasks

Aug 6, 2019 |

So you’re collecting lots of data with the intention to automate decision-making through the strategic use of machine learning. That’s great! But as your data scientists and data engineers quickly realize, building a production AI system is a lot easier said than done, and there are many steps to master before you get that ML magic. Read more…

Big Data Meltdown: How Unclean, Unlabeled, and Poorly Managed Data Dooms AI

Jun 13, 2019 |

We may be living in the fourth industrial age and on cusp of huge advances in automation powered by AI. But according to the latest data, our great future will be less rosy if enterprises don’t start doing something about one thing in particular: Read more…

Data Management: Still a Major Obstacle to AI Success

May 22, 2019 |

Data is the lifeblood of AI. Without good data, machine learning algorithms have no way to determine a normal distribution of activities, occurrences, or events. However, only about one in five businesses have data that’s fit for AI and is being used for that purpose, according a new report from Figure Eight. Read more…

Self-Service Data Preparation – At Scale or Sampling?

Nov 26, 2018 |

The phrase “data is the new oil” has become the favorite business transformation cliché of the past 10 years. The truth is that data in its raw form is about as useful for decision making as oil is for propelling a car. Read more…

The Seven Sins of Data Prep

May 21, 2018 |

Data preparation is often considered a necessary precursor to the “real” work found in visualizing or analyzing data, but this framing sells data prep short. The ways in which we cleanse and shape data for downstream use have significant bearing on our final analytic output, and cutting corners on data prep can run up a huge cost for companies. Read more…

The Role of Self-Service Data Preparation in Analytics Modernization

Oct 5, 2017 |

It’s no secret that data is playing an increasingly important role in not only today’s business environment but also within our society as a whole. In a May 2017 article, The Economist laid out why data has overtaken oil as the world’s most valuable resource. Read more…

Carts & Horses: Why You Need to Focus on Data First

Jun 19, 2017 |

Like most of us, I love shiny new objects and learning about how successful companies are building them into their operations. Google’s use of Neural Nets for Translate? Read more…

Do NOT follow this link or you will be banned from the site!