Follow Datanami:


Whitepapers Datanami's white paper database contains reports from the leading thought-leaders and idea generators in the Datanami industry.

Immuta Data Engineering Survey: 2021 Impact Report

Source: Immuta
Release Date: Jan 8, 2021

We’re on the verge of a perfect storm for modern data access governance. As organizations lean into data and the cloud, new research shows that the majority will adopt multiple cloud compute technologies within the next two years. Read more…

Enterprise Architect’s Guide: 4 Top Strategies for Automating and Accelerating Your Data Pipeline

Source: Qlik
Release Date: Dec 11, 2020

The explosion in data, the vast array of new capabilities, and the dramatic increase in demands have changed how data needs to be moved, stored, processed and analyzed. But new architectures like data warehouses and lakes are creating additional bottlenecks within IT, because many existing processes are labor-intensive and insufficient. Read more…

The Seven Tenets of Scalable Data Unification

Source: Tamr
Release Date: Dec 4, 2020

Michael Stonebraker, A.M. Turing Award winner, believes real digital transformation must start with clean, accurate, consolidated data sets. His database management strategies are driving major changes at GE, Thomson Reuters, and Toyota. Read more…

The Guide to External Data for Better User Experiences in Financial Services

Source: Explorium
Release Date: Dec 1, 2020

For the financial sector, Know Your Customer (KYC) processes are a vital — and unavoidable — part of doing business today. With fraud, identity theft, and money laundering increasing and evolving all the time, you need watertight ways to predict risk. Read more…

Responsible Machine Learning: Actionable Strategies for Mitigating Risks & Driving Adoption

Source: H2O
Release Date: Oct 22, 2020

Like other powerful technologies, AI and machine learning present significant opportunities. To reap the full benefits of ML, organizations must also mitigate the considerable risks it presents. This report outlines a set of actionable best practices for people, processes, and technology that can enable organizations to innovate with ML in a responsible manner. Read more…

Making Alternative Credit Scores the Norm: How to Create a New Scoring Model

Source: Explorium
Release Date: Oct 22, 2020

The way we currently measure potential borrowers’ and other customers’ creditworthiness is broken. The credit scores that have become the gold standard are narrow, opaque, and easily manipulated measures that reward financial risk-taking and ignore responsible behaviors. Read more…

Analyze-then-Store: The Journey to Continuous Intelligence

Source: Sponsored Content by Swim
Release Date: Oct 7, 2020

“Analyze-then-Store: The Journey to Continuous Intelligence” is a technical eBook intended for data architects and anyone else interested in learning how to design modern real-time data analytics and continuous intelligence solutions. Read more…

Guide to Maximum Data Lake Value

Source: Qlik
Release Date: Sep 28, 2020

In this whitepaper, Eckerson Group discusses how to get maximum value from data lakes and how Qlik’s Data Integration Platform helps businesses get the most value out of their data lakes quickly, accurately, and with the agility to respond to shifting business needs. Read more…

The Essential Guide to Feature Selection

Source: Explorium
Release Date: Sep 4, 2020

Feature selection is a key step in building powerful and interpretable machine learning models, but it’s also one of the easiest to get wrong. The wrong features will give you inaccurate answers and may impact your ML models’ efficiency in ways you can’t predict. Read more…

How to Improve Your Training Data for Vastly Better Machine Learning

Source: Explorium
Release Date: Aug 13, 2020

Your machine learning models are only as good as the data you’re using to train and test them. So, how can you improve your datasets? This guide breaks down simple strategies to acquire better data and quick approaches and methods to fine-tune and manipulate your existing data will get you better testing results and insights. Read more…