Crazy Idea No. 46: Making Big Data Beneficial for All
Now here’s a crazy idea: What if the data we all generate on a day to day basis benefited us, instead of the companies that collect it? It may sound nuts at first, but some AI experts see a future in which people hold full control over their data and smart digital assistants infused with AI work to protect and monetize a person’s individual’s data for his or her benefit.
This vision of a more equitable big data world is one that’s held by Sri Ambati. The H2O.ai founder and CEO sees a day not too far in the future in which people are empowered to control their own data as an asset, and even to profit directly from their data, which is something that only a handful of individuals are currently able to do.
“Today, whether we want it or not, our data is stored on giant social networks,” Ambati tells Datanami. “Our clicks are essentially stolen away and leave a fingerprint of who we are digitally. In that sense, we don’t have ownership. We’ve just kind of given carte blanche ownership to the companies with the largest Internet presence, if you will.”
The FAANG companies (Facebook, Amazon, Apple, Netflix, Google) have the biggest and most sophisticated data collection and AI systems ever built. As the big data wave has grown over the past decade, so have the FAANG company’s fortunes. In fact, their collective valuations have skyrocketed by nearly 20x over the past 10 years to more than $5 trillion.
But with a few changes in how people approach data, those data-driven benefits can begin accruing to the individual data owners instead of the tech giants.
Smart Digital Assistant
In Ambati’s vision, each individual will have a kind of digital assistant that can help make the best use of his or her data. These trusted digital assistants–think of an accountant fused with a stock broker mixed with a password management and identity protection–would use AI to play the role of “chief digital officer” for the individual’s existence, both online and in the physical world.
“In the same way that every company has a legal partner, every company has a trading partner, every company has a financial ecosystem – [we could] bring it down all the way to the individual,” Ambati says. It would be “a vault for your data, an investment partner for you to use your data effectively and get the most value for who you are.”
The primary directive of these digital assistants would be to protect the individual’s data. While physical assets can be replaced in this world, losing one’s data is “not a recoverable act. It’s not a reversable situation,” Ambati says. “And so it makes more sense to protect your data as if it were valuable, because it is valuable. It’s valuable not just to you, but to the world.”
The digital assistant’s first step would be to track down all the places where an individual’s data is being collected or used on the Internet. “That’s the beginning–tracking down and making sure we can get a bill of materials for all the assets that one has left on the Internet, just like we have ways to track down asset in the physical world,” he says.
A Marketplace for Data
Once the data is secured, the digital assistant would help individuals monetize that data on the open market. This could involve selling data to the highest bidder, or giving it away, if they so choose.
“It leads to an ecosystem of service providers who are going to basically provide good guidance as well as APIs on how to monetize the digital assets,” Ambati says. “I think the whole concept of commodity trading will need to happen for data. That whole exchange and marketplace is missing.”
Individuals will also gain the ability to barter with their data, and demand services in exchange for their data. There is some of that going on today, but individuals are typically not aware that it’s happening, unless they take the time to read the End User License Agreement (EULA) that accompanies most online services.
“Today you need an e-store, and there is no leverage. It’s not owned by anyone, other than the who created it,” Ambati says. “But once that leverage shifts a little, you get a free months cell phone bill, for example, for having them use a portion of the data. But today there is no such thing.”
Blockchain technologies could be employed to ensure the sanctity of individuals’ data and the security of transactions. Cybercriminals would be attracted to this new paradigm, of course, but Ambati says maintaining openness would allow marketplaces with the best track records on security to succeed.
“The exchanges with the most security are the ones you’re going to use and trade on,” he says. “The ones with the most liquidity are the ones you’re going to trade on.”
A Personal Data Gold Mine
Having all of one’s data readily available for analysis could also yield some interesting insights that today are primarily being leveraged by corporations with the goal of selling more stuff or getting people to look at ads.
For example, the digital assistant could recommend the best time to drive to the grocery store based on traffic patterns. One’s patterns of consumption–including what food one eats, what music one listens to, and what books one reads—could be analyzed and recommendations generated on behalf of the individual, not the corporation. Digital assistants could negotiate on one’s behalf, getting better rates on car insurance, or better communication bundles.
“Lots of these data sets are now locked in different corporations,” Ambati says. “Eventually the integration of all of that information for our own personal usage, and ability to deploy that data over a group of communities–all those pieces are value added services that will lead to creating that economy.”
Today’s smartphones have more storage capacity and processing power than the most powerful supercomputers did 30 years ago. If we can leverage the phones to collect and analyze the data exhaust generated by individuals, we have the ability to create rich models that describe, inform, and predict elements of our lives at a whole new level.
“There are billions of these devices that truly democratizes the ability to both save, but beyond that, to mine and use it for your own benefit,” Ambati says. “Data is a capture of our experience and the ability to share experiences and tell stories–that’s a very human endeavor. And I think if you can start building the right toolsets that are at the fingers of every smartphone owner, they should be able to use that toolset and start building a small but significant growing economy of data owners.”
This vision of beneficial ownership of data is already a reality, but only for a small handful of individuals, Ambati says. “It already happens for the top influencers on some social networks. We just have to democratize that for everyone,” he says. “The system should start working on your behalf.”
A Change for the Better
So what needs to happen for this vision to become reality? Why would the FAANG companies (plus Microsoft) willingly give up control of this rich natural resource (data) when it has been so good to them?
According to Ambati, people will naturally gravitate to this new mode once macro economic-social conditions make them more favorable.
“The alternatives haven’t been created. That’s kind of the reason why they’re not fighting the big companies today,” he says. “They [the FAANG companies] were startups 20 year ago…so the way capitalism works, it’s going to naturally make this transition happen [eventually].”
TikTok presents an interesting example. While the social media property may not be the best example of a company that’s responsibly collecting data (it’s been accused of funneling private data to the Chinese government), it is a compelling example of a company that come out of nowhere to disrupt established giants. This phenomenon will continue.
New data regulations will help, according to Ambati, who credits the new California Consumer Privacy Act (CCPA) as a good step to enabling citizens to take ownership of their own data.
With an open market and favorable regulation, new data services that value individuals’ rights should naturally come to being. These new data services could be startups, or they could be offshoots of existing offerings.
For example, perhaps banks will evolve to provide security and monetization of data. “Historically, rural and urban citizens could go to the bank, put their most expensive jewelry in the vault, and the banks were regulated,” Ambati says. “A modern bank would potentially offer ways to protect your data as a service for having a checking account with them, and create asset mangers for you who trade and show results for your data.”
A New Data Paradigm
However, the cards are currently not in the favor of the citizens. The FAANG companies (plus Microsoft) hold most of the data today, and benefit enormously from today’s advertising economy. After all, consumers on the Web are the product; advertising buyers are the real customers.
“The ad economy, the casino, is owned by a few players. What we want to do is democratize that and allow you to be at the center of that, a strong player in that economy, with the ability to rise up to the top and not be beholden to the ill-gotten gains that the casinos has,” Ambati says.
“I wouldn’t put any of the current netizens, or the digital warlords, on the side of playing well, because they don’t have to play well,” he continues. “There’s no economic advantage to doing that.”
Regulation has helped. GDPR got us in the habit of accepting cookies, and “now it’s legal to collect and track us,” he says. CCPA gets us closer to the digital ownership mindset, but that alone is not enough, Ambati says.
“It’s got to be innovation.”
August 4, 2020
- cnvrg.io AI OS Delivers Accelerated ML Workloads with Support of NVIDIA A100 Multi-Instance GPU
- Domo Enhances its COVID-19 Global Tracker with Google and Apple Mobility Trends
- Quantum ActiveScale Software Verified as Veeam Ready Object Solution
- Striim Expands Cloud Support with the Release of Version 3.10.1
- Azure NetApp Files Now Available to Government Agencies in Microsoft Azure
- Duality Technologies, NumFOCUS to Develop Privacy-Preserving Analysis Platform
- KDD 2020 Announces Special Diversity & Inclusion Track
- Micro Focus Announces the General Availability of ArcSight 2020
- Baffle Unveils New Tableau Integration to Protect Data Analytics Pipeline
- Cyxtera Launches AI/ML Compute-as-a-Service Powered by NVIDIA DGX A100 Systems
August 3, 2020
- Wejo Announces £10M in New Funds to Support Next Stage of Growth
- Kyber Data Science Announces $10M Series A Financing Round
July 31, 2020
- Quantida Launches Complete Data Management and Analytics Solution
- New Tool to Guide Decisions on Social Distancing Uses Hospital Data and Emphasizes Protecting the Vulnerable
- Acrisure Acquires Tulco’s AI Insurance Business
- Talend Named a Leader in Gartner Magic Quadrant for Data Quality Solutions
- Hasura Releases Hasura Cloud
July 30, 2020
- Qumulo’s Cloud File Offerings Now Available in AWS Marketplace for AWS GovCloud (US)
- Entel Chooses OmniSci to Improve Customer Satisfaction
- USAF Selects Tamr to Accelerate Aircraft Configuration Evaluations with Data Mastering Solutions
Most Read Features
- Big Data File Formats Demystified
- How to Build a Better Machine Learning Pipeline
- Is Python Strangling R to Death?
- How COVID-19 Is Impacting the Market for Data Jobs
- Is Hadoop Officially Dead?
- To Centralize or Not to Centralize Your Data–That Is the Question
- What’s the Difference Between AI, ML, Deep Learning, and Active Learning?
- Understanding Your Options for Stream Processing Frameworks
- Tracking the Spread of Coronavirus with Graph Databases
- Hacking AI: Exposing Vulnerabilities in Machine Learning
- More Features…
Most Read News In Brief
- Researchers Explore Link Between American Individualism and Poor COVID-19 Response
- Left for Dead, R Surges Again
- Data Prep Still Dominates Data Scientists’ Time, Survey Finds
- HPE Acquires MapR
- Why Gartner Dropped Big Data Off the Hype Curve
- Global DataSphere to Hit 175 Zettabytes by 2025, IDC Says
- Bitnine Looks to Scale PostgreSQL
- War Unfolding for Control of Elasticsearch
- Spark 3.0 Brings Big SQL Speed-Up, Better Python Hooks
- IBM Brings Back a Netezza, Attacks Yellowbrick
- More News In Brief…
Most Read This Just In
- UBS Launches Big Data Shareholder Activism Tool
- FortressIQ Launches Adaptive Computer Vision-Based Firewall for Data Privacy
- KNIME Analytics Platform 4.2 is Now Available
- Hazelcast, Sorint Expand Partnership to Address In-Memory Computing Adoption
- Privacera Raises $13.5M in Series A Funding
- Syniti Acquires Virtyx Technologies
- BP Invests $5M in Geospatial Analytics Software Company Satelytics
- Qubole Expands Integration with Informatica’s AI-driven Data Engineering Integration
- TileDB Closes $15M Series A to Expand its First Universal Data Engine
- MariaDB Platform X5 Adds New Distributed SQL
- More This Just In…