May 11, 2020

An Enterprise Guide to a Secure Data Science Pipeline

Sponsored Content by Anaconda

Open source is the backbone driving digital innovation (Gartner, 2019). It’s crucial to many of today’s leading-edge digital fields, including data science and machine learning. No single technology vendor can outmatch the pace of innovation the open-source data science community maintains. Thousands of open-source Python, R, and Conda packages provide data science practitioners with the building blocks they need to create models and applications using predictive analytics, natural language processing, robotics, and other cutting-edge tools.

These open-source tools are powerful, and they are essential for differentiation in a future where organizations must adopt AI to remain viable. But, there’s one thing many enterprise data science teams are missing: security protocols. In many organizations, there simply are no security protocols or governance tools for open-source software (OSS) use in data science. A lack of security protocols exposes the organization to overlooked defects and vulnerabilities, not to mention potential licensing and intellectual property issues.

In some organizations, DevOps teams have already adopted security protocols related to their use of OSS. DevOps uses open-source building blocks to accelerate their workflows and build applications, but generally they do so within a framework of security and governance to protect their work and enterprise infrastructure. Enterprise data scientists also use OSS tools and packages all the time. But, they use OSS without this safety net, putting the organization and customer data at risk. In some cases, DevOps teams may catch vulnerabilities in data science models when they attempt to put them in production. But, this means valuable data science team effort was wasted building a model that will never see the light of day.

When data scientists don’t monitor for potential threats, vulnerabilities inevitably creep into models over time. Data science leaders must step up and collaborate with IT and security leaders to take charge of their open-source data science and ML pipelines. Together, these leaders can increase the flow of innovative models to production while safeguarding against technical and legal risk.

Just Like all Software, Open Source Carries Risk

Companies tend to choose OSS over proprietary software because it offers more choice, support flexibility, transparency, and unmatched innovation.

More Choice

The open-source community provides a veritable candy store of tools and libraries to work with — there’s no need to get tied down to any single vendor. Try new tools, choose only the best of the best (or the ones that fit your needs best), with minimal hoops to jump through.

Support Flexibility

With proprietary software, support is generally bundled in by the vendor and available either through the original license or for an extra fee. The software vendor offers what it offers, take it or leave it. With OSS, you have multiple options among support providers — including community support, third-party vendors, and hiring in-house staff to support your open-source components.

Transparency

The source code of any OSS is viewable and fixable by anyone with the know-how to do so. Organizations using open-source software can verify its security themselves (or use an outside provider for verification). The source code in proprietary software, on the other hand, is usually only viewable and editable by a few internal people.

Unmatched Innovation

Data science and machine learning have a deep history with OSS, going back to the Apache Hadoop data-processing framework, which started a wave of open-source advances that’s still going strong. The top ML libraries, deep learning tools, and visual processing tools all came out of the open-source community. No single proprietary vendor can match its depth and breadth of Innovation.

Read more and get the complete guide here.

Only registered users may comment. Register using the form below.

Check off newsletters you would like to receive*
- HPCwire
- EnterpriseTech
- Datanami
- Technology Conferences & Events
- Advanced Computing Job Bank
- Technology Product Showcase
Email*
Name*
First Last
Organization*
Job Function*
Industry*
Country*
City*
State*
Province*
- Please check here to receive valuable email offers from Datanami on behalf of our select partners.

An Enterprise Guide to a Secure Data Science Pipeline

Just Like all Software, Open Source Carries Risk

More Choice

Support Flexibility

Transparency

Unmatched Innovation

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 22, 2024

April 19, 2024

April 18, 2024

Sponsored Partner Content

Get your Data AI Ready – Celebrate One Year of Deep Dish Data Virtual Series!

Supercharge Your Data Lake with Spark 3.3

Learn How to Build a Custom Chatbot Using a RAG Workflow in Minutes [Hands-on Demo]

Overcome ETL Bottlenecks with Metadata-driven Integration for the AI Era [Free Guide]

Gartner® Hype Cycle™ for Analytics and Business Intelligence 2023

The Art of Mastering Data Quality for AI and Analytics

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Building an Operational Data Warehouse for Real-time Analytics

Can You Use Kafka as a Database?

Sponsored Multimedia

The Power of DataOps: Bring Automation to Life
No Comments

Tactical Steps for Cloud Migration
No Comments

Immuta Data Access Platform
No Comments

Data Mesh: Fact or Fiction?
No Comments

Contributors

Featured Events

Call & Contact Center Expo

AI & Big Data Expo North America 2024

AI Hardware & Edge AI Summit 2024

CDAO Government 2024

An Enterprise Guide to a Secure Data Science Pipeline

Just Like all Software, Open Source Carries Risk

More Choice

Support Flexibility

Transparency

Unmatched Innovation

Join the discussion Cancel reply

Only registered users may comment. Register using the form below.

April 22, 2024

April 19, 2024

April 18, 2024

Most Read Features

Most Read News In Brief

Most Read This Just In

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors

Featured Events

Share

Copy short link