One of the most daunting tasks (in the USA and globally) that is not discussed in commercial research circles very often is the challenge Data Scientists have getting their data into the Cloud AND convincing Information Security to configure Cloud based tools to allow access to critical data.
We all know the benefits of the Cloud are many, including less expensive ways to store data, the scalability of big data platforms, and an advanced tool kit with many AI and machine learning tools (Data and Machine Learning Operations.)Risk and Information Security professionals still have many concerns about Cloud data security, including adequate staffing and configuration expertise to make broad scale use of cloud data a reality. There is often confusion about the current software stack and the scope of their cloud capabilities (on prem versus Cloud), further complicating the maze of issues surrounding using data in the Cloud.
Chief Data Officers should be joined at the hip with Chief Technology Officers and Information Security professionals to allow the migration of analytics data to the Cloud and provide configuration of access tools (i.e., MS Azure or AWS).
The Challenge:
– Chief Risk Officers have fears of data breaches. However, many may not be familiar with the latest cloud data security protocols and technologies that allow for creating highly secure data zones.
– Information Security departments are often woefully understaffed versus other parts of IT. They lack the skillsets and talent to configure newer tools to set up security protocols and controls for getting data to Cloud and allowing access.
– Other IT departments may similarly lack configuration and API skills for secure cloud tool integration.
– Vendors are focused on matching the business problem to the software solution but may neglect to mention the level of skills needed for proper deployment.
Recommended solutions:
– Bring your Risk and Compliance teams early in software or technology purchase decisions.
– Collaborate with Information Security to map the implementation journey before proceeding with large scale cloud based data science deployments.
– Before buying new Cloud based big data platforms or ML and Data Ops tools, create a Data Science and Cloud Data Literacy program. Most data literacy programs focus on data quality and governance but lack a focus on data science and Cloud big data tools. Such a program should include curriculum or learning paths for Cloud data security.
– Ensure the business case for new Cloud tools includes the Total Cost of ownership,including the best fit-for-
– purpose resource costs for the first five years post deployment. This is a journey, and the right skill sets are essential for each journey.
– Consider whether your market has the right talent pool to run your selected Cloud ecosystem. Ensure Information Security has the right skillsets to configure and deploy the tools.
– Vendors will provide rapid cycle training, but the business must understand these programs and how quickly their team members will come up to speed.
– Understand what Anonymization and Generative AI solutions are avail able, which also help protect data in the Cloud.
– Create an Identity Graph and keying system of standard identifiers to help protect data and ensure restricted data types are secured.
– Establish a Cloud Data Risk Committee and map all potential risks and controls to be implemented across all functional areas, i.e., Information Security, Data Science, Technology, etc., to understand the unique issues related to AI/Analytics in the Cloud.
– Data scientists must map out their use cases, explaining the an alytics and data matching to be performed:
o Transaction analysis
o Network analysis
o Sentiment analysis
o Risk analysis
I hope this article helps frame some of the issues and recommended solutions. I look forward to your thoughts and any additions you might have.
Tony Branda
I’m the Chief Data and Analytics Officer for Ahli United Bank. Previously, I was CDAO for ASB Bank Limited — a wholly-owned subsidiary of Commonwealth Bank of Australia, CAO for Citibank North America, and CDO for Embrace Home Loans. I was also a Clinical Professor of Marketing and Customer Intelligence at Pace University’s Lubin Graduate School of Business for nine years. I’m published in the Journal of Marketing Analytics and CIO/MIT Sloan Magazine. My business expertise includes Artificial Intelligence (AI), business analytics, data science and engineering, CRM, marketing analytics, and customer intelligence as a business strategy. I hold an M.B.A and a Ph.D. in Marketing from Pace University. I’m the founder of the Analytics Hall of Fame, a rewards and online recognition community.