The document discusses the need for data scientists to develop stronger privacy skills and knowledge. It notes that while most data scientists accept the importance of privacy, the profession overall lacks in-depth understanding of privacy principles and regulations. As data science tools are used for more high-risk applications and public concern over data privacy grows, it is essential for data scientists to learn about privacy challenges and solutions. The document provides a knowledge map to help data scientists and organizations assess privacy skill levels and identify training needs for different roles.
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
IAPP - Skills For Minimizing Privacy Risk in Data Science Product and Service Development
1. FEATURES
A DETAILED PRIVACY
KNOWLEDGE MAP
FOR DATA SCIENCE
DATA SCIENCE
A Professional's Guide to Essential Privacy Knowledge
Skills For Minimizing Privacy Risk in Data Science Product and Service Development
2. Iapp.org
Data science has unlimited potential as a business and scientific tool.
All it needs to fulfill its promise is, obviously, data.
That often means personal data collected online and by personal
devices. Intimate facts and details sent over public networks, merged
into bigger and bigger data sets, and used to peer deeper into
consumer behavior – exactly what consumers are worried about.
Legislators and regulatory officials have responded by steadily giving
consumers more legal rights to restrict the collection and use of their
data. Data science could be one monumental scandal away from a
plunge in opt-in rates that chokes off critical information flows.
“As we throw more technology and processing power at answering
questions, the potential for harm has risen. If consumers lack trust,
there will be an increase in requests for data deletion and
rectification,” said Aurelie Pols, founder of Aurelie Pols and
Associates, a digital strategy consultancy. With backgrounds in data
analytics and privacy, she sees mounting privacy challenges for the
data science profession.
“For example, there is growing concern about predictive analytics, and
not enough discussion about false positives and negatives. When I use
data to predict if you prefer a banana milkshake or a strawberry
yogurt, there are no serious consequences. If I get nine out of ten
right, that’s fine. But if law enforcement is using data science to
determine guilt or innocence, and someone could get a 10-year jail
sentence, the certainty has to be beyond 99.9 percent. The stakes are
getting much higher and the pendulum is swinging back toward
privacy,” Pols said.
The best way for data scientists to head off a full-on consumer
backlash is to show they can be trusted to use sensitive information
responsibly. To do that, they have to learn privacy.
3. GETTING PAST ‘NO’
Although consumer unease drives personal data regulation,
consumers are not the first audience data scientists have to win over.
Data scientists don’t usually get data directly from consumers. They
have to ask customer-facing departments like sales and customer
service for permission to access personal data. Then they need IT
departments to grant that access.
Departments that collect and control data are also responsible for
meeting legal and regulatory privacy requirements. They are
answerable to executives, customers and regulators for privacy and
security incidents. They are unlikely to share sensitive data with data
scientists who have little or no working knowledge of privacy,
according to Katharine Jarmul, principal data scientist at global
software consultancy Thoughtworks.
“The key to avoiding ‘no’ conversations with people who control data
is awareness of privacy technology and law. If data scientists
understand privacy, they can get into back-and-forth conversations
about data access rather than just hitting a wall. Then they can turn
‘no’s’ into ‘maybes’ and ‘yesses,’” Jarmul said. “There is a lot of upside
for data scientists who understand privacy. If they can go into
conversations with data controllers with recommendations about
privacy technologies and techniques, it will help them access data
they have been refused in the past.”
International Association of Privacy Professionals
MAP YOUR
PRIVACY SKILLS
The knowledge map “Data Science
Pros: Build Your Privacy Muscle”
ranks privacy skills as “need to
know,” “should know,” “good to
know,” and “non-essential” by
position. Individual data scientists
can use it to discover which privacy
training they need in their
positions. Directors, managers and
executives can map out
organization-wide privacy learning
strategies and plan targeted
training for their staffs.
Visit iapp.org/training
for more information.
Privacy can be an exciting part of
data science – an interesting
technical problem to solve. It’s not
just a stodgy, boring issue for lawyers.
Katherine Jarmul,
Principal data scientist, Thoughtworks
“
”
4. DATA SCIENCE’S PRIVACY DEFICIT
In a profession fascinated by mathematical and technical challenges, it is not surprising to find
attitudes toward a seemingly peripheral subject like privacy ranging from acceptance to hostility. The
former is more common. The latter is primarily among those who see privacy as an obstacle to tapping
new data sources, or who have had bad experiences with other departments, according to Jarmul.
Across the profession, however, most data scientists accept the practical and ethical need to
understand privacy, according to Rebecca Weiss, former director of data science at Mozilla. The
problem is that most data scientists lack working knowledge of data privacy principles.
“I’ve never met a data scientist who is cavalier about privacy. But when it comes to deeper knowledge
and skills, we are not there yet,” Weiss said. “Considering the impact of laws like the GDPR and CCPA,
and the global outlook for more privacy regulation, as a profession we need more knowledge. If
compliance requirements change and that affects how you have to manage your data, you have to
recognize that something has changed and respond.”
Data scientists do not have to be privacy experts any more than lawyers need to be technology
experts. But just as lawyers involved in product development should understand relevant technology
principles, so should data scientists need to know laws and regulations that apply to their roles. A data
modeler, for example, needs only a passing knowledge of law and policy but should be well-schooled in
privacy by design. Consult the accompanying knowledge map to see which privacy skills align with
specific data science roles.
With data science’s role in business growing and the public’s skepticism rising, creating top-to-bottom
privacy skills and knowledge in data science departments is essential.
“Companies that are dealing with user-level information and haven’t factored privacy into their
long-term planning are in danger. Especially medium and small companies, because of the fines they
face,” Weiss said. “If you are a data scientist asking other parts of the company for access to their
data, they will ask you what you’re going to use their data for, and what you’ve done to ensure privacy
in your products.”
International Association of Privacy Professionals
Considering the impact of laws like the GDPR
and CCPA, and the global outlook for more
privacy regulation, as a profession we need
more knowledge.
Rebecca Weiss
Former director of data science, Mozilla
“
”
6. iapp.org
LEARNING PRIVACY
The current generation of data scientists haven’t had many opportunities to learn privacy. It is only starting to
make its way into college curriculums, so most data scientists with privacy skills are self- taught, Jarmul said.
Data science organizations that want to raise their privacy IQs will have to train their people themselves.
Fortunately, the profession is comprised of eager learners who like to burnish their credentials with training
and certifications.
“I am seeing a lot more data scientists with privacy-related certifications, coursework and self-training. It helps
prepare them to manage the laws and regulations that affect their work,” Jarmul said. “There are a lot of
regulations focused on data sovereignty, rights to explanations and data lineage that are relevant to data
science. Data scientists who understand the regulations can talk to data controllers realistically about how to
use sensitive data.”
Convincing data scientists that privacy training is worth their time is getting easier with each well-publicized
privacy breach, according to Pols.
“In my first year of teaching ethics at a university in Spain, students would ask why we had to talk about
privacy. They didn’t care,” Pols said. “Over the years, as there have been more and more privacy breach issues,
interest grew steadily. When Cambridge Analytica happened, that interest became concern. They wanted to
learn about legal and technological tools and how to work with other stakeholders to minimize harm.”
There are opportunities for privacy-savvy data scientists at all sizes of organizations, but particularly small and
medium-sized companies, Weiss said. Lacking the resources of large corporations, they can’t compete for
people with experience in data science and privacy. Privacy training creates openings for younger data
scientists just breaking into the field. It also gives experienced data scientists even more mobility in a market
desperate for their skills.
BENEFITS OVER HARM
Data science is capable of enormous societal good. It is an economic growth engine and a powerful tool for
public health, scientific discovery, and technological innovation. It is also at risk of developing a reputation as
a threat to personal data privacy.
The most effective way to re-focus attention on data science’s benefits is for data scientists to embrace
privacy as an ethical and professional obligation. Data scientists must be trained to anticipate the privacy
implications of new algorithms and applications and proactively offer solutions. They must question the
provenance of shared data, and whether they can legally and ethically process it the way they want. Such
knowledge, underpinning a sincere commitment to protecting personal privacy, will help ensure a steady flow
of new data that enables data scientists to do what they want most: find answers to the hardest questions.
“Privacy can be an exciting part of data science – an interesting technical problem to solve,” Jarmul said. “It’s
not just a stodgy, boring issue for lawyers.”