Slides for breakout session given at 2020 Pluralsight LIVE online conference. Description:
In September 2019, McKinsey called for the rise of the in-house “analytics academy” to up-skill employees’ data literacy and prepare organizations for the changes from artificial intelligence and automation. In this session, I will share tips for designing effective data analytics training programs acquired from working with leading technology and development organizations. Attendees will gain a step-by-step blueprint for building a data analytics academy, making the case to management, to identifying candidates and topics, to measuring its effectiveness.
In September 2019, leading consulting firm, McKinsey, called for the rise of the in-house “analytics academy” to up-skill employees’ data literacy and prepare organizations for the changes from artificial intelligence and automation.
In this talk, George Mount will share tips acquired for designing effective data analytics training programs after over a year of developing technical content professionally. Attendees will learn how to hire and work with subject matter experts, develop effective technical content, and evaluate a program’s effectiveness.
George hopes to bridge the gap between content developers, subject matter experts, and technical writers in developing successful data analytics content and programs.
Learning & Development professionals and instructional designers will also gain tips for building the analytics academy that is essential to their organizations.
Takeaways: Attendees will have concrete steps to take in building their organization’s analytics academy.
Building a Data Academy: Presentation to Pittsburgh Chapter, Association for ...George Mount
In an information age, data powers business; it’s moved from a desirable to an essential workforce skillset. Tasked with the effort to build a data-driven culture and strategy, talent development leaders may rush to hire throngs of data scientists to implement cutting-edge artificial intelligence (AI) techniques. But the most data-savvy cultures take time and are built from within, by upskilling current talent.
In this presentation, Stringfest Analytics CEO George Mount will offer a roadmap for organizations to develop data training academies. He’ll explain the dangers of rushing data strategy or attempting to hire away the problem and provide concrete steps to get started with the process. Attendees will pick up techniques to sketch, implement and ultimately measure the impact of their data academy.
George Mount is the founder and CEO of Stringfest Analytics, a consulting firm specializing in analytics education. He has worked with leading bootcamps, learning providers, and practice organizations to help individuals excel at analytics. The author of the O’Reilly Media book, Advancing into Analytics: From Excel to Python and R, he also blogs on analytics and data education at stringfestanalytics.com.
George holds a bachelor’s degree in economics from Hillsdale College and master’s degrees in finance and information systems from Case Western Reserve University. He resides in Cleveland, Ohio.
It’s time to open-source your data tools and processesGeorge Mount
When I took an empirical finance in graduate school [redacted] years ago, the course was conducted in SAS. I’ve never used SAS since: it’s too expensive!
That same course today is conducted in Python, which is a free- and open-source tool. Even the starched-collar, highly-regulated finance industry has opened up to open-source tools. They’ve become a part of many organization data strategies.
But I propose that companies go a step further by not just consuming open-source data tools, but producing them too. Here’s an example of what I mean.
Data is private and proprietary, but that doesn’t mean the tools used to analyze that data should be.
Most organizations have already moved from expensive proprietary tools like SAS to free, open-source ones like R. The advanced organizations, however, have even moved to producing open-source tools, not just consuming them.
It’s another angle to consider in the quest for data up-skilling: cultivate top talent, release solid tools for free, and watch more top talent take notice.
The big HRIS players continue to consolidate limiting options and innovation, but there are tons of startups popping up everywhere... who are they and what do they do? Find out here.
Sometimes when you are starting on an idea for a project you dont know where or how to start. This is a tried and tested strategy that gets you going. From inspiration to organization, tools to knowledge, all you need to know to build the next great app.
In September 2019, leading consulting firm, McKinsey, called for the rise of the in-house “analytics academy” to up-skill employees’ data literacy and prepare organizations for the changes from artificial intelligence and automation.
In this talk, George Mount will share tips acquired for designing effective data analytics training programs after over a year of developing technical content professionally. Attendees will learn how to hire and work with subject matter experts, develop effective technical content, and evaluate a program’s effectiveness.
George hopes to bridge the gap between content developers, subject matter experts, and technical writers in developing successful data analytics content and programs.
Learning & Development professionals and instructional designers will also gain tips for building the analytics academy that is essential to their organizations.
Takeaways: Attendees will have concrete steps to take in building their organization’s analytics academy.
Building a Data Academy: Presentation to Pittsburgh Chapter, Association for ...George Mount
In an information age, data powers business; it’s moved from a desirable to an essential workforce skillset. Tasked with the effort to build a data-driven culture and strategy, talent development leaders may rush to hire throngs of data scientists to implement cutting-edge artificial intelligence (AI) techniques. But the most data-savvy cultures take time and are built from within, by upskilling current talent.
In this presentation, Stringfest Analytics CEO George Mount will offer a roadmap for organizations to develop data training academies. He’ll explain the dangers of rushing data strategy or attempting to hire away the problem and provide concrete steps to get started with the process. Attendees will pick up techniques to sketch, implement and ultimately measure the impact of their data academy.
George Mount is the founder and CEO of Stringfest Analytics, a consulting firm specializing in analytics education. He has worked with leading bootcamps, learning providers, and practice organizations to help individuals excel at analytics. The author of the O’Reilly Media book, Advancing into Analytics: From Excel to Python and R, he also blogs on analytics and data education at stringfestanalytics.com.
George holds a bachelor’s degree in economics from Hillsdale College and master’s degrees in finance and information systems from Case Western Reserve University. He resides in Cleveland, Ohio.
It’s time to open-source your data tools and processesGeorge Mount
When I took an empirical finance in graduate school [redacted] years ago, the course was conducted in SAS. I’ve never used SAS since: it’s too expensive!
That same course today is conducted in Python, which is a free- and open-source tool. Even the starched-collar, highly-regulated finance industry has opened up to open-source tools. They’ve become a part of many organization data strategies.
But I propose that companies go a step further by not just consuming open-source data tools, but producing them too. Here’s an example of what I mean.
Data is private and proprietary, but that doesn’t mean the tools used to analyze that data should be.
Most organizations have already moved from expensive proprietary tools like SAS to free, open-source ones like R. The advanced organizations, however, have even moved to producing open-source tools, not just consuming them.
It’s another angle to consider in the quest for data up-skilling: cultivate top talent, release solid tools for free, and watch more top talent take notice.
The big HRIS players continue to consolidate limiting options and innovation, but there are tons of startups popping up everywhere... who are they and what do they do? Find out here.
Sometimes when you are starting on an idea for a project you dont know where or how to start. This is a tried and tested strategy that gets you going. From inspiration to organization, tools to knowledge, all you need to know to build the next great app.
Beyond Fast: How to fund, design, build, and monetize your ambitious software...Originate
"Beyond Fast: How to fund, design, build, and monetize your ambitious software products and start-ups -- faster" presented by Rob Meadows, CEO of Originate at ICMA 2014.
Staying Competitive in Data Analytics: Analyze Boulder 20140903Richard Hackathorn
Presentation to Analyze Boulder on Sept 3 2014 by the Data Detectives of Boulder (https://www.linkedin.com/groups?home=&gid=6525462). Sharing our experiences over the past 3 years with MOOCs, Kaggle, etc.
Si è tornato a parlare molto di Machine Learning negli ultimi anni. Grazie anche al fatto che è possibile oggi processare enormi moli di dati in tempi (relativamente) veloci questa parte dell'informatica sta vivendo una seconda giovinezza.
In questa sessione vedremo cos'è il machine learning, quali sono le diverse casistiche tecniche e funzionali in cui può essere usato ed inizieremo a "giocare" con i dati per vedere fin dove possiamo spingerci, usando strumenti On-Premise e quindi spostandoci poi sull'offerta Azure Machine Learning dove, una volta fatta propria la teoria, si possono realizzare soluzioni estremamente complesse in modo molto visuale, oppure integrandosi con R ed IPython e sfruttare la scalabilità di Azure per avere performance ottimali. Il tutto senza dimenticare che gli algoritmi così ottenuti possono essere facilmente integrati nelle nostre applicazioni semplicemente invocando un web service.
A detailed look at how TYE (TiE Youth Entrepreneurs) Oregon runs its innovation and entrepreneurship program for high school students. Schedules, timelines, goals, tools included. Presented at TYE Global competition 2016 in Portland, OR.
Warren Buffet would often think of companies as castles with a competitive moat protecting the business. Products or companies that figure out how to build and leverage differentiated data assets will be best positioned to win their respective markets. This talk describes the properties of a good data moat, why it matters, and how to go about building them within your organization.
Selling xAPI / Getting Buy-in for TorranceLearning Download May 2016TorranceLearning
In this presentation Art Werkenthin of RISC and Megan Torrance of TorranceLearning discuss ways to address the concerns of the C-Suite, management, learners, IT, the L&D team and your vendors for an xAPI implementation.
Blogging Effectively about Coding (WordCamp Denver 2020)George Mount
Technical blogging has meant everything to my career. My clients have come from the content I produce.
I also see technical blogging as a way to build my community’s knowledge base and make learning social. Some of my most popular blog posts come from a friend asking about how to do something in Excel.
Full talk description below:
The WordPress community thrives when we share our technical expertise with each other. But, we’ve likely all met someone who is technically brilliant but struggles as a teacher.
In this talk, I will share my tips and tricks for developing effective technical instruction from over six years of blogging and online content creation. While my background is in data analytics, users in web development, technical writing and other fields will benefit from a discussion of blogging effectively about tech (particularly code).
Not only will attendees have a path forward for developing solid technical blog posts, I will lay out the win-win that this skill will have on their careers and the community as a whole.
Demo guide: The central limit theorem, visualized in ExcelGeorge Mount
Excel is not just a powerful tool for doing data, but for learning it: spreadsheets provide an unparalleled opportunity to look at the data, and watch it take shape before your eyes.
MailChimp data scientist John Foreman stated the advantage of learning data analysis with Excel like this: “You get to look at the data every step of the way, building confidence while learning the tricks of the trade.”
That’s why my demo guide on the central limit theorem, one of the most important statistical principles, is based not in a statistical programming language, but Excel.
What’s more, my example isn’t based on some sophisticated dataset or decontextualized random numbers, but the real-world example of a roulette wheel.
And, while the example of a roulette wheel may come from real life, the central limit theorm that it illustrates seems almost magical.
More Related Content
Similar to Building the Data Academy (Pluralsight LIVE presentation)
Beyond Fast: How to fund, design, build, and monetize your ambitious software...Originate
"Beyond Fast: How to fund, design, build, and monetize your ambitious software products and start-ups -- faster" presented by Rob Meadows, CEO of Originate at ICMA 2014.
Staying Competitive in Data Analytics: Analyze Boulder 20140903Richard Hackathorn
Presentation to Analyze Boulder on Sept 3 2014 by the Data Detectives of Boulder (https://www.linkedin.com/groups?home=&gid=6525462). Sharing our experiences over the past 3 years with MOOCs, Kaggle, etc.
Si è tornato a parlare molto di Machine Learning negli ultimi anni. Grazie anche al fatto che è possibile oggi processare enormi moli di dati in tempi (relativamente) veloci questa parte dell'informatica sta vivendo una seconda giovinezza.
In questa sessione vedremo cos'è il machine learning, quali sono le diverse casistiche tecniche e funzionali in cui può essere usato ed inizieremo a "giocare" con i dati per vedere fin dove possiamo spingerci, usando strumenti On-Premise e quindi spostandoci poi sull'offerta Azure Machine Learning dove, una volta fatta propria la teoria, si possono realizzare soluzioni estremamente complesse in modo molto visuale, oppure integrandosi con R ed IPython e sfruttare la scalabilità di Azure per avere performance ottimali. Il tutto senza dimenticare che gli algoritmi così ottenuti possono essere facilmente integrati nelle nostre applicazioni semplicemente invocando un web service.
A detailed look at how TYE (TiE Youth Entrepreneurs) Oregon runs its innovation and entrepreneurship program for high school students. Schedules, timelines, goals, tools included. Presented at TYE Global competition 2016 in Portland, OR.
Warren Buffet would often think of companies as castles with a competitive moat protecting the business. Products or companies that figure out how to build and leverage differentiated data assets will be best positioned to win their respective markets. This talk describes the properties of a good data moat, why it matters, and how to go about building them within your organization.
Selling xAPI / Getting Buy-in for TorranceLearning Download May 2016TorranceLearning
In this presentation Art Werkenthin of RISC and Megan Torrance of TorranceLearning discuss ways to address the concerns of the C-Suite, management, learners, IT, the L&D team and your vendors for an xAPI implementation.
Blogging Effectively about Coding (WordCamp Denver 2020)George Mount
Technical blogging has meant everything to my career. My clients have come from the content I produce.
I also see technical blogging as a way to build my community’s knowledge base and make learning social. Some of my most popular blog posts come from a friend asking about how to do something in Excel.
Full talk description below:
The WordPress community thrives when we share our technical expertise with each other. But, we’ve likely all met someone who is technically brilliant but struggles as a teacher.
In this talk, I will share my tips and tricks for developing effective technical instruction from over six years of blogging and online content creation. While my background is in data analytics, users in web development, technical writing and other fields will benefit from a discussion of blogging effectively about tech (particularly code).
Not only will attendees have a path forward for developing solid technical blog posts, I will lay out the win-win that this skill will have on their careers and the community as a whole.
Demo guide: The central limit theorem, visualized in ExcelGeorge Mount
Excel is not just a powerful tool for doing data, but for learning it: spreadsheets provide an unparalleled opportunity to look at the data, and watch it take shape before your eyes.
MailChimp data scientist John Foreman stated the advantage of learning data analysis with Excel like this: “You get to look at the data every step of the way, building confidence while learning the tricks of the trade.”
That’s why my demo guide on the central limit theorem, one of the most important statistical principles, is based not in a statistical programming language, but Excel.
What’s more, my example isn’t based on some sophisticated dataset or decontextualized random numbers, but the real-world example of a roulette wheel.
And, while the example of a roulette wheel may come from real life, the central limit theorm that it illustrates seems almost magical.
In solving analytics problems with software, the poor craftsman blames his tools... but at the same time, if they only tool one has is a hammer, everything looks like a nail.
I have identified four categories of applications that an analyst should have at their disposal: spreadsheets, databases, BI/dashboard platforms and programming languages.
None of these is "better" than another: they are all slices of the same stack. The superior analyst has the wisdom to choose the right set of tools for the right task.
"Blended learning" has become a trendy concept in education, but what is it, and what does it look like in the context of professional data education & data academies?
I provide a brief overview of the concept of blended learning and propose how to apply it to data education using four delivery methods: in-person learning, online study, work-based projects and hackathons.
Teaching coding: What is pair programming?George Mount
Pair programming is an exercise that's more or less what it sounds like, which is programming in pairs.
In this framework, a “driver” and a “navigator” trade off on roles. The driver is behind the "wheel," or keyboard, pushing ahead toward the destination. The navigator is in the co-pilot position, making sure the driver stays on course.
Pair programming is a very controversional practice in tech, but for the purposes of learning it can be a great occasional aide, because it makes students think out loud about their work and helps with giving and receiving feedback -- a huge part of professional success in general, not just for coding.
Hiring data scientists doesn't make a data cultureGeorge Mount
Some organizations try to kick-start their data capacities by hiring loads of highly-trained data scientists or investing in machine learning infrastructures.
This will not be a successful strategy because it surpasses necessary steps in building a data culture. Organizations are better off up-skilling the talent they have.
Building the data academy: Measuring ROIGeorge Mount
Most executives won’t get too excited about building something as grandiose-sounding as a “data academy,” but the alternative is even less feasible. A data-upskilled workforce is essential to compete, and building a data academy to get there is the highest-ROI strategy.
Ultimately, you’ll see that education is the most effective strategy for the necessary improvement of your organizaton’s data capacities.
Building your data academy: Assessing candidate skillsGeorge Mount
Analytics is a rapidly evolving field of high business value, so your organization needs a robust up-skilling strategy. I call the type of institutionalized data education platform organizations need a “data academy.”
Enrollment in the academy is a good time to take stock of candidate skills: after all, the nature of the education depends on the current skill levels of its students. So, what are some ways to assess the data skills of your candidates in relation to what your organization needs?
Take the time to assess your candidate data skills and motivations and you’ll be surprised at the talent you’ve got in house. The data academy will cultivate that talent.
Teaching coding: What is a faded example?George Mount
Imagine a faded example as a sort of fill-in-the-blank challenge, but with code. You can start students with a nearly completed block of code, but ask them to fill in some blanks.
Over time, you can ask for more and more blanks until the student is essentially coding the whole problem. This arrangement is often known as “scaffolding.”
The benefits of faded examples
This approach helps students work from example to example, learning progressively more about the code as they advance. It introduces an element of interactivity and debugging into the exercise in a way that starting off a blank file does not.
YouTube is not a data upskilling strategyGeorge Mount
Don’t get me wrong, there is some mind-bending educational content on YouTube. It’s a great place to learn.
But, if you are counting on your employees to use YouTube (or really any autodidactic medium) for their data upskilling, you are setting up your company for failure. Here’s why.
Five myths about learning data analyticsGeorge Mount
Myth 1: "I need to go to school for that:
Myth 2: "I won't like it because I don't like math"
Myth 3: "I don't need it to do my job"
Myth 4: "I *only* need it to do my job"
Myth 5: "I'm not the right 'type'"
Fact: You can learn data analytics.
Slides presented at the Greater Cleveland R User Group on July 31st, 2019.
This will be an introduction to the survey and measure development process with emphasis on applications in R. We will first explore and visualize self-report survey data, then walk through exploratory factor analysis. Finally, we will formally test our measurement model and use it to predict outside variables of interest.
It's not what you know, it's how you show.George Mount
-Are you an artist?
-What is ‘hierarchying?’
-Have you bought into the ‘biker bar fallacy?’
We will discuss these and more. Ultimately, this talk challenges professionals to consider themselves as artists, not workers, and what this distinction means for effective networking.
Slides presented at the Greater Cleveland R User Meetup group on the statistical concept of mediation using the lavaan package for structural equation modeling.
Hey, Analyst! Learn Some Content MarketingGeorge Mount
This is a preview of my forthcoming course on networking for analysts. Learn more at georgejmount.com/networkingcourse
In a freelance-based economy, we will all have to sell our skills through content marketing -- even "data nerds." Here are some of the key lessons I've taken from content marketing, and how analysts can use it to build a stronger network.
This is a preview of my course on networking for analysts. Learn more at georgejmount.com/networkingcourse.
I was introduced to the concept of the "social employee" by Mark and Cheryl Burgess, who argue that companies can build a strong brand through the content and creation of their own employees.
I took this concept to heart and became a "social analyst," getting the attention of peers and influencers. Here is why even chart monkeys should strive to be social.
Excel for Healthcare: Intro to Data VisualizationGeorge Mount
Healthcare produces massive amounts of data. Use data visualization to make meaning of it all.
This is an introduction to some of the most powerful data visualization principles I've learned as a healthcare analyst.
In the course we will cover examples of how to visualize healthcare data using Microsoft Excel. Learn more at georgejmount.com/excelforhealthcare
Data analytics is changing business. The response to prepare for this demand is an emphasis on STEM over a classical liberal arts education.
But what if liberal arts is foundational to good use of data?
I speak with Matt Walje, Project Manager, trends analysis at Oceans Beyond Piracy, on how the liberal arts can give you an edge to quantitative analysis.
Data analytics is changing business. The response to prepare for this demand is an emphasis on STEM over a classical liberal arts education.
But what if liberal arts can give an edge to data science? I argue that data science is just another form of science, and that science has always had a hallowed position in the liberal arts.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
I’m going to use “data academy” as the concept here, of course it’s catchier than data training program so that’s one plus, but I also think it as a more systematic “analytics training programs.” but something more involved, more institutionalized. Just catchier and also more holistic.
So I’ll focus on four elements here –
First, how to sell it, understanding the market forces that are at work and what’s at stake.
Then, designing it, focusing on what the learner needs and expects and delivering that.
Next, implementing it, so we will look at some more nuts and bolts concepts in developing analytics training programs, things that are specific to data analytics itself.
Finally, evaluating it, how can we measure the kind of ROI we are going to get from this program?
Here are a couple of screenshots I have found recently about this catchy idea of a “data academy.” You will see that these are both coming from banking in the UK and yes knowledge intensive industries like banking are definitely the right candidates for this kind of stuff.
But you are likely going to see this idea of a “data academy” in the news quite a lot across industries.
[Animation] we can even think about Amazon’s upskilling initiative as a data academy, look here how nearly all these roles are data-intensive, some even have data right in the name! So it’s coming.
So I mentioned this in my talk description as another one of these “market moving” events in this idea of a data academy. So you would think hey, McKinsey knows what they are talking about, let’s just pull that up everybody and I am going to stop talking! Well, not so fast.
I would recommend this article, they have a lot to say, but their perspective is a little different than what I am going for. They are a little off the mark from my own experience and from looking at the market what seems to be the demand.
One of the problems of over-emphasizing the AI bit of this is we start to think that all this data stuff is magic, and we start to put the cart before the horse.
So we talk about all this AI taking over the world stuff and yes it is certainly a game changer but let’s be honest for a lot of organizations your tech capacities are stretched to be able to print something. Very easy to get into the hand-waving about how this data analytics academy is supposed to go, but we need to start where we are at.
So what are some of these real-life pain points for data analytics?
And to demonstrate why I believe this is likely, here are some figures. I like this idea of a data breadline, where you have suppliers of data intelligence and consumers of data intelligence . The issue with this model is that data needs to become baked in (pun, again) to the organization.
What I am not saying is that everyone has to be the next AI engineer of their company, that’s kind of the opposite point I want to make.
What I am saying is that we are still using data hand to mouth. It’s seen as a scarce good of which some people are producers and some people are consumers. You have to wait your turn to get data and you need to be qualified to get data. And by the time you get that data it’s going to be day-old (ha) and backward looking.
So we see hints of these symptoms here in these results, in this survey asking about building a data culture and using data as a business asset. We just aren’t there yet.
Ok so let’s say that we have been watching the market, we are seeing our competitors building data academies, now we need to think about how to get this going in our organization. Knowing most employers, they can’t get too excited about building an academy, that sounds like a lot of work, so they may come up with some “hacks” to upskill your data analysts, and I mean that in the creative sense. So what are some of these objections going to be.
Let me know in the chat what you think! What are the blockers? Have you experienced them? What do you anticipate them to be?
Here’s a common what I call “talent friction” in building a data team, where you are nowhere near to being ready for a hardcore data science team, but you think the easiest way to do it is just to hire a bunch of them.
This is going to be very expensive and end in failure, and high turnover. Once your data talent sees that you are really not ready for advanced analytics, they will leave.
This is why data scientists have a fairly high turnover.
I am using broad-strokes “tech talent” here to fill in for workers who are comfortable with building automation and conducting analysis but maybe aren’t at the level to do more advanced analytics.
Now this seems like an opposite strategy but it’s kind of the same, the idea being to hire away the problem. You as an organization know you need to do more with data so you hire less-experienced tech talent to do it. Because you do know that hiring a truckload of PhDs is not a good idea.The problem with this strategy is that this talent, if they are smart enough to tame your pipelines, will be smart enough to leave your organization, so again it’s high turnover. So you can hire very experienced tech talent and they’ll leave, or less experienced tech talent and they’ll leave.
I’m calling this talent “motion” because it’s the opposite of friction! You want a way to bootstrap your organization to success and this is the way.
Hiring talent is going to lead to turnover. Up-skilling the talent that you have will help you automate processes with the people that understand the business the best, which is important.
Using this data academy is like bringing the bootcamp to your own organization, versus having people teach themselves and then leave. It’s a complex thing and there’s a lot to do, but poor data capacities is a big problem that you can’t delegate away, it’s really a cultural problem, and having this institutionalized bootcamp is the best way to build it.
So now that we have set the stage, defined the problem statement and the pain points, are there any questions before we proceed to actually designing this thing?
Ok so let’s say that we have made data-backed claims to some of these objections. And our powers-that-be, probably a CTO or someone like that, is on board.
We are doing data analytics education right, so when it comes to assessing our learners we want something rigorous? Well yes and no, we want a blend of assessment tools.
So let’s start with some more quantitative ways to do this.
Self-rating: This is self evident and easy but subjective. Everyone’s rating of their Excel skills on a scale of 1-5 is going to be different.
Time estimates: Not to be too crass but time is money and we are in the face of automation so if we can really get a sense of how long things are taking. Where is the problem in terms of loss of manpower?
Learning assessments – This seems like it would be the most objective of them all, right? Just ask some multiple choice questions about specific topics, get a pretty cut and dry take on where people are with data analytics.
However as you probably know from your own work, problems don’t come to data analysts in the form of multiple choice questions. So maybe our assessment might take on a more qualitative kind of understanding, to get inside the minds of where people are when working with data. Let’s see what some of those things might be.
Experiences – really take the time to understand, where are the pain points that they are seeing? What would people consider doing with their data if they knew how? What is holding them back and what can change with training?
Emotions – Data analytics is scary as is working with lots of data. There is a lot of impostor syndrome in these technical fields. Many are constantly afraid of making mistakes and just feel like they have lost a sense of control over their work.
Goals – Related to the last point, what are the motivations? What do learners hope to achieve?
For these next few slides I want to focus on more of the nuts and bolts of building this academy in terms of what media we are going to use? How are we going to reach our learners? How are we going to engage them? So let’s start with that last question there, it probably all stems from there.
How can we make sure that this stuff actually sticks? How are we going to get buy in from the students themselves? We’ve talked before about understanding the motivations of these data analysts.
A lot of it has to do with productivity, people don’t like spending all their time cleaning data as we’ve talked about, they want to automate things. So really keep this as a primary motivator, that this will boost productivity and make work more enjoyable.
And this is something that often gets overlooked -- we may not think this matters but it does. With the right training students will really start to see data analysis as a mode of inquiry, a means of creativity. I really think that if we can get people to see the art and the creativity in what they are doing with spreadsheets that is one of the best things we can do to plant data literacy into an organization.
I really believe that data literacy is like “literary” literacy in that, when done right, people start to see it as a fun way to engage with their world.
I think this is a somewhat unique trait for data analytics training programs. In high school I worked in a grocery store and I remember learning to nail the right order in which to put the pieces of food on the skewer: meat, pepper, onion, meat, pepper, etc… . That knowledge didn’t have much application outside the grocery store but learning how to analyze data is not like that. You really want to approach it less as a straight recipe as a mode of inquiry.
Now let’s look at how this information will be delivered and yes it’s easy to just use “blended learning” as a solve-all but let’s dig a bit into that could actually mean for teaching data analytics. It really does mean a virtuous cycle of theory and practice where students are able to be lifted out of their everyday work and learn something new which they can apply back to their jobs.
We should have some in-person sessions I think that this is really great for providing a distraction free zone for people to start to understand and really begin to articulate questions and communicate their knowledge. Learning should be social! And it should to some extent be done in a distinct place.
Then there is plenty of online study and there is a lot of educational technology on assessing data analytics skills, things like interactive coding exercises and so forth which we will talk a bit about here soon.
Where things can really get interesting is by blending the academy’s curriculum with real-world applications. This can be great, employees can begin to look on some problems they have seen in their workflows with fresh eyes as a student and begin to propose some alternatives, there is a sense of immunity that people get when they are students that allows them to speak more candidly about their work, and of course this is definitely a cultural thing, we want to get managers on board and so forth but some of these considerations are not unique to data analytics so I am not focusing too much on them.
Finally given some of the pain points and scenarios that come out of such sessions a hackathon can be an opportunity to really crack open some of these things. This can be a very democratizing force.
Any questions on our tour of data analytics teaching techniques?
I tried to focus on things that I thought were unique to data analytics and might be unfamiliar to you, I know there are TONS of resources out there on training and instructional design so I wanted to really tailor this to things specific to data analytics.
13 mins.
OK great, so we have buy-in, everybody’s happy with providing the academy, enrolling, etc. Now how can we actually establish some kind of ROI for this?
There are certainly some ways to quantify the ROI of a data academy from the revenue side in terms of offering more compelling and reliable services to customers, however I am going to focus on the expense side as I think this can be made a little more concrete and immediate.
Probably the simplest victory to declare is with time savings. I have personally seen and been involved with creating solutions that used to take hours and have been reduced into fully automated processes that kick off at the click of a button. That is pretty indisputably a return on investment because ideally that is going to free up your analysts to do more productive stuff.
These next kinds of benefits may come further down the road but are still pretty concrete. First, if you really start to enable these processes and this culture, you will begin to disband the data breadline and that will mean that your data teams will be happier with what they are doing, they will stick around longer.
There will also be more morale, as people get to do what they were trained for.
Last but not least you can begin to lower the bar for data analytics. I really believe that with a data academy like this, it’s going to be a more mutually beneficial solution to upskill than to hire the new grad who can satisfy everybody’s craving for the data breadline until the kid burns out.
2 minutes
Before we part ways, I wanted to share some recommended resources for you, we can exchange contact information, and if you have any final questions, that’s awesome.
This is a great introductory course on building data literacy in an organization instructed by Jordan Morrow who is a renowned expert on the topic. It will be an overarching vision of what data literacy looks like and why it matters, which complements nicely what we’ve focused on here, which is institutionalizing that knowledge into a data academy.
Another place that has been catching on to the idea of a data academy is the public sector, in particular GovEx which is kind of a think-tank consultancy at Johns Hopkins to help cities use data and evidence for their administration. This is a nice little worksheet they have on how to get started building a data academy.
Last but not least, this isn’t a book but I do want to let you know about the resource library I have on my site. This will give you access to a series of syllabuses with things like learning objectives, lesson descriptions and learning exercises. I also have some pedagogical tools like whitepapers, checklists and I hope to include things like slide decks and games here too. So sign up for access.
I have the learning objectives and high level agenda on the first page and then go into the descriptions, time estimates and assessments on the next page.
Thanks for coming! Feel free to contact me anytime, find me on LinkedIn, I also write frequently on this stuff so check out my website too.
Thanks for coming! Feel free to contact me anytime, find me on LinkedIn, I also write frequently on this stuff so check out my website too.
2 mins