Tietojohtamisen perusteet kalvosarjan 1 osa: Mitä on tietojohtaminen? Tässä kalvossa käydään läpi tietojohtamisen peruskäsitteitä ja sen eri osa-alueita.
Tietojohtaminen ja tiedolla johtaminenMiia Kosonen
Esitys yhteisömanagerikurssilla, Mikkeli 27.8.2015. Pikakatsaus siihen, mitä ovat tietojohtaminen ja tiedolla johtaminen, ja miten ne eroavat toisistaan? Tieto ja oppiminen yhteisömanagereiden työn näkökulmasta.
Tutustuminen data-analytiikan ja big datan maailmaanJari Jussila
Tutustuminen data-analytiikan ja big datan maailmaan. Valikoitua sisältöä Edutech Data ja analytiikka liiketoiminnan kehittämisessä koulutuspäivästä. Kouluttajina Pasi Hellsten & Jari Jussila. @EdutechTUT #Data4BizTraining
Data Stewardship - Retour d'expérience de Sarenza sur la façon de piloter un ...Jean-Pierre Riehl
Le Data Steward devient incontournable dans un système de self-service BI. Mais quel est réellement son travail avec Power BI ?
Dans cette session, nous nous mettrons dans la peau du Data Steward pendant 45 minutes et verrons comment il dompte Power BI. Gestion des sources, du Data Catalog, des questions Q&A, de la sécurité, du Data Refresh, de l'usage... Autant de sujets qui impactent le quotidien du Data Steward. Lors de cette session, vous aurez le témoignage d'un vrai Data Steward avec le retour d'expérience de Sarenza.
Tietojohtamisen perusteet kalvosarjan 1 osa: Mitä on tietojohtaminen? Tässä kalvossa käydään läpi tietojohtamisen peruskäsitteitä ja sen eri osa-alueita.
Tietojohtaminen ja tiedolla johtaminenMiia Kosonen
Esitys yhteisömanagerikurssilla, Mikkeli 27.8.2015. Pikakatsaus siihen, mitä ovat tietojohtaminen ja tiedolla johtaminen, ja miten ne eroavat toisistaan? Tieto ja oppiminen yhteisömanagereiden työn näkökulmasta.
Tutustuminen data-analytiikan ja big datan maailmaanJari Jussila
Tutustuminen data-analytiikan ja big datan maailmaan. Valikoitua sisältöä Edutech Data ja analytiikka liiketoiminnan kehittämisessä koulutuspäivästä. Kouluttajina Pasi Hellsten & Jari Jussila. @EdutechTUT #Data4BizTraining
Data Stewardship - Retour d'expérience de Sarenza sur la façon de piloter un ...Jean-Pierre Riehl
Le Data Steward devient incontournable dans un système de self-service BI. Mais quel est réellement son travail avec Power BI ?
Dans cette session, nous nous mettrons dans la peau du Data Steward pendant 45 minutes et verrons comment il dompte Power BI. Gestion des sources, du Data Catalog, des questions Q&A, de la sécurité, du Data Refresh, de l'usage... Autant de sujets qui impactent le quotidien du Data Steward. Lors de cette session, vous aurez le témoignage d'un vrai Data Steward avec le retour d'expérience de Sarenza.
Apresentação sobre Data Mesh e a relação com os Programas de Governança de Dados ministrada por Bergson Lopes na segunda edição do Evento Manhã com Dados.
Metadata is hotter than ever, according to a number of recent DATAVERSITY surveys. More and more organizations are realizing that in order to drive business value from data, robust metadata is needed to gain the necessary context and lineage around key data assets. At the same time, industry regulations are driving the need for better transparency and understanding of information.
While metadata has been managed for decades, new strategies & approaches have been developed to support the ever-evolving data landscape, and provide more innovative ways to drive business value from metadata. This webinar will provide an overview of metadata strategies & technologies available to today’s organization, and provide insights into building successful business strategies for metadata adoption & use.
Slides: Taking an Active Approach to Data GovernanceDATAVERSITY
A Look at How Riot Games Implemented Non-Invasive Data Governance
Riot Games created and runs “League of Legends,” the world’s most-played PC game and most viewed eSport — and is now transforming to become a multi-title publisher. To keep pace with this transformation and support a growing player base of millions, Riot Games is taking a page from Bob Seiner’s book, “Non-Invasive Data Governance: The Path of Least Resistance and Greatest Success” and leveraging the Alation Data Catalog to help guide accurate, well-governed analysis.
Bob Seiner will join Riot Games’ Chris Kudelka, Technical Product Manager, and Michael Leslie, Senior Data Governance Architect, and Alation’s John Wills, VP of Professional Service, for an inside look at Data Governance at one of the world’s leading gaming companies.
Join this webinar to learn:
• How Riot Games is implementing Non-Invasive Data Governance
• How this new approach to Data Governance helps to drive the business
• How the Alation Data Catalog helps Riot Games create the foundation for guiding accurate, well-governed data use
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
Change is hard, especially in response to negative stimuli or what is perceived as negative stimuli. So organizations need to reframe how they think about data privacy, security and governance, treating them as value centers to 1) ensure enterprise data can flow where it needs to, 2) prevent – not just react – to internal and external threats, and 3) comply with data privacy and security regulations.
Working together, these roles can accelerate faster access to approved, relevant and higher quality data – and that means more successful use cases, faster speed to insights, and better business outcomes. However, both new information and tools are required to make the shift from defense to offense, reducing data drama while increasing its value.
Join us for this panel discussion with experts in these fields as they discuss:
- Recent research about where data privacy, security and governance stand
- The most valuable enterprise data use cases
- The common obstacles to data value creation
- New approaches to data privacy, security and governance
- Their advice on how to shift from a reactive to resilient mindset/culture/organization
You’ll be educated, entertained and inspired by this panel and their expertise in using the data trifecta to innovate more often, operate more efficiently, and differentiate more strategically.
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace, from digital transformation to marketing, customer centricity, population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
Tietojohtamisen perusteet kalvosarjan 3. osa: Tiedon tasot ja lajit. Tässä kalvossa käydään läpi tiedon eri tasoja ja lajeja, sekä niiden vaikutusta tiedon käsittelyyn ja tulkintaan.
Delta Lake delivers reliability, security and performance to data lakes. Join this session to learn how customers have achieved 48x faster data processing, leading to 50% faster time to insight after implementing Delta Lake. You’ll also learn how Delta Lake provides the perfect foundation for a cost-effective, highly scalable lakehouse architecture.
Metadata management is critical for organizations looking to understand the context, definition and lineage of key data assets. Data models play a key role in metadata management, as many of the key structural and business definitions are stored within the models themselves. Can data models replace traditional metadata solutions? Or should they integrate with larger metadata management tools & initiatives?
Join this webinar to discuss opportunities and challenges around:
How data modeling fits within a larger metadata management landscape
When can data modeling provide “just enough” metadata management
Key data modeling artifacts for metadata
Organization, Roles & Implementation Considerations
Creating a clearly articulated data strategy—a roadmap of technology-driven capability investments prioritized to deliver value—helps ensure from the get-go that you are focusing on the right things, so that your work with data has a business impact. In this presentation, the experts at Silicon Valley Data Science share their approach for crafting an actionable and flexible data strategy to maximize business value.
Activate Data Governance Using the Data CatalogDATAVERSITY
Data Governance programs depend on the activation of data stewards that are held formally accountable for how they manage data. The data catalog is a critical tool to enable your stewards to contribute and interact with an inventory of metadata about the data definition, production, and usage. This interaction is active Data Governance in the truest sense of the word.
In this RWDG webinar, Bob Seiner will share tips and techniques focused on activating your data stewards through a data catalog. Data Governance programs that involve stewards in daily activities are more likely to demonstrate value from their data-intensive investments.
Bob will address the following in this webinar:
- A comparison of active and passive Data Governance
- What it means to have an active Data Governance program
- How a data catalog tool can be used to activate data stewards
- The role a data catalog plays in Data Governance
- The metadata in the data catalog will not govern itself
Metadata is hotter than ever, according a number of recent DATAVERSITY surveys. More and more organizations are realizing that in order to drive business value from data, robust metadata is needed to gain the necessary context and lineage around key data assets. At the same time, industry regulations are driving the need for better transparency and understanding of information.
While metadata has been managed for decades, new strategies & approaches have been developed to support the ever-evolving data landscape, and provide more innovative ways to drive business value from metadata. This webinar will provide an overview of metadata strategies & technologies available to today’s organization, and provide insights into building successful business strategies for metadata adoption & use.
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
MDM, data quality, data architecture, and more. At the same time, combining these foundational data management approaches with other innovative techniques can help drive organizational change as well as technological transformation. This webinar will provide practical steps for creating a data foundation for effective digital transformation.
You had a strategy. You were executing it. You were then side-swiped by COVID, spending countless cycles blocking and tackling. It is now time to step back onto your path.
CCG is holding a workshop to help you update your roadmap and get your team back on track and review how Microsoft Azure Solutions can be leveraged to build a strong foundation for governed data insights.
Building Data Quality pipelines with Apache Spark and Delta LakeDatabricks
Technical Leads and Databricks Champions Darren Fuller & Sandy May will give a fast paced view of how they have productionised Data Quality Pipelines across multiple enterprise customers. Their vision to empower business decisions on data remediation actions and self healing of Data Pipelines led them to build a library of Data Quality rule templates and accompanying reporting Data Model and PowerBI reports.
With the drive for more and more intelligence driven from the Lake and less from the Warehouse, also known as the Lakehouse pattern, Data Quality at the Lake layer becomes pivotal. Tools like Delta Lake become building blocks for Data Quality with Schema protection and simple column checking, however, for larger customers they often do not go far enough. Notebooks will be shown in quick fire demos how Spark can be leverage at point of Staging or Curation to apply rules over data.
Expect to see simple rules such as Net sales = Gross sales + Tax, or values existing with in a list. As well as complex rules such as validation of statistical distributions and complex pattern matching. Ending with a quick view into future work in the realm of Data Compliance for PII data with generations of rules using regex patterns and Machine Learning rules based on transfer learning.
The first step towards understanding data assets’ impact on your organization is understanding what those assets mean for each other. Metadata – literally, data about data – is a practice area required by good systems development, and yet is also perhaps the most mislabeled and misunderstood Data Management practice. Understanding metadata and its associated technologies as more than just straightforward technological tools can provide powerful insight into the efficiency of organizational practices and enable you to combine practices into sophisticated techniques supporting larger and more complex business initiatives. Program learning objectives include:
- Understanding how to leverage metadata practices in support of business strategy
- Discuss foundational metadata concepts
- Guiding principles for and lessons previously learned from metadata and its practical uses applied strategy
Metadata strategies include:
- Metadata is a gerund so don’t try to treat it as a noun
- Metadata is the language of Data Governance
- Treat glossaries/repositories as capabilities, not technology
Big data architectures and the data lakeJames Serra
With so many new technologies it can get confusing on the best approach to building a big data architecture. The data lake is a great new concept, usually built in Hadoop, but what exactly is it and how does it fit in? In this presentation I'll discuss the four most common patterns in big data production implementations, the top-down vs bottoms-up approach to analytics, and how you can use a data lake and a RDBMS data warehouse together. We will go into detail on the characteristics of a data lake and its benefits, and how you still need to perform the same data governance tasks in a data lake as you do in a data warehouse. Come to this presentation to make sure your data lake does not turn into a data swamp!
RWDG Webinar: Data Steward Definition and Other Data Governance RolesDATAVERSITY
The role of the Data Steward is critical to the success of a Data Governance program. There are several approaches to Stewardship including assigning people to be Data Stewards, identify existing Data Stewards and recognizing Data Stewards according to their relationship to the data they define, produce and use. However Stewards are only one of several Data Governance roles that must be considered.
In this month’s RWDG webinar, Bob Seiner will discuss several approaches to defining the role of the Data Steward as well as the other roles necessary for Data Governance program success. Data Governance roles must include operational, tactical, strategic and supporting levels of responsibilities. Spend an hour with Bob where he will share a customize-able Operating Model of Data Governance roles and responsibilities.
In this webinar, Bob will discuss:
• Several approaches to defining Data Stewards and Stewardship
• How to select the Stewardship approach that is right for you
• Different levels of Stewards required for a successful program
• An Operating Model of DG Roles that can be molded to fit in any culture
• Why the approach to defining DG roles can make or break the program
Data Governance and Data Science to Improve Data QualityDATAVERSITY
Data Science uses systematic methods, algorithms, and systems to extract knowledge and insights from structured and unstructured data. Data Science requires high-quality data that is trusted by the organization and data scientists. Many organizations focus their Data Governance programs on improving Data Quality results. These three concepts (governance, science, and quality) seem to be made for each other.
In this RWDG webinar, Bob Seiner and his special guest will discuss how the people focusing on Data Governance and Data Science must work together to improve the level of confidence the organization has in its most critical data assets. Heavy investments are being made in Data Science but not so much for Data Governance. Bob will talk about how Data Governance and Data Science must work together to improve Data Quality.
Analytiikka liiketoiminnassa esitys Edutech Big Data ja data-analytiikka liiketoiminnan kehittämisessä 1.4.2014. Lisätietoa kokonaisuudesta: http://www.bit.ly/ebd2014
Apresentação sobre Data Mesh e a relação com os Programas de Governança de Dados ministrada por Bergson Lopes na segunda edição do Evento Manhã com Dados.
Metadata is hotter than ever, according to a number of recent DATAVERSITY surveys. More and more organizations are realizing that in order to drive business value from data, robust metadata is needed to gain the necessary context and lineage around key data assets. At the same time, industry regulations are driving the need for better transparency and understanding of information.
While metadata has been managed for decades, new strategies & approaches have been developed to support the ever-evolving data landscape, and provide more innovative ways to drive business value from metadata. This webinar will provide an overview of metadata strategies & technologies available to today’s organization, and provide insights into building successful business strategies for metadata adoption & use.
Slides: Taking an Active Approach to Data GovernanceDATAVERSITY
A Look at How Riot Games Implemented Non-Invasive Data Governance
Riot Games created and runs “League of Legends,” the world’s most-played PC game and most viewed eSport — and is now transforming to become a multi-title publisher. To keep pace with this transformation and support a growing player base of millions, Riot Games is taking a page from Bob Seiner’s book, “Non-Invasive Data Governance: The Path of Least Resistance and Greatest Success” and leveraging the Alation Data Catalog to help guide accurate, well-governed analysis.
Bob Seiner will join Riot Games’ Chris Kudelka, Technical Product Manager, and Michael Leslie, Senior Data Governance Architect, and Alation’s John Wills, VP of Professional Service, for an inside look at Data Governance at one of the world’s leading gaming companies.
Join this webinar to learn:
• How Riot Games is implementing Non-Invasive Data Governance
• How this new approach to Data Governance helps to drive the business
• How the Alation Data Catalog helps Riot Games create the foundation for guiding accurate, well-governed data use
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
Change is hard, especially in response to negative stimuli or what is perceived as negative stimuli. So organizations need to reframe how they think about data privacy, security and governance, treating them as value centers to 1) ensure enterprise data can flow where it needs to, 2) prevent – not just react – to internal and external threats, and 3) comply with data privacy and security regulations.
Working together, these roles can accelerate faster access to approved, relevant and higher quality data – and that means more successful use cases, faster speed to insights, and better business outcomes. However, both new information and tools are required to make the shift from defense to offense, reducing data drama while increasing its value.
Join us for this panel discussion with experts in these fields as they discuss:
- Recent research about where data privacy, security and governance stand
- The most valuable enterprise data use cases
- The common obstacles to data value creation
- New approaches to data privacy, security and governance
- Their advice on how to shift from a reactive to resilient mindset/culture/organization
You’ll be educated, entertained and inspired by this panel and their expertise in using the data trifecta to innovate more often, operate more efficiently, and differentiate more strategically.
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace, from digital transformation to marketing, customer centricity, population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
Tietojohtamisen perusteet kalvosarjan 3. osa: Tiedon tasot ja lajit. Tässä kalvossa käydään läpi tiedon eri tasoja ja lajeja, sekä niiden vaikutusta tiedon käsittelyyn ja tulkintaan.
Delta Lake delivers reliability, security and performance to data lakes. Join this session to learn how customers have achieved 48x faster data processing, leading to 50% faster time to insight after implementing Delta Lake. You’ll also learn how Delta Lake provides the perfect foundation for a cost-effective, highly scalable lakehouse architecture.
Metadata management is critical for organizations looking to understand the context, definition and lineage of key data assets. Data models play a key role in metadata management, as many of the key structural and business definitions are stored within the models themselves. Can data models replace traditional metadata solutions? Or should they integrate with larger metadata management tools & initiatives?
Join this webinar to discuss opportunities and challenges around:
How data modeling fits within a larger metadata management landscape
When can data modeling provide “just enough” metadata management
Key data modeling artifacts for metadata
Organization, Roles & Implementation Considerations
Creating a clearly articulated data strategy—a roadmap of technology-driven capability investments prioritized to deliver value—helps ensure from the get-go that you are focusing on the right things, so that your work with data has a business impact. In this presentation, the experts at Silicon Valley Data Science share their approach for crafting an actionable and flexible data strategy to maximize business value.
Activate Data Governance Using the Data CatalogDATAVERSITY
Data Governance programs depend on the activation of data stewards that are held formally accountable for how they manage data. The data catalog is a critical tool to enable your stewards to contribute and interact with an inventory of metadata about the data definition, production, and usage. This interaction is active Data Governance in the truest sense of the word.
In this RWDG webinar, Bob Seiner will share tips and techniques focused on activating your data stewards through a data catalog. Data Governance programs that involve stewards in daily activities are more likely to demonstrate value from their data-intensive investments.
Bob will address the following in this webinar:
- A comparison of active and passive Data Governance
- What it means to have an active Data Governance program
- How a data catalog tool can be used to activate data stewards
- The role a data catalog plays in Data Governance
- The metadata in the data catalog will not govern itself
Metadata is hotter than ever, according a number of recent DATAVERSITY surveys. More and more organizations are realizing that in order to drive business value from data, robust metadata is needed to gain the necessary context and lineage around key data assets. At the same time, industry regulations are driving the need for better transparency and understanding of information.
While metadata has been managed for decades, new strategies & approaches have been developed to support the ever-evolving data landscape, and provide more innovative ways to drive business value from metadata. This webinar will provide an overview of metadata strategies & technologies available to today’s organization, and provide insights into building successful business strategies for metadata adoption & use.
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
MDM, data quality, data architecture, and more. At the same time, combining these foundational data management approaches with other innovative techniques can help drive organizational change as well as technological transformation. This webinar will provide practical steps for creating a data foundation for effective digital transformation.
You had a strategy. You were executing it. You were then side-swiped by COVID, spending countless cycles blocking and tackling. It is now time to step back onto your path.
CCG is holding a workshop to help you update your roadmap and get your team back on track and review how Microsoft Azure Solutions can be leveraged to build a strong foundation for governed data insights.
Building Data Quality pipelines with Apache Spark and Delta LakeDatabricks
Technical Leads and Databricks Champions Darren Fuller & Sandy May will give a fast paced view of how they have productionised Data Quality Pipelines across multiple enterprise customers. Their vision to empower business decisions on data remediation actions and self healing of Data Pipelines led them to build a library of Data Quality rule templates and accompanying reporting Data Model and PowerBI reports.
With the drive for more and more intelligence driven from the Lake and less from the Warehouse, also known as the Lakehouse pattern, Data Quality at the Lake layer becomes pivotal. Tools like Delta Lake become building blocks for Data Quality with Schema protection and simple column checking, however, for larger customers they often do not go far enough. Notebooks will be shown in quick fire demos how Spark can be leverage at point of Staging or Curation to apply rules over data.
Expect to see simple rules such as Net sales = Gross sales + Tax, or values existing with in a list. As well as complex rules such as validation of statistical distributions and complex pattern matching. Ending with a quick view into future work in the realm of Data Compliance for PII data with generations of rules using regex patterns and Machine Learning rules based on transfer learning.
The first step towards understanding data assets’ impact on your organization is understanding what those assets mean for each other. Metadata – literally, data about data – is a practice area required by good systems development, and yet is also perhaps the most mislabeled and misunderstood Data Management practice. Understanding metadata and its associated technologies as more than just straightforward technological tools can provide powerful insight into the efficiency of organizational practices and enable you to combine practices into sophisticated techniques supporting larger and more complex business initiatives. Program learning objectives include:
- Understanding how to leverage metadata practices in support of business strategy
- Discuss foundational metadata concepts
- Guiding principles for and lessons previously learned from metadata and its practical uses applied strategy
Metadata strategies include:
- Metadata is a gerund so don’t try to treat it as a noun
- Metadata is the language of Data Governance
- Treat glossaries/repositories as capabilities, not technology
Big data architectures and the data lakeJames Serra
With so many new technologies it can get confusing on the best approach to building a big data architecture. The data lake is a great new concept, usually built in Hadoop, but what exactly is it and how does it fit in? In this presentation I'll discuss the four most common patterns in big data production implementations, the top-down vs bottoms-up approach to analytics, and how you can use a data lake and a RDBMS data warehouse together. We will go into detail on the characteristics of a data lake and its benefits, and how you still need to perform the same data governance tasks in a data lake as you do in a data warehouse. Come to this presentation to make sure your data lake does not turn into a data swamp!
RWDG Webinar: Data Steward Definition and Other Data Governance RolesDATAVERSITY
The role of the Data Steward is critical to the success of a Data Governance program. There are several approaches to Stewardship including assigning people to be Data Stewards, identify existing Data Stewards and recognizing Data Stewards according to their relationship to the data they define, produce and use. However Stewards are only one of several Data Governance roles that must be considered.
In this month’s RWDG webinar, Bob Seiner will discuss several approaches to defining the role of the Data Steward as well as the other roles necessary for Data Governance program success. Data Governance roles must include operational, tactical, strategic and supporting levels of responsibilities. Spend an hour with Bob where he will share a customize-able Operating Model of Data Governance roles and responsibilities.
In this webinar, Bob will discuss:
• Several approaches to defining Data Stewards and Stewardship
• How to select the Stewardship approach that is right for you
• Different levels of Stewards required for a successful program
• An Operating Model of DG Roles that can be molded to fit in any culture
• Why the approach to defining DG roles can make or break the program
Data Governance and Data Science to Improve Data QualityDATAVERSITY
Data Science uses systematic methods, algorithms, and systems to extract knowledge and insights from structured and unstructured data. Data Science requires high-quality data that is trusted by the organization and data scientists. Many organizations focus their Data Governance programs on improving Data Quality results. These three concepts (governance, science, and quality) seem to be made for each other.
In this RWDG webinar, Bob Seiner and his special guest will discuss how the people focusing on Data Governance and Data Science must work together to improve the level of confidence the organization has in its most critical data assets. Heavy investments are being made in Data Science but not so much for Data Governance. Bob will talk about how Data Governance and Data Science must work together to improve Data Quality.
Analytiikka liiketoiminnassa esitys Edutech Big Data ja data-analytiikka liiketoiminnan kehittämisessä 1.4.2014. Lisätietoa kokonaisuudesta: http://www.bit.ly/ebd2014
Miten Master Dataa voi käyttää apuna Big Datan hyödyntämisessä? Mitä uutta Big Data voi tarjota Master Datalle?
- Käytä master dataa laadukkaan ja standardoidun tiedon lähteenä
- Sovella master datan hallinnan periaatteita Big Dataan
- Elinkaari
- Omistajuus
- Hyödynnä samoja data quality työkaluja ja periaatteita
- Määritä laatukriteerit
- Estä huonolaatuisen datan pääsy prosessiin
- Käytä samoja työkaluja
- Valitse sellaiset työkalut, jotka pystyvät käsittelemään isoja määriä ei-rakenteista dataa
Big datan ja analytiikkamaailman käsitteiden läpikäyminenJari Jussila
Big datan ja analytiikkamaailman käsitteiden läpikäyminen. Kooste esityksestä 12.3.2014 Edutech Big Data ja data-analytiikka liiketoiminnan kehittämisessä,
Moduuli 1: Big Data nyt ja tulevaisuudessa.
Vierailuluento Tampereen Teknillisen Yliopiston kurssilla - Tietohallinto ja sen johtaminen.
Esityksen sisältöä:
- ICT strategia osana liiketomintastrategiaa
- ICT organisaation missio
- ICT hallintomalli (strategia, roadmap,
projekti-/hankeportfolio, sovellus-/prosessialuetaso)
- ICT roolit ja vastuut
- ICT peruspalvelujen ulkoistuskokemuksia
- Case : Tiedolla johtaminen/raportointi - kehityshanke
- Case : Digitaalisten palveluiden kehitys : Ennustavan
huollon/laitemonitoroinnin IIoT kehityshanke
- Tietohallinto PK-yrityksessä - hyvät ja huonot puolet
Helsingin ekonomien Business Performance Management -seminaarissa 4.10.2012 pidetty esitys tietoperustaisen suorituskyvyn johtamisen kehittymisestä organisaatioissa
Suunnittelemme maailmanluokan digitaalisia ratkaisuja, jotka ilahduttavat asiakastanne ja tehostavat liiketoimintaanne. Konseptoimme ja suunnittelemme juuri teidän tarpeisiinne sopivan ratkaisun, jotta tekisitte oikeat teknologia- ja toimittajavalinnat. Ohjaamme teknistä toteutusta, jotta saatte varmasti sitä, mitä tarvitsette. Huolehdimme jalkautuksen
suunnittelusta, jotta hankkeistanne seuraa konkreettista hyötyä.
Olemme joukko ratkaisumuotoilijoita, joiden suunnittelutyön jälki näkyy digitaalisissa palveluissa ympäri maailman. Teemme työtämme riippumattomina teknologioiden tai työkalujen toimittajista, joten pystymme suunnittelemaan ratkaisut täysin asiakkaidemme liiketoiminnan lähtökohdista ja heille parhaalla tavalla.
Lue lisää meistä ja asiakkaistamme: http://www.talentbase.fi/
Teknologiatoimittajien yhteinen arvolupaus ja liiketoimintamallit, Pasi Suomi, Kim Kaustell ja Liisa Pesonen, Luke. Teknologiapäivä – Arvo ja liiketoiminta digitaalisessa maataloudessa -webinaari, 13.1.2021.
HAMK Design Factory data-analytiikkaprojekti ja palvelumuotoilu.pdfHAMK Design Factory
HAMK Design Factory data-analytiikkaprojekti ja palvelumuotoilu - Case LHJ Group. Esitys Hämeen kauppakamarin osaamisvaliokunnan kokouksessa 26.4.2022.
RUN-EU Super Week, Design Factory Bootcamp, Building Team Culture workshop prepared by Päivi Oinonen from Aalto Design Factory, fasilitated by Jari Jussila HAMK Design Factory and Eric Voigt Future Design Factory.
2. www.hamk.fi
Opintojakson osaamistavoitteet
•Opiskelija tuntee erilaisia data-analytiikan menetelmiä, työkaluja ja
ohjelmistoja.
•Opiskelija ymmärtää millä tavoin tiedolla
johtamisen lähestymistapoja ja data-analytiikan menetelmiä voidaan
hyödyntää liiketoiminnan kehittämisessä.
•Opiskelija tunnistaa erilaisia analytiikassa hyödynnettäviä data- ja
informaatiolähteitä ja niiden käyttömahdollisuuksia liiketoiminnassa.
•Opiskelija osaa PowerBI käytön perusteet.
3. www.hamk.fi
Mitä osaamista tiedolla johtamisen
ammattilaisilta odotetaan?
Substantive
Expertise
Machine
Learning
Traditional
Research
Traditional
Software
DATA SCIENCE
Lähde: Data Science Venn diagrammit (Conway 2010 & Geringer 2014)
4. www.hamk.fi
Rakenna opintojakson aikana oma osaamismatriisi tiedolla
johtamisen
Tieto- ja viestintätekninen
osaaminen
Biotalous, liiketoiminta,
kestävän kehityksen
osaaminen
Matemaattinen osaaminen
- - -
- - -
- - -
- - -
- - -
6. www.hamk.fi
Data-analytiikan menetelmät, työkalut ja
ohjelmistot
Lähde: Sacha et al. 2014
esim. Tableau, QlikView, Qlik Sense, Power BI (BI-työkalut)
D3.js, Highcharts, R, Revolution R, Google Fusion Tables (ohjelmointi/tee-se-itse)
esim. SAS, SPSS, Alteryx, Revolution R,
R, Knime, RapidMiner
esim.
Talend,
Datameer,
Pandas,
Anaconda
7. www.hamk.fi
Tiedolla johtamisen perinteinen työnkulku
Lähde: Mohanty et al., (2013)
Liiketoimintakysymys:
”Kuinka saavutan markkinoinnin
tavoitteet eri digitaalisissa
kanavissa?”
Liiketoimintapäätös:
”Määritä kanavakohtaiset
asiakassegmentit ja kohdista
markkinointi analyysin tuloksien
mukaisesti”
Liiketoiminta-
vastaava
Liiketoiminta-
ja
data-analyytikko
Tunnista datalähteet:
CRM, tietovarasto, Excel
sosiaalinen media, jne.
Yhdistä ja kokoa
data:
Hae ja yhdistä data
eri lähteistä
Siivoa ja rikasta:
ETL, datan siivous,
datan rikastaminen
Lisää ja
kontekstualisoi:
Sijainti, demografia,
segmentointi
Rakenna analytiikan
työnkulku:
Rakenna ympäristö,
mallinna, analysoi data,
ennusta
Analyysi:
Tarkastele tuloksia ja iteroi
Esitä ja sovita:
Visualisoi, rakenna applikaatio tai
kerro tarina
IT
/
datahallinnan
asiantuntija
8. www.hamk.fi
Datatietelijöiden työnkulku
Liiketoimintakysymys:
”Kuinka saavutan markkinoinnin
tavoitteet eri digitaalisissa
kanavissa?”
Liiketoimintapäätös:
”Määritä kanavakohtaiset
asiakassegmentit ja kohdista
markkinointi analyysin tuloksien
mukaisesti”
Liiketoiminta-
vastaava
Datatieteilijä
Tunnista datalähteet:
CRM, tietovarasto, Excel
sosiaalinen media, jne.
Yhdistä ja kokoa data:
Hae ja yhdistä data eri lähteistä
Siivoa ja rikasta:
ETL, datan siivous, datan
rikastaminen
Lisää ja kontekstualisoi:
Sijainti, demografia, segmentointi
Rakenna analytiikan työnkulku:
Rakenna ympäristö, mallinna,
analysoi data, ennusta
Analysoi:
Tarkastele tuloksia ja iteroi
Esitä ja sovita:
Visualisoi, rakenna applikaatio tai
kerro tarina
Syötä
dataa
Visualisoi
Mallinna
Lähde: Mohanty et al., (2013)
13. www.hamk.fi
Tiedonlouhinnan standardimalli
CRISP-DM referenssimallin tehtävät
Liiketoiminnan
ymmärtäminen
Datan
ymmärtäminen
Datan
valmistelu
Mallinnus,
tiedonlouhinta
Arviointi Tulosten
julkaisu
Määritä
liiketoiminnan
tavoitteet
Tee tilannearvio
Määritä
tiedonlouhinnan
tavoitteet
Laadi
projektisuunnitelma
Kerää alustava
data
Kuvaa data
Tutki dataa
Varmista datan
laatu
Datasetin
kuvaus
Datan valinta
Datan siivous
Datan
rakentaminen
Datan
integrointi
Datan
formatointi
Valitse mallinnus
menetelmä
Suunnittele koe
Rakenna malli
Arvioi mallia
Arvioi tuloksia
Arvioi ja
varmista
tiedonlouhinta
prosessin laatu
Määrittele
seuraavat
stepit
Suunnittele
mallin julkaisu
(esim.
verkkosivuna)
asiakkaalle
Suunnittele
mallin ylläpito
Laadi
loppuraportti
Arvioi projekti
Lähde: The CRISP-DM Model (Shearer 2000), ks. myös CRISP-DM 1.0 Step-by-step data
mining guide: https://the-modeling-agency.com/crisp-dm.pdf
14. www.hamk.fi
Datan käyttö ja analysointi; terminologiaa
Termi Ajanjakso Kuvaus
Päätöksenteon tuki
[Decision Support Systems]
1970-1985 Hyödynnetään data-analyysiä
tukemaan päätöksentekoa
Johdon tukijärjestelmät
[Executive Support/Information
Systems]
1980-1990 Fokus data-analyysissä
ylemmän johdon tueksi
Kuutioiden mallinnus
[Online Analytical Processing,
OLAP]
1990-2000 Ohjelmistoja
multidimensionaalisten
datataulujen analysointiin
Liiketoimintatiedon hallinta
[Business Intelligence]
1989-2005 Työkaluja tukemaan
datalähtöistä päätöksentekoa,
painopiste raportoinnissa
Analytiikka
[Analytics]
2005-2010 Fokus tilastollisessa ja
matemaattisessa analyysissä
päätöksenteon tueksi
Massadata
[Big Data]
2010- Fokus erittäin isossa,
monimuotoisessa ja nopeasti
liikkuvassa datassa
Lähde: Big Data at Work, Davenport, 2014
15. www.hamk.fi
Päätöksenteon tukijärjestelmät ja johdon
tietojärjestelmät ”1970-1990”
• Säästetään kustannuksia, optimoidaan tiettyä toimintaa,..
• Tuotannonohjaus (Material Requirements Planning)
• Laajempi tuotannon ohjaus (Manufacturing Resources Planning, MRP II)
• Toiminnanohjaus (Enterprise Resource Planning)
• Erilaiset toimintokohtaiset järjestelmät
→ Raportteja ja katsauksia
Liiketoiminnan odotukset IT:lle
(Tallon & Kraemer 2007)
17. www.hamk.fi
OLAP-kuutioiden perusidea
• OLAP-tekniikka poikkeaa perinteisestä tilastoinnista ja
raportoinnista siten, että käyttäjä voi vaihtaa näkökulmaa
tiedontarpeensa mukaan.
• Esimerkiksi analyytikko voi lähteä
1) tarkastelemaan tuoteryhmien myyntiä maittain,
2) filtteröidä aineisto tiettyihin maihin
3) joista löytää jotain esimerkiksi myyntitavoitteista poikkeavaa.
Tämän jälkeen
4) porautua esimerkiksi myyntiin tietyissä kaupungeissa.
5) Vaihtaa näkökulmaa liikevoittoon tuoteryhmittäin, jne.
”1980-1990”
23. www.hamk.fi
Analytiikan eri muodot
Analyysi
Informaatio
Mitä on
tapahtumassa?
Miksi se tapahtui?
Mitä tulee
todennäköisesti
tapahtumaan?
Mitä minun pitäisi
tehdä sille?
Prediktiivinen analytiikka
Diagnostiivinen analytiikka
Deskriptiivinen analytiikka Preskriptiivinen analytiikka
Lähde: Gartner
27. www.hamk.fi
Preskriptiivinen analytiikka
• Suosittelujärjestelmät, esim. Amazonin
automaattiset tuotesuositukset
• Vakuutusalalla esimerkiksi
ajoneuvovakuutusten hinnoittelussa pyritään
tunnistamaan tekijät, jotka ennustavat, mitä
tietylle kuljettaja-ajoneuvo-yhdistelmälle tulee
ensi vuonna tapahtumaan. Käytännössä
jokaiselle asiakkaalle tulee eri hinta. (Timo
Ahvonen, Vakuutusyhtiö Fennia)
29. www.hamk.fi
Big Data ”2010-”
Volyymi Vauhti Variaatio Varmuus
Data at Rest Data in Motion Data in Many Forms Data in Doubt
Terabittejä olemassa
olevaa dataa
prosessoitavaksi
Striimattua dataa, nopea
vasteaika millisekunneista
sekunteihin
Strukturoitua, ei-
strukturoitua, ja
semistrukturoitua dataa
Epävarmuutta liittyen
datan epätäydellisyyteen,
puutteellisuuteen, tai
virheellisyyteen
Lähde: Breuker 2014; Laney, Meta Group 2001 (3V:tä)
30. www.hamk.fi
Big Data transaktioista interaktioihin
WEB
BIG DATA
Lähde: mukaillen Yli-Pietilä & Backman 2013; Valli & Ahlgren 2013
ERP
CRM
ostotiedot
maksutiedot
segmentointi
tarjoustiedot
asiakaskohtaamiset
tukikontaktit
weblogit
tarjoushistoria A/B testaaminen
Dynaaminen
hinnoittelu
Hakukonemarkkinointi
ja optimointi
Mainosverkostot
Käyttäytymispohjainen
kohdentaminen
Dynaamiset
funnellit
Sentimentti
Ulkopuolinen demografia
Kuvat ja videot
Puheen muuttaminen tekstiksi
Feedit
Anturi/sensoridata
Tuote / palvelu logit
SMS/MMS
Sosiaaliset verkostot
Sosiaalinen media
Käyttäjien luoma sisältö
Mobiilidata
Klikkivirran analyysi
Sijaintidata