The document provides an overview of Thorhildur Jetzek's background and career. It summarizes her educational qualifications including a Ph.D. in Information Technology Management from CBS in 2015. It also lists some of her past roles working as an economist, IT consultant, and in various positions at CBS where she is currently a postdoctoral fellow. The document then discusses CBS' ranking and focus on industry collaboration through projects like industrial Ph.D. programs and crowdsourcing competitions for students.
In this presentation, let's have a look at What is Data Science and it's applications. We discussed most common use cases of Data Science.
I presented this at LSPE-IN meetup happened on 10th March 2018 at Walmart Global Technology Services.
Big data, new epistemologies and paradigm shiftsrobkitchin
This presentation examines how the availability of Big Data, coupled with new data analytics, challenges established epistemologies across the sciences, social sciences and humanities, and assesses the extent to which they are engendering paradigm shifts across multiple disciplines.
About
Evolution of Data, Data Science , Business Analytics, Applications, AI, ML, DL, Data science – Relationship, Tools for Data Science, Life cycle of data science with case study,
Algorithms for Data Science, Data Science Research Areas,
Future of Data Science.
In this presentation, let's have a look at What is Data Science and it's applications. We discussed most common use cases of Data Science.
I presented this at LSPE-IN meetup happened on 10th March 2018 at Walmart Global Technology Services.
Big data, new epistemologies and paradigm shiftsrobkitchin
This presentation examines how the availability of Big Data, coupled with new data analytics, challenges established epistemologies across the sciences, social sciences and humanities, and assesses the extent to which they are engendering paradigm shifts across multiple disciplines.
About
Evolution of Data, Data Science , Business Analytics, Applications, AI, ML, DL, Data science – Relationship, Tools for Data Science, Life cycle of data science with case study,
Algorithms for Data Science, Data Science Research Areas,
Future of Data Science.
Hawaii International Conference on Systems Sciences 2017. There are many opportunities for academics to submit papers for presentation at this very important conference which has sessions on Cognitive, Analytics, Big Data and much more. Haluk Demirkan, U Washington and Sergey Belov, IBM University Relations CEEMA made this presentation at Cognitive Systems Institute Speaker Series call on March 10, 2016.
Diffusion of Big Data and Analytics in Developing Countriestheijes
The purpose of this study is to shed light on the capabilities for storing, analysing and sharing big data in developing countries. The study takes an in-depth look at adoption of big data as a technological innovation, as well as the adoption issues for Big Data, its availability and access. The paper presents a review of academic literature, policy documents from international agencies and reports from industry in order to assess the diffusion and adoption of big data innovation in developing countries. The study was broadened by a Google Scholar search for relevant literature where the combinations of the following key words were used big data and analytics, developing countries, and diffusion of Innovations. Diffusion of innovations can greatly accelerate adoption and utilization of Big Data, even though there are challenges faced by developing countries which limit capability and utilization of these technologies effectively. The paper presents the Innovations Diffusions Theoretical framework for the study of Big Data innovation adoption in developing countries. The study concludes that the diffusion theory concepts provide an effective mechanism for policy leaders in developing countries to maximize adoption of Big Data innovations, and can also be used in informing policy implementers on how to increase adoption rates for Big Data.
Data Analytics with R, Contents and Course materials, PPT contents. Developed by K K Singh, RGUKT Nuzvid.
Contents:
Introduction to Data, Information and Data Analytics,
Types of Variables,
Types of Analytics
Life cycle of data analytics.
Data Science is a wonderful technology that has applications in almost every field. Let's learn the basics of this domain on 16th March at (time).
Agenda
1. What is Data Science? How is it different from ML, DL, and AI
2. Why is this skill in demand?
3. What are some popular applications of Data Science
4. Popular tools and frameworks used in Data Science
Hawaii International Conference on Systems Sciences 2017. There are many opportunities for academics to submit papers for presentation at this very important conference which has sessions on Cognitive, Analytics, Big Data and much more. Haluk Demirkan, U Washington and Sergey Belov, IBM University Relations CEEMA made this presentation at Cognitive Systems Institute Speaker Series call on March 10, 2016.
Diffusion of Big Data and Analytics in Developing Countriestheijes
The purpose of this study is to shed light on the capabilities for storing, analysing and sharing big data in developing countries. The study takes an in-depth look at adoption of big data as a technological innovation, as well as the adoption issues for Big Data, its availability and access. The paper presents a review of academic literature, policy documents from international agencies and reports from industry in order to assess the diffusion and adoption of big data innovation in developing countries. The study was broadened by a Google Scholar search for relevant literature where the combinations of the following key words were used big data and analytics, developing countries, and diffusion of Innovations. Diffusion of innovations can greatly accelerate adoption and utilization of Big Data, even though there are challenges faced by developing countries which limit capability and utilization of these technologies effectively. The paper presents the Innovations Diffusions Theoretical framework for the study of Big Data innovation adoption in developing countries. The study concludes that the diffusion theory concepts provide an effective mechanism for policy leaders in developing countries to maximize adoption of Big Data innovations, and can also be used in informing policy implementers on how to increase adoption rates for Big Data.
Data Analytics with R, Contents and Course materials, PPT contents. Developed by K K Singh, RGUKT Nuzvid.
Contents:
Introduction to Data, Information and Data Analytics,
Types of Variables,
Types of Analytics
Life cycle of data analytics.
Data Science is a wonderful technology that has applications in almost every field. Let's learn the basics of this domain on 16th March at (time).
Agenda
1. What is Data Science? How is it different from ML, DL, and AI
2. Why is this skill in demand?
3. What are some popular applications of Data Science
4. Popular tools and frameworks used in Data Science
In this presentation, Venkatesh introduces IoT and associated trends. His interest area lies in analytics of data obtained through sensors. Some of his ideas include predicting mean sea level based on Oxygen levels, Intelligent transport systems etc.
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...IT Network marcus evans
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong Value-Adding Proposition
by Patrick Hadley, Australian Bureau of Statistics at the Australian CIO Summit 2014
In the third part of the workshop series Smart Policies for Data, we will focus on two central building blocks – interoperability and balanced data sharing.
The presentations of the event:
- Szymon Lewandowski, DG CONNECT, European Commission
- Marko Turpeinen, CEO, 1001 Lakes
- Lars Nagel, CEO, International Data Spaces Association
an introductory course for Librarians on using Big Data and Data Science applications on the field of Library Science. The course is a 2 hour course module for basic fundamentals of applying DS work.
Data science, Know as data-driven science, is also an interdisciplinary field of scientific methods, processes, algorithms, and systems to extract knowledge or insights from data in various forms, either structured or unstructured, similar to data mining.
Big Data is one of the emerging areas in today's technological world. In this socially active world, data is growing at a tremendous pace of 2.5 quintillion bytes a day roughly that is only set to increase over the coming years.
Here is a guide for all beginners who express interest in this new field - Big Data.
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
This is my presentation on the Topic "Data Science - An emerging Stream of Science with its Spreading Reach & Impact". I have compiled and collected different statistics and data from different sources. This may be useful for students and those who might be interested in this field of Study.
Presentation: Study: #Big Data in #Austria, Mario Meir-Huber, Big Data Leader Eastern Europe, Teradata GmbH & Martin Köhler, Austrian Institute of Technology, AIT (AT), at the European Data Economy Workshop taking place back to back to SEMANTiCS2015 on 15 September 2015 in Vienna.
Similar to Big data presentation for University of Reykjavik, Iceland, March 22 (20)
A short summary of my PhD dissertation where I explore and analyze how value is generated from open data. Digital data have unique features that make them fundamentally different from most other resources. This is especially true if data are open and will impact how value is generated and evaluated. This calls for new value generation models which have implications for how governments operate, how businesses are run and can explain why new intermediaries are currently disrupting most markets.
Buy Verified PayPal Account | Buy Google 5 Star Reviewsusawebmarket
Buy Verified PayPal Account
Looking to buy verified PayPal accounts? Discover 7 expert tips for safely purchasing a verified PayPal account in 2024. Ensure security and reliability for your transactions.
PayPal Services Features-
🟢 Email Access
🟢 Bank Added
🟢 Card Verified
🟢 Full SSN Provided
🟢 Phone Number Access
🟢 Driving License Copy
🟢 Fasted Delivery
Client Satisfaction is Our First priority. Our services is very appropriate to buy. We assume that the first-rate way to purchase our offerings is to order on the website. If you have any worry in our cooperation usually You can order us on Skype or Telegram.
24/7 Hours Reply/Please Contact
usawebmarketEmail: support@usawebmarket.com
Skype: usawebmarket
Telegram: @usawebmarket
WhatsApp: +1(218) 203-5951
USA WEB MARKET is the Best Verified PayPal, Payoneer, Cash App, Skrill, Neteller, Stripe Account and SEO, SMM Service provider.100%Satisfection granted.100% replacement Granted.
Improving profitability for small businessBen Wann
In this comprehensive presentation, we will explore strategies and practical tips for enhancing profitability in small businesses. Tailored to meet the unique challenges faced by small enterprises, this session covers various aspects that directly impact the bottom line. Attendees will learn how to optimize operational efficiency, manage expenses, and increase revenue through innovative marketing and customer engagement techniques.
Falcon stands out as a top-tier P2P Invoice Discounting platform in India, bridging esteemed blue-chip companies and eager investors. Our goal is to transform the investment landscape in India by establishing a comprehensive destination for borrowers and investors with diverse profiles and needs, all while minimizing risk. What sets Falcon apart is the elimination of intermediaries such as commercial banks and depository institutions, allowing investors to enjoy higher yields.
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...BBPMedia1
Marvin neemt je in deze presentatie mee in de voordelen van non-endemic advertising op retail media netwerken. Hij brengt ook de uitdagingen in beeld die de markt op dit moment heeft op het gebied van retail media voor niet-leveranciers.
Retail media wordt gezien als het nieuwe advertising-medium en ook mediabureaus richten massaal retail media-afdelingen op. Merken die niet in de betreffende winkel liggen staan ook nog niet in de rij om op de retail media netwerken te adverteren. Marvin belicht de uitdagingen die er zijn om echt aansluiting te vinden op die markt van non-endemic advertising.
"𝑩𝑬𝑮𝑼𝑵 𝑾𝑰𝑻𝑯 𝑻𝑱 𝑰𝑺 𝑯𝑨𝑳𝑭 𝑫𝑶𝑵𝑬"
𝐓𝐉 𝐂𝐨𝐦𝐬 (𝐓𝐉 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬) is a professional event agency that includes experts in the event-organizing market in Vietnam, Korea, and ASEAN countries. We provide unlimited types of events from Music concerts, Fan meetings, and Culture festivals to Corporate events, Internal company events, Golf tournaments, MICE events, and Exhibitions.
𝐓𝐉 𝐂𝐨𝐦𝐬 provides unlimited package services including such as Event organizing, Event planning, Event production, Manpower, PR marketing, Design 2D/3D, VIP protocols, Interpreter agency, etc.
Sports events - Golf competitions/billiards competitions/company sports events: dynamic and challenging
⭐ 𝐅𝐞𝐚𝐭𝐮𝐫𝐞𝐝 𝐩𝐫𝐨𝐣𝐞𝐜𝐭𝐬:
➢ 2024 BAEKHYUN [Lonsdaleite] IN HO CHI MINH
➢ SUPER JUNIOR-L.S.S. THE SHOW : Th3ee Guys in HO CHI MINH
➢FreenBecky 1st Fan Meeting in Vietnam
➢CHILDREN ART EXHIBITION 2024: BEYOND BARRIERS
➢ WOW K-Music Festival 2023
➢ Winner [CROSS] Tour in HCM
➢ Super Show 9 in HCM with Super Junior
➢ HCMC - Gyeongsangbuk-do Culture and Tourism Festival
➢ Korean Vietnam Partnership - Fair with LG
➢ Korean President visits Samsung Electronics R&D Center
➢ Vietnam Food Expo with Lotte Wellfood
"𝐄𝐯𝐞𝐫𝐲 𝐞𝐯𝐞𝐧𝐭 𝐢𝐬 𝐚 𝐬𝐭𝐨𝐫𝐲, 𝐚 𝐬𝐩𝐞𝐜𝐢𝐚𝐥 𝐣𝐨𝐮𝐫𝐧𝐞𝐲. 𝐖𝐞 𝐚𝐥𝐰𝐚𝐲𝐬 𝐛𝐞𝐥𝐢𝐞𝐯𝐞 𝐭𝐡𝐚𝐭 𝐬𝐡𝐨𝐫𝐭𝐥𝐲 𝐲𝐨𝐮 𝐰𝐢𝐥𝐥 𝐛𝐞 𝐚 𝐩𝐚𝐫𝐭 𝐨𝐟 𝐨𝐮𝐫 𝐬𝐭𝐨𝐫𝐢𝐞𝐬."
Premium MEAN Stack Development Solutions for Modern BusinessesSynapseIndia
Stay ahead of the curve with our premium MEAN Stack Development Solutions. Our expert developers utilize MongoDB, Express.js, AngularJS, and Node.js to create modern and responsive web applications. Trust us for cutting-edge solutions that drive your business growth and success.
Know more: https://www.synapseindia.com/technology/mean-stack-development-company.html
LA HUG - Video Testimonials with Chynna Morgan - June 2024Lital Barkan
Have you ever heard that user-generated content or video testimonials can take your brand to the next level? We will explore how you can effectively use video testimonials to leverage and boost your sales, content strategy, and increase your CRM data.🤯
We will dig deeper into:
1. How to capture video testimonials that convert from your audience 🎥
2. How to leverage your testimonials to boost your sales 💲
3. How you can capture more CRM data to understand your audience better through video testimonials. 📊
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraAvirahi City Dholera
The Tata Group, a titan of Indian industry, is making waves with its advanced talks with Taiwanese chipmakers Powerchip Semiconductor Manufacturing Corporation (PSMC) and UMC Group. The goal? Establishing a cutting-edge semiconductor fabrication unit (fab) in Dholera, Gujarat. This isn’t just any project; it’s a potential game changer for India’s chipmaking aspirations and a boon for investors seeking promising residential projects in dholera sir.
Visit : https://www.avirahi.com/blog/tata-group-dials-taiwan-for-its-chipmaking-ambition-in-gujarats-dholera/
Attending a job Interview for B1 and B2 Englsih learnersErika906060
It is a sample of an interview for a business english class for pre-intermediate and intermediate english students with emphasis on the speking ability.
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...BBPMedia1
Grote partijen zijn al een tijdje onderweg met retail media. Ondertussen worden in dit domein ook de kansen zichtbaar voor andere spelers in de markt. Maar met die kansen ontstaan ook vragen: Zelf retail media worden of erop adverteren? In welke fase van de funnel past het en hoe integreer je het in een mediaplan? Wat is nu precies het verschil met marketplaces en Programmatic ads? In dit half uur beslechten we de dilemma's en krijg je antwoorden op wanneer het voor jou tijd is om de volgende stap te zetten.
Discover the innovative and creative projects that highlight my journey throu...dylandmeas
Discover the innovative and creative projects that highlight my journey through Full Sail University. Below, you’ll find a collection of my work showcasing my skills and expertise in digital marketing, event planning, and media production.
Skye Residences | Extended Stay Residences Near Toronto Airportmarketingjdass
Experience unparalleled EXTENDED STAY and comfort at Skye Residences located just minutes from Toronto Airport. Discover sophisticated accommodations tailored for discerning travelers.
Website Link :
https://skyeresidences.com/
https://skyeresidences.com/about-us/
https://skyeresidences.com/gallery/
https://skyeresidences.com/rooms/
https://skyeresidences.com/near-by-attractions/
https://skyeresidences.com/commute/
https://skyeresidences.com/contact/
https://skyeresidences.com/queen-suite-with-sofa-bed/
https://skyeresidences.com/queen-suite-with-sofa-bed-and-balcony/
https://skyeresidences.com/queen-suite-with-sofa-bed-accessible/
https://skyeresidences.com/2-bedroom-deluxe-queen-suite-with-sofa-bed/
https://skyeresidences.com/2-bedroom-deluxe-king-queen-suite-with-sofa-bed/
https://skyeresidences.com/2-bedroom-deluxe-queen-suite-with-sofa-bed-accessible/
#Skye Residences Etobicoke, #Skye Residences Near Toronto Airport, #Skye Residences Toronto, #Skye Hotel Toronto, #Skye Hotel Near Toronto Airport, #Hotel Near Toronto Airport, #Near Toronto Airport Accommodation, #Suites Near Toronto Airport, #Etobicoke Suites Near Airport, #Hotel Near Toronto Pearson International Airport, #Toronto Airport Suite Rentals, #Pearson Airport Hotel Suites
Digital Transformation and IT Strategy Toolkit and TemplatesAurelien Domont, MBA
This Digital Transformation and IT Strategy Toolkit was created by ex-McKinsey, Deloitte and BCG Management Consultants, after more than 5,000 hours of work. It is considered the world's best & most comprehensive Digital Transformation and IT Strategy Toolkit. It includes all the Frameworks, Best Practices & Templates required to successfully undertake the Digital Transformation of your organization and define a robust IT Strategy.
Editable Toolkit to help you reuse our content: 700 Powerpoint slides | 35 Excel sheets | 84 minutes of Video training
This PowerPoint presentation is only a small preview of our Toolkits. For more details, visit www.domontconsulting.com
2. • Stúdent frá Eðlisfræðibraut I í MR 1991
• B.Sc. in Economics 1994
• M.Sc. in Economics 1998
• Ph.D. in Information Technology Management 2015
• Have worked as a economist, IT consultant, assistant
professor, project manager, program manager, director,
industrial PhD and now postdoctoral researcher.
• Have always focused on use of technology
Who am I?
Traditional career
My career
@Thorhildur Jetzek CBS 2|
3. • High ranking
– 2nd in Europe (behind LSE) & 22 world-wide
• Focus on collaboration with industry
– Industrial PhD (my PhD contract was at KMD)
– Engaged scholarship & collaborative research
(current project of mine sponsored by industry)
– Crowdsourcing events:
• Student competition where CBS students got access to
anonymized data on 100.000 customers of Danske Bank
and socio-economic data from KMD as well as data from
Danske bank´s public Facebook wall
• Financial prices (DKK 75.000 1st price)
@Thorhildur Jetzek CBS 3|
6. Rise of Digitization
An average decline of almost 40% a year in the cost
per gigabyte of consumer hard disk drive from 1998
(OECD, 2013).
38% yearly
decrease in the
cost of shifting one
bit per second
since 1995 (OECD,
2013).
More than 30 million interconnected sensors are now
deployed worldwide, in areas such as security, health
care, transport systems or energy control systems,
and their numbers are growing by around 30% a year
(McKinsey, 2011).
6 billion people
have cellphones
30 billion pieces of
content are shared
on Facebook every
month
2002: The year when the amount of information
stored digitally surpasses non-digital information!
@Thorhildur Jetzek CBS 6|
7. Changes...
Forbes highlights
• IT in the boardroom: Digital strategies
• Changing business models - platforms
• Big data and analytics
• Lacking skills: EU estimates 160% increase in demand for
Big Data specialists between 2013-2020 to 346,000 new
jobs
IDC predicts
• Market for big data analysis services over $16 billion in 2014,
growing six times faster than the entire IT industry
• Cloud-based big data and analytics will grow three times faster
than spending for on premise solutions in 2015
@Thorhildur Jetzek CBS 7|
8. Global open access
FLOSS – Free/Open Source
Software
“…people´s pursuit of visible
carrots is at times interrupted by
the larger quest for the invisible
gold at the end of the rainbow.”
(von Krogh et al., 2012a, p. 671)
• Collaborative projects: Wikipedia, Human Genome Project,
Open.Nasa.gov
• Open NGO data: http://data.worldbank.org/ (and multitude of
similar)
• Open Government Data: http://data.gov / http://data.gov.uk
(and 300 others)
• Open company data (open API´s): Facebook, Twitter, LinkedIn
• Platforms: CouchSurfing.com; NeighbourGoods.net
@Thorhildur Jetzek CBS 8|
9. What is BIG data?
• The jury is still out
– Davenport: New technologies – software and
infrastructure plus the data itself
– Forrester defines Big Data as “techniques and
technologies that make handling data at extreme
scale affordable”
– McKinsey (2011): “Big data” refers to datasets
whose size is beyond the ability of typical
database software tools to capture, store,
manage, and analyze.
@Thorhildur Jetzek CBS 9|
11. Dimensions of big data: 4 V‘s
Source: IBM, http://www.ibmbigdatahub.com/infographic/four-vs-big-data
12. Utilization of data
Source: @PetteriA: http://www.slideshare.net/petterialahuhta/alahuhta-big-
dataandanalytics24sep2014
@Thorhildur Jetzek CBS 12|
14. Liquid open data
@HildaJetzek
Liquidity – reflects
ability to link and stream
data across systems
Openness – reflects ability to
use data outside of
organizational boundaries
Liquid dataIlliquid data
Closed data
Open data
Liquid closed data:
Data are effectively reused
across a variety of systems
within a single organization
Illiquid (silo’ed) closed data:
Data are stored where they
originate and not reused
Illiquid (silo’ed) open data :
Data are used outside of
organizational boundaries but
offer limited potential for
automation or coupling of data
Liquid open data:
Data are used outside of
organizational boundaries
and easily coupled with
other data and integrated
across systems
Combining internal
and external data for
improved insights
Internally
shared data
Most data
within
organizations
Many open
government data
initiatives
15. How do we identify opennes?
@HildaJetzek
Dimension Affordance Explanation
Openness
Strategic Availability Data are open to all by default
Economic Affordability
Data are free or charged for at maximum at
marginal cost of reproduction
Legal Reusability Data are published with open licenses
Liquidity
Conceptual Interoperability
Semantics and syntax are clear, data models
and metadata are published, use of
standard identifiers
Technical
Usability
Data are of high quality, published in
machine readable and standard formats,
using contextual metadata
Discoverability
Data are easily found through central portals
or published with searchable metadata or
using linked data semantics
Accessibility
Data are easily downloadable or ”query-
able” through APIs
16. Binary or continuous?
• Data are not just open or closed, or liquid
or illiquid – a continuous range
• Classification useful for strategy purposes
– A part of an organization’s data need to be
liquid across the company (customer master)
– Other data could be open but illiquid (financial
statement)
– Some data are liquid and open (genomics data,
geospatial data)
@Thorhildur Jetzek CBS 16|
17. Highlights
• Why do we have so much data?
• What are the underlying societal changes
we need to be aware of?
• Why has openness become so popular?
• Does it make sense to make more use of
data, even if it is expensive to re-think
how we handle data in the company?
@Thorhildur Jetzek CBS 17|
19. Machine-generated big data
• Sensors/IoT devices
– From car navigation systems, smart meters,
unmanned security systems, sensors etc.
@Thorhildur Jetzek CBS 19|
20. • Social data
– Sources: Social media websites, blog
sites, product reviews, search results
– Unstructured,
natural language
• Data from mobile phones
People-generated big data
– Most commonly geolocation
– for example used to analyze
traffic or movement of people
or to do geo-tagging
Source: Waze
https://www.waze.com/@Thorhildur Jetzek CBS 20|
21. Measurement data
• Nature/Environment
– Sources: Measurements, such as meteorological,
atmospheric and pollution Big Data
• Geospatial data
Source: @Vishy Iyer, UT
https://news.utexas.edu/2012/09/28/cracking
-the-genetic-code-of-brain-tumors
Cracking the Genetic Code of Brain Tumors
• Lifeforms
– Sources: Genetic
sequencing,
patient databases
@Thorhildur Jetzek CBS 21|
23. Structure of data
• How do we define structured data?
– Very often referred to as data in structured relational
databases
– Known datamodel, identities and tabular formats
(columns and rows)
– Still, a lot of (big) data analytics tools/packages want
tabular formats
• R uses data-frames
• Tableu wants a tabular format
• SAS/SPSS use a tabular format
@Thorhildur Jetzek CBS 23|
25. Semi-structured data
• Typically data such as XML or JSON
– Nested, not tabular but a known
structure all the same
– Could be applied to
text-files such as logs
– Can be transformed into
a tabular structure (with
many empty cells)
@Thorhildur Jetzek CBS 25|
29. Standard analytics
• Data analytics can take many different
forms
• Common forms of data analytics include:
– Static reporting: Annual reports – quarterly
reports etc.)
– Dynamic reporting: Business intelligence,
ability to choose columns and rows and
reorganize data into a format that makes
sense to user
– Simple analysis: sums, filtering, pivot tables,
max and min values, averages etc.
@Thorhildur Jetzek CBS 29|
30. Visual analytics
• To explore and understand data by visualizing
• Most people have an easier time understanding a
chart than t-values or large numeric matrices
• Can range from „traditional“ bar charts or lines to
word clouds (highlights most used words by
making them bigger), heatmaps, placing items on
geographic maps, use of treemaps, bubble
diagrams etc.
• Visual analytics is (like all statistics really) a
combination of art and science. It is difficult to tell
a good story with one picture, but a very powerful
tool if you succceed!
@Thorhildur Jetzek CBS 30|
31. Helps us understand
Source: SAS http://www.sas.com/en_nz/software/business-intelligence/visual-analytics.html
32. Basic analytics
• Use of a bit more advanced statistics using Excel
or Excel add-ins or tools such as SAS or SPSS.
• Correlational/regression analysis.
– Could be used to see if there are any interesting
correlations or explanations which can add to
company decision making
– Can be used for forecasting, for example if there is a
great weather forecast (external data), sales of
icecream are predicted to go up 15% => stock up on
icecream
– Be aware of the uncertainty in such models. This is
not the truth!
– Be aware of spurious correlations:
http://tylervigen.com/spurious-correlations
@Thorhildur Jetzek CBS 32|
33. Basic analytics
• Time series analysis
– Used to predict the future
– Makes use of historical data and looks for
trends in the data
– Seasonal changes, growth etc.
– Use of statistical methods like moving
averages to figure out long term trends
34. Advanced analytics
• There are many more less used statistical
methods for quantitative data (numbers)
– Dimension reduction: Search for any natural
clusters in the measurements (columns) that
help us identify composite variables
– Cluster analysis: Search for any natural clusters
in the data (rows). For instance, marketers like
to cluster the general population of consumers
into market segments with different buying
behaviors
– Social network analysis: Clusters and
relationships. Which groups on Facebook are
likely to connect?
35. Advanced analytics
• Structural equation modelling (SEM):
• Simultaneously estimate multiple equations
(multivariate)
• Estimate variables and paths (relationships)
• Can be based on covariance (CB-SEM) or multiple
regression (PLS-SEM)
• Confirmatory factor analysis based on similar technique
but without estimating the paths
Variable (typically
based on more
than one measures
to reduce risk of
measurment bias)
Path – to understand nature
of relationship@Thorhildur Jetzek 35|
38. Artificial intelligence
• Neural network analysis:
– A computer program modeled after the human
brain and can identify patterns in a similar way that
we do
– This technique is
particularly useful if you
have a large amount of
data, which can reveal
subtle patterns you haven’t
found or modelled ex ante
@Thorhildur Jetzek CBS 38|
39. Machine learning
• Machine learning can use many different
algorithms
– Machine learning can use supervised, semi-
supervised or unsupervised learning
processes
– Despite the fancy connotation, some
machine learning algorithms are not that
complex
– Of course they can also be very complex
(Google‘s self driving car, IBM Watson‘s
chess playing algorithm)
@Thorhildur Jetzek CBS 39|
41. Data mining
• Data mining:
– A process of extracting value from large quantities
of unstructured data, including text, images, voice
and video. Includes pattern recognition, tagging
and annotation
– Data mining can really increase the value of the
data
• Sentiment analysis:
– Seeks to extract subjective opinion or sentiment
from text, video or audio data
– The basic aim is to determine the attitude of an
individual or group regarding a particular topic or
overall context
– Used to understand stakeholder opinion
@Thorhildur Jetzek CBS 41|
42. Text analysis
Source: Zimmerman, C., Stein, M. K., Hardt, D., & Vatrapu, R. (2015). Emergence of Things Felt.
In Proceedings of the Thirty Sixth International Conference on Information Systems. ICIS 2015
Own analysis based on Twitter data, query all tweeds including "open data"
OR opengovdata OR opengov in March 2012/13/14, total of 100k rows
@Thorhildur Jetzek CBS 42|
44. Data storage (no-SQL)
• Hadoop: open-source software
framework for distributed
storage of very large datasets
on computer clusters
• Cloudera: An enterprise solution to help
businesses manage their Hadoop ecosystem
• MongoDB: It’s good for managing data that
changes frequently or data that is unstructured
or semi-structured
• Apache Cassandra: Data replication, scalability
and performance
@Thorhildur Jetzek CBS 44|
45. Middleware:
Data integration and management
• Talend: Master Data Management (MDM) offering,
which combines real-time data, applications, and process
integration with embedded data quality and stewardship
• Pentaho: A Comprehensive data integration and business
analytics platform, incl. embedded analysis
• Splunk: Monitor, search and analyze massive streams of
machine data
• InfoSphere Master Data Management: Helps link
unstructured content from external sources to the
golden record for that enhanced 360-degree view
@Thorhildur Jetzek CBS 45|
46. (Visual) Analytics
• Many of the middleware solutions reach into this space and vice
versa – most of these tools have data integration possibilities and
the others offer some analytics
• Tableau: Has focused on integration to various
data-sources (incl. Hadoop) and easy
visualization of data – very easy to use
• Qlik: Very robust, offering options to create
very nice dashboards, but has a bit steeper
learning curve
• IBM Watson Analytics: Can use natural
language to ask questions that are
„translated“ into a query
@Thorhildur Jetzek CBS 46|
47. Advanced Analytics
• SPSS – tabular data only and narrow
capabilities but relatively easy to use
• SAS – dynamic (a programming language)
and a lot of options for analysis
• Matlab – a lot of flexibility for doing own
programming
• R (open source) – can apply packages but
you still have to do a lot of manual labour
(code). Many options
• For structural equation modelling: specific
packages such as SmartPls or Amos
@Thorhildur Jetzek CBS 47|
49. Economics of data
• I use the economists approach and view
data (of any size) as a resource
• Specific features of digital data
– Low marginal costs – easy to distribute and
reuse
– Can be used for many different things
– Value mostly from downstream activities
From an economic perspective, it
makes sense to reuse data as much
as possible
@Thorhildur Jetzek CBS 49|
50. Data accounting
• Data as a resource
– What data do we have
– Where do they originate from
– Where are they stored, who is responsible
– Are they sensitive or can they be opened for resuse
– Are the streaming or static
– Are they mission critical or less important
– Are we using them optimally?
– Do we have the right skills
– Do we have the right tools
• We know about our human resources, machines,
buildings, cars, production parts... Now we must
have the same knowledge about data
@Thorhildur Jetzek CBS 50|
51. ...capitalizing on the benefits of digitization needs to
be a strategic imperative
Value generation
@Thorhildur Jetzek CBS 51|
52. Business strategy
• Consider the competitive advantage
offered by your own data
• Consider the potential value from using other
available external data
• Consider costs and benefits
• Consider what other companies are doing (not
necessarily in the same industry)
Sometimes it makes sense to reuse data internally
Sometimes it makes sense to fetch and use external
data
Sometimes it makes sense to share own data
@Thorhildur Jetzek CBS 52|
54. Model with 2-sided markets
@HildaJetzek
Soft infrastructure
Sustainable
value
Paying side
Buying and selling
goods and services
Non-paying side
Sharing relevant
content
Cost of high-
speed networks
Openness of data
Societal level impact
(MSPs)
Intermediaries
Information
sharing +
market
mechanisms
= Synergy
Effectiveness of data
and privacy protection
frameworks
Ease of reaching a
skilled workforce
Motivation
AbilityBasic requirements
Resource
Digital leadership of
government
Opportunity
Societal level structures
MSPs = Multi Sided Platforms
@Thorhildur Jetzek CBS 54|
56. Examples of use
• Better understand and target customers
• Understand and optimize business
processes
• Improving health
• Smart cities
• Improving sports performance
@Thorhildur Jetzek CBS 56|
62. Smart cities
• Improve security and save money
• Analyze traffic -> implement automatic
traffic controls
@Thorhildur Jetzek CBS 62|
63. Improving sports performance
Prozone analyses over 750,000 data points per game, 300 GPS data points
per training session and 110 data points per injury to create the most
comprehensive injury database in sport. By analyzing over 36 million data
points per club/per season we aim to identify the subtle patterns in a player’s
performance that predispose them to increased relative risk of injury
allowing club to act to prevent injury.
@Thorhildur Jetzek CBS 63|