This is the presentation I presented at superweek.in (Mumbai - Jan 2017) http://superweek.in/. In the field of data management and analytics, one statement we often get to hear is - "The data is not accurate". This presentation is based on this statement, and I explore scenarios that might lead to such statements, reasons and remedies, expectation management (with various functional teams) and business definitions.
The Digital Marketing tech stack is expending every day. With so many tools at our fingertips we have the potential to accomplish more than ever before. But hidden issues with the people, processes, and technology in your dept can lead too lost time, lost opportunity, and invalid decisions. Using real world examples from leading companies around the globe, we'll teach you how to identify gaps in your data landscape, find solutions, and turn them into opportunities using resources you already have.
Seven Common Redesign Pitfalls... and How to Avoid ThemBeacon
When redesigning a website the focus tends to be centered on overhauling the aesthetic appearance. The goal tends to revolve around developing a modern-day look that delivers great user experience, and rightly so.
There are only TWO substantial phases in a life of a digital service: either they are being built or they are being optimized. How to improve your UX with digital analytics?
Ask the Expert – Best Practices for a High Performing SiteDynatrace
Featuring Ben Rushlo, Dynatrace VP of Synthetic and UEM Services
Hear from Ben as he discusses key factors in building and maintain a high performing web site. Ben and his team of performance experts help hundreds of Dynatrace customers each year optimize their sites and he will share some of the common themes and best practices, including:
• What is fast enough?
• Typical “pain points” for slow performance
• Top 10 common performance mistakes
• The unique challenges of responsive design
The Digital Marketing tech stack is expending every day. With so many tools at our fingertips we have the potential to accomplish more than ever before. But hidden issues with the people, processes, and technology in your dept can lead too lost time, lost opportunity, and invalid decisions. Using real world examples from leading companies around the globe, we'll teach you how to identify gaps in your data landscape, find solutions, and turn them into opportunities using resources you already have.
Seven Common Redesign Pitfalls... and How to Avoid ThemBeacon
When redesigning a website the focus tends to be centered on overhauling the aesthetic appearance. The goal tends to revolve around developing a modern-day look that delivers great user experience, and rightly so.
There are only TWO substantial phases in a life of a digital service: either they are being built or they are being optimized. How to improve your UX with digital analytics?
Ask the Expert – Best Practices for a High Performing SiteDynatrace
Featuring Ben Rushlo, Dynatrace VP of Synthetic and UEM Services
Hear from Ben as he discusses key factors in building and maintain a high performing web site. Ben and his team of performance experts help hundreds of Dynatrace customers each year optimize their sites and he will share some of the common themes and best practices, including:
• What is fast enough?
• Typical “pain points” for slow performance
• Top 10 common performance mistakes
• The unique challenges of responsive design
Learn about the CDO Advisors Primed Analytic Process. This is our data science methodology that was created after years of practical experience. Learn more at www.primed-ap.com. This presentation is from the Global Predictive Analytic Conference April 3rd, 2018.
How to Improve Your Data Collection Using Mobile PhonesSocialCops
Join Nishant, data collection expert at SocialCops, as he runs you through the different types of data collection and shows you how to save time, money and resources by switching to mobile data collection.
In this presentation, you'll learn:
• About different data collection techniques
• How to collect information using mobile forms
• How to manage your data collection project
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarImpetus Technologies
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
View the webcast on http://bit.ly/1HFD8YR
The speakers from Forrester and Impetus talk about the options and optimal architecture to incorporate real-time insights into your apps that provisions benefitting from future innovation also.
Moving from data to insights: How to effectively drive business decisions & g...Cloudera, Inc.
Firms have become obsessed with data. But the key to competitive advantage is not just more or bigger data or big data technology, it is finding actionable insights from all the data as well as embedding insight in processes and applications. This requires a change in your approach - modernized architecture and embedding insights and data in you business decisions It also requires a change in how your people work systematically to find, test and implement insights. In this webinar, Forrester Vice President and Principal Analyst Brian Hopkins will present results from two years of research into these ideas and recommend to attendees how they can get the most out of their data and analytics to drive effective business decisions and gain competitive advantage.
Analytics and user experience. Alessandro TREZZILouise Chaussade
By Alessandro TREZZI, UX Designer
When designing a web application, what's the screen resolution you should take into account? Is it worth making your application compatible with an older browser? A customer is insisting on adding a very expensive functionality: is it worth developing it? How many people are accessing a specific page from a mobile device?
Goal of this talk is to show how analytics can be part of design strategy to identify Ux problems and solutions.
The New Self-Service Analytics - Going Beyond the ToolsKatherine Gabriel
In today’s business climate, using data to make quick decisions is a common ask across organizations. To fulfill such asks business users want more, faster, and better access to data and analytic tools. IT wants to balance this need for speed with the responsibility to protect the data assets from security, privacy, and quality risks. A common solution to this scenario is self-service BI or self-service analytics. Chances are you are already using self-service BI in some way, shape, or form or have heard a pitch from an analytic tool vendor!
Self-service BI has been around for several decades and yet business users keep asking for more and more. Has self-service BI failed to deliver on its promise? Is it time to revisit what self-service really means? How can business and IT work together to achieve better decision-making outcomes for their organization?
We cover:
• How to demystify what self-service analytics means
• New trends driving the self-service analytics evolution
• Best practices and lessons learned from real-life examples
• Recommendations for making progress within your organization
Advance your self-service journey.
Software Project Management Presentation FinalMinhas Kamal
Software Project Management- ResearchColab
Presented in 4th year of Bachelor of Science in Software Engineering (BSSE) course at Institute of Information Technology, University of Dhaka (IIT, DU).
Gain a Holistic View of your Customer's JourneyPlatfora
Today, companies are capturing information about customers at every touchpoint, but the reality is that most companies are working with siloed marketing data because they’re using disparate tools to track online, offline, web, social, mobile, and advertising data.
In this presentation, Rod Fontecilla, VP of Application Modernization at Unisys, explains how his team uses Platfora to analyze, interact and understand data to drive customer success at Unisys.
Rod will highlight three specific Unisys use cases of Platfora, one of which involved a timely text survey sentiment analysis that produced insights enabling a course correction in favor of improved customer satisfaction.
Using Web Data to Drive Revenue and Reduce CostsConnotate
This presentation is designed to help companies strengthen their competitive advantage by leveraging publicly available Web sources.
Entrepreneurs, global industry leaders and enterprises of all sizes are turning Web data into lucrative opportunities – creating new revenue-generating products, reducing costs and re-engineering workflows to optimize pricing, streamline reporting, ensure compliance, engage interactively with clients and more.
This presentation uses a variety of success stories to illustrate ways in which businesses can use Web data to drive revenue and streamline operations.
Making sense of analytics for documentation pagesPronovix
As content producers, we invest considerable time and effort in developing, packaging, and delivering content that we think our users need. After publishing the content, we hope that users find our content useful. And we often wonder how users really navigate and consume our content. Web page analytics can help us gauge the information needs of our customers, assess their content consumption behavior, and find opportunities to improve our content and how we deliver it.
Kumar explores the basics of web analytics, pitfalls of relying too much on web analytics for important decisions, the typical web analytics process, and he will share some guidelines for interpreting web analytics numbers.
Forget Big Data. It's All About Smart DataAlan McSweeney
This proposes an initial smart data framework and structure to allow the nuggets of value contained in the deluge of largely irrelevant and useless data to be isolated and extracted. It enables your organisation to ask the questions to understand where it should be in terms of its data state and profile and what it should do to achieve the desired skills level across the competency areas of the framework.
Every organisation operates within a data landscape with multiple sources of data relating to its activities that is acquired, transported, stored, processed, retained, analysed and managed. Interactions across the data landscape generate primary data. When you extend the range of possible interactions business processes outside the organisation you generate a lot more data.
Smart data means being:
• Smart in what data to collect, validate and transform
• Smart in how data is stored, managed, operated and used
• Smart in taking actions based on results of data analysis including organisation structures, roles, devolution and delegation of decision-making, processes and automation
• Smart in being realistic, pragmatic and even skeptical about what can be achieved and knowing what value can be derived and how to maximise value obtained
• Smart in defining an achievable, benefits-lead strategy integrated with the needs business and in its implementation
• Smart in selecting the channels and interactions to include – smart data use cases
Smart data competency areas comprise a complete set of required skills and abilities to design, implement and operate an appropriate smart data programme.
How to Use Apricot Software to Improve Data QualityJeffrey Haguewood
Data quality and data health in Apricot are top priorities, but where should you start? Managing data health in Apricot includes a mix of proactive and reactive strategies. In addition, there are a set of tactics you can use to efficiently clean up your data when needed.
Check out this SlideShare to learn the strategies, tools, and tips to improve the quality and health of your Apricot database. Learn how to assess data quality and take action.
Watch the recorded webinar: https://youtu.be/Zvvl1B0216M
Sidekick Solutions is an independent software consulting firm, specializing in Apricot software. We help new and existing Apricot users set up, streamline, and make the most of Apricot software with a range of professional services for implementation, reporting, consulting, data migration, and database audit/cleanup. We make Apricot easier to use and more capable for our clients, yielding higher return on investment in their Apricot software license.
Whether you are evaluating Apricot software for purchase, are a new user about to start implementation, or are an existing user with an existing system, designing and managing an Apricot database takes a dose of creativity, and it helps to have a few ideas and use cases as examples. To help you get started, let’s explore ten ways to configure Social Solutions Apricot software.
Check out this SlideShare to learn how to operate programs, deliver services, engage with constituents, and generate reports in Apricot. Learn how other organizations are using Apricot and apply the concepts in your database.
Watch the recorded webinar: https://youtu.be/TgWy9UVtD38
Sidekick Solutions is an independent software consulting firm, specializing in Apricot software. We help new and existing Apricot users set up, streamline, and make the most of Apricot software with a range of professional services for implementation, reporting, consulting, data migration, and database audit/cleanup. We make Apricot easier to use and more capable for our clients, yielding higher return on investment in their Apricot software license.
Learn about the CDO Advisors Primed Analytic Process. This is our data science methodology that was created after years of practical experience. Learn more at www.primed-ap.com. This presentation is from the Global Predictive Analytic Conference April 3rd, 2018.
How to Improve Your Data Collection Using Mobile PhonesSocialCops
Join Nishant, data collection expert at SocialCops, as he runs you through the different types of data collection and shows you how to save time, money and resources by switching to mobile data collection.
In this presentation, you'll learn:
• About different data collection techniques
• How to collect information using mobile forms
• How to manage your data collection project
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarImpetus Technologies
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
View the webcast on http://bit.ly/1HFD8YR
The speakers from Forrester and Impetus talk about the options and optimal architecture to incorporate real-time insights into your apps that provisions benefitting from future innovation also.
Moving from data to insights: How to effectively drive business decisions & g...Cloudera, Inc.
Firms have become obsessed with data. But the key to competitive advantage is not just more or bigger data or big data technology, it is finding actionable insights from all the data as well as embedding insight in processes and applications. This requires a change in your approach - modernized architecture and embedding insights and data in you business decisions It also requires a change in how your people work systematically to find, test and implement insights. In this webinar, Forrester Vice President and Principal Analyst Brian Hopkins will present results from two years of research into these ideas and recommend to attendees how they can get the most out of their data and analytics to drive effective business decisions and gain competitive advantage.
Analytics and user experience. Alessandro TREZZILouise Chaussade
By Alessandro TREZZI, UX Designer
When designing a web application, what's the screen resolution you should take into account? Is it worth making your application compatible with an older browser? A customer is insisting on adding a very expensive functionality: is it worth developing it? How many people are accessing a specific page from a mobile device?
Goal of this talk is to show how analytics can be part of design strategy to identify Ux problems and solutions.
The New Self-Service Analytics - Going Beyond the ToolsKatherine Gabriel
In today’s business climate, using data to make quick decisions is a common ask across organizations. To fulfill such asks business users want more, faster, and better access to data and analytic tools. IT wants to balance this need for speed with the responsibility to protect the data assets from security, privacy, and quality risks. A common solution to this scenario is self-service BI or self-service analytics. Chances are you are already using self-service BI in some way, shape, or form or have heard a pitch from an analytic tool vendor!
Self-service BI has been around for several decades and yet business users keep asking for more and more. Has self-service BI failed to deliver on its promise? Is it time to revisit what self-service really means? How can business and IT work together to achieve better decision-making outcomes for their organization?
We cover:
• How to demystify what self-service analytics means
• New trends driving the self-service analytics evolution
• Best practices and lessons learned from real-life examples
• Recommendations for making progress within your organization
Advance your self-service journey.
Software Project Management Presentation FinalMinhas Kamal
Software Project Management- ResearchColab
Presented in 4th year of Bachelor of Science in Software Engineering (BSSE) course at Institute of Information Technology, University of Dhaka (IIT, DU).
Gain a Holistic View of your Customer's JourneyPlatfora
Today, companies are capturing information about customers at every touchpoint, but the reality is that most companies are working with siloed marketing data because they’re using disparate tools to track online, offline, web, social, mobile, and advertising data.
In this presentation, Rod Fontecilla, VP of Application Modernization at Unisys, explains how his team uses Platfora to analyze, interact and understand data to drive customer success at Unisys.
Rod will highlight three specific Unisys use cases of Platfora, one of which involved a timely text survey sentiment analysis that produced insights enabling a course correction in favor of improved customer satisfaction.
Using Web Data to Drive Revenue and Reduce CostsConnotate
This presentation is designed to help companies strengthen their competitive advantage by leveraging publicly available Web sources.
Entrepreneurs, global industry leaders and enterprises of all sizes are turning Web data into lucrative opportunities – creating new revenue-generating products, reducing costs and re-engineering workflows to optimize pricing, streamline reporting, ensure compliance, engage interactively with clients and more.
This presentation uses a variety of success stories to illustrate ways in which businesses can use Web data to drive revenue and streamline operations.
Making sense of analytics for documentation pagesPronovix
As content producers, we invest considerable time and effort in developing, packaging, and delivering content that we think our users need. After publishing the content, we hope that users find our content useful. And we often wonder how users really navigate and consume our content. Web page analytics can help us gauge the information needs of our customers, assess their content consumption behavior, and find opportunities to improve our content and how we deliver it.
Kumar explores the basics of web analytics, pitfalls of relying too much on web analytics for important decisions, the typical web analytics process, and he will share some guidelines for interpreting web analytics numbers.
Forget Big Data. It's All About Smart DataAlan McSweeney
This proposes an initial smart data framework and structure to allow the nuggets of value contained in the deluge of largely irrelevant and useless data to be isolated and extracted. It enables your organisation to ask the questions to understand where it should be in terms of its data state and profile and what it should do to achieve the desired skills level across the competency areas of the framework.
Every organisation operates within a data landscape with multiple sources of data relating to its activities that is acquired, transported, stored, processed, retained, analysed and managed. Interactions across the data landscape generate primary data. When you extend the range of possible interactions business processes outside the organisation you generate a lot more data.
Smart data means being:
• Smart in what data to collect, validate and transform
• Smart in how data is stored, managed, operated and used
• Smart in taking actions based on results of data analysis including organisation structures, roles, devolution and delegation of decision-making, processes and automation
• Smart in being realistic, pragmatic and even skeptical about what can be achieved and knowing what value can be derived and how to maximise value obtained
• Smart in defining an achievable, benefits-lead strategy integrated with the needs business and in its implementation
• Smart in selecting the channels and interactions to include – smart data use cases
Smart data competency areas comprise a complete set of required skills and abilities to design, implement and operate an appropriate smart data programme.
How to Use Apricot Software to Improve Data QualityJeffrey Haguewood
Data quality and data health in Apricot are top priorities, but where should you start? Managing data health in Apricot includes a mix of proactive and reactive strategies. In addition, there are a set of tactics you can use to efficiently clean up your data when needed.
Check out this SlideShare to learn the strategies, tools, and tips to improve the quality and health of your Apricot database. Learn how to assess data quality and take action.
Watch the recorded webinar: https://youtu.be/Zvvl1B0216M
Sidekick Solutions is an independent software consulting firm, specializing in Apricot software. We help new and existing Apricot users set up, streamline, and make the most of Apricot software with a range of professional services for implementation, reporting, consulting, data migration, and database audit/cleanup. We make Apricot easier to use and more capable for our clients, yielding higher return on investment in their Apricot software license.
Whether you are evaluating Apricot software for purchase, are a new user about to start implementation, or are an existing user with an existing system, designing and managing an Apricot database takes a dose of creativity, and it helps to have a few ideas and use cases as examples. To help you get started, let’s explore ten ways to configure Social Solutions Apricot software.
Check out this SlideShare to learn how to operate programs, deliver services, engage with constituents, and generate reports in Apricot. Learn how other organizations are using Apricot and apply the concepts in your database.
Watch the recorded webinar: https://youtu.be/TgWy9UVtD38
Sidekick Solutions is an independent software consulting firm, specializing in Apricot software. We help new and existing Apricot users set up, streamline, and make the most of Apricot software with a range of professional services for implementation, reporting, consulting, data migration, and database audit/cleanup. We make Apricot easier to use and more capable for our clients, yielding higher return on investment in their Apricot software license.
Similar to Data is never accurate - and that's OK! (20)
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Data is never accurate - and that's OK!
1. “Errors using inadequate data are much less than those using no data at all”
- Charles Babbage
The data is never accurate – and that’s OK!
By: Chanpreet Singh| twitter.com/chanpreets
2. Part 1: Why is data
not accurate?
Part 2: How to
ensure data sanity?
Part 3: How do we
work with this kind
of data?
Jan 13, 2017 Chanpreet 2
4. Part 1: Why is data
not accurate?
Jan 13, 2017 Chanpreet 4
5. What is the meaning of Accurate
Towards care
• (especially of information, measurements,
or predictions) correct in all details; exact.
• “Done with care”
• Digital measurement - ‘care’ unfortunately
is not enough
Jan 13, 2017 Chanpreet 5
6. Why would data not be Accurate
Factors responsible
• Digital Analytics programs do not
collect exact number of hits or queries
• Designed to give ‘valuable insights’
• Digital Analytics is not an Audit
• Inherent uncertainties
• Human nature
Jan 13, 2017 Chanpreet 6
7. Not designed to collect exact hits or
queries
JS limitations
• Different
layout engines
render JS
differently
Jan 13, 2017 Chanpreet 7
Layout Engine Browser
Trident
Internet Explorer for Windows
Maxthon
Netscape 8.1
Tasman Internet Explorer 5 for Macintosh
Gecko
Firefox
Netscape 6 and later
Mozilla
Camino
K-Meleon
SeaMonkey
Epiphany 2.20 and before
Galeon
KHTML Konqueror
Tkhtml Html Viewer
Layout
Engine
Browser
WebKit
Safari
Chrome
iCab 4 and later
Epiphany 2.26 and
later
Maxthon 3
OmniWeb
Shiira
Midori
Presto Opera
iCab iCab 3 and before
Tkhtml Html Viewer
8. Not designed to collect exact hits or
queries
Cookies
• Cookies
blocked or
disabled by
users
Jan 13, 2017 Chanpreet 8
9. Not designed to collect exact hits or
queries
Cookies
• Cookie
deletion
Jan 13, 2017 Chanpreet 9
10. Not designed to collect exact hits or
queries
<noscript>
• <noscript> tag
Jan 13, 2017 Chanpreet 10
11. Insights through % of data collected
Digital landscape
• Digital landscape is ever changing
• Multiple platforms
• Basic functional mobile phones
• Networks, CDNs and other systems
• Considerations of Outliers and Bots
• Sample data to extrapolate performance
Jan 13, 2017 Chanpreet 11
12. Not an Audit
Audit involves
• Series of checks and balances
• Accuracy
• Standardized metrics and
methodology
• Consistency of process
• Transparency of results
Jan 13, 2017 Chanpreet 12
14. Human Nature
Human factor at play
• Humans think differently
• Big organization, big teams, multiple functions
• Cross functional/ team dependencies
• Dependencies on partners
• Lack of proper understanding
Jan 13, 2017 Chanpreet 14
23. Part 2: How to
ensure data sanity?
Jan 13, 2017 Chanpreet 23
24. Awareness
Be Aware of differences
• Discrepancies vs Differences
• Different tools - different data
• Sampling
• Purpose of tools
• Metrics used
Jan 13, 2017 Chanpreet 24
25. Different tools different data – GA vs
AdWords
Metrics usage and nature differs
• Clicks vs Sessions
• AdWords filters invalid clicks, Analytics shows
all data
• Landing page might redirect to another
• Users browser preferences
• Users return during the lifetime of campaign
• Users return to your site via bookmarks
Jan 13, 2017 Chanpreet 25
26. Different tools different data – GA vs
DFP
Metrics usage and nature differs
• DFP counts ad clicks at the source; GA counts pageviews or
sessions when a user hits the site
• Metrics are counted at different points in the click-referral cycle
Jan 13, 2017 Chanpreet 26
27. Different tools different data –
Real time GA vs Another (Chartbeat)
Common to use multiple tools
• Chartbeat checks in for visitors every
few secs.
• GA checks in once for visitors every
5 mins.
• GA > Chartbeat = concurrent users
are high, but not staying long
• GA < Chartbeat = most users stay
longer than 5 mins.
Jan 13, 2017 Chanpreet 27
28. Different tools different data – GA vs
ComScore
Competition reporting
• ComScore
• Eliminates/ignores machine generated traffic
• Measures people
• Eliminates visits from multiple browsers,
multiple screens
• Extrapolates numbers based on a panel users
• 3 second rule
Jan 13, 2017 Chanpreet 28
29. Limitations within the tool – GA vs GA
GA Sampling
• GA sampling and limitations
• Standard vs Ad-Hoc reports
• GA Standard vs Premium
• High-cardinality dimensions
• Report Query limit - 1 million rows of data
• Data limits - single day/ multi day processed tables
• Multi channel funnel report > 1 million conversions
• Flow visualization report more than 100k sessions
Jan 13, 2017 Chanpreet 29
30. Part 3: How do we
work with this kind
of data?
Jan 13, 2017 Chanpreet 30
31. Best attempt to ensure better data
Plan> Implement> configure> report> Interpret
• Coding
• Admin Panel
• Product
• Interface usage and reporting
• Interpretation
Jan 13, 2017 Chanpreet 31
32. Ensuring correct implementation - Code
Coding
• Updated version (UA)
• Correct UAID
• Correct domain/ multi domain coding
• Ensure code is not Commented out
• Check for code Breakages
• Customizations (session duration, session end)
• Placement of code
• Duplication of code
• Conflict with another code (other tool), events
Jan 13, 2017 Chanpreet 32
34. Ensuring correct implementation -
Product
Product Sanity (Website/ Mobile Apps)
• URL/ Screen pattern and duplications
• UTM tagging
• Conflict with tech and tools
• Version control
• Audience platforms - Desktop vs Mobile
• Cache management/ deletion/
exclusions
• Bots/ malware
• Geo location of audience and network
• Firewalls
Jan 13, 2017 Chanpreet 34
35. Ensuring correct Reporting
Interface usage and reporting
• Report/ data pull/ interpretation
• Sampling
• Time period/ timelines
• Data influencing Events
• Data Cleansing - Incomplete/ inaccurate data
• Interpretations Differ
• Blind to our own biases
• Critical points that might influence another outcome
• Working with Limitations
Jan 13, 2017 Chanpreet 35
36. Same data – different interpretation
Interpretation matters
• Same data can yield wildly different
results
• Redundant data stored in multiple
systems and in different formats
• Data transformation is also important
Jan 13, 2017 Chanpreet 36
38. Thank you
Reach me twitter/@chanpreets or
email: chanpreet.mus@gmail.com
Jan 13, 2017 Chanpreet 38
Editor's Notes
The layout engine is the part of the browser that actually interprets the HTML and draws stuff on the screen
It handles the execution of Javascript code and provides the document object model and event models that the Javascript code interacts with. There are several different layout engines around, and most of them are used in many different browsers.
&lt;noscript&gt; tag logic which will still capture minimal data
The &lt;noscript&gt; tag defines an alternate content for users that have disabled scripts in their browser or have a browser that doesn&apos;t support script.
Some mobile phones do not have the technical capabilities to be tracked by web analytics programs
Series of checks and balances, beyond simply quality control, that tests data for accuracy
Standardized metrics and methodology, consistency of process, and transparency of results
No tracking code no data/ less then optimal implementation – leading to poor and misleading data quality
HTML programmers forget to embed tracking code
JS system can slow down pages if the tracking server is bogged down - so some developers may exclude/ remove it
No tracking code no data/ less then optimal implementation – leading to poor and misleading data quality
HTML programmers forget to embed tracking code
JS system can slow down pages if the tracking server is bogged down - so some developers may exclude/ remove it
Users browser preferences set to prevent Analytics codes from firing
instead of cookies
panel users tracked via plugins, beacons or tracking cookies
While many companies realize the importance of normalizing the data to minimize data redundancy, transformation is also important