Why Analytics and to what level - By Novoniel DebNovoniel Deb
It's a very simple representation of how the benefits of Analytics can affect the business growth, even though the business is performing without it.
See my other articles attached in my profile as well.
Big Data LDN 2018: FIGHTING DATA CHAOS: CONNECTING USERS TO DATA AT SCALEMatt Stubbs
Date: 13th November 2018
Location: Self-Service Analytics
Theatre Time: 12:30 - 13:00
Speaker: Joel McKelvey
Organisation: Looker
About: Companies that use data well are more efficient, effective, and profitable. Unfortunately, most organizations struggle to keep up with the changing supply of data — and the growing business demands for that data. The key is to connect data supply to data users in a way that scales, supports existing workflows, and serves as a foundation for the future.
This session will explore how to bring data to users where and when they need it without sacrificing data governance or unified metrics. This session will also present proven ways to build a data foundation for your organisation that can support future changes in both data supply and data demand.
Specifically, attendees will discover:
• The key considerations to driving the most value from data, including: self-service, governance, custom interfaces, modeling, and connections to existing business systems.
• How to provide users access to data in a way that naturally fits in their existing workflows and allows users to take immediate action.
• How companies like Deliveroo and King extract critical business insights from growing data and deliver those insights to their business users.
Six steps to leveraging location for the Canadian insurance industryDMTI Spatial
The key to minimizing your risk and improving your profitability lies in leveraging location throughout the value chain. There are 6 steps to leveraging high precision location throughout your various business processes and systems to reduce risk, increase profitability and improve customer, agent, broker and underwriter satisfaction
Frank Bien, CEO of Looker - along with Amazon, Google and other data disrupters - discuss how innovators are deeply integrating analytics into every aspect of their businesses, from mobile to warehouse to cloud.
Frank shares Looker’s vision for the future of business intelligence and data analytics and reveal pivotal product and partnership updates.
Why Analytics and to what level - By Novoniel DebNovoniel Deb
It's a very simple representation of how the benefits of Analytics can affect the business growth, even though the business is performing without it.
See my other articles attached in my profile as well.
Big Data LDN 2018: FIGHTING DATA CHAOS: CONNECTING USERS TO DATA AT SCALEMatt Stubbs
Date: 13th November 2018
Location: Self-Service Analytics
Theatre Time: 12:30 - 13:00
Speaker: Joel McKelvey
Organisation: Looker
About: Companies that use data well are more efficient, effective, and profitable. Unfortunately, most organizations struggle to keep up with the changing supply of data — and the growing business demands for that data. The key is to connect data supply to data users in a way that scales, supports existing workflows, and serves as a foundation for the future.
This session will explore how to bring data to users where and when they need it without sacrificing data governance or unified metrics. This session will also present proven ways to build a data foundation for your organisation that can support future changes in both data supply and data demand.
Specifically, attendees will discover:
• The key considerations to driving the most value from data, including: self-service, governance, custom interfaces, modeling, and connections to existing business systems.
• How to provide users access to data in a way that naturally fits in their existing workflows and allows users to take immediate action.
• How companies like Deliveroo and King extract critical business insights from growing data and deliver those insights to their business users.
Six steps to leveraging location for the Canadian insurance industryDMTI Spatial
The key to minimizing your risk and improving your profitability lies in leveraging location throughout the value chain. There are 6 steps to leveraging high precision location throughout your various business processes and systems to reduce risk, increase profitability and improve customer, agent, broker and underwriter satisfaction
Frank Bien, CEO of Looker - along with Amazon, Google and other data disrupters - discuss how innovators are deeply integrating analytics into every aspect of their businesses, from mobile to warehouse to cloud.
Frank shares Looker’s vision for the future of business intelligence and data analytics and reveal pivotal product and partnership updates.
Data Stack Considerations: Build vs. Buy at ToutLooker
In this webinar we see why and how Tout – a fast-growing video platform – made the jump to Looker and Treasure Data, and solved their data stack issues.
Slides cover:
-The pitfalls of in-house solutions
-The importance of companywide access to all data
-The best way to optimize your data pipeline
-The facts behind buying vs. building analytics infrastructure
Hear Tomasz Tunguz and Frank Bien discuss their new book, Winning with Data, and offer their unique perspective to help you:
- Understand the positive impact a data culture can have on your company
- Utilize data to optimize every aspect of your business
- Learn how other companies are getting more from their data
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)Tech in Asia ID
Casper is currently the Head of Product & Growth at Kulina, an online food subscription service in Jakarta. Casper is responsible for driving product management and growth initiatives as well as leading marketing efforts. Previously, he led the product marketing teams at Product Madness in San Francisco. During his tenure at Product Madness, he helped the company's top app, Heart of Vegas achieve the record of $200M in annual revenue. Outside of his day-to-day work, he advises corporations and startups on product and growth, and writes frequently on Startup Grind, Mind the Product & Muzli. He graduated with a business degree from the University of Southern California in Los Angeles.
***
This slide was shared at Tech in Asia Product Development Conference 2017 (PDC'17) on 9-10 August 2017.
Get more insightful updates from TIA by subscribing techin.asia/updateselalu
Infotools transforms your consumer research data into a marketing knowledge bank so you can investigate, visualize and tell stories from your data. We work hard to give you space to think.
Panelists from a large company, a small company and a software consulting firm will share insights on how their companies are tackling the arena of Big Data and how to leverage a variety of data sources for strategic decision-making.
Enterprise Data World Webinars: Metadata Management – Getting Off On The Righ...DATAVERSITY
One of the great things about EDW is that each year new people join the Data Management community, and bring new perspectives to the challenges we all work with. At the same time, there are some questions that are re-raised very year, as the pool of talent is refreshed! One is "How should I start with metadata management?"
Initial metadata management projects often founder because they are wrongly scoped, wrongly structured, poorly justified or based on unverified assumptions. Drawing on many years of experience, Ian Rowlands, ASG's VP of Product Management for metadata solutions will walk areas that have to be considered to establish an environment for metadata success.
The focus will be on:
Justifying metadata management
Managing your stakeholders
Scoping a project
Marketing your solution
Managing for sustainability
Working with vendors
This webinar will preview the upcoming talk at Enterprise Data World 2014 Conference & Expo.
"Don’t worry about people stealing an idea. If it’s original, you will have to ram it down their throats.” Howard Aiken, Founder of Harvard’s Computing Science Program.
Data is moving so fast these days, and there is a shift whereby people are paying for value, not technology. This is where cloud computing comes in: it is very empowering, because anyone with an internet connection can access it. With Power BI in the cloud, small businesses are liberated with the ability to use the same tools and techniques to explore ideas as larger organisations.
In this session, we will look at understanding the Power BI components and tools available in the cloud, including the Power BI Admin Center, Power Query, Power Pivot, Power View and Power Map. We will look at how to use them will accelerate ideas and help to clarify decisions, and related to this, discuss the roles within IT and the business in relation to these tools. We will also look at business puzzles versus business mysteries, a definition evoked by Malcolm Gladwell (Blink, Outliers) in relation to Power BI.
“Out there in some garage is an entrepreneur who’s forging a bullet with your company’s name on it,” said Gary Hamel, a management guru. With Power BI, let’s see how you can translate your ideas in to a message that people can see, using cloud as an empowerment tool.
This presentation highlights the factors that are critical for the success of a Data Analytics initiative. Questions like how one should go about analyzing data and why data analytics initiatives go wrong are answered in this presentation.
Product Deep Dive: Getting the Most out of Scout ReportsScout RFP
Data is everywhere, but what sets Scout customers apart is that they know what to do with it. Through Scout
Reports, you can unlock the ability to understand, inform, and support your recommendations to the
business with defensible data and produce a hard-hitting impact to the bottom line. And you’ll help your
team become more data-aware in the process.
7 Pillars Of Data Strategy - High Five 2018Evan Levy
Data and analytics companies are seemingly everywhere, many claiming to be panaceas for all your data woes. As much as we like to rely on technology, the human factor is still the most important part of the equation. Clear strategy and focused leadership is required to make transformative, meaningful change in data culture, influencing capture, analysis, and presentation of your company’s data.
Help Me, Help You: Supporting Your DataData Con LA
Data Con LA 2020
Description
Understand the data product lifecycle and ensure your data is set up for success
In order to get the most out of your data team, understanding the infrastructure needs at every step of the data product lifecycle is imperative. In my presentation we'll cover: - Collect the Right Data: Collect what you want in the future not where you are now - Silo to Warehouse: Consolidating disparate data sources and establish source of truth - Setting Your Team Up for Success: Development Platform and DataOps - Don't Forget to A.I.M. - Thinking about product adoption, implementation, and monitoring - So What? - Tracking impact and making the case for more data
Speaker
Kisa Brostrom, boodleAI, Vice President of Data
Data Stack Considerations: Build vs. Buy at ToutLooker
In this webinar we see why and how Tout – a fast-growing video platform – made the jump to Looker and Treasure Data, and solved their data stack issues.
Slides cover:
-The pitfalls of in-house solutions
-The importance of companywide access to all data
-The best way to optimize your data pipeline
-The facts behind buying vs. building analytics infrastructure
Hear Tomasz Tunguz and Frank Bien discuss their new book, Winning with Data, and offer their unique perspective to help you:
- Understand the positive impact a data culture can have on your company
- Utilize data to optimize every aspect of your business
- Learn how other companies are getting more from their data
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)Tech in Asia ID
Casper is currently the Head of Product & Growth at Kulina, an online food subscription service in Jakarta. Casper is responsible for driving product management and growth initiatives as well as leading marketing efforts. Previously, he led the product marketing teams at Product Madness in San Francisco. During his tenure at Product Madness, he helped the company's top app, Heart of Vegas achieve the record of $200M in annual revenue. Outside of his day-to-day work, he advises corporations and startups on product and growth, and writes frequently on Startup Grind, Mind the Product & Muzli. He graduated with a business degree from the University of Southern California in Los Angeles.
***
This slide was shared at Tech in Asia Product Development Conference 2017 (PDC'17) on 9-10 August 2017.
Get more insightful updates from TIA by subscribing techin.asia/updateselalu
Infotools transforms your consumer research data into a marketing knowledge bank so you can investigate, visualize and tell stories from your data. We work hard to give you space to think.
Panelists from a large company, a small company and a software consulting firm will share insights on how their companies are tackling the arena of Big Data and how to leverage a variety of data sources for strategic decision-making.
Enterprise Data World Webinars: Metadata Management – Getting Off On The Righ...DATAVERSITY
One of the great things about EDW is that each year new people join the Data Management community, and bring new perspectives to the challenges we all work with. At the same time, there are some questions that are re-raised very year, as the pool of talent is refreshed! One is "How should I start with metadata management?"
Initial metadata management projects often founder because they are wrongly scoped, wrongly structured, poorly justified or based on unverified assumptions. Drawing on many years of experience, Ian Rowlands, ASG's VP of Product Management for metadata solutions will walk areas that have to be considered to establish an environment for metadata success.
The focus will be on:
Justifying metadata management
Managing your stakeholders
Scoping a project
Marketing your solution
Managing for sustainability
Working with vendors
This webinar will preview the upcoming talk at Enterprise Data World 2014 Conference & Expo.
"Don’t worry about people stealing an idea. If it’s original, you will have to ram it down their throats.” Howard Aiken, Founder of Harvard’s Computing Science Program.
Data is moving so fast these days, and there is a shift whereby people are paying for value, not technology. This is where cloud computing comes in: it is very empowering, because anyone with an internet connection can access it. With Power BI in the cloud, small businesses are liberated with the ability to use the same tools and techniques to explore ideas as larger organisations.
In this session, we will look at understanding the Power BI components and tools available in the cloud, including the Power BI Admin Center, Power Query, Power Pivot, Power View and Power Map. We will look at how to use them will accelerate ideas and help to clarify decisions, and related to this, discuss the roles within IT and the business in relation to these tools. We will also look at business puzzles versus business mysteries, a definition evoked by Malcolm Gladwell (Blink, Outliers) in relation to Power BI.
“Out there in some garage is an entrepreneur who’s forging a bullet with your company’s name on it,” said Gary Hamel, a management guru. With Power BI, let’s see how you can translate your ideas in to a message that people can see, using cloud as an empowerment tool.
This presentation highlights the factors that are critical for the success of a Data Analytics initiative. Questions like how one should go about analyzing data and why data analytics initiatives go wrong are answered in this presentation.
Product Deep Dive: Getting the Most out of Scout ReportsScout RFP
Data is everywhere, but what sets Scout customers apart is that they know what to do with it. Through Scout
Reports, you can unlock the ability to understand, inform, and support your recommendations to the
business with defensible data and produce a hard-hitting impact to the bottom line. And you’ll help your
team become more data-aware in the process.
7 Pillars Of Data Strategy - High Five 2018Evan Levy
Data and analytics companies are seemingly everywhere, many claiming to be panaceas for all your data woes. As much as we like to rely on technology, the human factor is still the most important part of the equation. Clear strategy and focused leadership is required to make transformative, meaningful change in data culture, influencing capture, analysis, and presentation of your company’s data.
Help Me, Help You: Supporting Your DataData Con LA
Data Con LA 2020
Description
Understand the data product lifecycle and ensure your data is set up for success
In order to get the most out of your data team, understanding the infrastructure needs at every step of the data product lifecycle is imperative. In my presentation we'll cover: - Collect the Right Data: Collect what you want in the future not where you are now - Silo to Warehouse: Consolidating disparate data sources and establish source of truth - Setting Your Team Up for Success: Development Platform and DataOps - Don't Forget to A.I.M. - Thinking about product adoption, implementation, and monitoring - So What? - Tracking impact and making the case for more data
Speaker
Kisa Brostrom, boodleAI, Vice President of Data
With the expertise of our CEO, we've put together a webinar about MVP readiness. If you're low on time, budget, and resources, build a lean solution. A minimum viable product has enough design and development to launch within a shorter time frame. Not only do you save time and money, you'll be able to make iterations and versions post-launch.
See how to prepare for an MVP with Ali Allage, the CEO of Boost Labs.
For more about MVPs, contact us!
Data is the lifeblood of just about every organization and functional area today. As businesses struggle to come to grips with the data flood, it is even more critical to focus on data as an asset that directly supports business imperatives as other organizational assets do. Organizations across most industries attempt to address data opportunities (e.g. Big Data) and data challenges (e.g. data quality) to enhance business unit performance. Unfortunately however, the results of these efforts frequently fall far below expectations due to haphazard approaches. Overall, poor organizational data management capabilities are the root cause of many of these failures. This webinar covers three lessons (illustrated by examples), which will help you to establish realistic OM plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers.
Data is the lifeblood of just about every organization and functional area today. As businesses struggle to come to grips with the data flood, it is even more critical to focus on data as an asset that directly supports business imperatives as other organizational assets do. Organizations across most industries attempt to address data opportunities (e.g. Big Data) and data challenges (e.g. data quality) to enhance business unit performance. Unfortunately however, the results of these efforts frequently fall far below expectations due to haphazard approaches. Overall, poor organizational data management capabilities are the root cause of many of these failures. This webinar covers three lessons (illustrated by examples), which will help you to establish realistic OM plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers.
Takeaways:
Organizational thinking must change: Value-added data management practices must be considered and included as a vital part of your business strategy.
Walk before you run with data focused initiatives: Understand and implement necessary data management prerequisites as a foundation, then build upon that foundation.
There are no silver bullets: Tools alone are not the answer. Specifying business requirements, business practices and data governance are almost always more important.
Product-thinking is making a big impact in the data world with the rise of Data Products, Data Product Managers, data mesh, and treating “Data as a Product.” But Honest, No-BS: What is a Data Product? And what key questions should we ask ourselves while developing them? Tim Gasper (VP of Product, data.world), will walk through the Data Product ABCs as a way to make treating data as a product way simpler: Accountability, Boundaries, Contracts and Expectations, Downstream Consumers, and Explicit Knowledge.
In order to deal with customers expecting a seamless omnichannel experience, increased regulations and speed with which innovative fin-techs enter the market, ING has formulated a customer centric strategy based on data and analytics.
Last year we talked about the fact that ING developed a new architecture, the ING Data Lake. And how within ING In parallel the Big Data paradigm, based on Hadoop, appeared and how this was mapped on the Data Lake architecture to make sure Hadoop is leveraged to the maximum.
This year we want to tell you how the international working group helped realizing the advanced analytic pattern on the ING private cloud, without prior management approval.
This presentation will discuss the community strategy, how to stay under the radar, how to surface when actual content is strong enough to force change, open issues and the private cloud challenges ING is dealing with. Join us in this ride from community idea through architecture to private cloud implementation with some organizational challenges along the way.
Data is the lifeblood of just about every organization and functional area today. As businesses struggle to cope with the data flood, it is even more critical to focus on data as an asset that directly supports business imperatives. Organizations across most industries attempt to address data opportunities (e.g. Big Data) and data challenges (e.g. data quality) to enhance business unit performance. Unfortunately, the results of these efforts frequently fall far below expectations due to haphazard approaches. Overall, poor organizational data management capabilities are the root cause of many of these failures. This webinar covers three lessons (illustrated by examples), which will help you to establish realistic expectations, and help demonstrate the value of this process to both internal and external decision makers.
Northern New England Tableau User Group (TUG) May 2024patrickdtherriault
Join us live in Portland or over the wire for networking and two fantastic presentations! Data viz freelancer Desireé Abbott will demonstrate how adding interactivity to your dashboards will delight and spark curiosity in your users. Then, Charlotte Taft & Laurie Rugemer will reprise their TC24 presentation on the keys to building a successful analytics team.
Northern New England TUG May 2024 - Abbott, Taft, Rugemerpatrickdtherriault
Join us live in Portland or over the wire for networking and two fantastic presentations! Data viz freelancer Desireé Abbott will demonstrate how adding interactivity to your dashboards will delight and spark curiosity in your users. Then, Charlotte Taft & Laurie Rugemer will reprise their TC24 presentation on the keys to building a successful analytics team.
Construyendo confianza y abandonando las zonas de confort en las organizacion...Data IQ Argentina
Cómo hizo Sanofi para crear confianza empresarial y abandonar las zonas de confort. De qué manera generando mayor confianza, a través de Analytics y del entendimiento de los datos del negocio, los vendedores se convirtieron en los mejores analistas de negocios.
The Support Job That Doesn’t Suck: Why Taking Care of Your People is Your Tic...Empower by Guru
Looker is one of the fastest growing SaaS companies in recent history. Throughout that breakneck growth, there’s been one constant - elevating the support function as being seen as more than just “the end of the line”. From early stages, to the awkward teenage years, to building a full blown global support team, Margaret will break down the three keys to success at each stage of scale: hiring, org structure, and data.
Speaker: Margaret Rosas, VP of Customer Care at Looker
How Guided Data Discovery Leads Users to Better Data InsightsDenodo
Watch full webinar here: https://bit.ly/3t5GUgr
The road to better data insights is not only dependent on data accessibility, but is even more reliant on the guided approach for data discovery. The data landscape is vast and has become even harder to navigate. Ability to access your data is no longer sufficient, we need systems to guide us to the right data, and more importantly to enable us to ask better questions. We are reliant on the guiding principles in our daily lives, when we shop online, watch movies, and much more. The same concepts apply to our enterprise data. Join our session to learn how Denodo can provide the guidance and guardrails for the better data insights.
Data is the lifeblood of just about every organization and functional area today. As businesses struggle to come to grips with the data flood, it is even more critical to focus on data as an asset that directly supports business imperatives as other organizational assets do. Organizations across most industries attempt to address data opportunities (e.g. Big Data) and data challenges (e.g. data quality) to enhance business unit performance. Unfortunately however, the results of these efforts frequently fall far below expectations due to haphazard approaches. Overall, poor organizational data management capabilities are the root cause of many of these failures. This webinar covers three lessons (illustrated by examples), which will help you to establish realistic OM plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers.
Takeaways:
- Organizational thinking must change: Value-added data management practices must be considered and included as a vital part of your business strategy.
- Walk before you run with data focused initiatives: Understand and implement necessary data management prerequisites as a foundation, then build upon that foundation.
- There are no silver bullets: Tools alone are not the answer. Specifying business requirements, business practices and data governance are almost always more important.
Data is the lifeblood of just about every organization and functional area today. As businesses struggle to come to grips with the data flood, it is even more critical to focus on data as an asset that directly supports business imperatives as other organizational assets do. Organizations across most industries attempt to address data opportunities (e.g. Big Data) and data challenges (e.g. data quality) to enhance business unit performance. Unfortunately however, the results of these efforts frequently fall far below expectations due to haphazard approaches. Overall, poor organizational data management capabilities are the root cause of many of these failures. This webinar covers three lessons (illustrated by examples), which will help you to establish realistic OM plans and expectations, and help demonstrate the value of such actions to both internal and external decision makers.
Check out more of our webinars here: http://www.datablueprint.com/resource-center/webinar-schedule/
In this presentation, Expert Decisions Inc. highlights the latest in Product innovation and how software tools can help optimize the planning and delivery processes
Ironside's VP of Strategy & Innovation, Greg Bonnette, delivered a presentation on "How to Build a Winning Strategy for Data & Analytics" to provide a framework for data-driven decision making.
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
2. Agenda
2
• Data at Hulu
• What about CONTENT Data?
• Use Cases
• Engineering Challenges
• What about SNOWFLAKE?
• How is it helping us?
• How is it helping our stakeholders?
• Recommendations in Snowflake
2
4. 10 Million Events/Minuteg
4
It all starts with the viewer...
35 PB of Data
>15M Unique
Devices per
Day
4
Data
Marts
Analytics
Third
Party
Targeting
Metrics
Data
Apps
5. 5
by user
by day
by hour
by device
by state
by Hulu product
by content
Data at Hulu
7. Content Acquisition
How has the overall performance of my
content partners been?
How many users are engaging
with Kids content?
What other genres are Anime users
interested in?
What is the expected performance of this
new title?
How many users first engaged with any sports
network within their first day of signup?
8. Content Marketing
Was my content promoted to the right people?
Why did my content not perform the way
I expected it to?
Was the content efficient in attracting
new subscribers to the service?
What is the average age of users
watching The Handmaid’s Tale?
9. Content Finance & Planning
Did I pay the right price for this season?
Should I buy more of this content or
am I saturated?
What is the average cost per hour of
dramas?
10. 10
Engineering Challenges
Data Analysts
Data Scientists
Data Engineers
Data Modeling
Challenges
Data Availability
Challenges
DATA GOVERNANCE
DATA QUALITY
FIND OPTIMAL AND CREATIVE SOLUTIONS SO OUR DATA PIPELINES
SCALE ALONG WITH THE GROWTH OF OUR DATA
11. 11
Recommendations to overcome Data Modeling challenges
● Data Profiling
● Understand our data
● Partition/Clustering
● Consistency in our designs
● Flexibility vs Performance
● Capacity planning
● Data Governance
12. 12
Recommendations to overcome Data Availability challenges
● Technology Selection
● Be Data Driven
● Data Quality
● Query plans
14. 14
How is Snowflake helping us
● Transactional capabilities
● Data Warehouse principles
● Parallel Execution
● Multi-dimensional cubes
15. 15
How is Snowflake helping our Stakeholders
● PARTNERS
○ Getting their data on time
○ Taking advantage of Snowflake’s
Data Sharing capability (future)
● HULUGANS
○ We know there’s a cost
associated to it, but Snowflake
allows our Analysts to generate
insights in the blink of an eye
○ Centralized place to get data
Snowflake is Reliable
16. 16
Recommendations in
● Cluster your tables
○ When creating big tables
○ Check Query Profile for Pruning
● Pick the right warehouse
○ Query Time: warehouse size
○ Concurrency: Clusters
○ As you need
17. 17
● Make use of Query History
○ Duration: Wait vs Execution
○ Bytes scanned: Local vs Remote
● Leverage query_history table
○ snowflake.account_usage.query_history
Recommendations in
(cont’d)