SlideShare a Scribd company logo
Praktisi Mengajar:
Workflow Analysis
Study:
Basic Database
Workflow Analysis
Business
Understanding
Business
Understanding
Business Understanding
Why Needs Business Understanding?
GOALS
• Memilih variabel kunci yang menjadi target
model yang kita bangun serta metrik-metriknya
adalah indikator kesuksesan proyek kita
• Mengidentifikasi sumber data relevan yang
aksesnya tersedia / perlu didapatkan aksesnya
Intro
Step
Suggest Recommendation
Rekomendasi yang diberikan
berdasarkan hasil analisis
Get Insight
Mengambil kesimpulan dari
informasi yang didapatkan
Define Business Problem
Memahami permasalahan
yang terjadi
Problem Discovery
Memilih metrics yang dapat
menyelesaikan permasalahan
Review Case
Brazilian E-Commerce Olist
● Business Problem:
○ Bagaimana kondisi e-commerce olist?
● Problem Discovery:
○ GMV: Gross Merchant Value
○ Total Order
○ Total Buyers
○ Retention Rate, etc
● Insight?
● Recommendation?
Review Case
Design Data Mart
Table Order
order_id
order_date
customer_id
order_status
Table Order Items
order_id
product_id
price
freight_value
Table Customer
customer_id
customer_city
Metrics Transaction
order_id
total_product
total_price
total_logistic _cost
Transform
Data Mart Transaction
order_id
order_date
customer_id
order_status
customer_city
total_product
total_price
total_logistic_cost
Join
Join
Why Data Mart?
● Gather Information
● Efficient
● Support to explore
data
Review Case
Query Data Mart
WITH
metrics_transaction AS (
SELECT
order_id,
COUNT(DISTINCT product_id) total_product,
SUM(price) total_product_price,
SUM(freight_value) total_logistic_cost
FROM
`database-385606.Latihan1.order_items`-- custom with your dataset
GROUP BY
1 )
SELECT
a.*,
c.customer_city,
c.customer_unique_id,
COALESCE(b.total_product,0) total_product,
COALESCE(b.total_product_price,0) total_product_price,
COALESCE(total_logistic_cost,0) total_logistic_cost
FROM
`database-385606.Latihan1.order` a -- custom with your dataset
LEFT JOIN
metrics_transaction b
ON
a.order_id=b.order_id
LEFT JOIN (
SELECT
DISTINCT customer_id,customer_unique_id
customer_city
FROM
`database-385606.Latihan1.cust`) c -- custom with your dataset
ON
a.customer_id=c.customer_id
CTE
Sub Query
Review Case
Create Data Mart
● View Table:
○ Table yang dibentuk menggunakan query dan
membutuhkan waktu dalam running query untuk
menampilkan result
● Fact Table:
○ Table yang dibentuk menggunakan query dan hasil akan
ditampilkan berupa tabel fisik
Review Case
View Table
1. Click Save > Save view
2. Setup Project and Dataset then fill name of table
3. Save
1
2
3
Review Case
Fact Table
1. Add query with:
create or replace table loyal-surfer-321414.Testing.data_mart
as
1. Run
Review Case
Hashing
Proses menghasilkan fixed-size output, dari variable-sized input yang dilakukan melalui
penggunaan rumus matematika yang dikenal sebagai hash function. Setiap aset kripto
menggunakan berbagai algoritma hashing yang berbeda untuk membuat berbagai jenis kode
hash – algoritma ini bertugas untuk menghasilkan alfanumerik acak
Contoh:
reference
Review Case
Step 1 - Check Definition
(Not Yet Metadata)
SELECT
order_status,
MAX(order_purchase_timestamp)
purchase_date,
MAX(order_approved_at) approve_date,
MAX(order_delivered_carrier_date)
pickup_date_by_courir,
MAX(order_delivered_customer_date)
delivered_date
FROM
`loyal-surfer-
321414.Testing.data_mart_purchase` --
custom with your dataset
GROUP BY
1
Metadata:
data that provides information about other data but not the content of the data, such as the text
of a message or the image itself. There are many distinct types of metadata,
including: Descriptive metadata – the descriptive information about a resource. It is used for
discovery and identification.
Review Case
Step 2 - Check Basic Statistics
SELECT
min(partition_columns) min_date
,max(partition_columns) max_date
,count(distinct order_id) total_order
,count(distinct customer_unique_id) total_buyers
,round(avg(total_product),2) avg_products
,round(avg(total_product_price),2) avg_gmv
,approx_quantiles(total_product_price, 100)[offset(50)] median_gmv
,STDDEV(total_product_price) std_gmv
,round(avg(total_logistic_cost),2) avg_logistic_cost
,approx_quantiles(total_logistic_cost, 100)[offset(50)] median_logistic_cost
,STDDEV(total_logistic_cost) std_logistic_cost
FROM
`loyal-surfer-321414.Testing.data_mart_purchase`
where order_status='delivered'
statistics values
min_date 2016-09-15
max_date 2018-08-29
total_order 96478
total_buyers 93358
avg_products 1.04
avg_gmv 137.04
median_gmv 86.57
std_gmv 209.05
avg_logistic_cost 22.79
median_logistic_cost 17.17
std_logistic_cost 21.56
Review Case
Step 2 - Check Basic Statistics
SELECT
FORMAT_DATETIME("%Y-%m-01", partition_columns) month_key
,sum(total_product_price) total_gmv
FROM
`loyal-surfer-321414.Testing.data_mart_purchase`
where order_status='delivered'
group by 1
order by 1
month_key total_gmv
2016-09-01 135
2016-10-01 40325
2016-12-01 11
2017-01-01 111798
2017-02-01 234223
2017-03-01 359199
2017-04-01 340670
2017-05-01 489338
2017-06-01 421923
2017-07-01 481605
2017-08-01 554700
2017-09-01 607400
2017-10-01 648248
2017-11-01 987765
2017-12-01 726033
Review Case
Hands On
Step 3 - Deep Dive with our assumptions
What Should we do?
Review Case
Hands On
Step 3 - Deep Dive with our assumptions
WITH
summary_boxplot AS (
SELECT
APPROX_QUANTILES(total_product_price, 4)[
OFFSET
(2)] q2_gmv,
APPROX_QUANTILES(total_product_price, 4)[
OFFSET
(3)] - APPROX_QUANTILES(total_product_price, 4)[
OFFSET
(1)] iqr
FROM
`loyal-surfer-321414.Testing.data_mart_purchase_remake`
WHERE
order_status='delivered' ),
summary_batas AS (
SELECT
q2_gmv+1.5*iqr batas_atas,
q2_gmv-1.5*iqr batas_bawah
FROM
summary_boxplot ),
fact_tagging_anomaly AS (
SELECT
*,
Review Case
Hands On
Step 3 - Deep Dive with our assumptions
Review Case
Hands On
Step 3 - Bonus Retention Rate with Cohort
Analysis
Query Cohort Analysis
reference
Review Case
Hands On
Step 4 - Summary
1. …
2. …
3. …
4. …
Review Case
Hands On
Coba
1. Lengkapi slide 17 untuk bikin 3 kelompok berdasarkan pendekatan boxplot (didalam
whisker pembelanjaan normal, pembelanjaan outlier dan pembelanjaan extreme)
2. Bikin summary estimasi interval untuk rata-rata gmv dan kasih tagging mana transaksi
yang diluar LCL dan UCL
3. Bandingkan taggingan menggunakan boxplot dan pendekatan distribusi normal,
simpulkan hasilnya
4. Lengkapi slide 20 untuk summary yang bisa diambil dari data olist
Praktisi Mengajar:
Workflow Analysis
Study:
Basic Database

More Related Content

Similar to Praktisi Mengajar - Workflow Analysis.pptx

Introduction to Dimesional Modelling
Introduction to Dimesional ModellingIntroduction to Dimesional Modelling
Introduction to Dimesional Modelling
Ashish Chandwani
 
Database Management Systems Lab manual (KR20) CSE.pdf
Database Management Systems Lab manual (KR20) CSE.pdfDatabase Management Systems Lab manual (KR20) CSE.pdf
Database Management Systems Lab manual (KR20) CSE.pdf
Anvesh71
 
111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx
111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx
111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx
hyacinthshackley2629
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
Shanthi Mukkavilli
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business Intelligence
Slava Kokaev
 
Modelado Dimensional 4 Etapas
Modelado Dimensional 4 EtapasModelado Dimensional 4 Etapas
Modelado Dimensional 4 Etapas
Roberto Espinosa
 

Similar to Praktisi Mengajar - Workflow Analysis.pptx (20)

Data warehouse logical design
Data warehouse logical designData warehouse logical design
Data warehouse logical design
 
Introduction to Dimesional Modelling
Introduction to Dimesional ModellingIntroduction to Dimesional Modelling
Introduction to Dimesional Modelling
 
Data Analytics Business Intelligence
Data Analytics Business IntelligenceData Analytics Business Intelligence
Data Analytics Business Intelligence
 
Enhancing Dashboard Visuals with Multi-Dimensional Expressions (MDX)
Enhancing Dashboard Visuals with Multi-Dimensional Expressions (MDX)Enhancing Dashboard Visuals with Multi-Dimensional Expressions (MDX)
Enhancing Dashboard Visuals with Multi-Dimensional Expressions (MDX)
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 
Modelado Dimensional 4 etapas.ppt
Modelado Dimensional 4 etapas.pptModelado Dimensional 4 etapas.ppt
Modelado Dimensional 4 etapas.ppt
 
SAP MM - Master Data.pptx
SAP MM - Master Data.pptxSAP MM - Master Data.pptx
SAP MM - Master Data.pptx
 
Database Management Systems Lab manual (KR20) CSE.pdf
Database Management Systems Lab manual (KR20) CSE.pdfDatabase Management Systems Lab manual (KR20) CSE.pdf
Database Management Systems Lab manual (KR20) CSE.pdf
 
INFORMATICA EASY LEARNING ONLINE TRAINING
INFORMATICA EASY LEARNING ONLINE TRAININGINFORMATICA EASY LEARNING ONLINE TRAINING
INFORMATICA EASY LEARNING ONLINE TRAINING
 
111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx
111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx
111Assignment Learning ObjectivesBSIS 105Assignment 3Purc.docx
 
(Lecture 4)Slowly Changing Dimensions.pdf
(Lecture 4)Slowly Changing Dimensions.pdf(Lecture 4)Slowly Changing Dimensions.pdf
(Lecture 4)Slowly Changing Dimensions.pdf
 
Master data in mm
Master data in mm Master data in mm
Master data in mm
 
Business Intelligence and decision support system
Business Intelligence and decision support system Business Intelligence and decision support system
Business Intelligence and decision support system
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business Intelligence
 
01 intro qa
01 intro qa01 intro qa
01 intro qa
 
Modelado Dimensional 4 Etapas
Modelado Dimensional 4 EtapasModelado Dimensional 4 Etapas
Modelado Dimensional 4 Etapas
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAP
 

Recently uploaded

Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
Avinash Rai
 

Recently uploaded (20)

NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General QuizPragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resources
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
 
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & EngineeringBasic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering
 
Gyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxGyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptx
 
The impact of social media on mental health and well-being has been a topic o...
The impact of social media on mental health and well-being has been a topic o...The impact of social media on mental health and well-being has been a topic o...
The impact of social media on mental health and well-being has been a topic o...
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
An Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptxAn Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Advances in production technology of Grapes.pdf
Advances in production technology of Grapes.pdfAdvances in production technology of Grapes.pdf
Advances in production technology of Grapes.pdf
 
Open Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPointOpen Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPoint
 
Morse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptxMorse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptx
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
 

Praktisi Mengajar - Workflow Analysis.pptx

  • 3. Business Understanding Business Understanding Why Needs Business Understanding? GOALS • Memilih variabel kunci yang menjadi target model yang kita bangun serta metrik-metriknya adalah indikator kesuksesan proyek kita • Mengidentifikasi sumber data relevan yang aksesnya tersedia / perlu didapatkan aksesnya
  • 4. Intro Step Suggest Recommendation Rekomendasi yang diberikan berdasarkan hasil analisis Get Insight Mengambil kesimpulan dari informasi yang didapatkan Define Business Problem Memahami permasalahan yang terjadi Problem Discovery Memilih metrics yang dapat menyelesaikan permasalahan
  • 5. Review Case Brazilian E-Commerce Olist ● Business Problem: ○ Bagaimana kondisi e-commerce olist? ● Problem Discovery: ○ GMV: Gross Merchant Value ○ Total Order ○ Total Buyers ○ Retention Rate, etc ● Insight? ● Recommendation?
  • 6. Review Case Design Data Mart Table Order order_id order_date customer_id order_status Table Order Items order_id product_id price freight_value Table Customer customer_id customer_city Metrics Transaction order_id total_product total_price total_logistic _cost Transform Data Mart Transaction order_id order_date customer_id order_status customer_city total_product total_price total_logistic_cost Join Join Why Data Mart? ● Gather Information ● Efficient ● Support to explore data
  • 7. Review Case Query Data Mart WITH metrics_transaction AS ( SELECT order_id, COUNT(DISTINCT product_id) total_product, SUM(price) total_product_price, SUM(freight_value) total_logistic_cost FROM `database-385606.Latihan1.order_items`-- custom with your dataset GROUP BY 1 ) SELECT a.*, c.customer_city, c.customer_unique_id, COALESCE(b.total_product,0) total_product, COALESCE(b.total_product_price,0) total_product_price, COALESCE(total_logistic_cost,0) total_logistic_cost FROM `database-385606.Latihan1.order` a -- custom with your dataset LEFT JOIN metrics_transaction b ON a.order_id=b.order_id LEFT JOIN ( SELECT DISTINCT customer_id,customer_unique_id customer_city FROM `database-385606.Latihan1.cust`) c -- custom with your dataset ON a.customer_id=c.customer_id CTE Sub Query
  • 8. Review Case Create Data Mart ● View Table: ○ Table yang dibentuk menggunakan query dan membutuhkan waktu dalam running query untuk menampilkan result ● Fact Table: ○ Table yang dibentuk menggunakan query dan hasil akan ditampilkan berupa tabel fisik
  • 9. Review Case View Table 1. Click Save > Save view 2. Setup Project and Dataset then fill name of table 3. Save 1 2 3
  • 10. Review Case Fact Table 1. Add query with: create or replace table loyal-surfer-321414.Testing.data_mart as 1. Run
  • 11. Review Case Hashing Proses menghasilkan fixed-size output, dari variable-sized input yang dilakukan melalui penggunaan rumus matematika yang dikenal sebagai hash function. Setiap aset kripto menggunakan berbagai algoritma hashing yang berbeda untuk membuat berbagai jenis kode hash – algoritma ini bertugas untuk menghasilkan alfanumerik acak Contoh: reference
  • 12. Review Case Step 1 - Check Definition (Not Yet Metadata) SELECT order_status, MAX(order_purchase_timestamp) purchase_date, MAX(order_approved_at) approve_date, MAX(order_delivered_carrier_date) pickup_date_by_courir, MAX(order_delivered_customer_date) delivered_date FROM `loyal-surfer- 321414.Testing.data_mart_purchase` -- custom with your dataset GROUP BY 1 Metadata: data that provides information about other data but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: Descriptive metadata – the descriptive information about a resource. It is used for discovery and identification.
  • 13. Review Case Step 2 - Check Basic Statistics SELECT min(partition_columns) min_date ,max(partition_columns) max_date ,count(distinct order_id) total_order ,count(distinct customer_unique_id) total_buyers ,round(avg(total_product),2) avg_products ,round(avg(total_product_price),2) avg_gmv ,approx_quantiles(total_product_price, 100)[offset(50)] median_gmv ,STDDEV(total_product_price) std_gmv ,round(avg(total_logistic_cost),2) avg_logistic_cost ,approx_quantiles(total_logistic_cost, 100)[offset(50)] median_logistic_cost ,STDDEV(total_logistic_cost) std_logistic_cost FROM `loyal-surfer-321414.Testing.data_mart_purchase` where order_status='delivered' statistics values min_date 2016-09-15 max_date 2018-08-29 total_order 96478 total_buyers 93358 avg_products 1.04 avg_gmv 137.04 median_gmv 86.57 std_gmv 209.05 avg_logistic_cost 22.79 median_logistic_cost 17.17 std_logistic_cost 21.56
  • 14. Review Case Step 2 - Check Basic Statistics SELECT FORMAT_DATETIME("%Y-%m-01", partition_columns) month_key ,sum(total_product_price) total_gmv FROM `loyal-surfer-321414.Testing.data_mart_purchase` where order_status='delivered' group by 1 order by 1 month_key total_gmv 2016-09-01 135 2016-10-01 40325 2016-12-01 11 2017-01-01 111798 2017-02-01 234223 2017-03-01 359199 2017-04-01 340670 2017-05-01 489338 2017-06-01 421923 2017-07-01 481605 2017-08-01 554700 2017-09-01 607400 2017-10-01 648248 2017-11-01 987765 2017-12-01 726033
  • 15. Review Case Hands On Step 3 - Deep Dive with our assumptions What Should we do?
  • 16. Review Case Hands On Step 3 - Deep Dive with our assumptions WITH summary_boxplot AS ( SELECT APPROX_QUANTILES(total_product_price, 4)[ OFFSET (2)] q2_gmv, APPROX_QUANTILES(total_product_price, 4)[ OFFSET (3)] - APPROX_QUANTILES(total_product_price, 4)[ OFFSET (1)] iqr FROM `loyal-surfer-321414.Testing.data_mart_purchase_remake` WHERE order_status='delivered' ), summary_batas AS ( SELECT q2_gmv+1.5*iqr batas_atas, q2_gmv-1.5*iqr batas_bawah FROM summary_boxplot ), fact_tagging_anomaly AS ( SELECT *,
  • 17. Review Case Hands On Step 3 - Deep Dive with our assumptions
  • 18. Review Case Hands On Step 3 - Bonus Retention Rate with Cohort Analysis Query Cohort Analysis reference
  • 19. Review Case Hands On Step 4 - Summary 1. … 2. … 3. … 4. …
  • 20. Review Case Hands On Coba 1. Lengkapi slide 17 untuk bikin 3 kelompok berdasarkan pendekatan boxplot (didalam whisker pembelanjaan normal, pembelanjaan outlier dan pembelanjaan extreme) 2. Bikin summary estimasi interval untuk rata-rata gmv dan kasih tagging mana transaksi yang diluar LCL dan UCL 3. Bandingkan taggingan menggunakan boxplot dan pendekatan distribusi normal, simpulkan hasilnya 4. Lengkapi slide 20 untuk summary yang bisa diambil dari data olist