SlideShare a Scribd company logo
1 of 17
SEBD 2021
Conversational OLAP
(discussion paper)
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli
University of Bologna, Italy
29th Italian Symposium on Advanced Database Systems (SEBD 2021)
SEBD 2021
Motivation
Goal: query multidimensional cubes through natural language
Natural language enables analytics in hand-free scenarios [1]
- Augmented reality or with smart assistants
OLAP is based on standard operators [2]
- No help in query construction and natural language disambiguation
We introduce COOL (COnversational OLap) [3]
Matteo Francia – University of Bologna 2
Introduction
[1] Matteo Francia, Matteo Golfarelli, Stefano Rizzi: A-BI+: A framework for Augmented Business Intelligence. Information Systems. (2020)
[2] Panos Vassiliadis, Patrick Marcel, Stefano Rizzi: Beyond roll-up's and drill-down's: An intentional analytics model to reinvent OLAP. Information Systems. (2019)
[3] Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: COOL: A Framework for Conversational OLAP. Information Systems. (2021)
SEBD 2021
COOL: architecture
Matteo Francia – University of Bologna 3
COOL:
overview
Automatic
KB feeding
Manual KB
enrichment KB
DW
Metadata
& values
Synonyms
Offline
Online
Synonyms
Ontology
SEBD 2021
COOL: architecture
Matteo Francia – University of Bologna 4
COOL:
overview
Speech-
to-Text
OLAP
operator
Full query
Disambiguation
& Enhancement
Execution &
Visualization
Automatic
KB feeding
Manual KB
enrichment
Raw
text
Annotated
parse forest
Parse
tree
Metadata
& values
Synonyms
Log
Interpretation
Offline
Online
Synonyms
Ontology
SQL
generation
SQL
Sales by
Customer and
Month
Parse tree
Statistics
KB
DW
SEBD 2021
Robustness: given a text T, we allow several mappings
- E.g., by matching each n-gram to a set of similar entities from the KB
T = «return the average sales for the product NY in each region»
M1 = avg, UnitSales, where, Product, =, New York, group by, Region
M2 = avg, UnitSales, where, Product, =, New York, group by, Regin
⟨GPSJ⟩ ::= ⟨MC⟩⟨GC⟩⟨SC⟩
⟨MC⟩ ::= (⟨Agg⟩⟨Mea⟩ | ⟨Cnt⟩⟨Fct⟩)+
⟨GC⟩ ::= “𝑔𝑟𝑜𝑢𝑝 𝑏𝑦” ⟨Attr⟩+
⟨SC⟩ ::= “𝑤ℎ𝑒𝑟𝑒” ⟨SCA⟩
⟨SCA⟩ ::= ⟨SCN⟩ “𝑎𝑛𝑑” ⟨SCA⟩ | ⟨SCN⟩
⟨SCN⟩ ::= “𝑛𝑜𝑡” ⟨SSC⟩ | ⟨SSC⟩
⟨SSC⟩ ::= ⟨Attr⟩⟨Cop⟩⟨Val⟩ | ⟨Attr⟩⟨Val⟩ | ⟨Val⟩
⟨Cop⟩ ::= “=” | “<>” | “>” | “<” | “≥” | “≤”
⟨Agg⟩ ::= “𝑠𝑢𝑚” | “𝑎𝑣𝑔” | “𝑚𝑖𝑛” | “𝑚𝑎𝑥”
⟨Cnt⟩ ::= “𝑐𝑜𝑢𝑛𝑡” | “𝑐𝑜𝑢𝑛𝑡 𝑑𝑖𝑠𝑡𝑖𝑛𝑐𝑡”
⟨Fct⟩ ::= Domain-specific facts
⟨Mea⟩ ::= Domain-specific measures
⟨Attr⟩ ::= Domain-specific attributes
⟨Val⟩ ::= Domain-specific values
COOL: interpretation
Matteo Francia – University of Bologna 5
COOL:
interpretation
M1 = avg, UnitSales, where, Product, =, New York, group by, Region
Mea
Agg “where”
MC SC
GPSJ
SCA
SCN
SSC
Val
Cop
Attr “group by”Attr
GC
T = «return the average sales for the product NY in each region»
SEBD 2021
COOL: ambiguities
Not all syntactically-correct clauses
are "valid"
- E.g., New York is not a Product
- Annotate it
- Ask a question for each ambiguity
Matteo Francia – University of Bologna 6
M1 = avg, UnitSales, where, Product, =, New York, group by, Region
Mea
Agg “where”
MC SC
GPSJ
SCA
SCN
SSC
Val
Cop
Attr “group by”Attr
GC
AVM
COOL:
interpretation
T = «return the average sales for the product NY in each region»
New York is a not a
product, could you pick
a product among ...?
SEBD 2021
COOL: ambiguities
Some clauses could be excluded
from the GPSJ query
- Annotate them for (possible) addiction
Matteo Francia – University of Bologna 7
COOL:
interpretation
M2 = avg, UnitSales, where, Product, =, New York, group by, Regin
Mea
Agg “where” “group by”
MC SC
GPSJ
Val
SCA
SCN
SSC
SCA
SCN
SSC
Val
SC
Attr Cop
AVM
unparsed
T = «return the average sales for the product NY in each region»
Do you want to add
the selection predicate
"Regin"?
SEBD 2021
COOL: scoring function
Return the forest with the highest score
Matteo Francia – University of Bologna 8
COOL:
interpretation
M1 = avg, UnitSales, where, Product, =, New York, group by, Region
Mea
Agg “where”
MC SC
GPSJ
SCA
SCN
SSC
Val
Cop
Attr “group by”Attr
GC
AVM
M2 = avg, UnitSales, where, Product, =, New York, group by, Regin
Mea
Agg “where” “group by”
MC SC
GPSJ
Val
SCA
SCN
SSC
SCA
SCN
SSC
Val
SC
Attr Cop
AVM
unparsed
Score(PFM1) Score(PFM2)
Score(M1) Score(M2)
Score(PFM1) > Score(PFM2)
SEBD 2021
COOL: scoring function
Also, use the score for pruning purpose
- Sort all the mappings by descending score Score(M)
- First, parse the mapping with the highest Score(M)
- Then, parse only the mappings s.t. Score(M) > Score(PFM)
Matteo Francia – University of Bologna 9
COOL:
interpretation
M1 = avg, UnitSales, where, Product, =, New York, group by, Region
Mea
Agg “where”
MC SC
GPSJ
SCA
SCN
SSC
Val
Cop
Attr “group by”Attr
GC
AVM
M3 = avg, UnitSales, where, Product, =, New York
Score(PFM1)
Score(M3)
Score(M1)
SEBD 2021
Experimental Evaluation
Top-𝑘 accuracy by varying the
similarity 𝛼 to build the mappings
- Real-world dataset from [1]
- Accuracy is stable wrt to 𝑘 (up to 94%)
- 𝛼 depends on the inaccuracies in the text
Matteo Francia – University of Bologna 10
Results
[1] K. Drushku, J. Aligon, N. Labroche, P. Marcel, V. Peralta, Interest-based recommendations for business intelligence users, Inf. Syst. 86 (2019)
SEBD 2021
User Evaluation
40 users with heterogeneous OLAP skills
- Asked to translate (Italian) analytic goals into English
- Users provided good feedback on the interface...
- ... as well as on the interpretation accuracy
Matteo Francia – University of Bologna 11
Results
Full Query OLAP operator
OLAP Familiarity Accuracy Time (s) Accuracy Time (s)
Low 0.91 141 0.86 102
High 0.91 97 0.92 71
SEBD 2021
In
Action!
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action!
EDBT (Best demo award). (2021)
SEBD 2021
In
Action!
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action!
EDBT (Best demo award). (2021)
SEBD 2021
In
Action!
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action!
EDBT (Best demo award). (2021)
SEBD 2021
In
Action!
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action!
EDBT (Best demo award). (2021)
SEBD 2021
In
Action!
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action!
EDBT (Best demo award). (2021)
SEBD 2021
Questions?
Matteo Francia – University of Bologna 17
Thank you.
Full paper:
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli:
COOL: A framework for conversational OLAP.
Information Systems. (2021)
Best demo award:
Matteo Francia, Enrico Gallinucci, Matteo Golfarelli:
Conversational OLAP in Action!
EDBT. (2021)

More Related Content

Similar to [SEBD2021] Conversational OLAP

Transforming deep into transformers – a computer vision approach
Transforming deep into transformers – a computer vision approachTransforming deep into transformers – a computer vision approach
Transforming deep into transformers – a computer vision approachFerdin Joe John Joseph PhD
 
Real Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object DetectionReal Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object DetectionIRJET Journal
 
CORE final workshop introduction
CORE final workshop introductionCORE final workshop introduction
CORE final workshop introductionCarlo Vaccari
 
On Execution Platforms for Large-Scale Aggregate Computing
On Execution Platforms for Large-Scale Aggregate ComputingOn Execution Platforms for Large-Scale Aggregate Computing
On Execution Platforms for Large-Scale Aggregate ComputingRoberto Casadei
 
Textual information analysis for the integration of different data repositories
Textual information analysis for the integration of different data repositoriesTextual information analysis for the integration of different data repositories
Textual information analysis for the integration of different data repositoriescarloamati
 
Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...SpagoWorld
 
Automated Program Repair, Distinguished lecture at MPI-SWS
Automated Program Repair, Distinguished lecture at MPI-SWSAutomated Program Repair, Distinguished lecture at MPI-SWS
Automated Program Repair, Distinguished lecture at MPI-SWSAbhik Roychoudhury
 
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...University of Bologna
 
Customer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustCustomer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustJim Porzak
 
Computational Fields meet Augmented Reality: Perspectives and Challenges
Computational Fields meet Augmented Reality: Perspectives and ChallengesComputational Fields meet Augmented Reality: Perspectives and Challenges
Computational Fields meet Augmented Reality: Perspectives and ChallengesDanilo Pianini
 
Research and activity report
Research and activity reportResearch and activity report
Research and activity reportMarco Cagnazzo
 
“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...
“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...
“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...Edge AI and Vision Alliance
 
vtu data structures lab manual bcs304 pdf
vtu data structures lab manual bcs304 pdfvtu data structures lab manual bcs304 pdf
vtu data structures lab manual bcs304 pdfLPSChandana
 
Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...
Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...
Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...Obeo
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Greg Makowski
 
Human in the Loop AI for Building Knowledge Bases
Human in the Loop AI for Building Knowledge Bases Human in the Loop AI for Building Knowledge Bases
Human in the Loop AI for Building Knowledge Bases Yunyao Li
 
PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...predictionio
 
Proceedings Scholar Metrics : NOTES
Proceedings Scholar Metrics : NOTESProceedings Scholar Metrics : NOTES
Proceedings Scholar Metrics : NOTESSubhajit Sahu
 

Similar to [SEBD2021] Conversational OLAP (20)

Transforming deep into transformers – a computer vision approach
Transforming deep into transformers – a computer vision approachTransforming deep into transformers – a computer vision approach
Transforming deep into transformers – a computer vision approach
 
Real Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object DetectionReal Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object Detection
 
CORE final workshop introduction
CORE final workshop introductionCORE final workshop introduction
CORE final workshop introduction
 
Tutorial_Caf_2020_etool.pdf
Tutorial_Caf_2020_etool.pdfTutorial_Caf_2020_etool.pdf
Tutorial_Caf_2020_etool.pdf
 
On Execution Platforms for Large-Scale Aggregate Computing
On Execution Platforms for Large-Scale Aggregate ComputingOn Execution Platforms for Large-Scale Aggregate Computing
On Execution Platforms for Large-Scale Aggregate Computing
 
Textual information analysis for the integration of different data repositories
Textual information analysis for the integration of different data repositoriesTextual information analysis for the integration of different data repositories
Textual information analysis for the integration of different data repositories
 
Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...
 
Automated Program Repair, Distinguished lecture at MPI-SWS
Automated Program Repair, Distinguished lecture at MPI-SWSAutomated Program Repair, Distinguished lecture at MPI-SWS
Automated Program Repair, Distinguished lecture at MPI-SWS
 
Ws2001 sessione8 cibella_tuoto
Ws2001 sessione8 cibella_tuotoWs2001 sessione8 cibella_tuoto
Ws2001 sessione8 cibella_tuoto
 
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
 
Customer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustCustomer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclust
 
Computational Fields meet Augmented Reality: Perspectives and Challenges
Computational Fields meet Augmented Reality: Perspectives and ChallengesComputational Fields meet Augmented Reality: Perspectives and Challenges
Computational Fields meet Augmented Reality: Perspectives and Challenges
 
Research and activity report
Research and activity reportResearch and activity report
Research and activity report
 
“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...
“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...
“An Industry Standard Performance Benchmark Suite for Machine Learning,” a Pr...
 
vtu data structures lab manual bcs304 pdf
vtu data structures lab manual bcs304 pdfvtu data structures lab manual bcs304 pdf
vtu data structures lab manual bcs304 pdf
 
Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...
Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...
Capella Days 2021 | Enhancing CubeSat design through ARCADIA and Capella: a c...
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
 
Human in the Loop AI for Building Knowledge Bases
Human in the Loop AI for Building Knowledge Bases Human in the Loop AI for Building Knowledge Bases
Human in the Loop AI for Building Knowledge Bases
 
PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...PredictionIO - Building Applications That Predict User Behavior Through Big D...
PredictionIO - Building Applications That Predict User Behavior Through Big D...
 
Proceedings Scholar Metrics : NOTES
Proceedings Scholar Metrics : NOTESProceedings Scholar Metrics : NOTES
Proceedings Scholar Metrics : NOTES
 

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 

Recently uploaded (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 

[SEBD2021] Conversational OLAP

  • 1. SEBD 2021 Conversational OLAP (discussion paper) Matteo Francia, Enrico Gallinucci, Matteo Golfarelli University of Bologna, Italy 29th Italian Symposium on Advanced Database Systems (SEBD 2021)
  • 2. SEBD 2021 Motivation Goal: query multidimensional cubes through natural language Natural language enables analytics in hand-free scenarios [1] - Augmented reality or with smart assistants OLAP is based on standard operators [2] - No help in query construction and natural language disambiguation We introduce COOL (COnversational OLap) [3] Matteo Francia – University of Bologna 2 Introduction [1] Matteo Francia, Matteo Golfarelli, Stefano Rizzi: A-BI+: A framework for Augmented Business Intelligence. Information Systems. (2020) [2] Panos Vassiliadis, Patrick Marcel, Stefano Rizzi: Beyond roll-up's and drill-down's: An intentional analytics model to reinvent OLAP. Information Systems. (2019) [3] Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: COOL: A Framework for Conversational OLAP. Information Systems. (2021)
  • 3. SEBD 2021 COOL: architecture Matteo Francia – University of Bologna 3 COOL: overview Automatic KB feeding Manual KB enrichment KB DW Metadata & values Synonyms Offline Online Synonyms Ontology
  • 4. SEBD 2021 COOL: architecture Matteo Francia – University of Bologna 4 COOL: overview Speech- to-Text OLAP operator Full query Disambiguation & Enhancement Execution & Visualization Automatic KB feeding Manual KB enrichment Raw text Annotated parse forest Parse tree Metadata & values Synonyms Log Interpretation Offline Online Synonyms Ontology SQL generation SQL Sales by Customer and Month Parse tree Statistics KB DW
  • 5. SEBD 2021 Robustness: given a text T, we allow several mappings - E.g., by matching each n-gram to a set of similar entities from the KB T = «return the average sales for the product NY in each region» M1 = avg, UnitSales, where, Product, =, New York, group by, Region M2 = avg, UnitSales, where, Product, =, New York, group by, Regin ⟨GPSJ⟩ ::= ⟨MC⟩⟨GC⟩⟨SC⟩ ⟨MC⟩ ::= (⟨Agg⟩⟨Mea⟩ | ⟨Cnt⟩⟨Fct⟩)+ ⟨GC⟩ ::= “𝑔𝑟𝑜𝑢𝑝 𝑏𝑦” ⟨Attr⟩+ ⟨SC⟩ ::= “𝑤ℎ𝑒𝑟𝑒” ⟨SCA⟩ ⟨SCA⟩ ::= ⟨SCN⟩ “𝑎𝑛𝑑” ⟨SCA⟩ | ⟨SCN⟩ ⟨SCN⟩ ::= “𝑛𝑜𝑡” ⟨SSC⟩ | ⟨SSC⟩ ⟨SSC⟩ ::= ⟨Attr⟩⟨Cop⟩⟨Val⟩ | ⟨Attr⟩⟨Val⟩ | ⟨Val⟩ ⟨Cop⟩ ::= “=” | “<>” | “>” | “<” | “≥” | “≤” ⟨Agg⟩ ::= “𝑠𝑢𝑚” | “𝑎𝑣𝑔” | “𝑚𝑖𝑛” | “𝑚𝑎𝑥” ⟨Cnt⟩ ::= “𝑐𝑜𝑢𝑛𝑡” | “𝑐𝑜𝑢𝑛𝑡 𝑑𝑖𝑠𝑡𝑖𝑛𝑐𝑡” ⟨Fct⟩ ::= Domain-specific facts ⟨Mea⟩ ::= Domain-specific measures ⟨Attr⟩ ::= Domain-specific attributes ⟨Val⟩ ::= Domain-specific values COOL: interpretation Matteo Francia – University of Bologna 5 COOL: interpretation M1 = avg, UnitSales, where, Product, =, New York, group by, Region Mea Agg “where” MC SC GPSJ SCA SCN SSC Val Cop Attr “group by”Attr GC T = «return the average sales for the product NY in each region»
  • 6. SEBD 2021 COOL: ambiguities Not all syntactically-correct clauses are "valid" - E.g., New York is not a Product - Annotate it - Ask a question for each ambiguity Matteo Francia – University of Bologna 6 M1 = avg, UnitSales, where, Product, =, New York, group by, Region Mea Agg “where” MC SC GPSJ SCA SCN SSC Val Cop Attr “group by”Attr GC AVM COOL: interpretation T = «return the average sales for the product NY in each region» New York is a not a product, could you pick a product among ...?
  • 7. SEBD 2021 COOL: ambiguities Some clauses could be excluded from the GPSJ query - Annotate them for (possible) addiction Matteo Francia – University of Bologna 7 COOL: interpretation M2 = avg, UnitSales, where, Product, =, New York, group by, Regin Mea Agg “where” “group by” MC SC GPSJ Val SCA SCN SSC SCA SCN SSC Val SC Attr Cop AVM unparsed T = «return the average sales for the product NY in each region» Do you want to add the selection predicate "Regin"?
  • 8. SEBD 2021 COOL: scoring function Return the forest with the highest score Matteo Francia – University of Bologna 8 COOL: interpretation M1 = avg, UnitSales, where, Product, =, New York, group by, Region Mea Agg “where” MC SC GPSJ SCA SCN SSC Val Cop Attr “group by”Attr GC AVM M2 = avg, UnitSales, where, Product, =, New York, group by, Regin Mea Agg “where” “group by” MC SC GPSJ Val SCA SCN SSC SCA SCN SSC Val SC Attr Cop AVM unparsed Score(PFM1) Score(PFM2) Score(M1) Score(M2) Score(PFM1) > Score(PFM2)
  • 9. SEBD 2021 COOL: scoring function Also, use the score for pruning purpose - Sort all the mappings by descending score Score(M) - First, parse the mapping with the highest Score(M) - Then, parse only the mappings s.t. Score(M) > Score(PFM) Matteo Francia – University of Bologna 9 COOL: interpretation M1 = avg, UnitSales, where, Product, =, New York, group by, Region Mea Agg “where” MC SC GPSJ SCA SCN SSC Val Cop Attr “group by”Attr GC AVM M3 = avg, UnitSales, where, Product, =, New York Score(PFM1) Score(M3) Score(M1)
  • 10. SEBD 2021 Experimental Evaluation Top-𝑘 accuracy by varying the similarity 𝛼 to build the mappings - Real-world dataset from [1] - Accuracy is stable wrt to 𝑘 (up to 94%) - 𝛼 depends on the inaccuracies in the text Matteo Francia – University of Bologna 10 Results [1] K. Drushku, J. Aligon, N. Labroche, P. Marcel, V. Peralta, Interest-based recommendations for business intelligence users, Inf. Syst. 86 (2019)
  • 11. SEBD 2021 User Evaluation 40 users with heterogeneous OLAP skills - Asked to translate (Italian) analytic goals into English - Users provided good feedback on the interface... - ... as well as on the interpretation accuracy Matteo Francia – University of Bologna 11 Results Full Query OLAP operator OLAP Familiarity Accuracy Time (s) Accuracy Time (s) Low 0.91 141 0.86 102 High 0.91 97 0.92 71
  • 12. SEBD 2021 In Action! Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action! EDBT (Best demo award). (2021)
  • 13. SEBD 2021 In Action! Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action! EDBT (Best demo award). (2021)
  • 14. SEBD 2021 In Action! Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action! EDBT (Best demo award). (2021)
  • 15. SEBD 2021 In Action! Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action! EDBT (Best demo award). (2021)
  • 16. SEBD 2021 In Action! Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action! EDBT (Best demo award). (2021)
  • 17. SEBD 2021 Questions? Matteo Francia – University of Bologna 17 Thank you. Full paper: Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: COOL: A framework for conversational OLAP. Information Systems. (2021) Best demo award: Matteo Francia, Enrico Gallinucci, Matteo Golfarelli: Conversational OLAP in Action! EDBT. (2021)

Editor's Notes

  1. DIFF: [17] returns tuples that maximize difference between cells of a cube given as input Profile user exploration to recommend which unvisited parts of the cube RELAXoperator allows toverify whether a pattern observed at a certain level of detail ispresent at a coarser level of detail too [19] Alternative operators have also been proposed in theCinecubes method [7,8]. The goal of this effort is to facilitateautomated reporting, given an original OLAP query as input.To achieve this purpose two operators (expressed asacts) areproposed, namely, (a)put-in-context, i.e., compare the result ofthe original query to query results over similar, sibling values;and (b)give-details, where drill-downs of the original query’sgroupers are performed.
  2. DIFF: [17] returns tuples that maximize difference between cells of a cube given as input Profile user exploration to recommend which unvisited parts of the cube RELAXoperator allows toverify whether a pattern observed at a certain level of detail ispresent at a coarser level of detail too [19] Alternative operators have also been proposed in theCinecubes method [7,8]. The goal of this effort is to facilitateautomated reporting, given an original OLAP query as input.To achieve this purpose two operators (expressed asacts) areproposed, namely, (a)put-in-context, i.e., compare the result ofthe original query to query results over similar, sibling values;and (b)give-details, where drill-downs of the original query’sgroupers are performed.