SlideShare a Scribd company logo
1 of 30
Beer analytics
using Kibana and
Elasticsearch
tweet: #ndcROI
“most bang for the bucks
product” #ndcoslo
You can win!
Christoffer Vig
works at
http://blog.comperiosearch.com
babadofar
Norwegian government owned alcohol
monopoly.
Sells beverages above 4,7 % alcohol
Open data
http://www.vinmonopolet.no
Vinmonopolet CSV file
CSV -> Elasticsearch
Logstash
Logstash config
Elasticsearch
output
Demo time!
#ndcroi
Discover
Visualize
Bitterness in beer
View details
Use cases
Boss is buying
Last call
Gourmand customer dinner
Foreign hipster visitors
...
Boss is buying
Last call
Price pr Alcohol unit
pricePrAlcohol
floor(doc['Literpris'].value/doc['Alkohol'].value)
pricePrAlcohol = Price pr 1 Alcohol unit
Gourmand
significant terms
Belgian beer significant terms
Optimal ROI product
#ndc-roi
Resources
vagrant ELK box
https://github.com/comperiosearch/vagrant-elk-
box
code for this talk
https://github.com/babadofar/bbuzz_code
… Thank you!

More Related Content

Viewers also liked

Viewers also liked (15)

Notemybook Mainstream Presentation
Notemybook Mainstream PresentationNotemybook Mainstream Presentation
Notemybook Mainstream Presentation
 
Halloween around the world
Halloween around the worldHalloween around the world
Halloween around the world
 
CherryBombClothes
CherryBombClothesCherryBombClothes
CherryBombClothes
 
Lookbook Verão 2015
Lookbook Verão 2015Lookbook Verão 2015
Lookbook Verão 2015
 
Family&friend 1
Family&friend 1Family&friend 1
Family&friend 1
 
개미뉴스가 궁금해?
개미뉴스가 궁금해?개미뉴스가 궁금해?
개미뉴스가 궁금해?
 
Penelitian Amnesty
Penelitian AmnestyPenelitian Amnesty
Penelitian Amnesty
 
Presentations
PresentationsPresentations
Presentations
 
Como aprender idiomas sin ir a roberto escudero
Como aprender idiomas sin ir a   roberto escuderoComo aprender idiomas sin ir a   roberto escudero
Como aprender idiomas sin ir a roberto escudero
 
Undang-Undang
Undang-UndangUndang-Undang
Undang-Undang
 
index_result_eng8
index_result_eng8index_result_eng8
index_result_eng8
 
Anotated bibliography
Anotated bibliographyAnotated bibliography
Anotated bibliography
 
EDC MOOC December, 2014
EDC MOOC December, 2014EDC MOOC December, 2014
EDC MOOC December, 2014
 
Vghdtd
VghdtdVghdtd
Vghdtd
 
Advanced theories
Advanced theoriesAdvanced theories
Advanced theories
 

Recently uploaded

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 

Recently uploaded (20)

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 

Ndc beer analytics using kibana and elasticsearch

Editor's Notes

  1. I am going to show how you can use Kibana 4 to create some cool visualizations. The visualizations will be done on top of open data from Norwegian Alcohol monolopoly, Vinmonopolet or Wine monopoly, focusing on the beer part of their catalogue. The invention of bread and beer has been argued to be responsible for humanity's ability to develop technology and build civilization (wikipedia) Agenda - Short intro to elasticsearch, and vinmonopolet, how I got data into Kibana. Demo use cases. Going to show how you can use Kibana to answer questions
  2. Comperio search consultancy company. 2004 - Fast -> 2008 -> sharepoint ,Norch, FAST, elasticsearch, solr, Neo4j, machine learning... What’s so fun about search engines difference between search engine and database. search engine has a human being as end user database is technic Creating good search solutions involves both deeply technical issues and human issues: What is a good search result?
  3. How it all fits together - Elastic is the company behind development of open source projects logstash, elasticsearch, kibana, ++ Elasticsearch is the main product grew out of compass, 2004 with dev usability for Lucene. Lucene -1999 - (Who used google in 1999?) Demand for scalability led to elasticsearch 2010 Logstash - log processing tool - general input, output filter Kibana 4 - latest gen of kibana, suppport for aggregations - d3.js, angular.js
  4. All beverages containing alchohol content higher than 4,75% is sold by Vinmonopolet. (max 60% ) Regulated opening hours High tax - taxed by alchol content queues at 1500 saturday, etc. preplay/afterplay culture beer below 4,8% is sold in grocery stores. -20 -18 restaurants and pubs may have othre products not sold a t vinmonopolet. (so the list does not include all alcholho availble in Norway)
  5. vinmonopolet product listing - look at all the nice metadat color freshness bitterness fullness depth
  6. Elasticsearch is a search engine. period. no crawler , connector. put data into it with JSON REST AP
  7. iconv - fix encoding of file csv columns drop first line fix decimal convert fields to float output to elasticsearhc template
  8. simple search listing
  9. Vagrant -elk box at github Use of discovery tab questions on next slide
  10. Discover tab search and filter Select fields Sort by fields save searches URL? sELECT Varetype: Øl . add Filter Search for Stout select Bitterhet - show field stats - Visualize TF-IDF
  11. How can lucene be so fast and effective looking up search results? Documents are converted into an inverted index . terms and the frequency. Lucene Term dictionary. - A dictionary containing all of the terms used in all of the indexed fields of all of the documents. The dictionary also contains the number of documents which contain the term, and pointers to the term's frequency and proximity data.
  12. How can lucene be so fast and effective looking up search results? Documents are converted into an inverted index . terms and the frequency. Lucene Term dictionary. - A dictionary containing all of the terms used in all of the indexed fields of all of the documents. The dictionary also contains the number of documents which contain the term, and pointers to the term's frequency and proximity data.
  13. How can we create a scoring algorithm? we have a query and documents. what’s the best way to rank them. Use term frequency: Count the number of occurences of each term, and add up. Docs with lots of matching terms come up at no.1 (prefers long documents) #1 has “stout” 7 times #2 has “imperial” 2 times, “Stout” 4 times #3 has “Russian 3 times, “Imperial” once, Stout once
  14. https://www.elastic.co/guide/en/elasticsearch/guide/current/scoring-theory.html https://www.elastic.co/guide/en/elasticsearch/guide/current/practical-scoring-function.html term frequency (tf ) = count of term in document document frequency (df) = count of term in all docs inverse doc frequency (idf) = log(count of docs/df) tf/idf = tf * idf The illustration is simplified!
  15. Top 20 bitterness X- AXIS - BITTERNESS NUMBERS Y AXIS - COUNT OF PRODUCTS WITH THIS BITTERNESS QUERY - STOUT
  16. add sig terms???
  17. add number of countries add Varetype Add alcohol range
  18. Top 8 unusual terms in lukt_smak brødbakst syrlig balsamico gjær rosin anslag kirsebær eik
  19. https://www.elastic.co/downloads
  20. illustrations by @eklem