SlideShare a Scribd company logo
Visitors in Focus –
How Data
Analysis Can
Optimize the
Museum
Experience
Bachelor Project 2019/20
Internal Presentation · Revised for publication
Laura Holz, Selina Reinhard, Leon Schmidt,
Georg Tennigkeit, Christoph Thiede, Tom Wollnik
These slides were originally presented at HPI
on 2019-07-21 to the chair of Information
Systems headed by Prof. Dr. Felix Naumann.
Data Collection Integration Visualization
Social Media
Tickets,
Bookings
Reviews
Project Structure
These slides were originally presented at HPI
on 2019-07-21 to the chair of Information
Systems headed by Prof. Dr. Felix Naumann.
Example Power BI dashboard for Instagram
posts published by the museum.
Example Power BI dashboard for Instagram
posts published by the museum.
Inter alia, it allows to compare the
engagements and impressions of posts
depending on the hashtag they were
attributed with.
System Architecture
Christoph Thiede
2020-07-21 5
For more information on the system
architecture, see our GitHub repository:
https://github.com/Museum-Barberini-
gGmbH/Barberini-Analytics
Data Sources
2020-07-21 6
MUSEUM SYSTEM SOCIAL MEDIA RATING PLATFORMS
All supported data sources:
DOCUMENTATION.md#data-sources.
Data Source: go~mus
REST API
Excel Reports
Web Scraper
2020-07-21 7
CUSTOMER
S
BOOKINGS
EXHIBITIONS
ORDERS
All go~mus sources: DOCUMENTATION.md
Tech Stack
2020-07-21 8
Ubuntu VM
Power BI Online
Backend
Frontend
Python
PostgreSQL
Docker
GitLab Enterprise
requests
Google
API
Client
beautiful
soup
lxml
⋯
pandas
psycopg2
GitLab CI Runner
GitLab
Runner
for Windows
Power BI
Desktop
Image
Magick
.NET
Standard
Windows 10 VM
Encapsulate + automate all dependencies.
Complete system architecture:
DOCUMENTATION.md
Pipeline Orchestration
2020-07-21 9
Task
output(): Target
requires(): Task[]
run()
CsvToDb
copy()
rows()
read_csv(file)
DataPreparationTask
ensure_foreign_keys()
condense_time_series()
DailyEntriesToDb
ExtractDailyEntries
FetchGomusReport
gomus_report.xlsx gomus.daily_entries
daily_entries.csv
requires
requires
Pipeline documentation:
DOCUMENTATION.md#
data-mining-pipeline
Pipeline Orchestration – Example (Success)
2020-07-21 10
PostsToDb
Pipeline documentation:
DOCUMENTATION.md#
data-mining-pipeline
Pipeline Orchestration – Example (Failure)
2020-07-21 11
PostsToDb
FetchTwitter
Pipeline documentation:
DOCUMENTATION.md#
data-mining-pipeline
Database: Post Schema
post
social_media_post
fb_post
fb_comment
ig_post
tweet
museum_review
google_maps_review
app_review
appstore_review
gplay_review
2020-07-21 12
Complete database schema is defined in the
repository (scripts/migrations).
Database: Migration System
2020-07-21 13
migration_001.sql
migration_002.py
migration_000.sql
migration_003.sql
migration_004.sh
migration_005.sql
$MIGRATION_VERSION=005
$ ./migrate.sh
Applying 'migration_005.sql' ...
Done.
ALTER TABLE tweet
ADD COLUMN post_date TIMESTAMP;
Migration system documentation:
DOCUMENTATION.md#migration-system
Continuous
Integration
2020-07-21 14
Build
Unit tests
Minimal mining pipeline
Visualization tests
Code Analysis
CI details:
DOCUMENTATION.md#
continuous-integration
References
• Repository: https://github.com/Museum-Barberini-
gGmbH/Barberini-Analytics
• go~mus API documentation: https://giantmonkey.github.io/gomus-
api-doc/public_api.html
• Luigi framework documentation: https://luigi.readthedocs.io/
• Pandas framework: https://pandas.pydata.org/
• Power BI website: https://powerbi.microsoft.com/en-us/
• PostgreSQL: https://www.postgresql.org/
2020-07-21 15

More Related Content

What's hot

Power BI Desktop | Power BI Tutorial | Power BI Training | Edureka
Power BI Desktop | Power BI Tutorial | Power BI Training | EdurekaPower BI Desktop | Power BI Tutorial | Power BI Training | Edureka
Power BI Desktop | Power BI Tutorial | Power BI Training | Edureka
Edureka!
 
International Data Spaces: Data Sovereignty for Business Model Innovation
International Data Spaces: Data Sovereignty for Business Model InnovationInternational Data Spaces: Data Sovereignty for Business Model Innovation
International Data Spaces: Data Sovereignty for Business Model Innovation
Boris Otto
 
SAP S/4HANA Service - Ein Überblick
 SAP S/4HANA Service - Ein Überblick SAP S/4HANA Service - Ein Überblick
SAP S/4HANA Service - Ein Überblick
IBsolution GmbH
 
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designerPentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Hamdi Hmidi
 
Jena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registryJena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registry
A. Anil Sinaci
 
Selecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachSelecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approach
Christopher Bradley
 
Power BI - 概要と 新しい機能など
Power BI - 概要と 新しい機能などPower BI - 概要と 新しい機能など
Power BI - 概要と 新しい機能など
Takeshi Kagata
 
SharePoint Online へのアクセスを制限しよう
SharePoint Online へのアクセスを制限しようSharePoint Online へのアクセスを制限しよう
SharePoint Online へのアクセスを制限しよう
Hirofumi Ota
 
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data LakesBig Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Denodo
 
Power BI Single Page Applications Boise Code Camp 2017
Power BI Single Page Applications Boise Code Camp 2017Power BI Single Page Applications Boise Code Camp 2017
Power BI Single Page Applications Boise Code Camp 2017
Stuart
 
Power BI のいろいろな活用パターン
Power BI のいろいろな活用パターンPower BI のいろいろな活用パターン
Power BI のいろいろな活用パターン
Yugo Shimizu
 
Rda step by step
Rda   step by stepRda   step by step
Rda step by step
Phani Kumar
 
情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)
情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)
情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)
Yugo Shimizu
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAP
Capgemini
 
20100430 introduction to business objects data services
20100430 introduction to business objects data services20100430 introduction to business objects data services
20100430 introduction to business objects data services
Junhyun Song
 

What's hot (15)

Power BI Desktop | Power BI Tutorial | Power BI Training | Edureka
Power BI Desktop | Power BI Tutorial | Power BI Training | EdurekaPower BI Desktop | Power BI Tutorial | Power BI Training | Edureka
Power BI Desktop | Power BI Tutorial | Power BI Training | Edureka
 
International Data Spaces: Data Sovereignty for Business Model Innovation
International Data Spaces: Data Sovereignty for Business Model InnovationInternational Data Spaces: Data Sovereignty for Business Model Innovation
International Data Spaces: Data Sovereignty for Business Model Innovation
 
SAP S/4HANA Service - Ein Überblick
 SAP S/4HANA Service - Ein Überblick SAP S/4HANA Service - Ein Überblick
SAP S/4HANA Service - Ein Überblick
 
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designerPentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
 
Jena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registryJena based implementation of a iso 11179 meta data registry
Jena based implementation of a iso 11179 meta data registry
 
Selecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachSelecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approach
 
Power BI - 概要と 新しい機能など
Power BI - 概要と 新しい機能などPower BI - 概要と 新しい機能など
Power BI - 概要と 新しい機能など
 
SharePoint Online へのアクセスを制限しよう
SharePoint Online へのアクセスを制限しようSharePoint Online へのアクセスを制限しよう
SharePoint Online へのアクセスを制限しよう
 
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data LakesBig Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data Lakes
 
Power BI Single Page Applications Boise Code Camp 2017
Power BI Single Page Applications Boise Code Camp 2017Power BI Single Page Applications Boise Code Camp 2017
Power BI Single Page Applications Boise Code Camp 2017
 
Power BI のいろいろな活用パターン
Power BI のいろいろな活用パターンPower BI のいろいろな活用パターン
Power BI のいろいろな活用パターン
 
Rda step by step
Rda   step by stepRda   step by step
Rda step by step
 
情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)
情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)
情報の伝え方が変わる!Power BIでレポートを作ってみよう(前半戦)
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAP
 
20100430 introduction to business objects data services
20100430 introduction to business objects data services20100430 introduction to business objects data services
20100430 introduction to business objects data services
 

Similar to Barberini Analytics - System Architecture

Pixelache 110311-Hintikka-Kari-A-Open-data-Network-esthetics
Pixelache 110311-Hintikka-Kari-A-Open-data-Network-estheticsPixelache 110311-Hintikka-Kari-A-Open-data-Network-esthetics
Pixelache 110311-Hintikka-Kari-A-Open-data-Network-esthetics
Kari A. Hintikka
 
Alexia Meyermann: Building a research infrastructure for educational studies ...
Alexia Meyermann: Building a research infrastructure for educational studies ...Alexia Meyermann: Building a research infrastructure for educational studies ...
Alexia Meyermann: Building a research infrastructure for educational studies ...
DIPF | Leibniz-Institut für Bildungsforschung und Bildungsinformation
 
10 years of IBM Connections
10 years of IBM Connections10 years of IBM Connections
10 years of IBM Connections
LetsConnect
 
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Beat Estermann
 
Value creation, value flows and liability over virtualised resources
Value creation, value flows and liability over virtualised resourcesValue creation, value flows and liability over virtualised resources
Value creation, value flows and liability over virtualised resources
ictseserv
 
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Bernhard Rieder
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond Ontologies
David Newbury
 
Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...
Georg Rehm
 
WP8 Dissemination and Exploitation
WP8 Dissemination and ExploitationWP8 Dissemination and Exploitation
WP8 Dissemination and Exploitation
INSEMTIVES project
 
Data Management Plans for SNSF Grants
Data Management Plans for SNSF GrantsData Management Plans for SNSF Grants
Data Management Plans for SNSF Grants
ETH-Bibliothek
 
Robust Expert Finding in Web-Based Community Information Systems
Robust Expert Finding in Web-Based Community Information SystemsRobust Expert Finding in Web-Based Community Information Systems
Robust Expert Finding in Web-Based Community Information Systems
Ralf Klamma
 
Holographic Data Visualization - M. Valoriani & A. Musone
Holographic Data Visualization - M. Valoriani & A. MusoneHolographic Data Visualization - M. Valoriani & A. Musone
Holographic Data Visualization - M. Valoriani & A. Musone
Data Driven Innovation
 
Innovation through data capitalisation
Innovation through data capitalisationInnovation through data capitalisation
Innovation through data capitalisation
Joanne Jacobs
 
HADOOP based Recommendation Algorithm for Micro-video URL
HADOOP based Recommendation Algorithm for Micro-video URLHADOOP based Recommendation Algorithm for Micro-video URL
HADOOP based Recommendation Algorithm for Micro-video URL
dbpublications
 
Layerized and temporally - Digital+humanities 2010 @MIT Medialab
Layerized and temporally - Digital+humanities 2010 @MIT MedialabLayerized and temporally - Digital+humanities 2010 @MIT Medialab
Layerized and temporally - Digital+humanities 2010 @MIT Medialab
Samuel Huron
 
open data for an open future.pptx
open data for an open future.pptxopen data for an open future.pptx
open data for an open future.pptx
Fach- und Koordinationsstelle OGD, Kanton Zürich
 
Understanding everyday users’ perception of socio-technical issues through s...
Understanding everyday users’ perception of  socio-technical issues through s...Understanding everyday users’ perception of  socio-technical issues through s...
Understanding everyday users’ perception of socio-technical issues through s...
Ahreum lee
 
KiWi – A flexible platform for semantic social media applications
KiWi – A flexible platform for semantic social media applicationsKiWi – A flexible platform for semantic social media applications
KiWi – A flexible platform for semantic social media applications
Kiwi Community
 
2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».
2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».
2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».
eMadrid network
 
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Beat Signer
 

Similar to Barberini Analytics - System Architecture (20)

Pixelache 110311-Hintikka-Kari-A-Open-data-Network-esthetics
Pixelache 110311-Hintikka-Kari-A-Open-data-Network-estheticsPixelache 110311-Hintikka-Kari-A-Open-data-Network-esthetics
Pixelache 110311-Hintikka-Kari-A-Open-data-Network-esthetics
 
Alexia Meyermann: Building a research infrastructure for educational studies ...
Alexia Meyermann: Building a research infrastructure for educational studies ...Alexia Meyermann: Building a research infrastructure for educational studies ...
Alexia Meyermann: Building a research infrastructure for educational studies ...
 
10 years of IBM Connections
10 years of IBM Connections10 years of IBM Connections
10 years of IBM Connections
 
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
 
Value creation, value flows and liability over virtualised resources
Value creation, value flows and liability over virtualised resourcesValue creation, value flows and liability over virtualised resources
Value creation, value flows and liability over virtualised resources
 
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond Ontologies
 
Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...
 
WP8 Dissemination and Exploitation
WP8 Dissemination and ExploitationWP8 Dissemination and Exploitation
WP8 Dissemination and Exploitation
 
Data Management Plans for SNSF Grants
Data Management Plans for SNSF GrantsData Management Plans for SNSF Grants
Data Management Plans for SNSF Grants
 
Robust Expert Finding in Web-Based Community Information Systems
Robust Expert Finding in Web-Based Community Information SystemsRobust Expert Finding in Web-Based Community Information Systems
Robust Expert Finding in Web-Based Community Information Systems
 
Holographic Data Visualization - M. Valoriani & A. Musone
Holographic Data Visualization - M. Valoriani & A. MusoneHolographic Data Visualization - M. Valoriani & A. Musone
Holographic Data Visualization - M. Valoriani & A. Musone
 
Innovation through data capitalisation
Innovation through data capitalisationInnovation through data capitalisation
Innovation through data capitalisation
 
HADOOP based Recommendation Algorithm for Micro-video URL
HADOOP based Recommendation Algorithm for Micro-video URLHADOOP based Recommendation Algorithm for Micro-video URL
HADOOP based Recommendation Algorithm for Micro-video URL
 
Layerized and temporally - Digital+humanities 2010 @MIT Medialab
Layerized and temporally - Digital+humanities 2010 @MIT MedialabLayerized and temporally - Digital+humanities 2010 @MIT Medialab
Layerized and temporally - Digital+humanities 2010 @MIT Medialab
 
open data for an open future.pptx
open data for an open future.pptxopen data for an open future.pptx
open data for an open future.pptx
 
Understanding everyday users’ perception of socio-technical issues through s...
Understanding everyday users’ perception of  socio-technical issues through s...Understanding everyday users’ perception of  socio-technical issues through s...
Understanding everyday users’ perception of socio-technical issues through s...
 
KiWi – A flexible platform for semantic social media applications
KiWi – A flexible platform for semantic social media applicationsKiWi – A flexible platform for semantic social media applications
KiWi – A flexible platform for semantic social media applications
 
2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».
2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».
2021_07_01 «Learning Informatics as Inspiration for Learning Analytics».
 
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
 

Recently uploaded

"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
manji sharman06
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - MydbopsMySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
Mydbops
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
 
AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)
HarpalGohil4
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 

Recently uploaded (20)

"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - MydbopsMySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
 
AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 

Barberini Analytics - System Architecture

  • 1. Visitors in Focus – How Data Analysis Can Optimize the Museum Experience Bachelor Project 2019/20 Internal Presentation · Revised for publication Laura Holz, Selina Reinhard, Leon Schmidt, Georg Tennigkeit, Christoph Thiede, Tom Wollnik These slides were originally presented at HPI on 2019-07-21 to the chair of Information Systems headed by Prof. Dr. Felix Naumann.
  • 2. Data Collection Integration Visualization Social Media Tickets, Bookings Reviews Project Structure These slides were originally presented at HPI on 2019-07-21 to the chair of Information Systems headed by Prof. Dr. Felix Naumann.
  • 3. Example Power BI dashboard for Instagram posts published by the museum.
  • 4. Example Power BI dashboard for Instagram posts published by the museum. Inter alia, it allows to compare the engagements and impressions of posts depending on the hashtag they were attributed with.
  • 5. System Architecture Christoph Thiede 2020-07-21 5 For more information on the system architecture, see our GitHub repository: https://github.com/Museum-Barberini- gGmbH/Barberini-Analytics
  • 6. Data Sources 2020-07-21 6 MUSEUM SYSTEM SOCIAL MEDIA RATING PLATFORMS All supported data sources: DOCUMENTATION.md#data-sources.
  • 7. Data Source: go~mus REST API Excel Reports Web Scraper 2020-07-21 7 CUSTOMER S BOOKINGS EXHIBITIONS ORDERS All go~mus sources: DOCUMENTATION.md
  • 8. Tech Stack 2020-07-21 8 Ubuntu VM Power BI Online Backend Frontend Python PostgreSQL Docker GitLab Enterprise requests Google API Client beautiful soup lxml ⋯ pandas psycopg2 GitLab CI Runner GitLab Runner for Windows Power BI Desktop Image Magick .NET Standard Windows 10 VM Encapsulate + automate all dependencies. Complete system architecture: DOCUMENTATION.md
  • 9. Pipeline Orchestration 2020-07-21 9 Task output(): Target requires(): Task[] run() CsvToDb copy() rows() read_csv(file) DataPreparationTask ensure_foreign_keys() condense_time_series() DailyEntriesToDb ExtractDailyEntries FetchGomusReport gomus_report.xlsx gomus.daily_entries daily_entries.csv requires requires Pipeline documentation: DOCUMENTATION.md# data-mining-pipeline
  • 10. Pipeline Orchestration – Example (Success) 2020-07-21 10 PostsToDb Pipeline documentation: DOCUMENTATION.md# data-mining-pipeline
  • 11. Pipeline Orchestration – Example (Failure) 2020-07-21 11 PostsToDb FetchTwitter Pipeline documentation: DOCUMENTATION.md# data-mining-pipeline
  • 13. Database: Migration System 2020-07-21 13 migration_001.sql migration_002.py migration_000.sql migration_003.sql migration_004.sh migration_005.sql $MIGRATION_VERSION=005 $ ./migrate.sh Applying 'migration_005.sql' ... Done. ALTER TABLE tweet ADD COLUMN post_date TIMESTAMP; Migration system documentation: DOCUMENTATION.md#migration-system
  • 14. Continuous Integration 2020-07-21 14 Build Unit tests Minimal mining pipeline Visualization tests Code Analysis CI details: DOCUMENTATION.md# continuous-integration
  • 15. References • Repository: https://github.com/Museum-Barberini- gGmbH/Barberini-Analytics • go~mus API documentation: https://giantmonkey.github.io/gomus- api-doc/public_api.html • Luigi framework documentation: https://luigi.readthedocs.io/ • Pandas framework: https://pandas.pydata.org/ • Power BI website: https://powerbi.microsoft.com/en-us/ • PostgreSQL: https://www.postgresql.org/ 2020-07-21 15

Editor's Notes

  1. These slides were originally presented at HPI on 2019-07-21 to the chair of Information Systems headed by Prof. Dr. Felix Naumann.
  2. Example Power BI dashboard for Instagram posts published by the museum.
  3. Example Power BI dashboard for Instagram posts published by the museum. Inter alia, it allows to compare the engagements and impressions of posts depending on the hashtag they were attributed with.
  4. For more information on the system architecture, see our GitHub repository: https://github.com/Museum-Barberini-gGmbH/Barberini-Analytics
  5. For more information on all supported data sources, see DOCUMENTATION.md#data-sources.
  6. For more information on gomus sources, see DOCUMENTATION.md.
  7. Motivation for containerization: encapsulate + automate all dependencies. For more information on the system architecture, see DOCUMENTATION.md.
  8. For more information on the pipeline, see DOCUMENTATION.md#data-mining-pipeline.
  9. For more information on the pipeline, see DOCUMENTATION.md#data-mining-pipeline.
  10. For more information on the pipeline, see DOCUMENTATION.md#data-mining-pipeline.
  11. For the complete database schema, see all migration scripts in the repository (scripts/migrations).
  12. Fore more information on the migration system, see DOCUMENTATION.md#migration-system.
  13. For more information on the CI, see DOCUMENTATION.md#continuous-integration.
  14. For more information on the system architecture, see our GitHub repository: https://github.com/Museum-Barberini-gGmbH/Barberini-Analytics