SlideShare a Scribd company logo
1 of 24
Download to read offline
HOW PEOPLE TALK ABOUT HEALTH?
Nicola Procopio 25-01-2022
Senior Data Scientist
Nicola Procopio
About me
nicola.procopio@healthwareinternational.com
Contacts
Background
Community
https://www.slideshare.net/NicolaProcopio
https://it.linkedin.com/in/nicolaprocopio
https://github.com/nickprock
Actually @
Mission
We play at the intersection of science,
creativity, boundless curiosity, and our
understanding of human needs.
That’s how we design
transformational healthcare
experiences that engage, simplify and
empower people’s lives.
We are digital natives and
multi-talented coders, connected and
passionate to learn and innovate.
Our mission is to design and develop
successful solutions and digital
products.
The full-service healthcare agency
of Healthware Group
Summary
1. Google Flu Trends (2009)
2. Online Clustering Algorithm on Twitter Data (2016)
3. Observational Study on an Online Decision Support
System Data (2021)
Google Flu Trends
● Started in 2008
● It provided estimates of influenza activity for more than
25 countries.
“Google web search queries can be used to accurately
estimate influenza-like illness percentages in each of the
nine public health regions of the United States” [1]
Google Flu Trends
● Google Flu Trends stopped publishing current estimates
on 9 August 2015.
HealthS-Tweet
● "Health Surveillance through Twitter" is a specialized
version for the healthcare domain of a previous work
about Online Clustering for Topic Detection[3]
● Incremental clustering
● Able to detect low-frequency topics
● not based on word frequencies, but on similarity of the
words and hashtag used
● Tweets posted in USA from September 2015 to April 2016
HealthS-Tweet: Data
● tweet → tw: (id, u, l, tf, fv)
○ id: tweet id
○ u: user id
○ l: location [Lat.; Lon.]
○ tf: creation time
○ fv: (wu, wb, hu, hb)
■ wu, wb: word unigram, word bigram
■ hu, hb: hashtag unigram, hashtag bigram
● Topic Summary → S like a tweet extended with (ht, t0, tc, fvt)
○ ht: health topic, the label
○ t0, tc: creation time, last update time
○ fvt: (fv, ff)
■ fv: analogous to the tweet fv
■ ff (fwu, fwb, fhu, fhb): the frequencies associate to unigram and bigram
HealthS-Tweet: Algo
tw_1
…
tw_n-1
tw_n
tw_2
tw_3
tw_4
update S
HT_1
similarity
HealthS-Tweet: Algo
tw_1
…
tw_n-1
tw_n
tw_2
tw_3
tw_4
HT_2
update S
HT_1
similarity
life-span
HealthS-Tweet: Algo
tw_1
…
tw_n-1
tw_n
tw_2
tw_3
tw_4
HT_2
update S
HT_1
similarity
life span
HT_3
HealthS-Tweet: Results
● outperform traditional topic modeling
● the topic are very different (spanning
from seasonal disease to common
disease)
● future developments*:
○ HT sentiments
○ improve the labeling about HT
○ automatic hyperparameters tuning
Online DSS
A digital Italian health tech
startup, Paginemediche,
developed a
noncommercial, online
DSS with a chat user
interface early 2020:
VISITAMI
This study aimed to compare the trend in online DSS sessions with that of COVID-19 cases
reported by the national health surveillance system in Italy, from February 2020 to March
2021.[4]
Online DSS: dataset
● 75557 sessions
○ 65207 were sessions by symptomatic users
○ 19062 were by contacts of individuals with
COVID-19
Online DSS: Analysis
● To perform the comparison, we first used moving averages to display the curves.
○ K was set at a value of 7, as data from the national surveillance system were
published weekly.
● We then scaled each time series in a range between 0 and 1
On the scaled Time Series:
● We first applied Symbolic Aggregate approXimation (SAX) encoding
● Then, we calculated the Hamming distance between SAX Strings
● To verify that the online DSS anticipated the trends observed in notified cases, we shifted
its time series 1 week ahead.
Online DSS: Analysis
SAX Encoding
Symbolic Aggregate approXimation Encoding:
● developed by Keogh and Lin in 2002
● transforms time series in sequence of symbols
● robust for missing values
● unsupervised
Adolphe Sax saxophone creator.
Correlation(SAX, Sax) = 0.00
Data Preparation
Il SAX Encoding needs data organized as follows:
● for each row a time series
● for each column a timestep
● standardized data
protezione civile’s data about Covid-19
Piecewise Aggregate
Approximation
Idea
“SAX encoding is a method used to simplify time series by symbolically
representing periods, the data becomes much smaller and easier to deal with,
while still capturing its important aspects.”
A Time Series Y = [Y₁, Y₂, …,Yn] can be reduced in a sequence X = [X₁, X₂, …, Xm] with m≤n
using:
special cases:
● m=n return the original series
● m=1 return only one value, the average of the original series
Piecewise Aggregate
Approximation
Important, the setting of
hyperparameter w, the time
window.
It controls the number of
segments, if len(TS)/w isn’t
integer round up so as not to
lose information.
SAX String
Creation of the SAX Strings:
● choose how many levels
● choose the bounds
● labeling periods returned by PAA
Online DSS: Results
● The series "user with contacts" was more consistent with the trend in confirmed
cases than "user with symptoms"
● Throughout the period,
after applying the 1-week shift,
the Hamming distance
improved from 0.49 to 0.46
● July - December 2020,
after applying the 1-week shift,
the Hamming distance
improved from 0.16 to 0.08
I haven’t a “Conclusion”
Thank You!
Let’s Chat!
References
1. Ginsberg, Jeremy, et al. "Detecting influenza epidemics using search engine query data." Nature 457.7232
(2009): 1012-1014.
2. Comito, Carmela, Clara Pizzuti, and Nicola Procopio. "How people talk about health? Detecting health topics
from Twitter streams." Proceedings of the International Conference on Big Data and Internet of Thing. 2017.
3. C. Comito, C. Pizzuti and N. Procopio, "Online Clustering for Topic Detection in Social Data Streams," 2016 IEEE
28th International Conference on Tools with Artificial Intelligence (ICTAI), 2016, pp. 362-369, doi:
10.1109/ICTAI.2016.0062.
4. Tozzi A, Gesualdo F, Urbani E, Sbenaglia A, Ascione R, Procopio N, Croci I, Rizzo C. “Digital Surveillance Through
an Online Decision Support Tool for COVID-19 Over One Year of the Pandemic in Italy: Observational Study”, J
Med Internet Res 2021;23(8):e29556.
5. “Individuare Pattern con il SAX Encoding”. IAML Blog

More Related Content

Similar to How people talk about health?

Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science suresh sood
 
Data Sets as Facilitator for new Products and Services for Universities
Data Sets as Facilitator for new Products and Services for UniversitiesData Sets as Facilitator for new Products and Services for Universities
Data Sets as Facilitator for new Products and Services for UniversitiesHendrik Drachsler
 
Personalized health knowledge graph ckg workshop - iswc 2018 (2)
Personalized health knowledge graph   ckg workshop - iswc 2018 (2)Personalized health knowledge graph   ckg workshop - iswc 2018 (2)
Personalized health knowledge graph ckg workshop - iswc 2018 (2)Amélie Gyrard
 
Learning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksLearning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksSymeon Papadopoulos
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis? Amit Sheth
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search petermurrayrust
 
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...Paolo Missier
 
Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...
Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...
Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...shengjing 孙胜晶
 
A Priori Relevance Based On Quality and Diversity Of Social Signals
A Priori Relevance Based On Quality and Diversity Of Social SignalsA Priori Relevance Based On Quality and Diversity Of Social Signals
A Priori Relevance Based On Quality and Diversity Of Social SignalsIsmail BADACHE
 
Extracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual dataExtracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual dataVasileios Lampos
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFOlga Scrivner
 
Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)tm1966
 
thesis_presentation_v1 shorter split
thesis_presentation_v1 shorter splitthesis_presentation_v1 shorter split
thesis_presentation_v1 shorter splitMiklas Njor
 
AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...Stefan Dietze
 
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
Kno.e.sis Approach to Impactful Research & Training for Exceptional CareersKno.e.sis Approach to Impactful Research & Training for Exceptional Careers
Kno.e.sis Approach to Impactful Research & Training for Exceptional CareersAmit Sheth
 
Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...
Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...
Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...Artificial Intelligence Institute at UofSC
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsPaul Groth
 

Similar to How people talk about health? (20)

Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
Data Sets as Facilitator for new Products and Services for Universities
Data Sets as Facilitator for new Products and Services for UniversitiesData Sets as Facilitator for new Products and Services for Universities
Data Sets as Facilitator for new Products and Services for Universities
 
Personalized health knowledge graph ckg workshop - iswc 2018 (2)
Personalized health knowledge graph   ckg workshop - iswc 2018 (2)Personalized health knowledge graph   ckg workshop - iswc 2018 (2)
Personalized health knowledge graph ckg workshop - iswc 2018 (2)
 
Learning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksLearning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction Networks
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis?
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search
 
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
 
Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...
Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...
Digitalization Capacity for Knowledge Acquisition-Learning from Health Monito...
 
A Priori Relevance Based On Quality and Diversity Of Social Signals
A Priori Relevance Based On Quality and Diversity Of Social SignalsA Priori Relevance Based On Quality and Diversity Of Social Signals
A Priori Relevance Based On Quality and Diversity Of Social Signals
 
Extracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual dataExtracting interesting concepts from large-scale textual data
Extracting interesting concepts from large-scale textual data
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVF
 
Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)Networks, Deep Learning (and COVID-19)
Networks, Deep Learning (and COVID-19)
 
thesis_presentation_v1 shorter split
thesis_presentation_v1 shorter splitthesis_presentation_v1 shorter split
thesis_presentation_v1 shorter split
 
AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...
 
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
Kno.e.sis Approach to Impactful Research & Training for Exceptional CareersKno.e.sis Approach to Impactful Research & Training for Exceptional Careers
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
 
Data and science
Data and scienceData and science
Data and science
 
Sensors1(1)
Sensors1(1)Sensors1(1)
Sensors1(1)
 
Big datafordevelopment un-globalpulsejune2012
Big datafordevelopment un-globalpulsejune2012Big datafordevelopment un-globalpulsejune2012
Big datafordevelopment un-globalpulsejune2012
 
Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...
Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...
Examples of Applied Semantic Technologies: Application of Semantic Sensor Net...
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge Graphs
 

More from Nicola Procopio

Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?Nicola Procopio
 
Algoritmi non supervisionati per time series
Algoritmi non supervisionati per time seriesAlgoritmi non supervisionati per time series
Algoritmi non supervisionati per time seriesNicola Procopio
 
Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...
Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...
Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...Nicola Procopio
 
Create R package with RStudio
Create R package with RStudioCreate R package with RStudio
Create R package with RStudioNicola Procopio
 
Breve introduzione alle CNN
Breve introduzione alle CNNBreve introduzione alle CNN
Breve introduzione alle CNNNicola Procopio
 
Integris - The Company, The People, The Solutions
Integris - The Company, The People, The SolutionsIntegris - The Company, The People, The Solutions
Integris - The Company, The People, The SolutionsNicola Procopio
 
Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...
Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...
Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...Nicola Procopio
 
Studiare Statistica all'UniCal
Studiare Statistica all'UniCalStudiare Statistica all'UniCal
Studiare Statistica all'UniCalNicola Procopio
 
Data Culture live @ TaG: uMap
Data Culture live @ TaG: uMapData Culture live @ TaG: uMap
Data Culture live @ TaG: uMapNicola Procopio
 
Data Culture live @ TaG: osm for beginners
Data Culture live @ TaG: osm for beginnersData Culture live @ TaG: osm for beginners
Data Culture live @ TaG: osm for beginnersNicola Procopio
 

More from Nicola Procopio (13)

Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?
 
Algoritmi non supervisionati per time series
Algoritmi non supervisionati per time seriesAlgoritmi non supervisionati per time series
Algoritmi non supervisionati per time series
 
Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...
Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...
Modelli di Durata: un'analisi sull'utilizzo del portale Web dell'Università d...
 
Create R package with RStudio
Create R package with RStudioCreate R package with RStudio
Create R package with RStudio
 
Breve introduzione alle CNN
Breve introduzione alle CNNBreve introduzione alle CNN
Breve introduzione alle CNN
 
Integris - The Company, The People, The Solutions
Integris - The Company, The People, The SolutionsIntegris - The Company, The People, The Solutions
Integris - The Company, The People, The Solutions
 
Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...
Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...
Introduzione all'Intelligenza Artificiale. Cos'è e come impatta sulle nostre ...
 
Studiare Statistica all'UniCal
Studiare Statistica all'UniCalStudiare Statistica all'UniCal
Studiare Statistica all'UniCal
 
Presentazione Tetris
Presentazione TetrisPresentazione Tetris
Presentazione Tetris
 
Pitch TechGarage
Pitch TechGaragePitch TechGarage
Pitch TechGarage
 
Pitch Barcamper
Pitch BarcamperPitch Barcamper
Pitch Barcamper
 
Data Culture live @ TaG: uMap
Data Culture live @ TaG: uMapData Culture live @ TaG: uMap
Data Culture live @ TaG: uMap
 
Data Culture live @ TaG: osm for beginners
Data Culture live @ TaG: osm for beginnersData Culture live @ TaG: osm for beginners
Data Culture live @ TaG: osm for beginners
 

Recently uploaded

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?RemarkSemacio
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...HyderabadDolls
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxAniqa Zai
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...gajnagarg
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesBoston Institute of Analytics
 
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...Delhi Call girls
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...HyderabadDolls
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 

Recently uploaded (20)

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get CytotecAbortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptx
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
Oral Sex Call Girls Kashmiri Gate Delhi Just Call 👉👉 📞 8448380779 Top Class C...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 

How people talk about health?

  • 1. HOW PEOPLE TALK ABOUT HEALTH? Nicola Procopio 25-01-2022
  • 2. Senior Data Scientist Nicola Procopio About me nicola.procopio@healthwareinternational.com Contacts Background Community https://www.slideshare.net/NicolaProcopio https://it.linkedin.com/in/nicolaprocopio https://github.com/nickprock Actually @
  • 3. Mission We play at the intersection of science, creativity, boundless curiosity, and our understanding of human needs. That’s how we design transformational healthcare experiences that engage, simplify and empower people’s lives. We are digital natives and multi-talented coders, connected and passionate to learn and innovate. Our mission is to design and develop successful solutions and digital products. The full-service healthcare agency of Healthware Group
  • 4. Summary 1. Google Flu Trends (2009) 2. Online Clustering Algorithm on Twitter Data (2016) 3. Observational Study on an Online Decision Support System Data (2021)
  • 5. Google Flu Trends ● Started in 2008 ● It provided estimates of influenza activity for more than 25 countries. “Google web search queries can be used to accurately estimate influenza-like illness percentages in each of the nine public health regions of the United States” [1]
  • 6. Google Flu Trends ● Google Flu Trends stopped publishing current estimates on 9 August 2015.
  • 7. HealthS-Tweet ● "Health Surveillance through Twitter" is a specialized version for the healthcare domain of a previous work about Online Clustering for Topic Detection[3] ● Incremental clustering ● Able to detect low-frequency topics ● not based on word frequencies, but on similarity of the words and hashtag used ● Tweets posted in USA from September 2015 to April 2016
  • 8. HealthS-Tweet: Data ● tweet → tw: (id, u, l, tf, fv) ○ id: tweet id ○ u: user id ○ l: location [Lat.; Lon.] ○ tf: creation time ○ fv: (wu, wb, hu, hb) ■ wu, wb: word unigram, word bigram ■ hu, hb: hashtag unigram, hashtag bigram ● Topic Summary → S like a tweet extended with (ht, t0, tc, fvt) ○ ht: health topic, the label ○ t0, tc: creation time, last update time ○ fvt: (fv, ff) ■ fv: analogous to the tweet fv ■ ff (fwu, fwb, fhu, fhb): the frequencies associate to unigram and bigram
  • 12. HealthS-Tweet: Results ● outperform traditional topic modeling ● the topic are very different (spanning from seasonal disease to common disease) ● future developments*: ○ HT sentiments ○ improve the labeling about HT ○ automatic hyperparameters tuning
  • 13. Online DSS A digital Italian health tech startup, Paginemediche, developed a noncommercial, online DSS with a chat user interface early 2020: VISITAMI This study aimed to compare the trend in online DSS sessions with that of COVID-19 cases reported by the national health surveillance system in Italy, from February 2020 to March 2021.[4]
  • 14. Online DSS: dataset ● 75557 sessions ○ 65207 were sessions by symptomatic users ○ 19062 were by contacts of individuals with COVID-19
  • 15. Online DSS: Analysis ● To perform the comparison, we first used moving averages to display the curves. ○ K was set at a value of 7, as data from the national surveillance system were published weekly. ● We then scaled each time series in a range between 0 and 1 On the scaled Time Series: ● We first applied Symbolic Aggregate approXimation (SAX) encoding ● Then, we calculated the Hamming distance between SAX Strings ● To verify that the online DSS anticipated the trends observed in notified cases, we shifted its time series 1 week ahead.
  • 17. SAX Encoding Symbolic Aggregate approXimation Encoding: ● developed by Keogh and Lin in 2002 ● transforms time series in sequence of symbols ● robust for missing values ● unsupervised Adolphe Sax saxophone creator. Correlation(SAX, Sax) = 0.00
  • 18. Data Preparation Il SAX Encoding needs data organized as follows: ● for each row a time series ● for each column a timestep ● standardized data protezione civile’s data about Covid-19
  • 19. Piecewise Aggregate Approximation Idea “SAX encoding is a method used to simplify time series by symbolically representing periods, the data becomes much smaller and easier to deal with, while still capturing its important aspects.” A Time Series Y = [Y₁, Y₂, …,Yn] can be reduced in a sequence X = [X₁, X₂, …, Xm] with m≤n using: special cases: ● m=n return the original series ● m=1 return only one value, the average of the original series
  • 20. Piecewise Aggregate Approximation Important, the setting of hyperparameter w, the time window. It controls the number of segments, if len(TS)/w isn’t integer round up so as not to lose information.
  • 21. SAX String Creation of the SAX Strings: ● choose how many levels ● choose the bounds ● labeling periods returned by PAA
  • 22. Online DSS: Results ● The series "user with contacts" was more consistent with the trend in confirmed cases than "user with symptoms" ● Throughout the period, after applying the 1-week shift, the Hamming distance improved from 0.49 to 0.46 ● July - December 2020, after applying the 1-week shift, the Hamming distance improved from 0.16 to 0.08
  • 23. I haven’t a “Conclusion” Thank You! Let’s Chat!
  • 24. References 1. Ginsberg, Jeremy, et al. "Detecting influenza epidemics using search engine query data." Nature 457.7232 (2009): 1012-1014. 2. Comito, Carmela, Clara Pizzuti, and Nicola Procopio. "How people talk about health? Detecting health topics from Twitter streams." Proceedings of the International Conference on Big Data and Internet of Thing. 2017. 3. C. Comito, C. Pizzuti and N. Procopio, "Online Clustering for Topic Detection in Social Data Streams," 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI), 2016, pp. 362-369, doi: 10.1109/ICTAI.2016.0062. 4. Tozzi A, Gesualdo F, Urbani E, Sbenaglia A, Ascione R, Procopio N, Croci I, Rizzo C. “Digital Surveillance Through an Online Decision Support Tool for COVID-19 Over One Year of the Pandemic in Italy: Observational Study”, J Med Internet Res 2021;23(8):e29556. 5. “Individuare Pattern con il SAX Encoding”. IAML Blog