The document presents a method for transferring unsupervised influenza-like illness models trained on online search data from a source location to a target location. The method involves:
1. Training a regression model on search query and illness rate data from the source location.
2. Mapping the important source queries to semantically and temporally similar target queries.
3. Transferring the trained source model weights to make predictions for the target location, without direct access to illness rate data there.
The goal is to estimate illness rates for a target location where such data is unavailable, by leveraging data and a model from a related source location.
BalloonNet: A Deploying Method for a Three-Dimensional Wireless Network Surro...Naoki Shibata
Aiming at fast establishment of a wireless network around a multi-level building in a disaster area, we propose an efficient method to determine the locations of network nodes in the air. Nodes are attached to balloons outside a building and deployed in the air so that the network can be accessed from anywhere in the building. In this paper, we introduce an original radio propagation model for predicting path loss from an outdoor position to a position inside a building. In order to address the three-dimensional deployment problem, the proposed method optimizes an objective function for satisfying two goals: (1) guarantee the coverage: the target space needs to be covered by over a certain percentage by wireless network nodes, (2) minimize the number of network nodes. For solving this problem, we propose an algorithm based on a genetic algorithm. To evaluate the proposed method, we compared our method with three benchmark methods, and the results show that the proposed method requires fewer nodes than other methods.
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RRevolution Analytics
Everything happens somewhere and spatial analysis attempts to use location as an explanatory variable. Such analysis is made complex by the very many ways we habitually record spatial location, the complexity of spatial data structures, and the wide variety of possible domain-driven questions we might ask. One option is to develop and use software for specific types of spatial data, another is to use a purpose-built geographical information system (GIS), but determined work by R enthusiasts has resulted in a multiplicity of packages in the R environment that can also be used.
BalloonNet: A Deploying Method for a Three-Dimensional Wireless Network Surro...Naoki Shibata
Aiming at fast establishment of a wireless network around a multi-level building in a disaster area, we propose an efficient method to determine the locations of network nodes in the air. Nodes are attached to balloons outside a building and deployed in the air so that the network can be accessed from anywhere in the building. In this paper, we introduce an original radio propagation model for predicting path loss from an outdoor position to a position inside a building. In order to address the three-dimensional deployment problem, the proposed method optimizes an objective function for satisfying two goals: (1) guarantee the coverage: the target space needs to be covered by over a certain percentage by wireless network nodes, (2) minimize the number of network nodes. For solving this problem, we propose an algorithm based on a genetic algorithm. To evaluate the proposed method, we compared our method with three benchmark methods, and the results show that the proposed method requires fewer nodes than other methods.
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RRevolution Analytics
Everything happens somewhere and spatial analysis attempts to use location as an explanatory variable. Such analysis is made complex by the very many ways we habitually record spatial location, the complexity of spatial data structures, and the wide variety of possible domain-driven questions we might ask. One option is to develop and use software for specific types of spatial data, another is to use a purpose-built geographical information system (GIS), but determined work by R enthusiasts has resulted in a multiplicity of packages in the R environment that can also be used.
Modeling Electronic Health Records with Recurrent Neural NetworksJosh Patterson
Time series data is increasingly ubiquitous. This trend is especially obvious in health and wellness, with both the adoption of electronic health record (EHR) systems in hospitals and clinics and the proliferation of wearable sensors. In 2009, intensive care units in the United States treated nearly 55,000 patients per day, generating digital-health databases containing millions of individual measurements, most of those forming time series. In the first quarter of 2015 alone, over 11 million health-related wearables were shipped by vendors. Recording hundreds of measurements per day per user, these devices are fueling a health time series data explosion. As a result, we will need ever more sophisticated tools to unlock the true value of this data to improve the lives of patients worldwide.
Deep learning, specifically with recurrent neural networks (RNNs), has emerged as a central tool in a variety of complex temporal-modeling problems, such as speech recognition. However, RNNs are also among the most challenging models to work with, particularly outside the domains where they are widely applied. Josh Patterson, David Kale, and Zachary Lipton bring the open source deep learning library DL4J to bear on the challenge of analyzing clinical time series using RNNs. DL4J provides a reliable, efficient implementation of many deep learning models embedded within an enterprise-ready open source data ecosystem (e.g., Hadoop and Spark), making it well suited to complex clinical data. Josh, David, and Zachary offer an overview of deep learning and RNNs and explain how they are implemented in DL4J. They then demonstrate a workflow example that uses a pipeline based on DL4J and Canova to prepare publicly available clinical data from PhysioNet and apply the DL4J RNN.
Text REtrieval Conference (TREC) Dynamic Domain Track 2015Grace Hui Yang
This is the introductory talk for the TREC Dynamic Domain Track. The Track ran from 2015 to 2017, aiming to evaluate and advance research in dynamic search and domain-specific search. This talk was prepared to introduce the ideas and setups in the upcoming Track to the research community.
Spatio-‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Joel Saltz
Presentation at Clusters, Clouds and Data for Scientific Computing 2014
Integrative analyses of large scale spatio-temporal datasets play increasingly important roles in many areas of science and engineering. Our recent work in this area is motivated by application scenarios involving complementary digital microscopy, radiology and “omic” analyses in cancer research. In these scenarios, the objective is to use a coordinated set of image analysis, feature extraction and machine learning methods to predict disease progression and to aid in targeting new therapies. I will describe tools and methods our group has developed for extraction, management, and analysis of features along with the systems software methods for optimizing execution on high end CPU/GPU platforms. Once having provided our current work as an introduction, I will then describe 1) related but much more ambitious exascale biomedical and non-biomedical use cases that also involve the complex interplay between multi-scale structure and molecular mechanism and 2) concepts and requirements for methods and tools that address these challenges.
[Digest] Eigenbehaviors- identifying structure in routineHsing-chuan Hsieh
1. Preface
-People’s daily routines are typically coupled with routines across other temporal scales
-E.g., going out on the town with friends on Saturday nights, or spending time with family during the December holidays.
-So are animals (daily and seasonal cycle)
2. Purpose: present a methodology-- “eigenbehaviors” to quantify these universal patterns in the behavior of individuals and communities within a social network.
-Eigenbehaviors- the principal components of an individual’s behavioral dataset.
For full information, please refer to the original paper:
http://realitycommons.media.mit.edu/pdfs/eigenbehaviors.pdf
Some engineering and scientific computer models that have high dimensional input space are actually only affected by a few essential input variables. If these active variables are identified, it would reduce the computation in the estimation of the Gaussian process (GP) model and help
researchers understand the system modeled by the computer simulation. More importantly, reducing the input dimensions would also increase the prediction accuracy, as it alleviates the "curse of dimensionality" problem.
In this talk, we propose a new approach to reduce the input dimension of the Gaussian process model. Specifically, we develop an optimization method to identify a convex combination of a subset of kernels of lower dimensions from a large candidate set of kernels, as the correlation function for the GP model. To make sure a sparse subset is selected, we add a penalty on the weights of kernels. Several numerical examples are shown to show the advantages of the
method. The proposed method has many connections with the existing methods including active subspace, additive GP, and composite GP models in the Uncertainty Quantification literature.
The Web of Data: do we actually understand what we built?Frank van Harmelen
Despite its obvious success (largest knowledge base ever built, used in practice by companies and governments alike), we actually understand very little of the structure of the Web of Data. Its formal meaning is specified in logic, but with its scale, context dependency and dynamics, the Web of Data has outgrown its traditional model-theoretic semantics.
Is the meaning of a logical statement (an edge in the graph) dependent on the cluster ("context") in which it appears? Does a more densely connected concept (node) contain more information? Is the path length between two nodes related to their semantic distance?
Properties such as clustering, connectivity and path length are not described, much less explained by model-theoretic semantics. Do such properties contribute to the meaning of a knowledge graph?
To properly understand the structure and meaning of knowledge graphs, we should no longer treat knowledge graphs as (only) a set of logical statements, but treat them properly as a graph. But how to do this is far from clear.
In this talk, I report on some of our early results on some of these questions, but I ask many more questions for which we don't have answers yet.
Track 13. New Trends in Digital Humanities
Authors: Alejandro Benito; Antonio G. Losada; Roberto Theron; Amelie Dorn; Melanie Seltmann; Eveline Wandl-Vogt
https://youtu.be/5tTot6vinZk
Survival analysis is an important method for analysis time to event data for biomedical and reliability applications. It is often done with semiparametric methods e.g. the Cox proportional hazards model. In this presentation I discuss an alternative parametric approach to survival analysis that can overcome some of the limitations of the Cox model and provide additional flexibility to the modeler. This approach may also be justified from a Bayesian perspective and the connection is shown as well. Simulations and case studies that illustrate the flexibility of the GAM approach for survival analysis and its equivalent performance to existing methods for survival data are discussed in the text.
The material presented herein are based on two publications:
1) Argyropoulos C, Unruh ML. Analysis of time to event outcomes in randomized controlled trials by generalized additive models. PLoS One. 2015 Apr 23;10(4):e0123784. doi: 10.1371/journal.pone.0123784. PMID: 25906075; PMCID: PMC4408032.
2)Bologa CG, Pankratz VS, Unruh ML, Roumelioti ME, Shah V, Shaffi SK, Arzhan S, Cook J, Argyropoulos C. High performance implementation of the hierarchical likelihood for generalized linear mixed models: an application to estimate the potassium reference range in massive electronic health records datasets. BMC Med Res Methodol. 2021 Jul 24;21(1):151. doi: 10.1186/s12874-021-01318-6. PMID: 34303362; PMCID: PMC8310602.
Slides from my lecture for the Information Retrieval and Data Mining course at University College London
The slides cover introductory concepts on topic models, vector semantics and basic end applications
More Related Content
Similar to Transfer learning for unsupervised influenza-like illness models from online search data
Modeling Electronic Health Records with Recurrent Neural NetworksJosh Patterson
Time series data is increasingly ubiquitous. This trend is especially obvious in health and wellness, with both the adoption of electronic health record (EHR) systems in hospitals and clinics and the proliferation of wearable sensors. In 2009, intensive care units in the United States treated nearly 55,000 patients per day, generating digital-health databases containing millions of individual measurements, most of those forming time series. In the first quarter of 2015 alone, over 11 million health-related wearables were shipped by vendors. Recording hundreds of measurements per day per user, these devices are fueling a health time series data explosion. As a result, we will need ever more sophisticated tools to unlock the true value of this data to improve the lives of patients worldwide.
Deep learning, specifically with recurrent neural networks (RNNs), has emerged as a central tool in a variety of complex temporal-modeling problems, such as speech recognition. However, RNNs are also among the most challenging models to work with, particularly outside the domains where they are widely applied. Josh Patterson, David Kale, and Zachary Lipton bring the open source deep learning library DL4J to bear on the challenge of analyzing clinical time series using RNNs. DL4J provides a reliable, efficient implementation of many deep learning models embedded within an enterprise-ready open source data ecosystem (e.g., Hadoop and Spark), making it well suited to complex clinical data. Josh, David, and Zachary offer an overview of deep learning and RNNs and explain how they are implemented in DL4J. They then demonstrate a workflow example that uses a pipeline based on DL4J and Canova to prepare publicly available clinical data from PhysioNet and apply the DL4J RNN.
Text REtrieval Conference (TREC) Dynamic Domain Track 2015Grace Hui Yang
This is the introductory talk for the TREC Dynamic Domain Track. The Track ran from 2015 to 2017, aiming to evaluate and advance research in dynamic search and domain-specific search. This talk was prepared to introduce the ideas and setups in the upcoming Track to the research community.
Spatio-‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Joel Saltz
Presentation at Clusters, Clouds and Data for Scientific Computing 2014
Integrative analyses of large scale spatio-temporal datasets play increasingly important roles in many areas of science and engineering. Our recent work in this area is motivated by application scenarios involving complementary digital microscopy, radiology and “omic” analyses in cancer research. In these scenarios, the objective is to use a coordinated set of image analysis, feature extraction and machine learning methods to predict disease progression and to aid in targeting new therapies. I will describe tools and methods our group has developed for extraction, management, and analysis of features along with the systems software methods for optimizing execution on high end CPU/GPU platforms. Once having provided our current work as an introduction, I will then describe 1) related but much more ambitious exascale biomedical and non-biomedical use cases that also involve the complex interplay between multi-scale structure and molecular mechanism and 2) concepts and requirements for methods and tools that address these challenges.
[Digest] Eigenbehaviors- identifying structure in routineHsing-chuan Hsieh
1. Preface
-People’s daily routines are typically coupled with routines across other temporal scales
-E.g., going out on the town with friends on Saturday nights, or spending time with family during the December holidays.
-So are animals (daily and seasonal cycle)
2. Purpose: present a methodology-- “eigenbehaviors” to quantify these universal patterns in the behavior of individuals and communities within a social network.
-Eigenbehaviors- the principal components of an individual’s behavioral dataset.
For full information, please refer to the original paper:
http://realitycommons.media.mit.edu/pdfs/eigenbehaviors.pdf
Some engineering and scientific computer models that have high dimensional input space are actually only affected by a few essential input variables. If these active variables are identified, it would reduce the computation in the estimation of the Gaussian process (GP) model and help
researchers understand the system modeled by the computer simulation. More importantly, reducing the input dimensions would also increase the prediction accuracy, as it alleviates the "curse of dimensionality" problem.
In this talk, we propose a new approach to reduce the input dimension of the Gaussian process model. Specifically, we develop an optimization method to identify a convex combination of a subset of kernels of lower dimensions from a large candidate set of kernels, as the correlation function for the GP model. To make sure a sparse subset is selected, we add a penalty on the weights of kernels. Several numerical examples are shown to show the advantages of the
method. The proposed method has many connections with the existing methods including active subspace, additive GP, and composite GP models in the Uncertainty Quantification literature.
The Web of Data: do we actually understand what we built?Frank van Harmelen
Despite its obvious success (largest knowledge base ever built, used in practice by companies and governments alike), we actually understand very little of the structure of the Web of Data. Its formal meaning is specified in logic, but with its scale, context dependency and dynamics, the Web of Data has outgrown its traditional model-theoretic semantics.
Is the meaning of a logical statement (an edge in the graph) dependent on the cluster ("context") in which it appears? Does a more densely connected concept (node) contain more information? Is the path length between two nodes related to their semantic distance?
Properties such as clustering, connectivity and path length are not described, much less explained by model-theoretic semantics. Do such properties contribute to the meaning of a knowledge graph?
To properly understand the structure and meaning of knowledge graphs, we should no longer treat knowledge graphs as (only) a set of logical statements, but treat them properly as a graph. But how to do this is far from clear.
In this talk, I report on some of our early results on some of these questions, but I ask many more questions for which we don't have answers yet.
Track 13. New Trends in Digital Humanities
Authors: Alejandro Benito; Antonio G. Losada; Roberto Theron; Amelie Dorn; Melanie Seltmann; Eveline Wandl-Vogt
https://youtu.be/5tTot6vinZk
Survival analysis is an important method for analysis time to event data for biomedical and reliability applications. It is often done with semiparametric methods e.g. the Cox proportional hazards model. In this presentation I discuss an alternative parametric approach to survival analysis that can overcome some of the limitations of the Cox model and provide additional flexibility to the modeler. This approach may also be justified from a Bayesian perspective and the connection is shown as well. Simulations and case studies that illustrate the flexibility of the GAM approach for survival analysis and its equivalent performance to existing methods for survival data are discussed in the text.
The material presented herein are based on two publications:
1) Argyropoulos C, Unruh ML. Analysis of time to event outcomes in randomized controlled trials by generalized additive models. PLoS One. 2015 Apr 23;10(4):e0123784. doi: 10.1371/journal.pone.0123784. PMID: 25906075; PMCID: PMC4408032.
2)Bologa CG, Pankratz VS, Unruh ML, Roumelioti ME, Shah V, Shaffi SK, Arzhan S, Cook J, Argyropoulos C. High performance implementation of the hierarchical likelihood for generalized linear mixed models: an application to estimate the potassium reference range in massive electronic health records datasets. BMC Med Res Methodol. 2021 Jul 24;21(1):151. doi: 10.1186/s12874-021-01318-6. PMID: 34303362; PMCID: PMC8310602.
Slides from my lecture for the Information Retrieval and Data Mining course at University College London
The slides cover introductory concepts on topic models, vector semantics and basic end applications
Mining online data for public health surveillanceVasileios Lampos
Online user-generated data (search engine activity, social media) offers health-related insights that traditional health monitoring systems cannot always provide. A central paradigm for this has been the estimation of the rate of a disease in a population by modelling the corresponding online search engine activity. However, the various inaccuracies in the estimates of Google Flu Trends (GFT) ―a popular approach for estimating influenza-like illness (ILI) rates from online search logs― have generated a debate as to whether this is something feasible. In this talk, we will first go through GFT and try to explain why it has been unsuccessful. We will then present alternative approaches which deploy more adequate statistical as well as semantic modelling techniques to successfully capture ILI rates both in the US and England. Long-term empirical analysis will showcase that models retain a strong predictive performance across many consecutive flu seasons. Expanding on the previous approaches, we will also present a brief overview of more recent work, where multi-task learning across different geographies is used to further improve ILI models, especially in situations where sporadic historical health reports are available.
Mining socio-political and socio-economic signals from social media contentVasileios Lampos
Over the past few years user-generated content, primarily originating from social media platforms, has been the focus of considerable research effort. One of the main motivations is the potential of using this data as an additional source of information for enhancing our understanding of societal or more personalised
signals. This talk will showcase a series of ideas which explored that basic hypothesis, proposing statistical natural language processing frameworks for the estimation of collective patterns (e.g. emotions or voting intentions) or of more focused demographic user attributes (e.g. the occupation, income and socioeconomic status). Apart from the essential statistical evaluation of the proposed methods, interesting insights, stemming from a qualitative analysis, will be presented as well.
Inferring the Socioeconomic Status of Social Media Users based on Behaviour a...Vasileios Lampos
This paper presents a method to classify social media users based on their socioeconomic status. Our experiments are conducted on a curated set of Twitter profiles, where each user is represented by the posted text, topics of discussion, interactive behaviour and estimated impact on the microblogging platform. Initially, we formulate a 3-way classification task, where users are classified as having an upper, middle or lower socioeconomic status. A nonlinear, generative learning approach using a composite Gaussian Process kernel provides significantly better classification accuracy (75%) than a competitive linear alternative. By turning this task into a binary classification - upper vs. medium and lower class - the proposed classifier reaches an accuracy of 82%.
An introduction to digital health surveillance from online user-generated con...Vasileios Lampos
This talk provides a brief introduction on methods for performing health surveillance tasks using online user-generated data. Four case studies are being presented: a) the original Google Flu Trends model, b) a basic model for mapping Twitter data to influenza rates, c) an improved Google Flu Trends model, and d) a method for assessing the impact of a health intervention from Internet data.
User-generated content: collective and personalised inference tasksVasileios Lampos
Over the past few years large-scale, user-generated, online resources, such as social media or search engine queries, have been driving various interesting research efforts. This talk will showcase a series of ideas that we have developed based on the hypothesis that this online stream of content should represent a biased -at least- fraction of real-world situations, opinions or behaviour. Firstly, core algorithms have been proposed for mapping Twitter or Google search data to estimations about the current rate of an infectious disease, such as influenza. Then, through the addition of a user dimension in linear regularised text regression models, a family of bilinear models has been introduced, capable of improving approximations of voting intention trends. Finally, by redirecting the previous modelling principle, social media content has been used to learn as well as
qualitatively interpret user attributes, such as impact, occupation,
income and socioeconomic status.
Instead of just modelling word frequencies (or in general word characteristics) by learning a weight vector w, we are also learning a weight for each user (u). Thus, from a linear regression model, we now go bilinear. This idea was applied for predicting voting intention polls from tweets in two countries (Austria and the UK), but it is also applicable on various other NLP tasks, such as the extraction of socioeconomic patterns from the news.
Extracting interesting concepts from large-scale textual dataVasileios Lampos
Over the past few years large-scale textual resources, such as news articles, social media, search queries or even digitised books, have been the centre of various research efforts. In particular, the open nature of the microblogging platform of Twitter provided the unique opportunity for various appealing ideas to be evaluated. Based on the hypothesis that this online stream of content should represent at least a biased fraction of real-world situations or opinions, we have proposed core algorithms for estimating the current rate of an infectious disease, such as influenza, or even of a natural, less stable phenomenon like rainfall rates. A simplified emotion analysis on a longitudinal set of tweets revealed interesting patterns, including signs of rising anger and fear before the UK riots in August, 2011. A similar analysis on the Google N-gram book database uncovered collective patterns of affect over the course of the 20th century. Through the extension of linear text regression approaches by the addition of a user dimension, we proposed a family of bilinear regularised regression models, which found application in the approximation of voting intention trends. Finally, we reversed the previous modelling principle in an attempt to qualitatively analyse user attributes or behaviours.
Assessing the impact of a health intervention via user-generated Internet con...Vasileios Lampos
Assessing the effect of a health-oriented intervention by traditional epidemiological methods is commonly based only on population segments that use healthcare services. Here we introduce a complementary framework for evaluating the impact of a targeted intervention, such as a vaccination campaign against an infectious disease, through a statistical analysis of user-generated content submitted on web platforms. Using supervised learning, we derive a nonlinear regression model for estimating the prevalence of a health event in a population from Internet data. This model is applied to identify control location groups that correlate historically with the areas, where a specific intervention campaign has taken place. We then determine the impact of the intervention by inferring a projection of the disease rates that could have emerged in the absence of a campaign. Our case study focuses on the influenza vaccination program that was launched in England during the 2013/14 season, and our observations consist of millions of geo-located search queries to the Bing search engine and posts on Twitter. The impact estimates derived from the application of the proposed statistical framework support conventional assessments of the campaign.
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
Comparative structure of adrenal gland in vertebrates
Transfer learning for unsupervised influenza-like illness models from online search data
1. Transfer learning for unsupervised influenza-like illness
models from online search data
Bin Zou
Vasileios Lampos
(
lampos.net
)
Ingemar J. Cox
Department of Computer Science
University College London
2. From online searches to influenza-like illness rates
50 60 70 80 90 100
of queries
queries
op-scoring queries to include in the
2004 2005 2006
Year
2007 2008
2
4
6
8
10
ILIpercentage
0
12
Figure 2 | A comparison of model estimates for the mid-Atlantic region
(black) against CDC-reported ILI percentages (red), including points over
which the model was fit and validated. A correlation of 0.85 was obtained
over 128 points from this region to which the model was fit, whereas a
correlation of 0.96 was obtained over 42 validation points. Dotted lines
indicate 95% prediction intervals. The region comprises New York, New
LETTERS
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 1/29
3. From online searches to influenza-like illness rates
Online data for health (2/3)
Google Flu Trends (discontinued)Google Flu Trends (discontinued)
popularising an established idea
Ginsberg et al. (2009)
Eysenbach (2006); Polgreen et al. (2008)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 1/29
4. From online searches to influenza-like illness rates
Task abstraction
• input – frequency of search queries over time: X∈Rn×s
• output – corresponding influenza-like illness (ILI) rate: y∈Rn
• regression task, i.e. learn f : X → y
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 2/29
5. From online searches to influenza-like illness rates
Task abstraction
• input – frequency of search queries over time: X∈Rn×s
• output – corresponding influenza-like illness (ILI) rate: y∈Rn
• regression task, i.e. learn f : X → y
Modelling
• originally proposed models were evidently not good solutions1
• new families of methods seem to work OK in various geographies2
1
Cook et al. (2011); Olson et al. (2013); Lazer et al. (2014)
2
Lampos et al. (2015a); Yang et al. (2015); Lampos et al. (2017); Wagner et al. (2018)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 2/29
6. Why estimate ILI rates from online search statistics?
Common arguments for:
• complements traditional syndromic surveillance
✓ timeliness
✓ broader demographic coverage, larger cohort
✓ broader geographical coverage
✓ not affected by closure days or national holidays
✓ lower cost
• applicable to locations that lack an established health system
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 3/29
7. Why estimate ILI rates from online search statistics?
Common arguments for:
• complements traditional syndromic surveillance
✓ timeliness
✓ broader demographic coverage, larger cohort
✓ broader geographical coverage
✓ not affected by closure days or national holidays
✓ lower cost
• applicable to locations that lack an established health system
✓ oxymoron (supervised learning)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 3/29
8. Why estimate ILI rates from online search statistics?
Common arguments for:
• complements traditional syndromic surveillance
✓ timeliness
✓ broader demographic coverage, larger cohort
✓ broader geographical coverage
✓ not affected by closure days or national holidays
✓ lower cost
• applicable to locations that lack an established health system
✓ oxymoron (supervised learning)
✓ motivated this paper
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 3/29
9. Our contribution in a nutshell
Main task
• train a model for a source location where historical syndromic
surveillance data is available, and
• transfer it to a target location where syndromic surveillance data is
not available or, in our experiments, ignored
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 4/29
10. Our contribution in a nutshell
Main task
• train a model for a source location where historical syndromic
surveillance data is available, and
• transfer it to a target location where syndromic surveillance data is
not available or, in our experiments, ignored
Transfer learning steps
1. Learn a linear regularised regression model for a source location
2. Map search queries from the source to the target domain
(languages may differ)
3. Transfer the source weights to the target domain
(might involve weight re-adjustment)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 4/29
11. Transfer learning task definition
query frequency xij =
#query j issued during ∆ti
#all queries issued during ∆ti
for a location
Source domain
• DS =
{
(xi, yi)
}
, i∈{1, ..., n}
• xi ∈Rs
= {xij}, j ∈{1, ..., s}: frequency of source queries
• yi ∈R: ILI rate for time interval i
Target domain
• DT = {x′
i}, i∈{1, ..., m}
• x′
i ∈Rt
: frequency of target queries
• note that t need not equal s
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 5/29
12. Transfer learning task definition
query frequency xij =
#query j issued during ∆ti
#all queries issued during ∆ti
for a location
Source domain
• DS =
{
(xi, yi)
}
, i∈{1, ..., n}
• xi ∈Rs
= {xij}, j ∈{1, ..., s}: frequency of source queries
• yi ∈R: ILI rate for time interval i
Target domain
• DT = {x′
i}, i∈{1, ..., m}
• x′
i ∈Rt
: frequency of target queries
• note that t need not equal s
§
¦
¤
¥
Aim: Given DS and DT, estimate y′
i
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 5/29
13. Step 1 – Learn a regression function in the source domain
Source domain
• xi ∈Rs
= {xij}, j ∈{1, ..., s}: frequency of source queries
• yi ∈R: ILI rate for time interval i
Elastic net1
(constrained)
argmin
w,β
n∑
i=1
(
yi − β −
( s∑
j=1
xijwj
))2
+ λ1
s∑
j=1
|wj| + λ2
s∑
j=1
w2
j
subject to w ≥ 0
1
Zou and Hastie (2005)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 6/29
14. Step 1 – Learn a regression function in the source domain
Elastic net (constrained)
argmin
w,β
n∑
i=1
(
yi − β −
( s∑
j=1
xijwj
))2
+ λ1
s∑
j=1
|wj| + λ2
s∑
j=1
w2
j
subject to w ≥ 0
Why use elastic net?
• more straightforward to transfer
• few training instances
• previous successful application1
• combines ℓ1- and ℓ2-norm regularisation: sparse solution, model
consistency under collinearity
1
Lampos et al. (2015a,b); Zou et al. (2016); Lampos et al. (2017)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 7/29
15. Step 1 – Learn a regression function in the source domain
Elastic net (constrained)
argmin
w,β
n∑
i=1
(
yi − β −
( s∑
j=1
xijwj
))2
+ λ1
s∑
j=1
|wj| + λ2
s∑
j=1
w2
j
subject to w ≥ 0
Why apply a non-negative weight constraint?
• (how?) coordinate descent restricting negative updates to 0
• worse performing model for the source location
• but enables a more comprehensive transfer
• better performance at the target location
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 7/29
16. Step 1 – Learn a regression function in the source domain
Selecting queries prior to applying elastic net
• hybrid feature selection similarly to previous work1
• derive query embeddings eq using fastText2
• define a flu context/topic: T = {‘flu’, ‘fever’}
• compute each query’s similarity to T using
g (q, T ) = cos (eq, eT1 ) × cos (eq, eT2 )
cos(·, ·) is mapped to [0, 1]
1
Zou et al. (2016); Lampos et al. (2017); Zou et al. (2018)
2
Bojanowski et al. (2017)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 8/29
17. Step 1 – Learn a regression function in the source domain
Selecting queries prior to applying elastic net
• hybrid feature selection similarly to previous work1
• derive query embeddings eq using fastText2
• define a flu context/topic: T = {‘flu’, ‘fever’}
• compute each query’s similarity to T using
g (q, T ) = cos (eq, eT1 ) × cos (eq, eT2 )
cos(·, ·) is mapped to [0, 1]
• filter out queries with either g ≤ 0.5 or r ≤ 0.3 (corr. with ILI)
§
¦
¤
¥
QS: remaining queries after applying elastic net
1
Zou et al. (2016); Lampos et al. (2017); Zou et al. (2018)
2
Bojanowski et al. (2017)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 8/29
18. Step 2 – Mapping source to target queries
Task: map QS to a subset of PT (pool of target queries)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 9/29
19. Step 2 – Mapping source to target queries
Task: map QS to a subset of PT (pool of target queries)
How?
• direct translation does not work
— invalid search queries
— worse performance
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 9/29
20. Step 2 – Mapping source to target queries
Task: map QS to a subset of PT (pool of target queries)
How?
• direct translation does not work
— invalid search queries
— worse performance
• semantic similarity, Θs: (cross-lingual) word embeddings
• temporal similarity, Θc: correlation between frequency time series
• hybrid similarity: Θ = γΘs + (1 − γ)Θc, γ ∈ [0, 1]
• consider 1-to-k mappings
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 9/29
21. Step 2 – Semantic similarity (Θs)
Same language in both domains?
• Use cosine similarity on query embeddings
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 10/29
22. Step 2 – Semantic similarity (Θs)
Same language in both domains?
• Use cosine similarity on query embeddings
If not, derive bi-lingual embeddings1
• m core translation pairs, σ→τ, with embeddings Eσ, Eτ ∈Rm×d
• learn a transformation matrix, W∈Rd×d
, by minimising:
argmin
W
∥EσW − Eτ ∥
2
2 , subject to W⊤
W = I
1
Smith et al. (2016)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 10/29
23. Step 2 – Semantic similarity (Θs)
Same language in both domains?
• Use cosine similarity on query embeddings
If not, derive bi-lingual embeddings1
• m core translation pairs, σ→τ, with embeddings Eσ, Eτ ∈Rm×d
• learn a transformation matrix, W∈Rd×d
, by minimising:
argmin
W
∥EσW − Eτ ∥
2
2 , subject to W⊤
W = I
• orthogonality constraint:
— Eτ ≈ EσW and Eσ ≈ Eτ W⊤
— improves the performance of machine translation2
• solution: W = VU⊤
, where E⊤
τ Eσ = UΣV⊤
(SVD)
1
Smith et al. (2016) 2
Artetxe et al. (2016)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 10/29
24. Step 2 – Semantic similarity (Θs)
Compute a query (source) to query (target) similarity matrix
• source, target query embedding: eqi
, eqj
∈R1×d
• cosine similarity matrix Ω∈Rs×|PT|
, ωij =
(
eqi
W e⊤
qj
)
(
∥eqi
W∥2∥eqj
∥2
)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 11/29
25. Step 2 – Semantic similarity (Θs)
Compute a query (source) to query (target) similarity matrix
• source, target query embedding: eqi
, eqj
∈R1×d
• cosine similarity matrix Ω∈Rs×|PT|
, ωij =
(
eqi
W e⊤
qj
)
(
∥eqi
W∥2∥eqj
∥2
)
Inverted softmax
• using ωij directly for translations can generate hubs
— target query is similar to way too many different source queries
— reduces performance of machine translation1
• instead, given a source query qi, find a target qj that maximises
Pj→i =
exp (η ωij)
αj
s∑
z=1
exp (η ωiz)
1
Dinu et al. (2014); Smith et al. (2016)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 11/29
26. Step 2 – Semantic similarity (Θs)
Pj→i =
exp (η ωij)
αj
s∑
z=1
exp (η ωiz)
• αj: ensures Pj→i is a probability
• s: number of source queries
• η: learned by maximising the log probability over the alignment
dictionary (σ→τ): argmax
η
∑
pairs ij
ln (Pj→i)
Inverted softmax
• probability that a target query translates back to the source query
• hub target query =⇒ large denominator
• top-k target queries are selected as possible mappings of qi
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 12/29
27. Step 2 – Semantic similarity (Θs)
Inverted softmax
• probability that a target query translates back to the source query
• hub target query =⇒ large denominator
• top-k target queries are selected as possible mappings of qi
Determine the semantic similarity score by
• using these top-k queries (average if k > 1)
• and computing
Θs(qi, qj) =
(
eqi
W e⊤
qj
)
/
(
∥eqi
W∥2∥eqj
∥2
)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 13/29
28. Step 2 – Temporal similarity (Θc)
Exploit query relationship in the frequency space:
• important relationship; based on the core statistical input
information
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 14/29
29. Step 2 – Temporal similarity (Θc)
Exploit query relationship in the frequency space:
• important relationship; based on the core statistical input
information
• compute pair-wise correlation between the frequency time series of
source and target queries
• flu seasons may be offset in different locations
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 14/29
30. Step 2 – Temporal similarity (Θc)
Exploit query relationship in the frequency space:
• important relationship; based on the core statistical input
information
• compute pair-wise correlation between the frequency time series of
source and target queries
• flu seasons may be offset in different locations
✓ compute all correlations using a shifting window of ±ξ weeks
✓ optimal window lij (source query qi, target query qj) is
independently computed for each target query
Θc(qi, qj) = ρ
(
xi(t), xj(t + lij)
)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 14/29
31. Step 3 – Determining weights for target queries
Previous steps
• source query qi allocated weight wi
• source query qi mapped to a set Ti of k ≥ 1 target queries
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 15/29
32. Step 3 – Determining weights for target queries
Previous steps
• source query qi allocated weight wi
• source query qi mapped to a set Ti of k ≥ 1 target queries
Weight transfer
• if k = 1, directly assign wi to the single target query
• if k > 1, wi is distributed across the k identified target queries
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 15/29
33. Step 3 – Determining weights for target queries
Previous steps
• source query qi allocated weight wi
• source query qi mapped to a set Ti of k ≥ 1 target queries
Weight transfer
• if k = 1, directly assign wi to the single target query
• if k > 1, wi is distributed across the k identified target queries
Weighting schemes
• uniform: w′
j = wi/k
• based on Θij, j ∈ {2, . . . , k}: w′
j =
wiΘij
∑
qj ∈Ti
Θij
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 15/29
34. Experiments – Transfer tasks
Source location: United States (US)
Target locations
• France (FR): from English to French
• Spain (ES): from English to Spanish
• Australia (AU): from English to English, different hemisphere,
greater temporal difference in flu outbreaks
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 16/29
35. Experiments – Transfer tasks
Source location: United States (US)
Target locations
• France (FR): from English to French
• Spain (ES): from English to Spanish
• Australia (AU): from English to English, different hemisphere,
greater temporal difference in flu outbreaks
Why choose locations where syndromic surveillance systems exist?
• more robust evaluation at this preliminary stage
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 16/29
36. Experiments – Data
Search query frequencies from Google
• retrieved from the Google Correlate endpoint
• z-scored (by default)
• weekly rates
• September 2007 to August 2016 (both inclusive)
• # queries: 34,121 (US), 29,996 (FR), 15,673 (ES), 8,764 (AU)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 17/29
37. Experiments – Data
Search query frequencies from Google
• retrieved from the Google Correlate endpoint
• z-scored (by default)
• weekly rates
• September 2007 to August 2016 (both inclusive)
• # queries: 34,121 (US), 29,996 (FR), 15,673 (ES), 8,764 (AU)
Influenza-like illness (ILI) rates
• data from health organisations in these countries
(CDC, SN, SISSS, ASPREN)
• same date range, weekly ILI rates
• z-scored as the metric systems vary in these countries
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 17/29
38. Experiments – ILI rates in the source vs. target country
How similar are they?
US
2008 2009 2010 2011 2012 2013 2014 2015 2016
-1
0
1
2
3
4
5
ILIrates(z-scored)
US
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 18/29
39. Experiments – ILI rates in the source vs. target country
How similar are they?
US vs. FR
2008 2009 2010 2011 2012 2013 2014 2015 2016
-1
0
1
2
3
4
5
ILIrates(z-scored)
US
FR
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 18/29
40. Experiments – ILI rates in the source vs. target country
How similar are they?
US vs. ES
2008 2009 2010 2011 2012 2013 2014 2015 2016
-1
0
1
2
3
4
5
ILIrates(z-scored)
US
ES
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 18/29
41. Experiments – ILI rates in the source vs. target country
How similar are they?
US vs. AU
2008 2009 2010 2011 2012 2013 2014 2015 2016
-1
0
1
2
3
4
5
ILIrates(z-scored)
US
AU
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 18/29
42. Experiments – Evaluation
Protocol
• train a model using 5 flu seasons, test it on the next
• evaluate performance on the the last 4 flu seasons of our data set
• Θc: use a window of ξ = ±6 weeks
• source query → k = {1, ..., 5} target queries
• Pearson correlation, mean absolute error (MAE), root mean
squared error (RMSE)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 19/29
43. Experiments – Evaluation
Protocol
• train a model using 5 flu seasons, test it on the next
• evaluate performance on the the last 4 flu seasons of our data set
• Θc: use a window of ξ = ±6 weeks
• source query → k = {1, ..., 5} target queries
• Pearson correlation, mean absolute error (MAE), root mean
squared error (RMSE)
Baseline models
• worst case baseline (R): random shuffling of identified query pairs
• unsupervised learning (U) using most semantically relevant queries
• best case threshold (S): supervised learning using elastic net
• transfer component analysis (TCA)1
1
Pan et al. (2009)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 19/29
44. Experiments – General observations
In general:
• semantic similarity (Θs) is performing better than temporal similarity
(Θc) when used in isolation
• using semantic or temporal similarity in isolation provides inferior
performance, i.e. hybrid similarity works best
• values for k > 1 did not help the hybrid similarity to improve
• when k > 1, the non-uniform way of weighting was performing better
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 20/29
45. Experiments – General observations
In general:
• semantic similarity (Θs) is performing better than temporal similarity
(Θc) when used in isolation
• using semantic or temporal similarity in isolation provides inferior
performance, i.e. hybrid similarity works best
• values for k > 1 did not help the hybrid similarity to improve
• when k > 1, the non-uniform way of weighting was performing better
Closer look at results for γ = 0, γ = 1 and the best choice of γ
§
¦
¤
¥
Θ = γΘs + (1 − γ)Θc, γ ∈ [0, 1]
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 20/29
51. Experiments – Results for Australia
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 26/29
52. Experiments – Results for different values of γ
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 27/29
53. Experiments – Results for different values of γ
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 27/29
54. Experiments – Results for different values of γ
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 27/29
55. Experiments – Results for different values of γ
• hybrid similarity optima differ
per target country
• optimal γ depends on the
characteristics of the input
space
• µ(Θc)/µ(Θs) across queries
relates to optimal γ: 1.143
(FR), 0.982 (ES), 2.261 (AU)
• identifying optimal γ
automatically is an open task
• γ = 0.5 provides better results
than non hybrid similarities
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 27/29
56. Experiments – Where do some of the errors come from?
Error analysis setup
• investigate the models for the optimal gammas
• compute the mean ILI estimate impact (%) during the 10 weeks
with highest MAE across all test periods per target country
• identify the worst-5 query pairings
— – — – — – —
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 28/29
57. Experiments – Where do some of the errors come from?
Error analysis setup
• investigate the models for the optimal gammas
• compute the mean ILI estimate impact (%) during the 10 weeks
with highest MAE across all test periods per target country
• identify the worst-5 query pairings
— – — – — – —
France – from English (US) to French
• 24 hour flu → grippe intestinale
• influenza a treatment → grippe traitement
• remedies for colds → rhume de cerveau
• child temperature → température du corps
• child fever → fièvre adulte
(13.24%)
(8.07%)
(6.75%)
(6.37%)
(6.04%)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 28/29
58. Experiments – Where do some of the errors come from?
Error analysis setup
• investigate the models for the optimal gammas
• compute the mean ILI estimate impact (%) during the 10 weeks
with highest MAE across all test periods per target country
• identify the worst-5 query pairings
— – — – — – —
Spain – from English (US) to Spanish
• mucinez for kids → tratmiento de la grippe
• child fever → sinusitis
• influenza a treatment → con gripe
• symptoms pneumonia → bronquitis
• child temperature → temperatura corporal
(20.76%)
(7.76%)
(7.02%)
(6.04%)
(5.62%)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 28/29
59. Experiments – Where do some of the errors come from?
Error analysis setup
• investigate the models for the optimal gammas
• compute the mean ILI estimate impact (%) during the 10 weeks
with highest MAE across all test periods per target country
• identify the worst-5 query pairings
— – — – — – —
Australia – from English (US) to English (AU)
• 24 hour flu → flu duration
• child temperature → warmer
• how to treat a fever → have a fever
• tamiflu and breastfeeding → flu while pregnant
• robitussin cf → colds
(11.51%)
(9.77%)
(6.94%)
(6.81%)
(5.18%)
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 28/29
60. Conclusions and future work
Summary of outcomes
• previous efforts were heavily based on supervised learning models
• transfer learning method to enable modelling in areas that lack an
established syndromic surveillance system
— unsupervised (no ground truth data at the target location)
— core operation: how to map source to target queries
• satisfactory performance (e.g. r > .92)
• 21.6% increase in RMSE compared to a fully supervised model
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 29/29
61. Conclusions and future work
Summary of outcomes
• previous efforts were heavily based on supervised learning models
• transfer learning method to enable modelling in areas that lack an
established syndromic surveillance system
— unsupervised (no ground truth data at the target location)
— core operation: how to map source to target queries
• satisfactory performance (e.g. r > .92)
• 21.6% increase in RMSE compared to a fully supervised model
Future work
• study where target location is a low or middle income country
— harder to evaluate; qualitative analysis by experts
• investigate parameters γ (similarity balance) and k (number of
target queries in a mapping) further and learn them from the data
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19. 29/29
62. Questions
?
Acknowledgements
• Funded by the EPSRC project “i-sense” (EP/K031953/1, EP/R00529X/1)
• SISSS and Amparo Larrauri (Spain) for providing syndromic surveillance data
• Simon Moura and Peter Hayes for offering constructive feedback
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19.
63. References
Artetxe, M., Labaka, G., and Agirre, E. (2016). Learning Principled Bilingual Mappings of Word
Embeddings while Preserving Monolingual Invariance. In Proceedings of the 2016 Conference
on Empirical Methods in Natural Language Processing, pages 2289–2294.
Bojanowski, P., Grave, E., Joulin, A., and Mikolov, T. (2017). Enriching Word Vectors with
Subword Information. Transactions of the Association of Computational Linguistics,
5(1):135–146.
Cook, S., Conrad, C., Fowlkes, A. L., and Mohebbi, M. H. (2011). Assessing Google Flu Trends
Performance in the United States during the 2009 Influenza Virus A (H1N1) Pandemic. PLOS
ONE, 6(8).
Dinu, G., Lazaridou, A., and Baroni, M. (2014). Improving Zero-shot Learning by Mitigating the
Hubness Problem. arXiv preprint arXiv:1412.6568.
Eysenbach, G. (2006). Infodemiology: tracking flu-related searches on the web for syndromic
surveillance. Proc. of AMIA Annual Symposium, pages 244–248.
Ginsberg, J., Mohebbi, M. H., Patel, R. S., Brammer, L., Smolinski, M. S., and Brilliant, L. (2009).
Detecting Influenza Epidemics using Search Engine Query Data. Nature, 457(7232):1012–1014.
Lampos, V., Miller, A. C., Crossan, S., and Stefansen, C. (2015a). Advances in Nowcasting
Influenza-like Illness Rates using Search Query Logs. Scientific Reports, 5(12760).
Lampos, V., Yom-Tov, E., Pebody, R., and Cox, I. J. (2015b). Assessing the Impact of a Health
Intervention via User-Generated Internet Content. Data Mining and Knowledge Discovery,
29(5):1434–1457.
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19.
64. References
Lampos, V., Zou, B., and Cox, I. J. (2017). Enhancing Feature Selection Using Word Embeddings:
The Case of Flu Surveillance. In Proceedings of the 26th International Conference on World
Wide Web, pages 695–704.
Lazer, D., Kennedy, R., King, G., and Vespignani, A. (2014). The Parable of Google Flu: Traps in
Big Data Analysis. Science, 343(6176):1203–1205.
Olson, D. R., Konty, K. J., Paladini, M., Viboud, C., and Simonsen, L. (2013). Reassessing Google
Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative
Epidemiological Study at Three Geographic Scales. PLOS Computational Biology, 9(10).
Pan, S. J., Tsang, I. W., Kwok, J. T., and Yang, Q. (2009). Domain Adaptation via Transfer
Component Analysis. In Proceedings of the 21st International Joint Conference on Artificial
Intelligence, pages 1187–1192.
Polgreen, P. M., Chen, Y., Pennock, D. M., Nelson, F. D., and Weinstein, R. A. (2008). Using
Internet Searches for Influenza Surveillance. Clinical Infectious Diseases, 47(11):1443–1448.
Smith, S. L., Turban, D. H. P., Hamblin, S., and Hammerla, N. Y. (2016). Offline Bilingual Word
Vectors, Orthogonal Transformations and the Inverted Softmax. arXiv preprint
arXiv:1702.03859.
Wagner, M., Lampos, V., Cox, I. J., and Pebody, R. (2018). The added value of online
user-generated content in traditional methods for influenza surveillance. Scientific reports,
8(1):13963.
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19.
65. References
Yang, S., Santillana, M., and Kou, S. C. (2015). Accurate Estimation of Influenza Epidemics using
Google Search Data via ARGO. Proceedings of the National Academy of Sciences,
112(47):14473–14478.
Zou, B., Lampos, V., and Cox, I. J. (2018). Multi-Task Learning Improves Disease Models from
Web Search. In Proceedings of the 2018 World Wide Web Conference, pages 87–96.
Zou, B., Lampos, V., Gorton, R., and Cox, I. J. (2016). On Infectious Intestinal Disease
Surveillance using Social Media Content. In Proceedings of the 6th International Conference on
Digital Health, pages 157–161.
Zou, H. and Hastie, T. (2005). Regularization and Variable Selection via the Elastic Net. Journal
of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301–320.
Zou, Lampos, Cox. Transfer learning for unsupervised flu models from online search. WWW ’19.