SlideShare a Scribd company logo
1 of 15
Download to read offline
Outline General overview of the project Last 6 months Next 6 months
3rd
Ph.D. Review
Tracking trends on the web using novel Machine Learning
methods
Vasileios Lampos
advised by
Professor Nello Cristianini
Intelligent Systems Laboratory, University of Bristol
May 25, 2010
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
1 General overview of the project
Aims
Being more specific
2 Last 6 months
P1: Tracking the flu pandemic by monitoring the Social Web
P2: Flu detector - Tracking epidemics on Twitter
Other activities
3 Next 6 months
General goals & activities
A more time specific tentative plan
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
Aims
The general aims of our research project can be summarised in
the following points:
1 Track trends on the Web by applying Machine Learning
methods (track expresses the notions of infer or predict as
well)
2 Extend current or invent new methodologies (where and if
needed) for accomplishing our primary aim
3 Build tools that apply the experimental/theoretical results in
real and large-scale applications (featured research)
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
Being more specific
1 Trends about what? Examples?
Predict flu rates (epidemics)
Infer vote intensions (politics)
Infer traffic/weather conditions (toy problems)
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
Being more specific
1 Trends about what? Examples?
Predict flu rates (epidemics)
Infer vote intensions (politics)
Infer traffic/weather conditions (toy problems)
2 Methodologies?
Feature extraction/selection
Exploit probabilistic relationships (PGMs)
Regression/classification/ranking scenarios
Active learning
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
Being more specific
1 Trends about what? Examples?
Predict flu rates (epidemics)
Infer vote intensions (politics)
Infer traffic/weather conditions (toy problems)
2 Methodologies?
Feature extraction/selection
Exploit probabilistic relationships (PGMs)
Regression/classification/ranking scenarios
Active learning
3 Applications?
Back-end infrastructure for data collection/retrieval/mining
Real time online tools for making and displaying predictions
(like the Flu detector)
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
P1 - Summary (1 of 3)
Title: Tracking the flu pandemic by monitoring the Social Web
Authors: V. Lampos and N. Cristianini
Submitted to: IAPR Cognitive Information Processing 2010 (accepted)
Twitter and Health Protection Agency data for weeks 26-49,
2009 (on average 160,000 tweets collected per day geolocated
in 54 urban centres in the UK)
Frequency of 41 flu related words (markers) in Twitter
corpus had a correlation of >80% with the HPA flu rates in
all UK regions
Learn a better list of weighted markers automatically:
Generate a list of candidate markers (1560 words taken from
flu related web pages)
Use LASSO for feature selection
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
P1 - Summary (2 of 3)
Validation schemes:
1 Train on one region, validate regularisation parameter on
another, test on the remaining regions (for all possible
combinations)
Train/Validate (regions) A B C D E
A - 0.9594 0.9375 0.9348 0.9297
B 0.9455 - 0.9476 0.9267 0.9003
C 0.9154 0.9513 - 0.8188 0.908
D 0.9463 0.9459 0.9424 - 0.9337
E 0.8798 0.9506 0.9455 0.8935 -
Total Avg. 0.9256
97 selected words: lung, unwel, temperatur, like, headach, season, unusu, chronic, child,
dai, appetit, stai, symptom, spread, diarrhoea, start, muscl, weaken, immun, feel, liver, plenti, antivir,
follow, sore, peopl, nation, small, pandem, pregnant, thermomet, bed, loss, heart, mention, condit, ...
2 Aggregate data from all regions, test on weeks 28 and 41
(2009) and train using the rest of the data set
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
P1 - Summary (3 of 3)
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
P2 - Summary
Title: Flu detector - Tracking epidemics on Twitter
Authors: V. Lampos, T. De Bie, and N. Cristianini
Submitted to: ECML PKDD 2010 Demos (under review)
Extending and making more robust the methodology of P1
Larger data sets (bigger time series) and more (2675)
candidate features
Select a list of features (markers) using BoLASSO (bootstrap
version of LASSO)
Then learn weights of those markers via linear least squares
regression
Stricter evaluation of the methodology - Available online
Put all this into practice and come up with the Flu detector
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
Other activities
Studied/implemented the necessary statistical tools and
algorithms (in MATLAB or Java)
Extended further the infrastructure for conducting large scale
experiments and data retrieval on demand
TA for Intro to AI, Data Analysis and Pattern Analysis &
Statistical Learning
Attended some of the ISL meetings and seminars
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
General goals & activities
1 ... (content omitted)
2 ... (content omitted)
3 ... (content omitted)
4 ... (content omitted)
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
A more time specific tentative plan
In June: ... (content omitted)
In July: ... (content omitted)
In August: ... (content omitted)
In September - November: ... (content omitted)
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
This is the last slide.
Vasileios Lampos 3rd
Ph.D. Review
Outline General overview of the project Last 6 months Next 6 months
This is the last slide.
Any questions?
Vasileios Lampos 3rd
Ph.D. Review

More Related Content

Viewers also liked

Effective Service Delivery - Streamlining Your Workflows
Effective Service Delivery - Streamlining Your WorkflowsEffective Service Delivery - Streamlining Your Workflows
Effective Service Delivery - Streamlining Your WorkflowsRonnie Parisella
 
Autotask how to stop being a whiner 2013
Autotask how to stop being a whiner 2013Autotask how to stop being a whiner 2013
Autotask how to stop being a whiner 2013Ronnie Parisella
 
Stories yet to be told
Stories yet to be toldStories yet to be told
Stories yet to be toldCCIHP
 

Viewers also liked (6)

Effective Service Delivery - Streamlining Your Workflows
Effective Service Delivery - Streamlining Your WorkflowsEffective Service Delivery - Streamlining Your Workflows
Effective Service Delivery - Streamlining Your Workflows
 
futue goals
 futue goals futue goals
futue goals
 
Glosario BD
Glosario BDGlosario BD
Glosario BD
 
Autotask how to stop being a whiner 2013
Autotask how to stop being a whiner 2013Autotask how to stop being a whiner 2013
Autotask how to stop being a whiner 2013
 
Facilitate Development
Facilitate DevelopmentFacilitate Development
Facilitate Development
 
Stories yet to be told
Stories yet to be toldStories yet to be told
Stories yet to be told
 

Similar to Presentation

Reproducibility, preregistration, etc.: Making good science even better
Reproducibility,  preregistration, etc.: Making good science even betterReproducibility,  preregistration, etc.: Making good science even better
Reproducibility, preregistration, etc.: Making good science even betterAlex Holcombe
 
research design
research designresearch design
research designRajeshwori
 
How can the use of computer simulation benefit the monitoring and mitigation ...
How can the use of computer simulation benefit the monitoring and mitigation ...How can the use of computer simulation benefit the monitoring and mitigation ...
How can the use of computer simulation benefit the monitoring and mitigation ...BrennanMinns
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingBram Zandbelt
 
Observational Research designs: detailed description
Observational Research designs: detailed description Observational Research designs: detailed description
Observational Research designs: detailed description Tarek Tawfik Amin
 
WK 2_Oppenheim 1992_pp 6-29.pdf
WK 2_Oppenheim 1992_pp 6-29.pdfWK 2_Oppenheim 1992_pp 6-29.pdf
WK 2_Oppenheim 1992_pp 6-29.pdfsnazkhan8686
 
Research Methods for Business 6-ch06 (research design).pptx
Research Methods for Business 6-ch06 (research design).pptxResearch Methods for Business 6-ch06 (research design).pptx
Research Methods for Business 6-ch06 (research design).pptxAhmedAlrashid7
 
Sir Tariq M. Research.2 (1)
Sir Tariq M. Research.2 (1)Sir Tariq M. Research.2 (1)
Sir Tariq M. Research.2 (1)Ashar Azam
 
Study Proposal format-SRC.docx
Study Proposal format-SRC.docxStudy Proposal format-SRC.docx
Study Proposal format-SRC.docxKeerthana990449
 
Ppt research Methods and statistical consultancy
Ppt  research Methods and statistical consultancyPpt  research Methods and statistical consultancy
Ppt research Methods and statistical consultancyTilayeMatebe
 
How to write chapter three of your research project
How to write chapter three of your research projectHow to write chapter three of your research project
How to write chapter three of your research projectEtieneIma123
 
How to write chapter three of your research project
How to write chapter three of your research projectHow to write chapter three of your research project
How to write chapter three of your research projectEtieneIma123
 
MAC411(A) Analysis in Communication Researc.ppt
MAC411(A) Analysis in Communication Researc.pptMAC411(A) Analysis in Communication Researc.ppt
MAC411(A) Analysis in Communication Researc.pptPreciousOsoOla
 
Overall presentation Matram project
Overall presentation Matram project Overall presentation Matram project
Overall presentation Matram project RaphaelGirod
 
BRM Research Outline, Ch 1-7 NEW.pptx
BRM Research Outline, Ch 1-7 NEW.pptxBRM Research Outline, Ch 1-7 NEW.pptx
BRM Research Outline, Ch 1-7 NEW.pptxHaleemaAbdella
 
2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx
2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx
2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docxvickeryr87
 
John Lavis | Making research work for decision makers: international perspect...
John Lavis | Making research work for decision makers: international perspect...John Lavis | Making research work for decision makers: international perspect...
John Lavis | Making research work for decision makers: international perspect...Sax Institute
 

Similar to Presentation (20)

Example 1
Example 1Example 1
Example 1
 
Reproducibility, preregistration, etc.: Making good science even better
Reproducibility,  preregistration, etc.: Making good science even betterReproducibility,  preregistration, etc.: Making good science even better
Reproducibility, preregistration, etc.: Making good science even better
 
research design
research designresearch design
research design
 
How can the use of computer simulation benefit the monitoring and mitigation ...
How can the use of computer simulation benefit the monitoring and mitigation ...How can the use of computer simulation benefit the monitoring and mitigation ...
How can the use of computer simulation benefit the monitoring and mitigation ...
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
Observational Research designs: detailed description
Observational Research designs: detailed description Observational Research designs: detailed description
Observational Research designs: detailed description
 
WK 2_Oppenheim 1992_pp 6-29.pdf
WK 2_Oppenheim 1992_pp 6-29.pdfWK 2_Oppenheim 1992_pp 6-29.pdf
WK 2_Oppenheim 1992_pp 6-29.pdf
 
Research Methods for Business 6-ch06 (research design).pptx
Research Methods for Business 6-ch06 (research design).pptxResearch Methods for Business 6-ch06 (research design).pptx
Research Methods for Business 6-ch06 (research design).pptx
 
Sir Tariq M. Research.2 (1)
Sir Tariq M. Research.2 (1)Sir Tariq M. Research.2 (1)
Sir Tariq M. Research.2 (1)
 
Study Proposal format-SRC.docx
Study Proposal format-SRC.docxStudy Proposal format-SRC.docx
Study Proposal format-SRC.docx
 
Ppt research Methods and statistical consultancy
Ppt  research Methods and statistical consultancyPpt  research Methods and statistical consultancy
Ppt research Methods and statistical consultancy
 
How to write chapter three of your research project
How to write chapter three of your research projectHow to write chapter three of your research project
How to write chapter three of your research project
 
How to write chapter three of your research project
How to write chapter three of your research projectHow to write chapter three of your research project
How to write chapter three of your research project
 
MAC411(A) Analysis in Communication Researc.ppt
MAC411(A) Analysis in Communication Researc.pptMAC411(A) Analysis in Communication Researc.ppt
MAC411(A) Analysis in Communication Researc.ppt
 
Overall presentation Matram project
Overall presentation Matram project Overall presentation Matram project
Overall presentation Matram project
 
Mm22
Mm22Mm22
Mm22
 
Mm23
Mm23Mm23
Mm23
 
BRM Research Outline, Ch 1-7 NEW.pptx
BRM Research Outline, Ch 1-7 NEW.pptxBRM Research Outline, Ch 1-7 NEW.pptx
BRM Research Outline, Ch 1-7 NEW.pptx
 
2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx
2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx
2 hours agoLuke Powell Main Post - Luke PowellCOLLAPSETo.docx
 
John Lavis | Making research work for decision makers: international perspect...
John Lavis | Making research work for decision makers: international perspect...John Lavis | Making research work for decision makers: international perspect...
John Lavis | Making research work for decision makers: international perspect...
 

Recently uploaded

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 

Recently uploaded (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 

Presentation

  • 1. Outline General overview of the project Last 6 months Next 6 months 3rd Ph.D. Review Tracking trends on the web using novel Machine Learning methods Vasileios Lampos advised by Professor Nello Cristianini Intelligent Systems Laboratory, University of Bristol May 25, 2010 Vasileios Lampos 3rd Ph.D. Review
  • 2. Outline General overview of the project Last 6 months Next 6 months 1 General overview of the project Aims Being more specific 2 Last 6 months P1: Tracking the flu pandemic by monitoring the Social Web P2: Flu detector - Tracking epidemics on Twitter Other activities 3 Next 6 months General goals & activities A more time specific tentative plan Vasileios Lampos 3rd Ph.D. Review
  • 3. Outline General overview of the project Last 6 months Next 6 months Aims The general aims of our research project can be summarised in the following points: 1 Track trends on the Web by applying Machine Learning methods (track expresses the notions of infer or predict as well) 2 Extend current or invent new methodologies (where and if needed) for accomplishing our primary aim 3 Build tools that apply the experimental/theoretical results in real and large-scale applications (featured research) Vasileios Lampos 3rd Ph.D. Review
  • 4. Outline General overview of the project Last 6 months Next 6 months Being more specific 1 Trends about what? Examples? Predict flu rates (epidemics) Infer vote intensions (politics) Infer traffic/weather conditions (toy problems) Vasileios Lampos 3rd Ph.D. Review
  • 5. Outline General overview of the project Last 6 months Next 6 months Being more specific 1 Trends about what? Examples? Predict flu rates (epidemics) Infer vote intensions (politics) Infer traffic/weather conditions (toy problems) 2 Methodologies? Feature extraction/selection Exploit probabilistic relationships (PGMs) Regression/classification/ranking scenarios Active learning Vasileios Lampos 3rd Ph.D. Review
  • 6. Outline General overview of the project Last 6 months Next 6 months Being more specific 1 Trends about what? Examples? Predict flu rates (epidemics) Infer vote intensions (politics) Infer traffic/weather conditions (toy problems) 2 Methodologies? Feature extraction/selection Exploit probabilistic relationships (PGMs) Regression/classification/ranking scenarios Active learning 3 Applications? Back-end infrastructure for data collection/retrieval/mining Real time online tools for making and displaying predictions (like the Flu detector) Vasileios Lampos 3rd Ph.D. Review
  • 7. Outline General overview of the project Last 6 months Next 6 months P1 - Summary (1 of 3) Title: Tracking the flu pandemic by monitoring the Social Web Authors: V. Lampos and N. Cristianini Submitted to: IAPR Cognitive Information Processing 2010 (accepted) Twitter and Health Protection Agency data for weeks 26-49, 2009 (on average 160,000 tweets collected per day geolocated in 54 urban centres in the UK) Frequency of 41 flu related words (markers) in Twitter corpus had a correlation of >80% with the HPA flu rates in all UK regions Learn a better list of weighted markers automatically: Generate a list of candidate markers (1560 words taken from flu related web pages) Use LASSO for feature selection Vasileios Lampos 3rd Ph.D. Review
  • 8. Outline General overview of the project Last 6 months Next 6 months P1 - Summary (2 of 3) Validation schemes: 1 Train on one region, validate regularisation parameter on another, test on the remaining regions (for all possible combinations) Train/Validate (regions) A B C D E A - 0.9594 0.9375 0.9348 0.9297 B 0.9455 - 0.9476 0.9267 0.9003 C 0.9154 0.9513 - 0.8188 0.908 D 0.9463 0.9459 0.9424 - 0.9337 E 0.8798 0.9506 0.9455 0.8935 - Total Avg. 0.9256 97 selected words: lung, unwel, temperatur, like, headach, season, unusu, chronic, child, dai, appetit, stai, symptom, spread, diarrhoea, start, muscl, weaken, immun, feel, liver, plenti, antivir, follow, sore, peopl, nation, small, pandem, pregnant, thermomet, bed, loss, heart, mention, condit, ... 2 Aggregate data from all regions, test on weeks 28 and 41 (2009) and train using the rest of the data set Vasileios Lampos 3rd Ph.D. Review
  • 9. Outline General overview of the project Last 6 months Next 6 months P1 - Summary (3 of 3) Vasileios Lampos 3rd Ph.D. Review
  • 10. Outline General overview of the project Last 6 months Next 6 months P2 - Summary Title: Flu detector - Tracking epidemics on Twitter Authors: V. Lampos, T. De Bie, and N. Cristianini Submitted to: ECML PKDD 2010 Demos (under review) Extending and making more robust the methodology of P1 Larger data sets (bigger time series) and more (2675) candidate features Select a list of features (markers) using BoLASSO (bootstrap version of LASSO) Then learn weights of those markers via linear least squares regression Stricter evaluation of the methodology - Available online Put all this into practice and come up with the Flu detector Vasileios Lampos 3rd Ph.D. Review
  • 11. Outline General overview of the project Last 6 months Next 6 months Other activities Studied/implemented the necessary statistical tools and algorithms (in MATLAB or Java) Extended further the infrastructure for conducting large scale experiments and data retrieval on demand TA for Intro to AI, Data Analysis and Pattern Analysis & Statistical Learning Attended some of the ISL meetings and seminars Vasileios Lampos 3rd Ph.D. Review
  • 12. Outline General overview of the project Last 6 months Next 6 months General goals & activities 1 ... (content omitted) 2 ... (content omitted) 3 ... (content omitted) 4 ... (content omitted) Vasileios Lampos 3rd Ph.D. Review
  • 13. Outline General overview of the project Last 6 months Next 6 months A more time specific tentative plan In June: ... (content omitted) In July: ... (content omitted) In August: ... (content omitted) In September - November: ... (content omitted) Vasileios Lampos 3rd Ph.D. Review
  • 14. Outline General overview of the project Last 6 months Next 6 months This is the last slide. Vasileios Lampos 3rd Ph.D. Review
  • 15. Outline General overview of the project Last 6 months Next 6 months This is the last slide. Any questions? Vasileios Lampos 3rd Ph.D. Review