Ongoing work on context inference, place detection. Presented at the International Conference and Exhibition of Ubiquitous Positioning, Indoor Positioning and Location Based Services. Corpus Christi, Texas. Nov 2014.
Visualising large spatial databases and Building bespoke geodemographicsDr Muhammad Adnan
This presentation outlines my work at the Local Futures and the PhD research. I have been working on a combined project between Local Futures and UCL and the presentation starts by giving an introduction of the project. My PhD investigated the creation of Real-time bespoke geodemographics, and this presentation presents the work i did during the PhD journey.
Designing a Better Planet with Big Data and Sensor Networks (for Intelligent ...Rainer Sternfeld
Planet OS is a data discovery engine designed for real world sensor data. One interface to access your local, remote, open and vendor data.
This presentation answers questions like:
• How is the growth of sensor data challenging traditional data management, storage and usability of it?
• What are the trends in machine data and how will sensor data change Big Data over the next decade?
• How many devices are there on the Internet today? What will happen to this map in 10 years?
• What is the sensor data value chain, what gives you competitive advantage over others?
• Why is sensor data hard?
• Examples and use cases of the markets utilizing the latest robotic and mobile sensing platforms on land (energy production, agriculture, connected cars, weather forecasting), in the ocean (oil & gas, marine acoustics, shipping, environmental monitoring), air (drones) and space (nanosatellites, data-driven weather forecasting).
• How Planet OS is solving these challenges with it's Data Discovery Engine and a mission to index the real world? What are the data types we work with? What are the applications and how having a single interface and a single index help organizations to increase their ROI of operations, emergency response and planning?
• The Industrial Internet (GE), The Internet of Everything (Cisco)
• Why Big Data clouds need trust management for secure operations over open networks? (Intertrust)
Multiple user decision making is important in to-day’s location-based service scenarios.Existing query services such as kNN and Skyline queries only consider single user and do not consider user’s preferencesThis system is designed using an authenticated query processing framework based on MR-tree.
Comprehensive experiments and Researchers have tested the effectiveness and robustness of this query and its 100% functional
Visualising large spatial databases and Building bespoke geodemographicsDr Muhammad Adnan
This presentation outlines my work at the Local Futures and the PhD research. I have been working on a combined project between Local Futures and UCL and the presentation starts by giving an introduction of the project. My PhD investigated the creation of Real-time bespoke geodemographics, and this presentation presents the work i did during the PhD journey.
Designing a Better Planet with Big Data and Sensor Networks (for Intelligent ...Rainer Sternfeld
Planet OS is a data discovery engine designed for real world sensor data. One interface to access your local, remote, open and vendor data.
This presentation answers questions like:
• How is the growth of sensor data challenging traditional data management, storage and usability of it?
• What are the trends in machine data and how will sensor data change Big Data over the next decade?
• How many devices are there on the Internet today? What will happen to this map in 10 years?
• What is the sensor data value chain, what gives you competitive advantage over others?
• Why is sensor data hard?
• Examples and use cases of the markets utilizing the latest robotic and mobile sensing platforms on land (energy production, agriculture, connected cars, weather forecasting), in the ocean (oil & gas, marine acoustics, shipping, environmental monitoring), air (drones) and space (nanosatellites, data-driven weather forecasting).
• How Planet OS is solving these challenges with it's Data Discovery Engine and a mission to index the real world? What are the data types we work with? What are the applications and how having a single interface and a single index help organizations to increase their ROI of operations, emergency response and planning?
• The Industrial Internet (GE), The Internet of Everything (Cisco)
• Why Big Data clouds need trust management for secure operations over open networks? (Intertrust)
Multiple user decision making is important in to-day’s location-based service scenarios.Existing query services such as kNN and Skyline queries only consider single user and do not consider user’s preferencesThis system is designed using an authenticated query processing framework based on MR-tree.
Comprehensive experiments and Researchers have tested the effectiveness and robustness of this query and its 100% functional
Hosted PBX- Should You Be a Provider or a Reseller?NetSapiens
The growing demands for Hosted PBX services in the SMB and Enterprise markets have opened up profitable opportunities for service providers. It is easy to decide to capture these opportunities but how should you build your business model? In this webinar, we will discuss the differences in the two most popular strategies for entering into the Hosted PBX space and answer the question that many growing ITSPs have on their mind: "Should I be a provider or a reseller?"
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Sumeet Singh
As organizations begin to make use of large data sets, approaches to understand and manage true costs of big data will become an important facet with increasing scale of operations.
Whether an on-premise or cloud-based platform is used for storing, processing and analyzing data, our approach explains how to calculate the total cost of ownership (TCO), develop a deeper understanding of compute and storage resources, and run the big data operations with its own P&L, full transparency in costs, and with metering and billing provisions. While our approach is generic, we will illustrate the methodology with three primary deployments in the Apache Hadoop ecosystem, namely MapReduce and HDFS, HBase, and Storm due to the significance of capital investments with increasing scale in data nodes, region servers, and supervisor nodes respectively.
As we discuss our approach, we will share insights gathered from the exercise conducted on one of the largest data infrastructures in the world. We will illustrate how to organize cluster resources, compile data required and typical sources, develop TCO models tailored for individual situations, derive unit costs of usage, measure resources consumed, optimize for higher utilization and ROI, and benchmark the cost.
Introducing Multi Valued Vectors Fields in Apache LuceneSease
Since the introduction of native vector-based search in Apache Lucene happened, many features have been developed, but the support for multiple vectors in a dedicated KNN vector field remained to explore. Having the possibility of indexing (and searching) multiple values per field unlocks the possibility of working with long textual documents, splitting them in paragraphs and encoding each paragraph as a separate vector: scenario that is often encountered by many businesses. This talk explores the challenges, the technical design and the implementation activities happened during the work for this contribution to the Apache Lucene project. The audience is expected to get an understanding of how multi-valued fields can work in a vector-based search use-case and how this feature has been implemented.
Optique - to provide semantic end-to-end connection between users and data sources; enable users to rapidly formulate intuitive queries using familiar vocabularies and conceptualisations and return timely answers from large scale and heterogeneous data sources.
If your business is heavily dependent on the Internet, you may be facing an unprecedented level of network traffic analytics data. How to make the most of that data is the challenge. This presentation from Kentik VP Product and former EMA analyst Jim Frey explores the evolving need, the architecture and key use cases for BGP and NetFlow analysis based on scale-out cloud computing and Big Data technologies.
In these slides, we explore the unique challenges that mobile data present. The high cardinality, low signal to noise ratio and realtime needs have significant system implications. We outline how InMobi tackles these challenges. A specific Data Science use case is also presented. We outline our approach to user segmentation. A brief description of the challenges faced and our attempts to address them is also included.
Survey on confidentiality of the user and query processing on spatial networkeSAT Journals
Abstract
The administration of transhipment systems has become increasingly important in many applications such as position-based services, supply cycle management, travel control, and so on. These applications usually involve queries over spatial networks with vigorously changing and problematical travel conditions. There may be possibilities of user's privacy violated when they are querying about the location information on the third party servers where the location information about the users will be tracked. The malicious attackers may steal the location information about the users. The k nearest neighbour query verification with location points on Voronoi diagram increases the verification cost on mobile clients. The reverse nearest neighbour queries by assigning each object and query with a safe region is applied such that the expensive recomputation is not required as long as the query and objects remain in their respective safe regions. The proposed system reduces the communication cost in client-server architectures because an object does not report its location to the server unless it leaves its safe region or the server sends a location update request. Hilbert curve is used here for the capability of partially retaining the neighbouring adjacency of the original data. The user data is protected by applying Hilbert transform over the original values and storing the transformed values in the Hilbert curve.
Keywords— Hilbert Curve, Voronoi diagram, Hilbert Transform
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...Beniamino Murgante
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Data Quality Interpretation
Erik Borg, Bernd Fichtelmann - German Aerospace Center, German Remote Sensing Data Center
Hartmut Asche - Department of Geography, University of Potsdam
Since the introduction of native vector-based search in Apache Lucene happened, many features have been developed, but the support for multiple vectors in a dedicated KNN vector field remained to explore. Having the possibility of indexing (and searching) multiple values per field unlocks the possibility of working with long textual documents, splitting them into paragraphs, and encoding each paragraph as a separate vector: a scenario that is often encountered by many businesses. This talk explores the challenges, the technical design and the implementation activities that happened during the work for this contribution to the Apache Lucene project. The audience is expected to get an understanding of how multi-valued fields can work in a vector-based search use case and how this feature has been implemented.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Hosted PBX- Should You Be a Provider or a Reseller?NetSapiens
The growing demands for Hosted PBX services in the SMB and Enterprise markets have opened up profitable opportunities for service providers. It is easy to decide to capture these opportunities but how should you build your business model? In this webinar, we will discuss the differences in the two most popular strategies for entering into the Hosted PBX space and answer the question that many growing ITSPs have on their mind: "Should I be a provider or a reseller?"
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Sumeet Singh
As organizations begin to make use of large data sets, approaches to understand and manage true costs of big data will become an important facet with increasing scale of operations.
Whether an on-premise or cloud-based platform is used for storing, processing and analyzing data, our approach explains how to calculate the total cost of ownership (TCO), develop a deeper understanding of compute and storage resources, and run the big data operations with its own P&L, full transparency in costs, and with metering and billing provisions. While our approach is generic, we will illustrate the methodology with three primary deployments in the Apache Hadoop ecosystem, namely MapReduce and HDFS, HBase, and Storm due to the significance of capital investments with increasing scale in data nodes, region servers, and supervisor nodes respectively.
As we discuss our approach, we will share insights gathered from the exercise conducted on one of the largest data infrastructures in the world. We will illustrate how to organize cluster resources, compile data required and typical sources, develop TCO models tailored for individual situations, derive unit costs of usage, measure resources consumed, optimize for higher utilization and ROI, and benchmark the cost.
Introducing Multi Valued Vectors Fields in Apache LuceneSease
Since the introduction of native vector-based search in Apache Lucene happened, many features have been developed, but the support for multiple vectors in a dedicated KNN vector field remained to explore. Having the possibility of indexing (and searching) multiple values per field unlocks the possibility of working with long textual documents, splitting them in paragraphs and encoding each paragraph as a separate vector: scenario that is often encountered by many businesses. This talk explores the challenges, the technical design and the implementation activities happened during the work for this contribution to the Apache Lucene project. The audience is expected to get an understanding of how multi-valued fields can work in a vector-based search use-case and how this feature has been implemented.
Optique - to provide semantic end-to-end connection between users and data sources; enable users to rapidly formulate intuitive queries using familiar vocabularies and conceptualisations and return timely answers from large scale and heterogeneous data sources.
If your business is heavily dependent on the Internet, you may be facing an unprecedented level of network traffic analytics data. How to make the most of that data is the challenge. This presentation from Kentik VP Product and former EMA analyst Jim Frey explores the evolving need, the architecture and key use cases for BGP and NetFlow analysis based on scale-out cloud computing and Big Data technologies.
In these slides, we explore the unique challenges that mobile data present. The high cardinality, low signal to noise ratio and realtime needs have significant system implications. We outline how InMobi tackles these challenges. A specific Data Science use case is also presented. We outline our approach to user segmentation. A brief description of the challenges faced and our attempts to address them is also included.
Survey on confidentiality of the user and query processing on spatial networkeSAT Journals
Abstract
The administration of transhipment systems has become increasingly important in many applications such as position-based services, supply cycle management, travel control, and so on. These applications usually involve queries over spatial networks with vigorously changing and problematical travel conditions. There may be possibilities of user's privacy violated when they are querying about the location information on the third party servers where the location information about the users will be tracked. The malicious attackers may steal the location information about the users. The k nearest neighbour query verification with location points on Voronoi diagram increases the verification cost on mobile clients. The reverse nearest neighbour queries by assigning each object and query with a safe region is applied such that the expensive recomputation is not required as long as the query and objects remain in their respective safe regions. The proposed system reduces the communication cost in client-server architectures because an object does not report its location to the server unless it leaves its safe region or the server sends a location update request. Hilbert curve is used here for the capability of partially retaining the neighbouring adjacency of the original data. The user data is protected by applying Hilbert transform over the original values and storing the transformed values in the Hilbert curve.
Keywords— Hilbert Curve, Voronoi diagram, Hilbert Transform
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...Beniamino Murgante
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Data Quality Interpretation
Erik Borg, Bernd Fichtelmann - German Aerospace Center, German Remote Sensing Data Center
Hartmut Asche - Department of Geography, University of Potsdam
Since the introduction of native vector-based search in Apache Lucene happened, many features have been developed, but the support for multiple vectors in a dedicated KNN vector field remained to explore. Having the possibility of indexing (and searching) multiple values per field unlocks the possibility of working with long textual documents, splitting them into paragraphs, and encoding each paragraph as a separate vector: a scenario that is often encountered by many businesses. This talk explores the challenges, the technical design and the implementation activities that happened during the work for this contribution to the Apache Lucene project. The audience is expected to get an understanding of how multi-valued fields can work in a vector-based search use case and how this feature has been implemented.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Studia Poinsotiana
I Introduction
II Subalternation and Theology
III Theology and Dogmatic Declarations
IV The Mixed Principles of Theology
V Virtual Revelation: The Unity of Theology
VI Theology as a Natural Science
VII Theology’s Certitude
VIII Conclusion
Notes
Bibliography
All the contents are fully attributable to the author, Doctor Victor Salas. Should you wish to get this text republished, get in touch with the author or the editorial committee of the Studia Poinsotiana. Insofar as possible, we will be happy to broker your contact.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...Wasswaderrick3
In this book, we use conservation of energy techniques on a fluid element to derive the Modified Bernoulli equation of flow with viscous or friction effects. We derive the general equation of flow/ velocity and then from this we derive the Pouiselle flow equation, the transition flow equation and the turbulent flow equation. In the situations where there are no viscous effects , the equation reduces to the Bernoulli equation. From experimental results, we are able to include other terms in the Bernoulli equation. We also look at cases where pressure gradients exist. We use the Modified Bernoulli equation to derive equations of flow rate for pipes of different cross sectional areas connected together. We also extend our techniques of energy conservation to a sphere falling in a viscous medium under the effect of gravity. We demonstrate Stokes equation of terminal velocity and turbulent flow equation. We look at a way of calculating the time taken for a body to fall in a viscous medium. We also look at the general equation of terminal velocity.
1. Semantic Labeling of Places
based on Phone Usage Features
using Supervised Learning
A. Rivero-Rodriguez, H. Leppäkoski ,R. Piché
1
19.11.2014
Tampere University of Technology
Tampere, Finland
www.tut.fi/posgroup
November 21, 2014
Corpus Christi, Texas, USA
UPIN-LBS
Context inference and awareness
2. This talk describes the design of the algorithms
for a smartphone to learn your significant
places
Training data Features Classifiers
2
19.11.2014
3. MDC dataset
Idiap and NRC-Lausanne
Lausanne Data Collection Campaign (2009-2011)
Records of 200 users over 18 months
Captures all types of information
Users provide extra information (labels!)
Anonymisation
46 GB of data!
3
19.11.2014
Training Data
4. Active Phone Usage
calls, messages
calendar, contacts
application usage
Pasive Phone Usage
network information
system Information
location & movement
4
19.11.2014
Features Available
Training Data
5. 5
19.11.2014
The places were identified by
clustering, then labeled by the user Training Data
200 m
Friend’s Home
Restaurant
Work
Home
6. 6
19.11.2014
We selected 14 features that could be
used by a place-labelling application
Call logs
callsTimeRatio
callsPerHour
Accelerometer
idleStillRatio
walkRatio
vehicleRatio
sportRatio
Features
System
duration
startHour
endHour
nightStay
batteryAvg
chargingTimeRatio
sysActiveRatio
sysActStartsPerHour
8. 8
19.11.2014
visits_20min.csv
places.csv
Definitions
for DB queries Make queries
system
call logs
accel activity
start times,
end times,
used ids,
place labels
Accumulate times & counts,
weight averages
feature vectors
for places
for each
user & place
Compute times,
counts, averages
for each
visit
Compute ratios Compute ratios
feature vectors
for visits
Features
We preprocessed the data to obtain
the features for both approaches
9. 9
19.11.2014
We applied five popular classification
methods to the data Classifiers
ܲ X | ܣ, ܤ =
ܲ ܣ| ܺ ܲ B|ܺ ܲ ܺ )
ܲ ܣ ܲ(ܤ)
Naïve Bayes (NB)
Decision Tree (DT)
K-nearest neighbors (K-NN)
Bagged Tree (DT)
Neural Networks (NN)
10. 10
19.11.2014
10000
9000
8000
7000
6000
5000
4000
3000
2000
1000
0
70%
O
W
H
28%
O
H
W
65%
W
H
O
82%
O
W
H
80%
O
H
W
12%
W
H
O
80%
O
W
H
89%
O
H
W
7%
W
H
O
96%
W O
H
29%
O
H
W
2%
W
H
O
93%
W O
H
25%
O
H
W
7%
W
H
O
Number of cases(visits)
Well Classified Misclassified
NB
53%
DT
75%
BT
77%
NN
61%
KNN
58%
H: Home
W: Work
O: Others
Results - Visits approach
Classifiers
11. 11
19.11.2014
40
35
30
25
20
15
10
5
0
97%
O
H
88%
O
H
W
69%
W
H
O
86%
O
H
91%
O
H
W
67%
W
H
O
97%
O
H
91%
O
H
W
69%
W
H
O
93%
O
H
85%
O
H
W
69%
W
H
O
86%
O
H
79%
O
H
W
53%
W
H
O
Number of cases(visits)
Well Classified Misclassified
NN
71%
DT
81%
NB
84%
BT
85%
KNN
71%
H: Home
W: Work
O: Others
Results - Places approach
Classifiers
12. Naive Bayes and Bagged Decision Tree with Places data-representation
are best
NN and K-NN underperform and are computationally demanding
Most relevant features are: night stay, stay duration, start time,
battery status, idle time
Other classifiers (logistic regresion, support vector machine)
Combine Places and Visits data-representations
12
19.11.2014
Classifiers
Results & Future Work
Alejandro Rivero
alejandro.rivero@tut.fi