Bridging the Gap: Machine Learning for Ubiquitous Computing -- Study Design and Deployment

Study Design and Deployments
Machine Learning for UbiComp
Mayan Goel

Generalization from a fraction of data

Problem
Depends on:
&
Phenomenon Sensed

What do you need to be careful about?
• Study environment/situation
• Participants
• Quantity of data
• Data annotation
• Data collection procedure
• Basically everything!

Study Environment/Situation
• Maintain a balance between controlled and realistic
• Change variables one-by-one
• Keeps the analysis tractable

Participants
• Equal representation

Unlike usual  
machine learning data,

sensor data is
expensive.

How much data?
• Don’t treat it like a black box

• Depends on the algorithm as well

• E.g., Parametric & Non-parametric algorithms

Let’s look at an example
Reference: C.M. Bishop. Pattern Recognition and Machine Learning

Reference: C.M. Bishop. Pattern Recognition and Machine Learning

One heuristic:

Data should be 5-10 times
the number of features

But do we always know
what features we will use?

Run a pilot
study
Visualize &
Analyze
Run
statistics
Collect more
data

Statistics and Visualizations
• Bland Altman Plots
0.7 0.8 0.9 1
−30
−20
−10
0
10
20
30
Flow−rate (Liters/s)
PercentageDifference
0.7
−30
−20
−10
0
10
20
30
Flow−
30 30
Estimated Value

.8 0.9 1
e (Liters/s)
0.7 0.8 0.9 1
−30
−20
−10
0
10
20
30
Flow−rate (Liters/s)
30
Estimated Value
• Bland Altman Plots

• Power Analysis
• t-test
• chi-square test
• ANOVA
• etc.
You either need prior data or an intuition of your problem
to run these analyses

Data Collection Sessions
• Sometimes you need to collect same data multiple
times, in different sessions.
• Example:
4Appliances
1Minute
20Instances

Features
• Mean, Median, Standard Deviation
Mean
StandardDeviation
100%

4Appliances
1Minute
20Instances
3Sessions

In summary
• No secret sauce
• Depends on the problem & the phenomenon sensed
• Run a pilot study
• Be prepared to iterate and collect more data
• Try to make a real time system and test it out

Bridging the Gap: Machine Learning for Ubiquitous Computing -- Study Design and Deployment

Recommended

Recommended

More Related Content

Similar to Bridging the Gap: Machine Learning for Ubiquitous Computing -- Study Design and Deployment

Similar to Bridging the Gap: Machine Learning for Ubiquitous Computing -- Study Design and Deployment (20)

Recently uploaded

Recently uploaded (20)

Bridging the Gap: Machine Learning for Ubiquitous Computing -- Study Design and Deployment