1. The document proposes OptWedge, an optimized visualization cue for guiding users to off-screen points of interest.
2. It develops a cognitive cost model based on how wedge shape impacts user estimation error and accounts for bias and individual differences.
3. An experiment compares user estimation accuracy for vanilla, unbiased, and biased wedge visualizations, finding optimized wedges improve accuracy for short distances.
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Jens Grubert
Slides from our ISMAR 2014 tutorial http://stctutorial.icg.tugraz.at/
Abstract:
Head Mounted Displays such as Google Glass and the META have the potential to spur consumer-oriented Optical See-Through Augmented Reality applications. A correct spatial registration of those displays relative to a user’s eye(s) is an essential problem for any HMD-based AR application.
At our ISMAR 2014 tutorial we provide an overview of established and novel approaches for the calibration of those displays (OST calibration) including hands on experience in which participants will calibrate such head mounted displays.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2021/08/robust-object-detection-under-dataset-shifts-a-presentation-from-arm/
Partha Maji, Principal Research Scientist at Arm’s Machine Learning Research Lab, presents the “Robust Object Detection Under Dataset Shifts” tutorial at the May 2021 Embedded Vision Summit.
In image classification tasks, the evaluation of models’ robustness to increased dataset shifts with a probabilistic framework is very well studied. However, object detection (OD) tasks pose other challenges for uncertainty estimation and evaluation. For example, one needs to evaluate both the quality of the label uncertainty (i.e., what?) and spatial uncertainty (i.e., where?) for a given bounding box, but that evaluation cannot be performed with more traditional average precision metrics (e.g., mAP).
In this talk, Maji discusses how to adapt well-established object detection models to generate uncertainty estimations by introducing stochasticity in the form of Monte Carlo Dropout (MC-Drop). He also discusses how such techniques could be extended to a broad class of embedded vision tasks to improve robustness.
A Probabilistic U-Net for Segmentation of Ambiguous ImagesSeunghyun Hwang
Review : A Probabilistic U-Net for Segmentation of Ambiguous Images
- by Seunghyun Hwang (Yonsei University, Severance Hospital, Center for Clinical Data Science)
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorJinwon Lee
TensorFlow Korea 논문읽기모임 PR12 270번째 논문 review입니다.
이번 논문은 Baidu에서 나온 PP-YOLO: An Effective and Efficient Implementation of Object Detector입니다. YOLOv3에 다양한 방법을 적용하여 매우 높은 성능과 함께 매우 빠른 속도 두마리 토끼를 다 잡아버린(?) 그런 논문입니다. 논문에서 사용한 다양한 trick들에 대해서 좀 더 깊이있게 살펴보았습니다. Object detection에 사용된 기법 들 중에 Deformable convolution, Exponential Moving Average, DropBlock, IoU aware prediction, Grid sensitivity elimination, MatrixNMS, CoordConv, 등의 방법에 관심이 있으시거나 알고 싶으신 분들은 영상과 발표자료를 참고하시면 좋을 것 같습니다!
논문링크: https://arxiv.org/abs/2007.12099
영상링크: https://youtu.be/7v34cCE5H4k
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Jens Grubert
Slides from our ISMAR 2014 tutorial http://stctutorial.icg.tugraz.at/
Abstract:
Head Mounted Displays such as Google Glass and the META have the potential to spur consumer-oriented Optical See-Through Augmented Reality applications. A correct spatial registration of those displays relative to a user’s eye(s) is an essential problem for any HMD-based AR application.
At our ISMAR 2014 tutorial we provide an overview of established and novel approaches for the calibration of those displays (OST calibration) including hands on experience in which participants will calibrate such head mounted displays.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2021/08/robust-object-detection-under-dataset-shifts-a-presentation-from-arm/
Partha Maji, Principal Research Scientist at Arm’s Machine Learning Research Lab, presents the “Robust Object Detection Under Dataset Shifts” tutorial at the May 2021 Embedded Vision Summit.
In image classification tasks, the evaluation of models’ robustness to increased dataset shifts with a probabilistic framework is very well studied. However, object detection (OD) tasks pose other challenges for uncertainty estimation and evaluation. For example, one needs to evaluate both the quality of the label uncertainty (i.e., what?) and spatial uncertainty (i.e., where?) for a given bounding box, but that evaluation cannot be performed with more traditional average precision metrics (e.g., mAP).
In this talk, Maji discusses how to adapt well-established object detection models to generate uncertainty estimations by introducing stochasticity in the form of Monte Carlo Dropout (MC-Drop). He also discusses how such techniques could be extended to a broad class of embedded vision tasks to improve robustness.
A Probabilistic U-Net for Segmentation of Ambiguous ImagesSeunghyun Hwang
Review : A Probabilistic U-Net for Segmentation of Ambiguous Images
- by Seunghyun Hwang (Yonsei University, Severance Hospital, Center for Clinical Data Science)
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorJinwon Lee
TensorFlow Korea 논문읽기모임 PR12 270번째 논문 review입니다.
이번 논문은 Baidu에서 나온 PP-YOLO: An Effective and Efficient Implementation of Object Detector입니다. YOLOv3에 다양한 방법을 적용하여 매우 높은 성능과 함께 매우 빠른 속도 두마리 토끼를 다 잡아버린(?) 그런 논문입니다. 논문에서 사용한 다양한 trick들에 대해서 좀 더 깊이있게 살펴보았습니다. Object detection에 사용된 기법 들 중에 Deformable convolution, Exponential Moving Average, DropBlock, IoU aware prediction, Grid sensitivity elimination, MatrixNMS, CoordConv, 등의 방법에 관심이 있으시거나 알고 싶으신 분들은 영상과 발표자료를 참고하시면 좋을 것 같습니다!
논문링크: https://arxiv.org/abs/2007.12099
영상링크: https://youtu.be/7v34cCE5H4k
Design of Experiments & Regression AnalysisSagar Sable
My role in the project:
• Designed five factorial experiment for analysis of maximum distance travelled in catapult.
• Identified the nuisance’s variable which helped to reduce the noise in the system.
• Analysed the output of experiment using ANOVA method to find significant factors.
• Performed enough replications to create a representative body of data.
• Summarized the results and methodology in a report. (Tool Used-Minitab).
Overview of costs and benefits of virtual reality, potential savings, aspects and models for cost-effectiveness assessment, created by the Virtual Dimension Center (VDC) in Fellbach.
Screencasting and Presenting for EngineersKunal Johar
Engineers often think about the 'how' as the most exciting part of their work. These details often bore what would be candid listeners.
Take a step back, think about what excites others, then ease in your grand challenges. Tie it all together in a story.
he presentation explores the different technique of assigning confidence score at a single feature-level and node-level ( a set of features ). This steps are crucial for constructing a graph representation of video segment and for determining node cost/penalty values before applying graph-cut method.
Lecture notes of the course Future Models I (AR1TWF030), The Why Factory, Directed by Prof. Winy Mass, TU Delft, Faculty of Architecture and Built Environment
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Aalto University
Tutorial on Model-Based User Interface Optimization. Part IV: ADVANCED TOPICS.
Presented by Antti Oulasvirta (Aalto University) at SICSA Summer School on Computational Interaction in 2015 in Glasgow. Note: This one-day lecture is divided into multiple parts.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
More Related Content
Similar to OptWedge: Cognitive Optimized Guidance toward Off-screen POIs (PDPTA 2021)
Design of Experiments & Regression AnalysisSagar Sable
My role in the project:
• Designed five factorial experiment for analysis of maximum distance travelled in catapult.
• Identified the nuisance’s variable which helped to reduce the noise in the system.
• Analysed the output of experiment using ANOVA method to find significant factors.
• Performed enough replications to create a representative body of data.
• Summarized the results and methodology in a report. (Tool Used-Minitab).
Overview of costs and benefits of virtual reality, potential savings, aspects and models for cost-effectiveness assessment, created by the Virtual Dimension Center (VDC) in Fellbach.
Screencasting and Presenting for EngineersKunal Johar
Engineers often think about the 'how' as the most exciting part of their work. These details often bore what would be candid listeners.
Take a step back, think about what excites others, then ease in your grand challenges. Tie it all together in a story.
he presentation explores the different technique of assigning confidence score at a single feature-level and node-level ( a set of features ). This steps are crucial for constructing a graph representation of video segment and for determining node cost/penalty values before applying graph-cut method.
Lecture notes of the course Future Models I (AR1TWF030), The Why Factory, Directed by Prof. Winy Mass, TU Delft, Faculty of Architecture and Built Environment
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Aalto University
Tutorial on Model-Based User Interface Optimization. Part IV: ADVANCED TOPICS.
Presented by Antti Oulasvirta (Aalto University) at SICSA Summer School on Computational Interaction in 2015 in Glasgow. Note: This one-day lecture is divided into multiple parts.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
2. ※ POI: 2D or 3D Point-of-Interest
Guidance toward Off-screen POIs
Off-screen
POI
Display
On-screen
POI
Visualization of
Off-screen POI
3. 1. Draw human attention
to the out of view
2. Providing an additional
POI for small sized display
Applications
obstacle
destination
4. Visualization types for Off-screen POIs
[Müller+14]
1. Overview+Detail 2. Focus+Context 3. Cue-based
More space Not intuitive
Less space
Intuitive
[Miau+16]
Overview
Detailed
view
[Jo+11] [Burigat+]
6. [Petford+19]
Comparing cues
for room-scaled
visualization vs.
Wedge Flashing Point animation
vs.
Comparing cues
for head-mounted display
[Gruenefeld+18]
vs.
Halo Wedge
Research Question in Cue-based Visualization
To decrease an error in the users’ estimation,
1. What cue should we use?
2. How should we visualize the selected cue?
7. This study uses Wedge as a cue and optimize
the shape to decrease an estimation error
Research Question in Cue-based Visualization
To decrease an error in the users’ estimation,
1. What cue should we use?
2. How should we visualize the selected cue?
• shape, color, scale, ...
8. Amodal complement enables us
to imagine the whole from the part
Wedge
POI
Display
Wedge: partially invisible
isosceles triangle
We can estimate POI
location on a invisible
vertex due to
amodal complement
9. Vanilla Wedge (VW) heuristically determined values of
shape-related parameter (𝜃, 𝑑, 𝑙)
Vanilla Wedge [Gustafson+07]
distance 𝒅
leg 𝒍
aperture 𝜽
POI
Problem
• Validity of parameter values
is unclear
• Does not consider cognitive
impact in amodal complement
• Cannot handle constraints on
displaying area
10. Our Solution: Optimization Problem
Problem
• Validity of parameter values
is unclear
• Does not consider cognitive
impact in amodal complement
• Cannot handle constraints on
displaying area
Solution
• Validity of parameter values
is clear
• Consider bias and individual
difference as cognitive impact
• Can introduce any constraint
into optimization problems
OptWedge (Optimized Wedge)
based on minimizing a cognitive cost
11. Normal distribution 𝑷
(depends on Wedge shape)
Estimated points are samples of normal distribution 𝑷(𝒃, 𝝈𝒙, 𝝈𝒚)
individual
difference 𝜎𝑥
bias 𝑏
individual
difference
σy
Proposed method
𝒙
𝒚
Estimated
point
Assumption
12. Ideal normal distribution 𝐐
(mean is on POI)
Proposed method
Normal distribution 𝑷
(depends on Wedge shape)
𝒙
𝒚
Distance from 𝑷(𝒃, 𝝈𝒙, 𝝈𝒚) to ideal normal distribution Q
Cognitive cost
e.g., Kullback-Leibler divergence
13. Proposed method
bias 𝑏
UOW (Unbiased OptWedge)
Vertex position is fixed on
POI and optimization
makes bias zero
BOW
bias 𝑏
(Biased OptWedge)
Vertex position is optimized
so as to counteract bias
15. • Presented 20 subjects with
375 different shapes of Wedge
• Asked subjects to estimate
POI location
• Conducted an experiment
in VR environment to reduce
a burden of input operation
10m
VR controller
ray
Ex.1: Setting
16. • The larger distance 𝑑 is,
the smaller bias 𝑏 is
• There is a trade-off error between
𝜎𝑥 and 𝜎𝑦 concerning aperture 𝜃
Ex.1: Result
𝜎𝑥
𝜎𝑥
𝜎𝑦
𝜎𝑦
𝑏 > 0
𝑏 < 0
𝑑
𝑑
17. individual difference 𝝈𝒙(𝜽, 𝒅, 𝒍)
distance 𝑑
distance 𝑑
individual difference 𝝈𝒚(𝜽, 𝒅, 𝒍)
Ex.1: Result
distance 𝑑
leg 𝑙
aperture 𝜃
bias 𝒃(𝜽, 𝒅, 𝒍)
leg 𝑙
leg 𝑙
Compared the Mean Squared Error [m2]
for polynomial regression (PR) and
Gaussian process regression (GR) models
and found GR model works better
Mean Squared Error [𝐦𝟐]
aperture 𝜃
aperture 𝜃
19. Ex.2 is similar to Ex.1, but..
• Showed VW/UOW/BOW
• Performed gradient descent using
VW parameter as initial point
• Considered constraints on domain
of definition and drawing area
Visualization of cognitive cost
VW
(initial point)
UOW
Constraints
𝜃[rad]
𝑙 [m]
Ex.2: Setting
20. 0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6
W1 W2 W3
**
*
*
†
VW
UOW
BOW
**: p < 0.01
*: p < 0.05
†: p < 0.10
Distance to POI [m]
Distance to POI [m]
Ex.2: Result
Cognitive cost
Experimental value
Model prediction
UOW
BOW
VW
Mean squared error
21. 0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6
W1 W2 W3
**
*
*
†
VW
UOW
BOW
**: p < 0.01
*: p < 0.05
†: p < 0.10
Distance to POI [m]
Experimental value
GR model prediction
UOW
BOW
VW
Cognitive cost Mean squared error
Ex.2: Result
Distance to POI [m]
UOW/BOW > VW
for short distance
22. Ex.2: Result
Cognitive cost Mean squared error
Experimental value
GR model prediction
UOW
BOW
VW
Distance to POI [m]
UOW/BOW ≒ VW
for middle distance.
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6
W1 W2 W3
**
*
*
†
VW
UOW
BOW
**: p < 0.01
*: p < 0.05
†: p < 0.10
Distance to POI [m]
23. Our model is not accurate
for long distance
Ex.2: Result
Experimental value
GR model prediction
UOW
BOW
VW
Distance to POI [m]
Cognitive cost Mean squared error
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
1 2 3 4 5 6
W1 W2 W3
**
*
*
†
VW
UOW
BOW
**: p < 0.01
*: p < 0.05
†: p < 0.10
Distance to POI [m]
24. distance VW UOW BOW
short
medium
long △
○ ○
△
△ ×
×
△ △
Q1: Why was no significant difference obtained between
VW and UOW/BOW for medium distance?
Q2: Why did the performance of BOW drop?
Ex.2: Discussion
Q1
Q2
25. Q1: Why was no significant difference obtained between VW
and UOW/BOW for medium distance?
A1: VW has high validity because an initial point is sufficient
close to an optimal point
Ex.2: Discussion
short
distance
long
distance
Initial
point
(VW)
Optimal
point(UOW)
middle
distance
26. Q2: Why did the performance of BOW drop?
A2: Model generalization performance degrades because of
a lack of data around long distance 𝑑.
distance 𝑑
leg 𝑙
aperture 𝜃
bias 𝒃 individual difference 𝝈𝒙
distance 𝑑
leg 𝑙
aperture 𝜃
distance 𝑑
leg 𝑙
aperture 𝜃
individual difference 𝝈𝒚
Ex.2: Discussion
27. We proposed OptWedge based on minimizing a cognitive cost
and designed two kinds of OptWedge including UOW/BOW
From two experiments, we can see that
• OptWedge is more effective for short distance
• Vanilla Wedge has a high validity for middle distance
• Model generalization performance degrades
for long distance (⇒ Future work)
Conclusion
28.
29. Application for other cues
POI
聴収者
POI
POI
time
Circle
POI
透明度
POI
3D figure
Sound
Animation
30. VW
UOW
BOW
• Aperture get bigger as
distance increases for
UOW and BOW
• For BOW, model reflect a
trend that “human
underestimate a distance
to the POI”
Ex.2: Result