A Fusion Framework for Multimodal Interactive Applications

•Download as PPT, PDF•

2 likes•638 views

This research aims to propose a multi-modal fusion framework for high-level data fusion between two or more modalities. It takes as input low level features extracted from dier- ent system devices, analyses and identies intrinsic meanings in these data. Extracted meanings are mutually compared to identify complementarities, ambiguities and inconsistencies to better understand the user intention when interacting with the system. The whole fusion life cycle will be described and evaluated in an OCE environment scenario, where two co-workers interact by voice and movements, which might show their intentions. The fusion in this case is focusing on combining modalities for capturing a context to enhance the user experience.

Technology Education

A Fusion Framework for Multimodal
Interactive Applications
Presented by: Hildeberto Mendonça
Jean-Yves Lionel Lawson
Olga Vybornova
Benoit Macq
Jean Vanderdonckt
ICMI-MLMI 2009 – Cambridge MA, USA, November 2-6, 2009
Special Session Fusion Engines for Multimodal Interfaces
November 3, 2009

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA2
Motivations

How to support multimodal fusion in order to
maximize reuse and minimize complexity?

If there is complexity on multimodal fusion it should
be about the fusion in itself

What already exists should be reused with minimal
adaptation

A general life cycle can guarantee a standard
treatment for each modality

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA3
Research Goal
To deﬁne and develop a multipurpose framework
for high level data fusion on multimodal
interactive applications

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA4
Fusion Principles

Type: Parallel + Combined = Synergistic

Each modality is endowed of meanings

Level: Feature (i.e. pattern extraction) + decision (i.e.
Recognized task)

Input Devices: Multiple

Notation: Defined by the developer

Ambiguity resolution: Defined by the developer

Time representation (Quantitative – Qualitative): Both

Application Type : The domain is defined using ontologies

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA5
Process

Recognition: identification of patterns on input signals.

Segmentation: delimitation of identified areas.

Meanings Extraction: deeper analysis to identify
meanings and correlations between segments according
to specific domains.

Annotation: formal description of segments through
domain concepts.

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA6
Process

The flow is fixed but it can start at any point
respecting the sequence.

Not fixed to any particular method. The method
is “plugged”.

Focus on good level of analysis, not on real
time processing.

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA7
OpenInterface

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA8
OpenInterface

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA9
OpenInterface

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA10
Fusion Mechanism
 Define a process for each modality and put them in parallel.
 Data from each stage is buffered and processed together for the
purpose of fusion.
 Agent-oriented: problem solved in a distributed fashion.

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA11
Fusion Mechanism

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA12
Fusion Mechanism

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA13
Fusion Mechanism – OpenInterface
OI Modeling Tool

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA14
Fusion Mechanism - Instance

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA15
Scenario
Maybe I can find a
book about it in the
library
Ronald is moving
towards the book
shelves

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA16
Results

managed spatial relationships based on the fixed objects
in the room

made semantic fusion of events not coinciding in time

achieved good results in speaker identification -
synchronization between image and speech identification

created an open framework to manage fusion between two
(in our case) or more modalities (in enhanced future work)

designed the system so that each component can run in a
separate machine due to the distribution mechanism
interchanging data through a TCP/IP network

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA17
Next Steps

Implementing the segmentation and annotation
of 3D content

Migrate the framework to a real-time
implementation

Evaluate other methods under the rules of the
framework

Continuously extend the framework to support
other fusion concepts and methods of
implementation

01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA18
Thank you for your attention!

Similar to A Fusion Framework for Multimodal Interactive Applications

Learning from ubicomp deployments keio 2010

Adrian Friday

The field of machine learning has seen the development of thousands of learning algorithms. Typically, scientists choose from these algorithms to solve specific problems. Their choices often being limited by their familiarity with these algorithms. In this classical/traditional framework of machine learning, scientists are constrained to making some assumptions so as to use an existing algorithm. This is in contrast to the model-based machine learning approach which seeks to create a bespoke solution tailored to each new problem.

Introduction to Model-Based Machine Learning

Daniel Emaasit

Molecules of Knowledge (MoK) is a coordination model supporting self-organisation of knowledge in Knowledge Intensive Environments (KIE). Usual approaches to knowledge management in KIE consider data as a passive, "dead" entity and rely on "brute force" approaches assuming an ever-increasing computational power and storage capacity (e.g. big data). This won't scale forever, thus alternative approaches should be explored. MoK promotes the vision of data as a "live" thing, continuously and spontaneously interacting and evolving---self-organising. Accordingly, MoK relies on features such as locality, probability and situatedness to tackle KIE challenges such as scale, openness and unpredictability. In this seminar, the MoK model is motivated and introduced, then some early "evaluation" described.

Molecules of Knowledge: Self-Organisation in Knowledge-Intensive Environments

Stefano Mariani

Modeling and simulation

Payel Rani

Pervasive context-aware computing networks call for designing algorithms for information propagation and reconfiguration that promote self-adaptation, namely, which can guarantee – at least to a probabilistic extent – certain reliability and robustness properties in spite of unpredicted changes and conditions. The possibility of formally analyzing their properties is obviously an essential engineering requirement, calling for general-purpose models and tools. As proposed in recent works, several such algorithms can be modeled by the notion of computational field: a dynamically evolving spatial data structure mapping every node of the network to a data value. Based on this idea, as a contribution toward formally verifying properties of pervasive computing systems, in this article we propose a specification language to model computational fields, and a framework based on PRISM stochastic model checker explicitly targeted at supporting temporal property verification. By a number of pervasive computing examples, we show that the proposed approach can be effectively used for quantitative analysis of systems running on networks composed of hundreds of nodes.

A Framework to Specify and Verify Computational Fields for Pervasive Computin...

Danilo Pianini

This review paper creates a bridge between MIML classification framework and Image annotation. There are generally four classification frameworks, known as Single Instance Single Label (SISL), Multi-Instance Learning (MIL), Multi-Label Learning (MLL) and Multi-Instance Multi-Label Learning (MIML). This paper introduces various classification frameworks with examples and related algorithms. An annotation is one type of metadata that can be attached to any video, image (2D/3D), text, audio and other data in the form of explanation, comments, navigation or presentational markup. This paper briefly introduces different types of annotation, annotation dataset, techniques and current research challenges in annotations

A review-miml-framework-and-image-annotation

Editor IJMTER

Tutorial on Social Multimedia Computing

Jitao Sang

Model-based Measurement of Interaction in Mobile Multimodal Environments

Pedro Luis Mateo Navarro

Collaborative Filtering - MF, NCF, NGCF

Park JunPyo

Data and model management in Systems Biology

University Medicine Greifswald

محازةرةي يةكةم

Gaylan Blbas

GEOFRAME: a system for doing hydrology by computer

Riccardo Rigon

SocialCom 2009 - Social Synchrony

Munmun De Choudhury

Handling Uncertainty in Automatically Generated Implementation Models in the ...

Alessio Bucaioni

Self-organising pervasive ecosystems of devices are set to become a major vehicle for delivering infrastructure and end-user services. The inherent complexity of such systems poses new challenges to those who want to dominate it by applying the principles of engineering. The recent growth in number and distribution of devices with decent computational and communicational abilities, that suddenly accelerated with the massive diffusion of smartphones and tablets, is delivering a world with a much higher density of devices in space. Also, communication technologies seem to be focussing on short-range device-to-device (P2P) interactions, with technologies such as Bluetooth and Near-Field Communication gaining greater adoption. Locality and situatedness become key to providing the best possible experience to users, and the classic model of a centralised, enormously powerful server gathering and processing data becomes less and less efficient with device density. Accomplishing complex global tasks without a centralised controller responsible of aggregating data, however, is a challenging task. In particular, there is a local-to-global issue that makes the application of engineering principles challenging at least: designing device-local programs that, through interaction, guarantee a certain global service level. In this thesis, we first analyse the state of the art in coordination systems, then motivate the work by describing the main issues of pre-existing tools and practices and identifying the improvements that would benefit the design of such complex software ecosystems. The contribution can be divided in three main branches. First, we introduce a novel simulation toolchain for pervasive ecosystems, designed for allowing good expressiveness still retaining high performance. Second, we leverage existing coordination models and patterns in order to create new spatial structures. Third, we introduce a novel language, based on the existing ``Field Calculus'' and integrated with the aforementioned toolchain, designed to be usable for practical aggregate programming.

Engineering Complex Computational Ecosystems (PhD defense)

Danilo Pianini

DEFENSE

Fabio Caraffini

Presentation of the Advertising Networks & Exchanges Panel at the ad:tech conference in Singapore on 10.06.09. In an increasingly fragmented media environment and in an age of accountability, marketers are faced with the complex task of reaching customers in a huge number of different destinations and platforms and converting them. Traditional media buying approaches and processes are inefficient and out-dated. Ad networks and exchanges are promising to streamline media buying and delivery on a pay-per-performance basis. Join our panel as they explain how ad networks and exchanges have evolved so far and how they are removing many of the inefficiencies associated with traditional buying and selling of online channels. The discussion also explores the obstacles of adopting ad networks & exchanges in current media strategies, as well as their future direction into an evolving all-digital media-marketplace. Global senior executives of leading ad networks are on the panel to help you better understand how such important channels can benefit your business.

Advertising Networks & Exchanges - ad:tech, Singapore

Chris Schaumann

Business Model Transformation

Lakshmi Salelkar

As MIL (Multi-Instance Learning) considers only input ambiguity and MLL (Multi-Label Learning) consider only output ambiguity, we require a framework which consider both ambiguities together and solve the complex problems. MIML (Multi-Instance Multi-Label) framework can solve this problem, but the implementation of MIML dataset is more complex as it considers multiple labels and its multiple instances both together. This research work focuses on implementation of MIML framework using 2014 annotated natural scene image dataset. An image annotation task is closely related to MIML learning problem. Multi class SVM (MSVMpack) used to handle classification of more than two classes without depending on different decomposition methods. Bag of Regions (BoR) is used as a bag generator which is well known framework to generate local features from images. SIFT Scale-Invariant Feature Transform (SIFT) good descriptor can handle intensity, rotation and scale with variations. During experiment for each image SIFT descriptors are extracted for each shot. As a result it also provide vector of predicted labels, accuracy rate during classification, hamming loss, one-error, coverage and R-loss after testing the model.

Implementation of miml framework using annotated

Editor IJMTER

ODSC West 2021 – Composition in ML

Bryan Bischof

Similar to A Fusion Framework for Multimodal Interactive Applications (20)

Learning from ubicomp deployments keio 2010

Introduction to Model-Based Machine Learning

Molecules of Knowledge: Self-Organisation in Knowledge-Intensive Environments

Modeling and simulation

A Framework to Specify and Verify Computational Fields for Pervasive Computin...

A review-miml-framework-and-image-annotation

Tutorial on Social Multimedia Computing

Model-based Measurement of Interaction in Mobile Multimodal Environments

Collaborative Filtering - MF, NCF, NGCF

Data and model management in Systems Biology

محازةرةي يةكةم

GEOFRAME: a system for doing hydrology by computer

SocialCom 2009 - Social Synchrony

Handling Uncertainty in Automatically Generated Implementation Models in the ...

Engineering Complex Computational Ecosystems (PhD defense)

DEFENSE

Advertising Networks & Exchanges - ad:tech, Singapore

Business Model Transformation

Implementation of miml framework using annotated

ODSC West 2021 – Composition in ML

More from Jean Vanderdonckt

To the end of our possibilities with Adaptive User Interfaces

Jean Vanderdonckt

Paper presented at EICS '22: https://dl.acm.org/doi/10.1145/3532210 The "Software as a Service" (SaaS) model of cloud computing popularized online multiuser collaborative software. Two famous examples of this class of software are Office 365 from Microsoft and Google Workspace. Cloud technology removes the need to install and update the software on end users' computers and provides the necessary underlying infrastructure for online collaboration. However, to provide a good end-user experience, cloud services require an infrastructure able to scale up to the task and allow low-latency interactions with a variety of users worldwide. This is a limiting factor for actors that do not possess such infrastructure. Unlike cloud computing which forgets the computational and interactional capabilities of end users' devices, the edge computing paradigm promises to exploit them as much as possible. To investigate the potential of edge computing over cloud computing, this paper presents a method for engineering interactive collaborative software supported by edge devices for the replacement of cloud computing resources. Our method is able to handle user interface aspects such as connection, execution, migration, and disconnection differently depending on the available technology. We exemplify our approach by developing a distributed Pictionary game deployed in two scenarios: a nonshared scenario where each participant interacts only with their own device and a shared scenario where participants also share a common device, including a TV. After a theoretical comparative study of edge vs. cloud computing, an experiment compares the two implementations to determine their effect on the end user's perceived experience and latency vs. real latency

Engineering the Transition of Interactive Collaborative Software from Cloud C...

Jean Vanderdonckt

Presentation of ACM EICS '22 paper: https://dl.acm.org/doi/10.1145/3532207 Eye movement analysis is a popular method to evaluate whether a user interface meets the users' requirements and abilities. However, with current tools, setting up a usability evaluation with an eye-tracker is resource-consuming, since the areas of interest are defined manually, exhaustively and redefined each time the user interface changes. This process is also error-prone, since eye movement data must be finely synchronised with user interface changes. These issues become more serious when the user interface layout changes dynamically in response to user actions. In addition, current tools do not allow easy integration into interactive applications, and opportunistic code must be written to link these tools to user interfaces. To address these shortcomings and to leverage the capabilities of eye-tracking, we present UsyBus, a communication framework for autonomous, tight coupling among reusable agents. These agents are responsible for collecting data from eye-trackers, analyzing eye movements, and managing communication with other modules of an interactive application. UsyBus allows multiple heterogeneous eye-trackers as input, provides multiple configurable outputs depending on the data to be exploited. Modules exchange data based on the UsyBus communication framework, thus creating a customizable multi-agent architecture. UsyBus application domains range from usability evaluation to gaze interaction applications design. Two case studies, composed of reusable modules from our portfolio, exemplify the implementation of the UsyBus framework.

UsyBus: A Communication Framework among Reusable Agents integrating Eye-Track...

Jean Vanderdonckt

Paper presented at ACM EICS '22 Finger-based gesture input becomes a major interaction modality for surface computing. Due to the low precision of the finger and the variation in gesture production, multistroke gestures are still challenging to recognize in various setups. In this paper, we present µV, a multistroke gesture recognizer that addresses the properties of articulation, rotation, scaling, and translation invariance by combining $P+'s cloud-matching for articulation invariance with !FTL's local shape distance for RST-invariance. We evaluate µV against five competitive recognizers on MMG, an existing gesture set, and on two new versions for smartphones and tablets, MMG+ and RMMG+, a randomly rotated version on both platforms. µV is significantly more accurate than its predecessors when rotation invariance is required and not significantly inferior when it is not. µV is also significantly faster than others with many samples and not significantly slower with few samples

µV: An Articulation, Rotation, Scaling, and Translation Invariant (ARST) Mult...

Jean Vanderdonckt

The body of knowledge accumulated by gesture elicitation studies (GES), although useful, large, and extensive, is also heterogeneous, scattered in the scientific literature across different venues and fields of research, and difficult to generalize to other contexts of use represented by different gesture types, sensing devices, applications, and user categories. To address such aspects, we introduce RepliGES, a conceptual space that supports (1) replications of gesture elicitation studies to confirm, extend, and complete previous findings, (2) reuse of previously elicited gesture sets to enable new discoveries, and (3) extension and generalization of previous findings with new methods of analysis and for new user populations towards consolidated knowledge of user-defined gestures. Based on RepliGES, we introduce GEStory, an interactive design space and visual tool, to structure, visualize and identify user-defined gestures from a number of 216 published gesture elicitation studies

RepliGES and GEStory: Visual Tools for Systematizing and Consolidating Knowle...

Jean Vanderdonckt

Gesture-based information systems: from DesignOps to DevOps

Jean Vanderdonckt

Intra-platform plasticity regularly assumes that the display of a computing platform remains fixed and rigid during interactions with the platform in contrast to reconfigurable displays, which can change form depending on the context of use. In this paper, we present a model-based approach for designing and deploying graphical user interfaces that support intra-platform plasticity for reconfigurable displays. We instantiate the model for E3Screen, a new device that expands a conventional laptop with two slidable, rotatable, and foldable lateral displays, enabling slidable user interfaces. Based on a UML class diagram as a domain model and a SCRUD list as a task model, we define an abstract user interface as interaction units with a corresponding master-detail design pattern. We then map the abstract user interface to a concrete user interface by applying rules for the reconfiguration, concrete interaction, unit allocation, and widget selection and implement it in JavaScript. In a first experiment, we determine display configurations most preferred by users, which we organize in the form of a state-transition diagram. In a second experiment, we address reconfiguration rules and widget selection rules. A third experiment provides insights into the impact of the lateral displays on a visual search task.

Engineering Slidable User Interfaces with Slime

Jean Vanderdonckt

Evaluating Gestural Interaction: Models, Methods, and Measures

Jean Vanderdonckt

Conducting a Gesture Elicitation Study: How to Get the Best Gestures From Peo...

Jean Vanderdonckt

Designing Gestural Interaction: Challenges and Pitfalls

Jean Vanderdonckt

Fundamentals of Gestural Interaction

Jean Vanderdonckt

Gestural Interaction, Is it Really Natural?

Jean Vanderdonckt

See the paper at https://www.scitepress.org/Link.aspx?doi=10.5220/0010258900600071 We conducted a user-centered design of a clinical decision-support system for breast cancer screening, diagnosis, and reporting based on stroke gestures. We combined knowledge elicitation interviews, scenario-focused questionnaires, and paper mock-ups to understand user needs. Multi-fidelity (low and high) prototypes were designed and compared first in vitro in a usability laboratory, then in vivo in the real world. The resulting user interface provides radiologists with a platform that integrates domain-oriented tools for the visualization of mammograms, the manual, and the semi-automatic annotation of breast cancer findings based on stroke gestures. The contribution of this work lies in that, to the best of our knowledge, stroke gestures have not yet been applied to the annotation of mammograms. On the one hand, although there is a substantial amount of research done in stroke-based interaction, none focuses especially on the domain of breast cancer annotation. On the other hand, typical gestures in breast cancer annotation tools are those with a keyboard and a mouse

User-centred Development of a Clinical Decision-support System for Breast Can...

Jean Vanderdonckt

Ensuring responsive design of web applications requires their user interfaces to be able to adapt according to different contexts of use, which subsume the end users, the devices and platforms used to carry out the interactive tasks, and also the environment in which they occur. To address the challenges posed by responsive design, aiming to simplify their development by factoring out the common parts from the specific ones, this paper presents Quill, a web-based development environment that enables various stakeholders of a web application to collaboratively adopt a model-based design of the user interface for cross-platform deployment. The paper establishes a series of requirements for collaborative model-based design of cross-platform web user interfaces motivated by the literature, observational and situational design. It then elaborates on potential solutions that satisfy these requirements and explains the solution selected for Quill. A user survey has been conducted to determine how stakeholders appreciate model-based design user interface and how they estimate the importance of the requirements that lead to Quill

Simplifying the Development of Cross-Platform Web User Interfaces by Collabo...

Jean Vanderdonckt

Detachable user interfaces consist of graphical user interfaces whose parts or whole can be detached at run-time from their host, migrated onto another computing platform while carrying out the task, possibly adapted to the new platform and attached to the target platform in a peer-to-peer fashion. De- taching is the property of splitting a part of a UI for transferring it onto another platform. AttAaching is the reciprocal property: a part of an existing interface can be attached to the currently being used interface so as to recompose another one on-demand, according to user's needs, task requirements. Assembling interface parts by detaching and attaching allows dynamically composing, decom- posing and re-composing new interfaces on demand. To support this interaction paradigm, a development infrastructure has been developed based on a series of primitives such as display, undisplay, copy, expose, return, transfer, delegate, and switch. We exemplify it with QTkDraw, a painting application with attaching and detaching based on the development infrastructure.

Attach Me, Detach Me, Assemble Me like you Work

Jean Vanderdonckt

Whereas gesture elicitation studies for TV interaction assume that participants adopt an upright, frontal viewing position, we asked 21 participants to hold a natural, comfortable viewing position, the posture they adopt when watching TV at home. By involving a broad selection of users regarding age, profession, our study targets a higher ecological validity than in existing studies. Agreements rates were lower than existing studies using an upright, frontal viewing position. Participants experienced problems due to (1) having to use their slave hand instead of their dominant hand, (2) being in a certain orientation with their head making it more difficult to perform some physical movements, and (3) being hindered in their movement by the sofa there lay on. Since each person may have a different position inducing different gestures due to the aforementioned problems, the effect of a comfortable viewing position is analyzed by comparison to gestures for a frontal position.

The Impact of Comfortable Viewing Positions on Smart TV Gestures

Jean Vanderdonckt

This paper presents empirical results about user-dened gestures for head and shoulders by analyzing 308 gestures elicited from 22 participants for 14 referents materializing 14 different types of tasks in IoT context of use. We report an overall medium consensus but with medium variance (mean: .263, min: .138, max: .390 on the unit scale) between participants gesture proposals, while their thinking time were less similar (min: 2.45 sec, max: 22.50 sec), which suggests that head and shoulders gestures are not all equally easy to imagine and to produce. We point to the challenges of deciding which head and shoulders gestures will become the consensus set based on four criteria: the agreement rate, their individual frequency, their associative frequency, and their unicity. Paper accessible at https://dial.uclouvain.be/pr/boreal/en/object/boreal%3A213794

Head and Shoulders Gestures: Exploring User-Defined Gestures with Upper Body

Jean Vanderdonckt

Instead of relying on graphical or vocal modalities for searching an item by keyword (called K-Menu), this paper presents the G-Menu exploiting gesture interaction and gesture recognition: when a user sketches a keyword by gesturing the first letters of its label, a menu with items related to the recognized letters is constructed dynamically and presented to the user for selection and auto-completion. The selection can be completed either gesturally by an appropriate gesture (called the G-Menu) or by touch only (called the T-Menu). This paper compares the three types of menu, i.e., by keyword, by gesture, and by touching, in a user study with twenty participants on their item selection time (for measuring task efficiency), their error rate (for measuring task effectiveness), and their subjective satisfaction (for measuring user satisfaction). Paper accessible at https://dial.uclouvain.be/pr/boreal/en/object/boreal%3A213790

G-Menu: A Keyword-by-Gesture based Dynamic Menu Interface for Smartphones

Jean Vanderdonckt

Unistroke and multistroke gesture recognizers have always striven to reach some robustness with respect to all variations encountered when people issue gestures by hand on touch surfaces or with sensing devices. For this purpose, successful stroke recognizers rely on a gesture recognition algorithm that satisfies a series of invariance properties such as: stroke-order invariance, stroke-number invariance, stroke direction invariance, position, scale, and rotation invariance. Before initiating any recognition activity, these algorithms ensure these properties by performing several pre-processing operations. These operations induce an additional computational cost to the recognition process, as well as a potential error bias. To cope with this problem, we introduce an algorithm that ensures all these properties analytically instead of statistically based on a vector algebra. Instead of points, the recognition algorithm works on vectors between vectors. We demonstrate that this approach not eliminates the need for these preprocessing operations but also satisfies an entire structure preserving transformation. Paper available at https://dial.uclouvain.be/pr/boreal/en/object/boreal%3A217006

Vector-based, Structure Preserving Stroke Gesture Recognition

Jean Vanderdonckt

Body-based gestures, such as acquired by Kinect sensor, today benefit from efficient tools for their recognition and development, but less for automated reasoning. To facilitate this activity, an ontology for structuring body-based gestures, based on user, body and body parts, gestures, and environment, is designed and encoded in Ontology Web Language according to modelling triples (subject, predicate, object). As a proof-of-concept and to feed this ontology, a gesture elicitation study collected 24 participants X 19 referents for IoT tasks = 456 elicited body-based gestures, which were classified and expressed according to the ontology. See paper at https://dl.acm.org/citation.cfm?id=3328238

An ontology for reasoning on body-based gestures

Jean Vanderdonckt

More from Jean Vanderdonckt (20)

To the end of our possibilities with Adaptive User Interfaces

Engineering the Transition of Interactive Collaborative Software from Cloud C...

UsyBus: A Communication Framework among Reusable Agents integrating Eye-Track...

µV: An Articulation, Rotation, Scaling, and Translation Invariant (ARST) Mult...

RepliGES and GEStory: Visual Tools for Systematizing and Consolidating Knowle...

Gesture-based information systems: from DesignOps to DevOps

Engineering Slidable User Interfaces with Slime

Evaluating Gestural Interaction: Models, Methods, and Measures

Conducting a Gesture Elicitation Study: How to Get the Best Gestures From Peo...

Designing Gestural Interaction: Challenges and Pitfalls

Fundamentals of Gestural Interaction

Gestural Interaction, Is it Really Natural?

User-centred Development of a Clinical Decision-support System for Breast Can...

Simplifying the Development of Cross-Platform Web User Interfaces by Collabo...

Attach Me, Detach Me, Assemble Me like you Work

The Impact of Comfortable Viewing Positions on Smart TV Gestures

Head and Shoulders Gestures: Exploring User-Defined Gestures with Upper Body

G-Menu: A Keyword-by-Gesture based Dynamic Menu Interface for Smartphones

Vector-based, Structure Preserving Stroke Gesture Recognition

An ontology for reasoning on body-based gestures

Recently uploaded

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Zilliz

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

Join our latest Connector Corner webinar to discover how UiPath Integration Service revolutionizes API-centric automation in a 'Quote to Cash' process—and how that automation empowers businesses to accelerate revenue generation. A comprehensive demo will explore connecting systems, GenAI, and people, through powerful pre-built connectors designed to speed process cycle times. Speakers: James Dickson, Senior Software Engineer Charlie Greenberg, Host, Product Marketing Manager

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

DianaGray10

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

Created by Mozilla Research in 2012 and now part of Linux Foundation Europe, the Servo project is an experimental rendering engine written in Rust. It combines memory safety and concurrency to create an independent, modular, and embeddable rendering engine that adheres to web standards. Stewardship of Servo moved from Mozilla Research to the Linux Foundation in 2020, where its mission remains unchanged. After some slow years, in 2023 there has been renewed activity on the project, with a roadmap now focused on improving the engine’s CSS 2 conformance, exploring Android support, and making Servo a practical embeddable rendering engine. In this presentation, Rakhi Sharma reviews the status of the project, our recent developments in 2023, our collaboration with Tauri to make Servo an easy-to-use embeddable rendering engine, and our plans for the future to make Servo an alternative web rendering engine for the embedded devices industry. (c) Embedded Open Source Summit 2024 April 16-18, 2024 Seattle, Washington (US) https://events.linuxfoundation.org/embedded-open-source-summit/ https://ossna2024.sched.com/event/1aBNF/a-year-of-servo-reboot-where-are-we-now-rakhi-sharma-igalia

A Year of the Servo Reboot: Where Are We Now?

Igalia

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Modernizing Securities Finance: The cloud-native prime brokerage platform transforming capital markets. Madhu Subbu, Managing Director, Head of Securities Finance Engineering Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

apidays

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

AXA XL - Insurer Innovation Award Americas 2024

The Digital Insurer

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

apidays

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

A Beginners Guide to Building a RAG App Using Open Source Milvus

Zilliz

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

MINDCTI Revenue Release Quarter One 2024

MIND CTI

Real Time Object Detection Using Open CV

Khem

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

Recently uploaded (20)

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

presentation ICT roal in 21st century education

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Automating Google Workspace (GWS) & more with Apps Script

A Year of the Servo Reboot: Where Are We Now?

AWS Community Day CPH - Three problems of Terraform

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

AXA XL - Insurer Innovation Award Americas 2024

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Exploring the Future Potential of AI-Enabled Smartphone Processors

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

A Beginners Guide to Building a RAG App Using Open Source Milvus

Data Cloud, More than a CDP by Matt Robison

MINDCTI Revenue Release Quarter One 2024

Real Time Object Detection Using Open CV

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

A Fusion Framework for Multimodal Interactive Applications

1. A Fusion Framework for Multimodal Interactive Applications Presented by: Hildeberto Mendonça Jean-Yves Lionel Lawson Olga Vybornova Benoit Macq Jean Vanderdonckt ICMI-MLMI 2009 – Cambridge MA, USA, November 2-6, 2009 Special Session Fusion Engines for Multimodal Interfaces November 3, 2009

2. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA2 Motivations  How to support multimodal fusion in order to maximize reuse and minimize complexity?  If there is complexity on multimodal fusion it should be about the fusion in itself  What already exists should be reused with minimal adaptation  A general life cycle can guarantee a standard treatment for each modality

3. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA3 Research Goal To deﬁne and develop a multipurpose framework for high level data fusion on multimodal interactive applications

4. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA4 Fusion Principles  Type: Parallel + Combined = Synergistic  Each modality is endowed of meanings  Level: Feature (i.e. pattern extraction) + decision (i.e. Recognized task)  Input Devices: Multiple  Notation: Defined by the developer  Ambiguity resolution: Defined by the developer  Time representation (Quantitative – Qualitative): Both  Application Type : The domain is defined using ontologies

5. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA5 Process  Recognition: identification of patterns on input signals.  Segmentation: delimitation of identified areas.  Meanings Extraction: deeper analysis to identify meanings and correlations between segments according to specific domains.  Annotation: formal description of segments through domain concepts.

6. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA6 Process  The flow is fixed but it can start at any point respecting the sequence.  Not fixed to any particular method. The method is “plugged”.  Focus on good level of analysis, not on real time processing.

7. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA7 OpenInterface

8. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA8 OpenInterface

9. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA9 OpenInterface

10. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA10 Fusion Mechanism  Define a process for each modality and put them in parallel.  Data from each stage is buffered and processed together for the purpose of fusion.  Agent-oriented: problem solved in a distributed fashion.

11. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA11 Fusion Mechanism

12. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA12 Fusion Mechanism

13. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA13 Fusion Mechanism – OpenInterface OI Modeling Tool

14. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA14 Fusion Mechanism - Instance

15. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA15 Scenario Maybe I can find a book about it in the library Ronald is moving towards the book shelves

16. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA16 Results  managed spatial relationships based on the fixed objects in the room  made semantic fusion of events not coinciding in time  achieved good results in speaker identification - synchronization between image and speech identification  created an open framework to manage fusion between two (in our case) or more modalities (in enhanced future work)  designed the system so that each component can run in a separate machine due to the distribution mechanism interchanging data through a TCP/IP network

17. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA17 Next Steps  Implementing the segmentation and annotation of 3D content  Migrate the framework to a real-time implementation  Evaluate other methods under the rules of the framework  Continuously extend the framework to support other fusion concepts and methods of implementation

18. 01/30/15 ICMI-MLMI 2009 – Cambridge MA, USA18 Thank you for your attention!

A Fusion Framework for Multimodal Interactive Applications

Recommended

Recommended

More Related Content

Similar to A Fusion Framework for Multimodal Interactive Applications

Similar to A Fusion Framework for Multimodal Interactive Applications (20)

More from Jean Vanderdonckt

More from Jean Vanderdonckt (20)

Recently uploaded

Recently uploaded (20)

A Fusion Framework for Multimodal Interactive Applications