3. Mark A. Greenwood
The University of Sheffield
WP 6 Presentation
Contextualization / Decontextualization
ForgetIT 1st Review Meeting, April 29-30, 2014
Kaiserslautern, Germany
4. WP Objectives
• Determine the context associated with an information object
• Extract an object from its natural context and store it with the
required context in a Submission Information Package (SIP)
• Reintegrate a previously archived information object into active user
Focus of Year 1
• Review the current state-of-the-art
• Generate a formal, ForgetIT specific, model of contextualization
• Develop prototype contextualization components
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Objectives of WP and Year 1 Focus
5. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
What is Context? Why Do We Need Context?
The new Quarry Hunslet arrived today.Quarry Hunslet
Elider was built in
1889 as works
number 493 for the
Dinorwic Quarry in
Wales. It can now be
seen at the Llanberis
Lake Railway.
6. Formal Model of Contextualization
• Context, C(t): information surrounding one or more information
pieces at time t and that influences its interpretation
• Interpretation, I(i,C(t),B(t,u)): interpretation of information piece i
done with context C and with background knowledge B of user u at
time t
• Contextualization: for an information piece i, it is the process of
providing additional context c+(i,t0) C(t0) where I(i,C(t0),B(t0,u0)) is
referred to as the intended interpretation
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Contextualization Model
7. Operative model for Contextualization
• Information pieces: items to be contextualized, e.g. text documents,
images, document collections
• Contextualization hooks: elements within the information piece, e.g.
entities, topics, temporal references, other metadata
• Contextualization source: source of the information (made of
contextualization units) exploited for contextualization, e.g. knowledge
bases, document collections
• Contextualization method: it provides a set of contextualization units
that help in understanding and interpreting the information piece
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Contextualization Model
8. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Example Contextualization Source
9. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Image Contextualization
Step 1: find similar collections
Step 2: add contextual information
Archived Image Collections
Seed Image Collection
Distance CalculationSimilar
Step 1: find similar collections
Additional
images
10. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Text Contextualization via Disambiguation
I might visit Paris while travelling through Texas to Oklahoma.
While information extraction would stop at this point, we collect the
surrounding context from the ontology which we store along with the
item being archived.
• Currently we store all concepts and relations within 3 relations of
the disambiguated concepts
11. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Text Re-Contextualization
KB annotation
Wikipedia pages
Sentence
index
Annotated sentences
Sentence
Extraction
Named Entity
Recognition
Temporal
Expression
Extraction
Article annotation
Query
Formulation
Sentence
Ranking
Annotated document
Document
Queries
Sentences
Contextualized
document
Contextualizing
Sentences
Topic
Extraction
Term
Extraction
Sentence
Extraction
Named Entity
Recognition
Temporal
Expression
Extraction
Topic
Extraction
Term
Extraction
12. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Role in Preserve-or-Forget Architecture
13. Andrea Ceroni, Nam Khanh Tran, Nattiya Kanhabua and Claudia Niederée,
Bridging Temporal Context Gaps using Time-Aware Re-Contextualization, (To
appear) Proceedings of the 37th Annual ACM SIGIR Conference (SIGIR'2014),
Goal Coast, Australia, July, 2014.
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Publications