TREC 2016: Looking Forward Panel

Looking Forward
Prof.dr.ir. Arjen P. de Vries
arjen@acm.org
Gaithersburg MD, November 15th, 2016

Top Result:
 50 years of Star Trek
(Article on the Verge about Facebook Like buttons)

Science Fiction
 Defining a TREC task or a track is like time-travel in Back
to the Future
Note to the audience: that is just 74 characters
You could even add the hashtag #TREC #TRECCelebrations
and my Twitter handle @arjenpdevries

Better Search – “Deep Personalization”
 “Even more broadly than trying to get people the right
content based on their context, we as a community need to
be thinking about how to support people through the entire
search experience.”
Jaime Teevan on “Slow Search”
 Search as a dialogue
My first journal paper:
De Vries, Van der Veer and Blanken: Let’s talk about it: dialogues with multimedia databases (1998)

Moving Forward
 Elements of the “Slow Search movement” at TREC today:
- Sessions
- Tasks
- Dynamic domains
- Total recall
- Complex Answer Retrieval (new!)

Missing from TREC!
 Access to rich personal data including email, browsing
history, documents read and contents of the user’s home
directory…

Trade log data!
IR-809: (2011) Feild, H., Allan, J. and Glatt, J.,
"CrowdLogging: Distributed, private, and
anonymous search logging," Proceedings of the
International Conference on Research and
Development in Information Retrieval (SIGIR'11),
pp. 375-384. [View bibtex]
We describe an approach for distributed search log collection, storage, and mining,
with the dual goals of preserving privacy and making the mined information broadly
available. [..] The approach works with any search behavior artifact that can be
extracted from a search log, including queries, query reformulations, and query-
click pairs.

Open challenges
 How to select the part of your log data you are willing to
trade?
 How to estimate the value of this log data?
 And a social challenge, not so much scientific:
How to get people to participate?

The TREC Brand
 A community that creates reusable test collections

Reproducibility vs. Representativeness
 Increasing representativeness of a TREC task should not
come at the cost of sacrificing reproducibility
(104 characters )
Samar, T., Bellogín, A. & de Vries, A.P. Inf Retrieval J (2016) 19: 230.
doi: 10.1007/s10791-015-9276-9

Baltimore
 Title query of TREC topic 478 for the information need “Who is
the mayor of Baltimore”
 “The honest conclusion of this year’s evaluation should be that we
underestimated the problem of handling Web data. Surprising is
the performance of the title-only queries doing better than queries
including description or even narrative. It seems that the web-track
topics are really different from the previous TREC topics in the ad-
hoc task, for which we never weighted title terms different from
description or narrative.”
(Quote from the CWI TREC-9 paper)

TREC 2016: Looking Forward Panel

More Related Content

Similar to TREC 2016: Looking Forward Panel

More from Arjen de Vries

Recently uploaded

TREC 2016: Looking Forward Panel