Design for Interaction

design for interaction
Daniel Tunkelang
Chief Scientist, Endeca

© 2009 Endeca Technologies, Inc. All rights reserved.

about me

Organizing SIGIR ’09 Industry Track in Boston on July 22nd!

2 © 2009 Endeca Technologies, Inc. All rights reserved.

about endeca

leading provider of
search applications

250M+
end users
per month
600+ customers
$100M+ annual sales


what i hope you learn from this talk

the db and ir perspectives have a common thread

convergence may be upon us

but we need interaction to make it work


overview

don't put all your eggs in one basket


human-computer information retrieval


don’t put all your eggs in one basket

Still Life with Basket and Broken Eggs by Michael Edwards, 2008


the db approach: perfection in, perfection out

http://www.storeitfoodsblog.com/category/food-preparation/meat-grinder/


db usability researchers recognize the pain


sql is hard

Making Database Systems Usable
[Jagadish et al., SIGMOD 2007]
__
sql

• labor-intensive query construction

• lengthy query evaluation

• high query reformulation cost


data sucks and users are lazy

Extracting Problems for Database
and IR Researchers
[Naughton, Spring 2008 North East DB/IR Day]

• real data is
– incomplete
– inconsistent
– incorrect

• users don’t want to learn
– data schemas
– structured query languages we’re not gonna take it!


the ir way: don’t worry, be happy

http://adsoftheworld.com/media/print/mcdonalds_burger_mysteries


ir for db people: what would google do?

tf-idf PageRank
SYSTEM:

rank using IR model

USER:

information Need query select from results


assumptions of relevance-centric ir approach

• self-awareness

• self-expression

• model knows best

• answer is a document

• one-shot query


life is not a batch

• db approach expects too much of user
• ir approach expects too much of system

both approaches act as if it all
comes down to a single query

is that your final answer question?



The Future of Social Interaction by Jim Stoten


changes assumptions about what to optimize

precision
recall
complexity relevance

communication


how do we optimize communication?

transparency

guidance

control


ir offers a black box

ca c'est la caisse. le mouton que tu veux est dedans.


db / set retrieval offers 2 out of 3

transparency

guidance

control


but we need it all!

• set retrieval is a failure in the ir world
– though quite successful in the db world!

• but ranked retrieval is inherently crippled
– no transparency, control, or guidance!

how do we optimize for communication?


human-computer information retrieval

“Toward Human-Computer
Information Retrieval”

Gary Marchionini

• don’t just guess the user’s intent
• increase user responsibility and control
• require and reward human intellectual effort


great idea

how?


treat query construction as a process

A Case for Interaction
[Koenemann and Belkin, 1996]

• used term feedback to improve alerting queries

• users select from suggested terms

• 17 – 34% improvement in precision @ 30

• users liked the feedback interface


expose the facets of semistructured content


success in the lab and the field

• favored in user studies by Marti Hearst
– http://flamenco.berkeley.edu/

• ubiquitous in ecommerce
– amazon.com
– eBay
– endeca powers 42 of top 100 online retailers

• taking over media, libraries, enterprise, etc.


even a few db folks have drunk the kool-aid

DataGuides
[Goldman and Widom, VLDB 1997]
• user-friendly schema summaries

Magnet
[Sinha and Karger, SIGMOD 2005]
• navigation and refinement options

common theme: semistructured


what is semistructured data?

• one universe

• self-describing

• blends data / meta-data


data modeling flexibility

• no a-priori schema
– integrated sources without up-front schema design

• richer modeling capabilities tame data complexity
– hierarchy, multi-valued fields, sparse fields

• schema flexibility eases schema evolution
– new entity types, new data source

WWW SOA, ESB, Groupware and Content
Databases ERP
Internet File Systems Web Service Collaboration Management


semantically direct queries

which attributes
which on-sale items characterize on-sale
are available in blue? blue items?

price, sleeve,
color, salePrice,
brand, fabric, …

<shirt>
<buyingGuide>
<sku>1234</sku>
<title>Selecting the right
<sleeve>Long</sleeve>
ski coat for you.</title>
<desc>Classic end-on-end shirt</desc>
<file>skiguide.pdf</file>
<price>39.99</price>
<keyword>ski</keyword>
<salePrice>29.99</salePrice>
<keyword>coat</keyword>
<color>Blue</color>
...
<color>Yellow</color>
</buyingGuide>
<color>White</color>
...
</shirt> <trousers>
<sku>1579</sku>
<price>59.99</price>
<color>Khaki</color>
...
</trousers>


but let’s make this concrete

Uh oh, I’m presenting at
SIGMOD! Better find a good
book about databases!


quick, to the goog-mobile!

not quite…


i know, i’ll go to the library!

#%@$!


let’s try a little hcir…


hcir works for news too


life in a semistructured world

• search is a great starting point
– users can’t / won’t initiate structured queries

• ranked lists are an inadequate ending point
– search queries are lossy projections of intent

• hcir leads users down a garden path to structure


lots of trade-offs

“everything should be made as simple
as possible, but no simpler”

“speed of thought” vs. “going nowhere quickly”

“to err is human, but to really foul
things up requires a computer”

simple interfaces don’t
always yield satisfaction


users want the triumvirate

• transparency
• control
• guidance

transparency and control are easy

guidance requires cleverness


in closing

all of us want to help people access information

the best help is to help them help themselves

design for interaction though
transparency, control, guidance


thank you…and come to SIGIR!

communication 1.0
email: dt@endeca.com

communication 2.0
blog: http://thenoisychannel.com
twitter: http://twitter.com/dtunkelang

SIGIR: July 19-23 in Boston
Industry Track on July 22nd!


Design for Interaction

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Design for Interaction

Similar to Design for Interaction (20)

More from Daniel Tunkelang

More from Daniel Tunkelang (20)

Recently uploaded

Recently uploaded (20)

Design for Interaction