Your SlideShare is downloading. ×
Searching Political Data by Strategy
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Searching Political Data by Strategy

638
views

Published on

Presentation on the Exposé demonstrator, to enable search of Dutch parliamentary proceedings (the Political Mashup data collection).

Presentation on the Exposé demonstrator, to enable search of Dutch parliamentary proceedings (the Political Mashup data collection).


0 Comments
4 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
638
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
8
Comments
0
Likes
4
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. SearchingPolitical Data by Strategy Roberto Cornacchia Jaap Kamps Wouter Alink Arjen P. de Vries info@spinque.com
  • 2. Search by Strategy An iterative 2-stage search process  Express domain knowledge as high-level search strategies  Generate search engine from the strategy  A dynamic REST API  UI controls for unspecified parameters Separate search strategy definition (the how) from actual searching and browsing of data collections (the what)
  • 3. https://devel.spinque.com/ExPoSeApp-20130116/?config=demo# dashboard/demo04: /p/topic/Mokken
  • 4. Search by Strategy captures: Arbitrary retrieval unit types (not just documents)  E.g., expert finding, entity search “Semantic” search  The building blocks operate on scored triples Semi-structured search  Data objects may be structured in hierarchies Exploratory search  Use facets as preferences
  • 5. Exposé Searching the parliamentary proceedings of the Dutch parliament  Complete transcripts of everything said in parliament  Organized by parliamentary session  Detailing who sais what in what role and context
  • 6. Exposé Original data is PDF, transformed into XML by award-winning project Political Mashup  http://politicalmashup.nl/
  • 7. In Politics… Essence is not only what is said, but also by who and to whom, and why Concrete example:  Wilders sais “knettergek” in parliament (in 2007) – is this remarkable?
  • 8. “Knettergek” caseThe word “knettergek” has been used many times in parliament…… but never to address a member of the government
  • 9. Varying result typesUtterances Person / Party / …
  • 10. Flexibility Concrete case:  Maarten: “I cannot find Prof. Mokken, who I know has been spoken about in parliament multiple times!”
  • 11. Flexibility Default indexing uses stemming and normalization But… searching for people’s names (and, as we mention it, many other domain specific terminology) can be negatively affected by stemming  “Mokken” transformed into “mok”, leading us to geographic locations “Mook” and “De Mok”, but not to the famous professor!
  • 12. https://devel.spinque.com/ExPoSeApp-20130116/?config=demo#dashboard/demo05: /p/topic/mokken/p/emphasis_stemming/0
  • 13. Joins to the rescue!Which house speakers from the Rotterdam harbour say what about “Amsterdam”?
  • 14. Semantic Searchbiographies describes person utterance
  • 15. Advantages Define and execute custom build search strategies  Specialized to the task, or even to the search at hand Search multiple data sources at once Explore and refine results interactively “Search provenance”  Complete transparency on how search results were obtained
  • 16. Position Statement Search professionals think in terms of search strategies already Let them design their own strategies, and thereby tailor their search engines So they learn to trust what we claim to be the effective information retrieval techniques!

×