The Horizons of News Search<br />SSM Workshop 2010<br />By Richard McCreadie, Craig Macdonald, IadhOunis<br />1<br />
How is News reported?<br />2<br />Newswire companies publish the article<br />Reporters arrive and write an article<br />N...
How do we access that news?<br /><ul><li>Integrated News Search</li></ul>etc . . .<br /><ul><li>News Aggregators</li></ul>...
Motivations for Improving News Search<br /><ul><li>Improving real-time search</li></ul>Detect events more quickly  (freshn...
How can User-Generated Content Help?<br />Freshness<br />Volume<br />Classifying<br />User Queries<br />Ranking<br />Resul...
How can User-Generated Content Help?<br /><ul><li>Observers report news `from the scene’</li></ul> We could detect new eve...
Challenges to be addressed<br /><ul><li>Confidence</li></ul>How can we know when a source is trustworthy?<br />Inaccuracie...
Upcoming SlideShare
Loading in …5
×

Insights on the Horizon of News Search

431 views

Published on

Presentation on future directions for news search. Prepared for the 2010 workshop on search and social media.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
431
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • No coincidence chatprograms
  • News search – integrates news articles into rankingAggregators – listing of articles about the query
  • Real time speed, accuracy, coverage, wide def newsFreshness, coverage of viewpoints avoid bias, sample public opinion
  • Reporting speed, cite first sourceComentary detect queries, include in results, users like to see peer views, wide range of storiesNone of these will be easy however!
  • Challenges to overcomeConfidence trustworthyQuality ad value, good posts, sources?Coverage
  • Insights on the Horizon of News Search

    1. 1. The Horizons of News Search<br />SSM Workshop 2010<br />By Richard McCreadie, Craig Macdonald, IadhOunis<br />1<br />
    2. 2. How is News reported?<br />2<br />Newswire companies publish the article<br />Reporters arrive and write an article<br />News is commentated on and discussed . . .<br />Newswire<br />Company<br />Reporter<br />Internet<br />The public witnesses event<br />Event!<br />. . .<br />Both publicly<br />. . .<br />Observer<br />. . .<br />Observers now report events online<br />Instant Messaging <br />Logs<br />And implicitly . . . <br />
    3. 3. How do we access that news?<br /><ul><li>Integrated News Search</li></ul>etc . . .<br /><ul><li>News Aggregators</li></ul>etc . . .<br />3<br />
    4. 4. Motivations for Improving News Search<br /><ul><li>Improving real-time search</li></ul>Detect events more quickly (freshness)<br />Classify user queries more accurately<br /> Attain a better coverage of news stories<br /> Enable the serving of a wider definition of `news’<br /><ul><li>Improve coverage in news results</li></ul> Avoid institutional bias of newswire companies<br /> Provide a sample of `public opinion’<br />4<br />***<br />***<br />What’s the latest news from the SSM Workshop?<br />
    5. 5. How can User-Generated Content Help?<br />Freshness<br />Volume<br />Classifying<br />User Queries<br />Ranking<br />Results<br />5<br />
    6. 6. How can User-Generated Content Help?<br /><ul><li>Observers report news `from the scene’</li></ul> We could detect new events before they are reported<br />Could include the `first source’ of the story in the news ranking<br /><ul><li>Users commentate on news in social media</li></ul>Could we leverage user-generated content to detect news-related queries we might otherwise miss?<br />Inclusion of user-generated content into the results might provide interesting alternative coverage of an event, e.g. from a different perspective than that of the main newswire companies<br />Users like to see what their peers say on a story?<br />We could cover a wide range of news stories than that covered by news articles.<br />6<br />
    7. 7. Challenges to be addressed<br /><ul><li>Confidence</li></ul>How can we know when a source is trustworthy?<br />Inaccuracies/falsification/spam?<br /><ul><li>Quality</li></ul> Does user-generated content really add value?<br />How to distinguish `good’ posts?<br />What sources should we include?<br /><ul><li>Coverage</li></ul>What stories class as news?<br />How to distinguish between real events and `chat’?<br />Should results be diversified?<br />vs<br />?<br />?<br />?<br />?<br />7<br />
    8. 8. Outlook<br /><ul><li>Users now act as both reporters, commentators and consumers of news
    9. 9. This creates exciting possibilities in leveraging the rich information they produce to improve news search
    10. 10. CR: TREC Blog track, Top News Stories Identification</li></ul>8<br />

    ×