• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Financial Comic Information Retrieval System
 

Financial Comic Information Retrieval System

on

  • 431 views

 

Statistics

Views

Total Views
431
Views on SlideShare
427
Embed Views
4

Actions

Likes
1
Downloads
2
Comments
0

2 Embeds 4

http://progressreport4all.blogspot.com 3
http://www.slideshare.net 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Financial Comic Information Retrieval System Financial Comic Information Retrieval System Presentation Transcript

    • Financial ComicInformation Retrieval System
      2010/05/28
      1
    • Outline
      Architecture of IR system
      Indexing process
      Query process
      2
    • Indexing process
      MySQL Database
      Text Acquisition
      Index Creation
      Index
      Financial Comics
      資料來源:鉅融全球資本市場演進知識庫
      http://www.global5capital.com
      Text Transformation
      3
    • Indexing process
      Text Acquisition
      Store the description of Financial Comics in the database
      Database schema
      4
    • Indexing process
      Text Transformation
      Convert text encoding to UTF-8
      Stopping
      Filter punctuation and number from document
      Filter a single English alphabet
      5
    • Indexing process
      Index Creation
      Unigram
      Bigram
      Word Segmentation
      Yahoo! 斷章取義API
      Compute tf.idf weight for index term
      tf(term frequency)
      idf(inverse document frequency)
      6
    • 7
      idf value
      tf value
    • Query process
      MySQL Database
      User Interaction
      Ranking
      Index
      8
    • Query process
      User Interaction
      Construct the display of top 10 documents for a query
      Highlight keywords
      Ranking
      Measure by tf∙idf weight
      9
    • Demo
      10