8. How does that work?
Inverted Index
Term Normalization
1. Similar words (merge)
2. Stop words (remove)
3. +relevance –size on disk
Term Document Ids
And 1,2,3
Big 2,4,7
Fire 1
Keep 7,8
keeper 3,4
the 1,8
9. Analyzers
“@Andy52 went to school yesterday!”
StandardAnalyzer
[@Andy52] [went] [school] [yesterday!]
StopAnalyzer
[Andy] [went] [school] [yesterday]
SimpleAnalyzer
[andy] [went] [to] [school] [yesterday]
WhitespaceAnalyzer
[@Andy52] [went] [to] [school] [yesterday]
KeywordAnalyzer
[@Andy52 went to school yesterday!]