SlideShare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our User Agreement and Privacy Policy.
SlideShare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our Privacy Policy and User Agreement for details.
Successfully reported this slideshow.
Activate your 14 day free trial to unlock unlimited reading.
44.
テキストマイニングにおける形態素解析
l テキストマイニングとは?
l ⼤大量量の⽂文書データを解析して何らかの知⾒見見を得る技術の総称
l 例例:単語頻度度の偏りを検知する
l ここで欲しいのは、単語というより「概念念」
l 同⼀一概念念はまとめたい(多義語問題、名寄問題)
l そのため単語単位は⻑⾧長めで、同義表現などがまとまると嬉しい
Michael Jackson
同一概念
King of Pop
44