SlideShare a Scribd company logo
1 of 11
SIG-SWO 41st : ISWC2016 Survey Session
KAMEDA Akihiro (Kyoto University)
Multilinguality 10/19(水)
YAGO: a multilingual knowledge base from Wikipedia, Wordnet, and Geonames
Thomas Rebele, Fabian M. Suchanek, Johannes Hoffart, Joanna Biega, Erdal Kuzey and Gerhard
Weikum
Translating Ontologies in a Real-World Setting with ESSOT
Mihael Arcan, Mauro Dragoni and Paul Buitelaar
Zhishi.lemon:On Publishing Zhishi.me as Linguistic Linked Open Data
Haofen Wang, Zhijia Fang, Jorge Gracia, Julia Bosque-Gil and Tong Ruan
Domain Adaptation for Ontology Localization
John P. McCrae, Mihael Arcan, Kartik Asooja, Jorge Gracia, Paul Buitelaar and Philipp Cimiano
Natural Language Processing 10/21(金)
Linked Disambiguated Distributional Semantic Networks
Stefano Faralli, Alexander Panchenko, Chris Biemann and Simone Paolo Ponzetto
Abstract Meaning Representations as Linked Data
Gully Burns, Ulf Hermjakob and José Luis Ambite
A Replication Study of the Top Performing Systems in SemEval Twitter Sentiment Analysis
Efstratios Sygkounas, Giuseppe Rizzo and Raphaël Troncy
Building event-centric knowledge graphs from news
Marco Rospocher, Marieke van Erp, Piek Vossen, Antske Fokkens, Itziar Aldabe, German
Rigau, Aitor Soroa, Thomas Ploeger, Tessel Bogaard
Natural Language Processing
Session
左上にトラックタイプ{Research,
Resources, Applications, Journal}
Linked Disambiguated Distributional Semantic Networks
WordNet とか
BabelNet とか
説明側ベクトルも
disambiguate
Resources
Abstract Meaning Representations as Linked Data
SerpinE2 is overexpressed in intestinal epithelial cells
transformed by activated MEK1 and oncogenic RAS and BRAF
これを Linked Data 化
- RDF化しURLとスキーマ
を与える
https://github.com/BMKEG/amr-ld/
- 外のリソースにマッピン
グ
http://fril.sourceforge.net/
+tf-idfベースのシンプルな方法で補った
Resources
A Replication Study of the Top Performing Systems in
SemEval Twitter Sentiment Analysis
• SemEval Twitter Sentiment Analysis
• Twitter 上のツイートの感情判別(ポジ・ネガ・中立)を行うタスク
• 2013-2015で好成績を収めた5つのアルゴリズムの再現性確認
(Replication Study)
• ついでに、その4つ+ Stanford Sentiment System でアンサン
ブル学習
• 学び:パラメータとかは論文に書かれてなくてGitHubのコード
で助けられた。データとかライブラリとか特徴量設計の部分で
差異が出てしまった(まあ大体あってたけど)
• Feature engineering is an art and the devil is in the details.
Resources
Building Event-Centric Knowledge Graphs from News
• ニュースからイベント・セントリックな知識グラフを抽出する
• 既存の知識グラフは百科事典的で静的なものが多いからそれを補完
• English and Spanish, Italian and Dutch.
• まず文書ごとに抽出してから
(Step1)繋げる(Step2)
Journal
Multilinguality
Session
YAGO: a multilingual knowledge base from Wikipedia,
Wordnet, and Geonames
• YAGO は Wikipedia に基づいて10の言語を横断して作られてい
て、空間・時間の情報も付けられている。このプロセスおよび
品質保証の詳細
• 時間情報: Infobox に正規表現
• 空間情報: GeoNamesがWikipediaへのリンクを持っている。説明を手
掛かりに高F値でWordNetにもマッピング
• 「15人が76の関係に関する4412の事実を検証。98%が正しかった。2
カ月かかった」
Resources
Translating Ontologies in a Real-World Setting with ESSOT
• オントロジーを翻訳するときに、人手で修正・承認をするため
のサポートシステム作りました。
• 統計ベースの機械翻訳システムも自分でトレーニングしました
Microsoft の翻訳 API よりはつよかった(使ったデータの違い以外に
NLP的新規性はなさそうだけど、BLEUとかで比較して勝っている)
Applications
Zhishi.lemon:On Publishing Zhishi.me as Linguistic
Linked Open Data
• Lemon (The Lexicon Model for Ontologies)
• Lexical schema 最大手
WordNet も BabelNet もこれ
• Zhishi.me
• Baidu Baike, Hudong Baike
and Chinese Wikipedia.
• DBpediaとBabelNetを介在さ
せ西と英に対応付け
• DBpedia は C-Wikipedia。
BabelNetはカテゴリの重複度
を手掛かりに曖昧性解消
Resources
Domain Adaptation for Ontology Localization
tokenization
Phrase table Explicit Semantic Analysis (分散
表現の一種, not Latent)のCross-
Language 版。コサイン距離で意
味の近さを測る
N-gramモデルの各gramをドメ
インオントロジのfreqと対象言
語文書のfreqのコサイン距離
cos(tf_o, tf_d)で重みづけ
Journal
3つのDomain Adaptation工夫

More Related Content

Viewers also liked

види друкованої продукції та принципи їх оформлення та верстки
види друкованої продукції та принципи їх оформлення та версткивиди друкованої продукції та принципи їх оформлення та верстки
види друкованої продукції та принципи їх оформлення та версткиzaykoannaivanivna
 
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged OntologyYAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged OntologyGerard de Melo
 
Homomorphic encryption in_cloud
Homomorphic encryption in_cloudHomomorphic encryption in_cloud
Homomorphic encryption in_cloudShivam Singh
 
практична робота №6
практична робота №6практична робота №6
практична робота №6Sasha Sasha
 
Penicillium Paecilomyces Aspergillus
Penicillium Paecilomyces AspergillusPenicillium Paecilomyces Aspergillus
Penicillium Paecilomyces AspergillusJosua Sitorus
 
2007 Slc Presenation
2007 Slc Presenation2007 Slc Presenation
2007 Slc PresenationHeithaus
 
Factors influencing popularity of branded content in Facebook fan pages
Factors influencing popularity of branded content in Facebook fan pagesFactors influencing popularity of branded content in Facebook fan pages
Factors influencing popularity of branded content in Facebook fan pagesToni Cañabate
 
Beijing to Burketown by Jane Hardy
Beijing to Burketown by Jane HardyBeijing to Burketown by Jane Hardy
Beijing to Burketown by Jane Hardybryceives
 
B2B marketing analytics-report
B2B marketing analytics-reportB2B marketing analytics-report
B2B marketing analytics-reportProgress
 
M learn 2014 slideshare
M learn 2014   slideshareM learn 2014   slideshare
M learn 2014 slideshareMike Sharples
 
PRESENTATION ABOUT SYN - ABAF
PRESENTATION ABOUT SYN  - ABAFPRESENTATION ABOUT SYN  - ABAF
PRESENTATION ABOUT SYN - ABAFbryceives
 
Beijing to Burketown by Jane Hardy
Beijing to Burketown by Jane HardyBeijing to Burketown by Jane Hardy
Beijing to Burketown by Jane Hardybryceives
 
Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Progress
 

Viewers also liked (18)

види друкованої продукції та принципи їх оформлення та верстки
види друкованої продукції та принципи їх оформлення та версткивиди друкованої продукції та принципи їх оформлення та верстки
види друкованої продукції та принципи їх оформлення та верстки
 
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged OntologyYAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
 
Homomorphic encryption in_cloud
Homomorphic encryption in_cloudHomomorphic encryption in_cloud
Homomorphic encryption in_cloud
 
ոսպնյակ
ոսպնյակոսպնյակ
ոսպնյակ
 
практична робота №6
практична робота №6практична робота №6
практична робота №6
 
Penicillium Paecilomyces Aspergillus
Penicillium Paecilomyces AspergillusPenicillium Paecilomyces Aspergillus
Penicillium Paecilomyces Aspergillus
 
Opin Upplysingataekni2
Opin Upplysingataekni2Opin Upplysingataekni2
Opin Upplysingataekni2
 
2007 Slc Presenation
2007 Slc Presenation2007 Slc Presenation
2007 Slc Presenation
 
Factors influencing popularity of branded content in Facebook fan pages
Factors influencing popularity of branded content in Facebook fan pagesFactors influencing popularity of branded content in Facebook fan pages
Factors influencing popularity of branded content in Facebook fan pages
 
Beijing to Burketown by Jane Hardy
Beijing to Burketown by Jane HardyBeijing to Burketown by Jane Hardy
Beijing to Burketown by Jane Hardy
 
B2B marketing analytics-report
B2B marketing analytics-reportB2B marketing analytics-report
B2B marketing analytics-report
 
Creative visiting
Creative visitingCreative visiting
Creative visiting
 
M learn 2014 slideshare
M learn 2014   slideshareM learn 2014   slideshare
M learn 2014 slideshare
 
Phyrev4
Phyrev4Phyrev4
Phyrev4
 
PRESENTATION ABOUT SYN - ABAF
PRESENTATION ABOUT SYN  - ABAFPRESENTATION ABOUT SYN  - ABAF
PRESENTATION ABOUT SYN - ABAF
 
Ph Yrev1
Ph Yrev1Ph Yrev1
Ph Yrev1
 
Beijing to Burketown by Jane Hardy
Beijing to Burketown by Jane HardyBeijing to Burketown by Jane Hardy
Beijing to Burketown by Jane Hardy
 
Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?
 

More from Akihiro Kameda

iPRES Day 3 instruction
iPRES Day 3 instructioniPRES Day 3 instruction
iPRES Day 3 instructionAkihiro Kameda
 
Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Akihiro Kameda
 
いかにしてデータを手に入れるか
いかにしてデータを手に入れるかいかにしてデータを手に入れるか
いかにしてデータを手に入れるかAkihiro Kameda
 
"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説Akihiro Kameda
 
Iodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareIodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareAkihiro Kameda
 
PATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesPATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesAkihiro Kameda
 
Reference Scope Identification in Citing Sentences
Reference Scope Identification in Citing SentencesReference Scope Identification in Citing Sentences
Reference Scope Identification in Citing SentencesAkihiro Kameda
 

More from Akihiro Kameda (7)

iPRES Day 3 instruction
iPRES Day 3 instructioniPRES Day 3 instruction
iPRES Day 3 instruction
 
Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...
 
いかにしてデータを手に入れるか
いかにしてデータを手に入れるかいかにしてデータを手に入れるか
いかにしてデータを手に入れるか
 
"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説
 
Iodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareIodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshare
 
PATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesPATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic Types
 
Reference Scope Identification in Citing Sentences
Reference Scope Identification in Citing SentencesReference Scope Identification in Citing Sentences
Reference Scope Identification in Citing Sentences
 

ISWC2016 1-slide-survey

  • 1. SIG-SWO 41st : ISWC2016 Survey Session KAMEDA Akihiro (Kyoto University) Multilinguality 10/19(水) YAGO: a multilingual knowledge base from Wikipedia, Wordnet, and Geonames Thomas Rebele, Fabian M. Suchanek, Johannes Hoffart, Joanna Biega, Erdal Kuzey and Gerhard Weikum Translating Ontologies in a Real-World Setting with ESSOT Mihael Arcan, Mauro Dragoni and Paul Buitelaar Zhishi.lemon:On Publishing Zhishi.me as Linguistic Linked Open Data Haofen Wang, Zhijia Fang, Jorge Gracia, Julia Bosque-Gil and Tong Ruan Domain Adaptation for Ontology Localization John P. McCrae, Mihael Arcan, Kartik Asooja, Jorge Gracia, Paul Buitelaar and Philipp Cimiano Natural Language Processing 10/21(金) Linked Disambiguated Distributional Semantic Networks Stefano Faralli, Alexander Panchenko, Chris Biemann and Simone Paolo Ponzetto Abstract Meaning Representations as Linked Data Gully Burns, Ulf Hermjakob and José Luis Ambite A Replication Study of the Top Performing Systems in SemEval Twitter Sentiment Analysis Efstratios Sygkounas, Giuseppe Rizzo and Raphaël Troncy Building event-centric knowledge graphs from news Marco Rospocher, Marieke van Erp, Piek Vossen, Antske Fokkens, Itziar Aldabe, German Rigau, Aitor Soroa, Thomas Ploeger, Tessel Bogaard
  • 3. Linked Disambiguated Distributional Semantic Networks WordNet とか BabelNet とか 説明側ベクトルも disambiguate Resources
  • 4. Abstract Meaning Representations as Linked Data SerpinE2 is overexpressed in intestinal epithelial cells transformed by activated MEK1 and oncogenic RAS and BRAF これを Linked Data 化 - RDF化しURLとスキーマ を与える https://github.com/BMKEG/amr-ld/ - 外のリソースにマッピン グ http://fril.sourceforge.net/ +tf-idfベースのシンプルな方法で補った Resources
  • 5. A Replication Study of the Top Performing Systems in SemEval Twitter Sentiment Analysis • SemEval Twitter Sentiment Analysis • Twitter 上のツイートの感情判別(ポジ・ネガ・中立)を行うタスク • 2013-2015で好成績を収めた5つのアルゴリズムの再現性確認 (Replication Study) • ついでに、その4つ+ Stanford Sentiment System でアンサン ブル学習 • 学び:パラメータとかは論文に書かれてなくてGitHubのコード で助けられた。データとかライブラリとか特徴量設計の部分で 差異が出てしまった(まあ大体あってたけど) • Feature engineering is an art and the devil is in the details. Resources
  • 6. Building Event-Centric Knowledge Graphs from News • ニュースからイベント・セントリックな知識グラフを抽出する • 既存の知識グラフは百科事典的で静的なものが多いからそれを補完 • English and Spanish, Italian and Dutch. • まず文書ごとに抽出してから (Step1)繋げる(Step2) Journal
  • 8. YAGO: a multilingual knowledge base from Wikipedia, Wordnet, and Geonames • YAGO は Wikipedia に基づいて10の言語を横断して作られてい て、空間・時間の情報も付けられている。このプロセスおよび 品質保証の詳細 • 時間情報: Infobox に正規表現 • 空間情報: GeoNamesがWikipediaへのリンクを持っている。説明を手 掛かりに高F値でWordNetにもマッピング • 「15人が76の関係に関する4412の事実を検証。98%が正しかった。2 カ月かかった」 Resources
  • 9. Translating Ontologies in a Real-World Setting with ESSOT • オントロジーを翻訳するときに、人手で修正・承認をするため のサポートシステム作りました。 • 統計ベースの機械翻訳システムも自分でトレーニングしました Microsoft の翻訳 API よりはつよかった(使ったデータの違い以外に NLP的新規性はなさそうだけど、BLEUとかで比較して勝っている) Applications
  • 10. Zhishi.lemon:On Publishing Zhishi.me as Linguistic Linked Open Data • Lemon (The Lexicon Model for Ontologies) • Lexical schema 最大手 WordNet も BabelNet もこれ • Zhishi.me • Baidu Baike, Hudong Baike and Chinese Wikipedia. • DBpediaとBabelNetを介在さ せ西と英に対応付け • DBpedia は C-Wikipedia。 BabelNetはカテゴリの重複度 を手掛かりに曖昧性解消 Resources
  • 11. Domain Adaptation for Ontology Localization tokenization Phrase table Explicit Semantic Analysis (分散 表現の一種, not Latent)のCross- Language 版。コサイン距離で意 味の近さを測る N-gramモデルの各gramをドメ インオントロジのfreqと対象言 語文書のfreqのコサイン距離 cos(tf_o, tf_d)で重みづけ Journal 3つのDomain Adaptation工夫