Rapid Prototyping of a Semantic-Web-based Research Workbench

859 views
773 views

Published on

Talk at the UDS-SJTU Joint Research Lab for Language Technology.
I describe I project I did for Totuba.

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
859
On SlideShare
0
From Embeds
0
Number of Embeds
14
Actions
Shares
0
Downloads
13
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Rapid Prototyping of a Semantic-Web-based Research Workbench

  1. 1. Rapid Prototyping of a Semantic-Web-based Research Workbench Carsten Ullrich Dept. of Computer Science and Engineering, SJTU
  2. 2. Overview • Project done with Totuba, Inc. • Goal: develop a research workbench – bibliography manager – research network – support while writing research papers • Sorry, no new pure research results • But: overview on state-of-the-art of existing Web services / Web data
  3. 3. • context-sensitive further reading • related topics • drag&drop referencing
  4. 4. Entity Extraction The term "Web 2.0" is used to describe applications that distinguish themselves from previous generations of software by a number of principles. Existing work shows that Web 2.0 applications can be successfully exploited for technology- enhanced learning. However, in-depth analyses of the relationship between Web 2.0 technology on the one hand and teaching and learning on the other hand are still rare.
  5. 5. Entity Extraction Gur grez "Jro 2.0" vf hfrq gb qrfpevor nccyvpngvbaf gung qvfgvathvfu gurzfryirf sebz cerivbhf trarengvbaf bs fbsgjner ol n ahzore bs cevapvcyrf. Rkvfgvat jbex fubjf gung Jro 2.0 nccyvpngvbaf pna or fhpprffshyyl rkcybvgrq sbe grpuabybtl- raunaprq yrneavat. Ubjrire, va-qrcgu nanylfrf bs gur eryngvbafuvc orgjrra Jro 2.0 grpuabybtl ba gur bar unaq naq grnpuvat naq yrneavat ba gur bgure unaq ner fgvyy ener.
  6. 6. Entity Extraction Gur grez "Jro 2.0" vf hfrq gb qrfpevor nccyvpngvbaf gung qvfgvathvfu gurzfryirf sebz cerivbhf trarengvbaf bs fbsgjner ol n ahzore bs cevapvcyrf. Rkvfgvat jbex fubjf gung Jro 2.0 nccyvpngvbaf pna or fhpprffshyyl rkcybvgrq sbe grpuabybtl- raunaprq yrneavat. Ubjrire, va-qrcgu nanylfrf bs gur eryngvbafuvc orgjrra Jro 2.0 grpuabybtl ba gur bar unaq naq grnpuvat naq yrneavat ba gur bgure unaq ner fgvyy ener. OpenCalais • Jro 2.0 • grpuabybtl-raunaprq yrneavat
  7. 7. Open Calais • Thomson Reuters company • Web Service • Extracts entities, facts, events (about 100 types) • Free for noncommercial and commercial use Entities Anniversary, City, Company, Continent, Country, Currency, EmailAddress, EntertainmentAwardEvent, Facility, FaxNumber, Holiday, IndustryTerm, MarketIndex, MedicalCondition, MedicalTreatment, Movie, MusicAlbum, MusicGroup, NaturalFeature, OperatingSystem, Organization, Person, PhoneNumber, Position, Product, ProgrammingLanguage, ProvinceOrState, PublishedMedium, RadioProgram, RadioStation, Region, SportsEvent, SportsGame, SportsLeague, Technology, TVShow, TVStation, URL
  8. 8. Semantifying The term "Web 2.0“... OpenCalais • Web 2.0 • technology-supported learning DBPedia (others: Yago, Freebase, UMBEL) • http://dbpedia.org/resource/Web_2.0 • http://dbpedia.org/resource/Technology-Enhanced_Learning
  9. 9. Related Topics: Web_2.0 in DBPedia • skos:subject – dbpedia:Category:Buzzwords – dbpedia:Category:Branding – dbpedia:Category:Cloud_applications – dbpedia:Category:Internet_memes – dbpedia:Category:Social_Information_Processing – dbpedia:Category:World_Wide_Web – dbpedia:Category:Web_2.0 – dbpedia:Category:Web_services
  10. 10. Linked Open Data dataset cloud
  11. 11. Reuse • Highly efficient entity extraction • Enormous databases – describe the entities – link to related entities • Give a high-level starting position to explore new challenges – how to put this data into use? – context: what is relevant for user/current usage
  12. 12. Lessons Learned • Reuse enables progress – no duplication of work – focus on problems relevant for you • Having a landscape that encourages reuse creates advantages for research / commercial applications • Problems – mostly only English – few Chinese services / programming libraries • e.g., named entity extraction
  13. 13. Questions • I have some: – opinion mining – information extraction
  14. 14. Questions • I have some: – opinion mining – information extraction • Any toolkits available? RASCALLI? • Contact me in case you find this interesting • ullrich_c@sjtu.edu.cn

×