EDF2012  Aris Karanikas - PortDial
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
677
On Slideshare
424
From Embeds
253
Number of Embeds
4

Actions

Shares
Downloads
2
Comments
0
Likes
0

Embeds 253

http://2012.data-forum.eu 135
http://www.data-forum.eu 100
http://data-forum.eu 16
http://2013.data-forum.eu 2

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. PortDial: Language Resources forPortable Spoken Dialogue Systems Aris Karanikas, CCO, VoiceWeb European Data Forum June 6-7, 2012 Copenhagen, Denmark
  • 2. Spoken Dialogue SystemsDATA• Speech recordings (used for training acoustic models)• Text data (used for training language models)• Ontologies (used to define application domain)• Grammars (used for recognition) etc.
  • 3. ChallengeA major roadblock in spoken dialogue system(SDS) design is the lack of linguistic resourcesthat would enable the rapid porting of speechservices to new domains and languages
  • 4. PortDial Objectives• Devise machine-aided methods for creating, cleaning-up and publishing multilingual domain ontologies and grammars for SDS prototyping• Create a platform that supports cost-effective language resource building for the domain and language porting scenarios• Create and support a sustainable pool of users that contribute to a linguistic resources data exchange
  • 5. Main InnovationsTechnological Innovation: Combining knowledge-based and data-driven approaches for ontology and grammar induction from web-harvested dataMarket Innovation: – Speech services prototyping/porting platform reduces time-to-market and barrier-to-entry – Spoken dialogue resources/data as a service
  • 6. Partners• Expertise: language engineering, spoken dialogue systems, semantic web, speech services, semantic networks, linked-data
  • 7. PortDial Scenarios• Porting to a new application domain – Focus on adaptation• Porting to a new language – Focus on translation• Resource-rich scenario – Focus on reusability/adaptation of existing resources – Use targeted web mined data to enrich resources• Resource-poor scenario – Focus on data-driven bottom-up creation of resources using the web
  • 8. Main Concept
  • 9. Main Outputs• A commercial platform for rapid prototyping of speech service resources for new domains and languages• A collection of multilingual speech service resources (ontologies, grammars) for entertainment, banking and customer service domains• Languages covered: English, German, Italian, Spanish, Greek, Turkish, Hebrew
  • 10. Target Groups• SMEs worldwide in the mobile application development industry lacking the expertise/ resources to develop speech services in-house• Non-commercial actors including the research community that can maintain and enrich the free version of the data pool
  • 11. ImpactThe SDS linguistic resources will lower thebarrier to entry for European SMEs to speechservices, allowing for inexpensive proof-of-concept demonstrator development, openingup new markets and application domains.