A	
  European	
  collabora.on	
  between	
  
research	
  and	
  the	
  publishing	
  industry	
  
TISP	
  Workshop	
  
ICT	
  2015	
  Lisbon,	
  Oct	
  22nd	
  2015	
  
Flanders	
  digital	
  research	
  &	
  	
  
entrepeneurship	
  hub	
  
Collabora.on	
  between	
  5	
  universi.es	
  
But	
  with	
  strong	
  link	
  to	
  industry	
  
Mul.media	
  Lab	
  Research	
  Group	
  
Electronics	
  and	
  Informa.on	
  Systems	
  
department	
  
University	
  of	
  Ghent	
  
About	
  Mul.medialab	
  
2	
  year	
  Innova?on	
  Ac?on	
  
Feb	
  2015	
  –	
  Feb	
  2017	
  
Consor.um	
  
Mul?lingual	
  enrichment	
  
through	
  Machine	
  
Transla?on	
  
	
  
Seman?c	
  enrichment	
  
using	
  Linked	
  Data	
  
	
  
•  Machine	
  transla?on	
  
•  Patented	
  technology	
  
•  Domain	
  focused	
  transla?on	
  
•  Training	
  data	
  required	
  
Mul.lingual	
  enrichment	
  
Seman.c	
  enrichment	
  
•  Harness	
  the	
  power	
  of	
  the	
  s?ll	
  largely	
  untapped	
  
Seman?c	
  Web	
  
•  Huge	
  amount	
  of	
  data	
  available	
  on	
  the	
  web	
  
•  But…largely	
  untapped	
  due	
  to	
  
– Lack	
  of	
  skills	
  
– Lack	
  of	
  awareness	
  
– Lack	
  of	
  tangible	
  use	
  cases	
  
•  Ease	
  of	
  use	
  
•  Standard	
  formats	
  
•  Well	
  documented	
  
E-­‐Services	
  
	
  FREME	
  will	
  provide	
  Open	
  Source	
  framework	
  of	
  e-­‐
services,	
  in	
  the	
  form	
  of	
  easy	
  to	
  use	
  REST	
  API’s	
  
	
  
e-­‐En.ty	
   Named	
  en?ty	
  recogni?on	
  
e-­‐Link	
   Fetch	
  extra	
  informa?on	
  about	
  specific	
  en?ty	
  
e-­‐Terminology	
   Term	
  recogni?on	
  +	
  transla?on	
  sugges?ons	
  
e-­‐Transla.on	
   Enrich	
  with	
  transla?on	
  
e-­‐Publishing	
   Publish	
  content	
  as	
  EPUB	
  digital	
  book	
  with	
  
SEO	
  op?miza?ons	
  for	
  content	
  discovery	
  
 	
  	
  	
  
Valida.on	
  of	
  FREME	
  in	
  Publishing	
  
•  Academic	
  &	
  Trade	
  book	
  publishing	
  -­‐	
  
seman?c	
  &	
  mul?lingual	
  enrichment	
  
	
  
•  Financial	
  press	
  –	
  seman?c	
  enrichment	
  
	
  
•  Video	
  publishing	
  –	
  enrichment	
  of	
  
metadata	
  
	
  
•  Tourism	
  –	
  transla?on	
  of	
  touris?c	
  content	
  
e-­‐Transla.on	
  for	
  Publishers:	
  Applica.ons	
  
Book	
  transla?on	
  
•  To	
  speed	
  up	
  book	
  transla?on	
  process	
  
by	
  assis?ng	
  human	
  translators	
  
•  To	
  facilitate	
  interna?onal	
  coopera?on	
  
by	
  gis?ng	
  book	
  content	
  in	
  other	
  languages	
  
Metadata	
  transla?on	
  
•  To	
  add	
  cross-­‐language	
  access	
  to	
  informa?on	
  
about	
  books	
  
in	
  publisher,	
  library,	
  others’	
  catalogs	
  
Business	
  modeling	
  
•  Besides	
  technological	
  aspects	
  also:	
  
o Business	
  modeling	
  
o Crea?on	
  of	
  new	
  job	
  profiles	
  
	
  
•  Sustainability	
  is	
  a	
  desired	
  outcome!	
  
	
  
•  Turn	
  FREME	
  into	
  a	
  startup	
  at	
  end	
  of	
  project?	
  
Ques.ons?	
  
	
  
	
  
Thank	
  you	
  for	
  your	
  aOen.on!	
  
Frank	
  Salliau	
  
Project	
  Manager	
  iMinds	
  MMLab	
  
	
  
Email	
  frank.salliau@ugent.be	
  
	
  
(Erra?c)	
  Twider	
  @fsalliau	
  

Frank Salliau, iMinds @ ICT 2015, TISP workshop

  • 1.
    A  European  collabora.on  between   research  and  the  publishing  industry   TISP  Workshop   ICT  2015  Lisbon,  Oct  22nd  2015  
  • 2.
    Flanders  digital  research  &     entrepeneurship  hub  
  • 3.
    Collabora.on  between  5  universi.es   But  with  strong  link  to  industry  
  • 4.
    Mul.media  Lab  Research  Group   Electronics  and  Informa.on  Systems   department   University  of  Ghent   About  Mul.medialab  
  • 5.
    2  year  Innova?on  Ac?on   Feb  2015  –  Feb  2017  
  • 6.
  • 7.
    Mul?lingual  enrichment   through  Machine   Transla?on     Seman?c  enrichment   using  Linked  Data    
  • 8.
    •  Machine  transla?on   •  Patented  technology   •  Domain  focused  transla?on   •  Training  data  required   Mul.lingual  enrichment  
  • 9.
    Seman.c  enrichment   • Harness  the  power  of  the  s?ll  largely  untapped   Seman?c  Web   •  Huge  amount  of  data  available  on  the  web   •  But…largely  untapped  due  to   – Lack  of  skills   – Lack  of  awareness   – Lack  of  tangible  use  cases  
  • 10.
    •  Ease  of  use   •  Standard  formats   •  Well  documented  
  • 11.
    E-­‐Services    FREME  will  provide  Open  Source  framework  of  e-­‐ services,  in  the  form  of  easy  to  use  REST  API’s     e-­‐En.ty   Named  en?ty  recogni?on   e-­‐Link   Fetch  extra  informa?on  about  specific  en?ty   e-­‐Terminology   Term  recogni?on  +  transla?on  sugges?ons   e-­‐Transla.on   Enrich  with  transla?on   e-­‐Publishing   Publish  content  as  EPUB  digital  book  with   SEO  op?miza?ons  for  content  discovery  
  • 12.
  • 13.
    Valida.on  of  FREME  in  Publishing   •  Academic  &  Trade  book  publishing  -­‐   seman?c  &  mul?lingual  enrichment     •  Financial  press  –  seman?c  enrichment     •  Video  publishing  –  enrichment  of   metadata     •  Tourism  –  transla?on  of  touris?c  content  
  • 14.
    e-­‐Transla.on  for  Publishers:  Applica.ons   Book  transla?on   •  To  speed  up  book  transla?on  process   by  assis?ng  human  translators   •  To  facilitate  interna?onal  coopera?on   by  gis?ng  book  content  in  other  languages   Metadata  transla?on   •  To  add  cross-­‐language  access  to  informa?on   about  books   in  publisher,  library,  others’  catalogs  
  • 16.
    Business  modeling   • Besides  technological  aspects  also:   o Business  modeling   o Crea?on  of  new  job  profiles     •  Sustainability  is  a  desired  outcome!     •  Turn  FREME  into  a  startup  at  end  of  project?  
  • 17.
  • 18.
    Thank  you  for  your  aOen.on!   Frank  Salliau   Project  Manager  iMinds  MMLab     Email  frank.salliau@ugent.be     (Erra?c)  Twider  @fsalliau