Uploaded on

 

More in: Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
  • ... im always surprised when i see how others are using my design and my personal pictures without my permission. It would be fair to ask or at least mention source.
    Are you sure you want to
    Your message goes here
    Be the first to like this
No Downloads

Views

Total Views
987
On Slideshare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
7
Comments
1
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. exchange ideas / share knowledge
    VIDEOLECTURES.NET
  • 2. Outline of the talk
    About videolectures.net and K4A
    Technical solutions in preparation
    Towards the content personalisation
    Automatic Transcriptions
    Enhanced Recommender Services
    Visitors analytics
    OCWC on videolectures.net
  • 3. Jozef Stefan Institute Department of Knowledge Technologies @ Center for Knowledge Transfer
    Jozef Stefan Institute (JSI) is the leading Slovene research institution for natural sciences (800+ people) in the areas of computer science, physics, chemistry
    Document-Atlas
    Department of Knowledge Technologies have around 60 people working in various areas of artificial intelligence (machine learning, data mining, semantic technologies, computational linguistics, decision support)
    Spinoff-s: Cyc-Europe, Quintelligence, LiveNetLife, Temida, XLab
    Selection of Portals and Products:
    • Text-Garden (http://www.textmining.net)
    • 4. Enrycher(http://enrycher.ijs.si/)
    • 5. VideoLectures.NET (http://videolectures.net/)
    • 6. IST-World (http://www.ist-world.org/)
    • 7. Project Intelligence (http://pi.ijs.si/)
    • 8. Search-Point (http://searchpoint.ijs.si/)
    • 9. OntoGen(http://ontogen.ijs.si/)
    • 10. Document-Atlas (http://docatlas.ijs.si/)
    • 11. AnswerArt(http://answerart.net/)
    Semantic-Graphs
    VideoLectures.NET
    Selection of FP6 & FP7 Projects (Integrated Projects and Networks of Excellence only):
    FP7 IP ACTIVE – Enabling the Knowledge Powered Enterprise
    FP7 IP COIN – COllaboration and INteroperability for networked enterprises
    FP7 IP EURIDICE – Inter-Disciplinary Research on Intelligent Cargo for Efficient, Safe and Environment-friendly Logistics
    FP7 NoE PASCAL2 – Pattern Analysis, Statistical Modeling and Computational Learning
    FP7 NoE T4ME – Machine Translation & Multilingual Information Retrieval
    FP6 IP NeOn– Lifecycle Support for Networked Ontologies
    FP6 IP ECOLEAD – European Collaborative Networked Organizations Leadership Initiative
    FP6 IP SEKT – Semantically-Enabled Knowledge Technologies
  • 12. Videolectures: Basic facts
    10000 videolectures - CC
    10000 unique visitors per day
    Recorded events 2009: 70, 2868 videos
    Shared business models:
    Research projects
    Events
    Academic institutions
    Baseline funds
    In-house developed services with strong support in research in semantics
    JSI infrastructure, 5 permanent, 10-15 part time
    Goal: Contributing to a global higher ed change by offering open access to high quality scientific material
  • 13. International dimension
    European research supported by the European Commission (from 3M to 10M Euro scale RTD projects)
    International institutions: EC, CEEMAN , CERN , Cluster Network , EFMD, IPSA , CLSP, MIT, UC Irvine , Yale, Stanford, TEDx, CMU, University of Ljubljana, Slovenian public research agency…
    Active participation in: Opencast, OCWC, EuroCRIS
    Knowledge4All foundation
  • 14. K4A
    Originates from Pascal NoE
    Knowledge and content exchange network
    Inspired and lead by most active institutions and organisations around the world from the area of free and open scientific content
    Effective and pragmatic
    Global impact
    Distributed, networked, bottom –up governance
    Funds , joint projects
    Using existing University networks and resources
    Distinctive element: all content to be scientifically approved
  • 15. K4A - Five pillars of activity
    Infrastructure: ICT Matterhorn - Interoperability, Channels, Semantics
    Science: Journal and conferences
    Online scientific video journal to global university
    Education: courses and content
    Quality assurance – peer reviewed content
    Research:
    facilitating the systems, accessing the content, enabling interaction
    IPRs, multilinguality, standards
    Business models (added value models)
    Other continent connections: case study in engagement and interaction
  • 16. World Summit Award 09
    World Summit award 09
    “With this, “Videolectures.Net” has approximately outrun 20.000 other products and projects from 157 countries participating in the 4th edition of the WSA, the United Nations based contest for e-content and creativity in the Information Society”.
  • 17. Technology stack
    5 servers serving 20 TB of data
    700,000 unique files
    300,000 web requests daily (90,000 dynamic)
  • 18. Technologies and Research
    Deep Semantics & Reasoning (Cyc)
    Light-Weight Semantic Technologies
    (OntoGen, OntoBridge)
    Decision Support (DEX)
    Social Computing/Web2.0 (LiveNetLife)
    Computational Linguistics
    (Enrycher, AnswerArt)
    Complex Data Visualization
    (DocAtlas, NewsExplorer, SearchPoint)
    Graph/Social Network Analysis
    (GraphGarden/SNAP, IST-World, FPIntelligence)
    Data/Web/Text/Stream-Mining
    (TextGarden Suite of tools)
    Statistical Machine Learning
  • 19. Personalisation
    Log files
    Content mining
    Adaptation
    Modeling
    (Needs and preferences)
  • 20. Towards personalisation @ videolectures.net
  • 21. User profiling service(Qminer)
    Ver1 – identifying segments: developed for NYT, Bloomberg
    Ver2 – individual profiling: web service for videolectures.net
    Analysing user logs and the content being accessed
    Textual description – need for transcripts
    Contextualisation – need for enriched content
    Deep analytics
    Modeling user behavior
    Detecting SIGs – marketing groups, investors,…
    Predicting and simulating user’s
    Detecting trends in visits
    Personalising content and methods

  • 22. User profiling – identifying segments
    QMiner
    System/services
    Log
    files
    User profiles
    Search fields
    Search field values
    Add state
    Non-persistent Query
    Get state
    Get states
    Update
    Rename state
    Delete state
    Change Index
    Exit
    Videos articles
    Editors
    Advertisers
    Authors
  • 23. Recommendation service(Recommender)
    Ver1: Developed and tested for videolectures.net
    Ver2: Operating at Bloomberg.com also for textual documents
    Each video is scored from three directions:
    Collaborative filtering
    Category – VL taxonomy and improved SVM module working on optimized categories
    Content – matching video against the user group’s history using all the enriched features
    All three scores are combined into final score using weights estimated from the collected training data
    Demonstration
  • 24. Content enrichment(enrycher)
    Providing wider context to the document
    … needed for efficient content mining and modeling
    A set of Web services (http://enrycher.ijs.si)
    Enriching a document with annotations presenting:
    Extracted known concepts to the machine
    Generated most descriptive sentences and dynamic abstracts
    Semantic graph
    Descriptions with existing ontologies
    Links to the external sources (wikipedia, dmoz, dbpedia, openlink data)
    Demonstration
  • 25. Transcription service(Transcriptor)
    Prototype service with automatic rapid vocabulary training of the speech recognition engine using:
    Lecture description
    Slides information
    Videolectures taxonomy
    Enriched complementary content
    Used for:
    Transcription
    Speech indexing
    Video content search
    Demonstration
  • 26. OCWC on videolectures.NET
    Videolectures.NET offers to organisations:
    Low cost service and channel
    Unlimited video preservation and fixed urls
    Organisation, project and personal videography pages
    Access to the back-office editorial and tools
    Many innovative viewing and content management features
    Sustainable innovation through research projects
    Demonstration
  • 27. Supporting OCWC
    Video and courses content distribution through videolectures.net
    User modeling and analytics
    … on a distributed network of OCWC sites
    … common access to the analytics services
    Opening existing services for independent use
    … transcription, categorisation, classification, content enrichment
    OCWC website on videolectures.net:
    … crawling, enriching, structuring, categorisingdistributed materials
    … common curriculum support
  • 28. mitja.jermol@ijs.si – head of Center for knowledge transfer at JSI
    marjana.plukavec@ijs.si – head of videolectures.net service
    davor.orlic@ijs.si – main editor at videolectures.net
    marko.grobelnik@ijs.si – head of the KT research group at JSI
    John Shawe -Taylor (jst@cs.ucl.ac.uk) – K4A director
    Colin de la Higuera (cdlh@univ-nantes.fr) – K4A director
    Enrycher: http://enrycher.ijs.si
    Recommender: http://videolectures.net
    Contextual search: http://searchpoint.ijs.si
  • 29. Support slides
  • 30. A movement/competition …
  • 31. Competitive advantage
    Access to lecture rooms and the three most active communities
    Videos + slides + comments
    Viewing features
    Semantically enriched functionalities
    Curriculum building and management support
    Efficient back-office
    Low cost and efficient service from recording to hosting
  • 32. Answering to challenges?
    OpenCourseWare
    MIT + >140 Universities
    Curriculum, standards, quality of training
    OpenCast
    Berkeley, ETH + 40 top World Universities
    OS for video recording at Universities
    VL as CDCs
    Knowledge4All foundation
    Open CDN
    Videolectures + JSI team
    Using University Internet links and servers
  • 33. K4A founders
    Europe – Pascal2 Network of Excellence:
    University College London
    Jozef Stefan Institute
    University of Bristol
    XEROX Research Centre Europe
    ETH Zurich
    CERN
    US:
    Berkeley + Opencast community
    MIT + OCW consortium
    Asia
    Korea University + Network of South Korean Universities
    Africa
    Voices of Africa, Kenya + East Africa Universities
    Kofi Annan Center for ICT and Development, Ghana + West Africa Universities
  • 34. K4A - reach
  • 35. Current development
    OpenCDN – OSS/Collaborative Content Distribution Network
    Automatic capturing, enriching, and synchronisation
    Deep semantic search through videos
    Accessibility, multilinguality
    Knowledge extraction
    Speech Indexing, Text Mining, Video mining,
    Automatic ontology construction,
    User Tracking and Profiling.
  • 36. SCOPE proposal
  • 37.
  • 38. Visitors
  • 39. Knowledge 4 All
  • 40. Expressed interest
    Internet Society Central America - Mexico
    Individual organisations: Trento, ULJ, Zagreb, Southampton, CNRS, VTT, Max Planck, TU Graz, TUB, Oxford, Carlos III de Madrid, UVA,…
    Commercial organisations: Springer Verlag, Elsevier Science
    Governmental bodies: Slovenia, European Commission
  • 41. Development
    Research
    Added value (business) models
    Emerging organisation models
    Innovative tools
    Operative
    Methods (individual, collaborative, business)
    Didactics, methodics, pedagogical models
    Systems, standards, interoperability
    Free, open access, high quality, scientific content
  • 42. Projects
    In preparation:
    AI Research institute for West Africa: implications for infrastructure, summer schools, course definition, interaction software, etc.
    Education kiosks in Africa
    Journal SCI registration – also in discussion with Springer about possible publication
    Virtual conference
    Virtual university
    Web 2.5 for learning: support for discussion groups, research communities
  • 43. Long-term options
    Innovation tube – industry/business use
    Virtual universities and virtual programmes
    Bottom-up, distributed, self-organised,
    Authoring services
    Support content enrichment for the content creators
    Services:
    On-the-fly personalisation and recommendation
    Video scene recognition, automatic annotation and categorisation
    Semantic and multilingual search
    Accessibility, Internationalization (subtitles, transcripts)
    Advanced presentation services with direct user involvement
    Textual, graphical, video (audio) content integration services and enrichment