SlideShare a Scribd company logo
1 of 22
Download to read offline
Building a Semantic Enterprise
 Content Management System from Scratch
How	
  we	
  built	
  a	
  prac/cal	
  ontology-­‐driven	
  corporate	
  intranet	
  portal
  in	
  the	
  cloud	
  in	
  three	
  months	
  using	
  off-­‐the-­‐shelf	
  technology


                SemTechBiz	
  San	
  Francisco,	
  June	
  6th	
  2012
               Ron	
  Michael	
  Ze-lemoyer	
  and	
  Cliff	
  Jurkiewicz
                         @ronmichael	
  and	
  @cessna_pilot
Mobile & Desktop Apps




                                                            Web Apps & Services




fynydd	
  	
  	
  	
  	
  :in-­‐id	
  -­‐	
  noun
                                                       Semantic Knowledge Management

1.	
  	
  a	
  word	
  of	
  Welsh	
  origin	
  
          meaning	
  mountain.
                                                            User Interface Design
2.	
  	
  a	
  company	
  of	
  big	
  thinkers,	
  
          innovative	
  problem	
  solvers,	
  
          and	
  doers.
                                                            Systems Architecture



              fynydd.com
                                                            Reporting & Analytics
How we got here

               @thomson                                 “TranslaAonal	
  
                reuters   #kolexperts
  @jwindz                                             medicine	
  meets	
  the	
  
                                                       semanAc	
  web”
                                                                                          #semtech
                                                                                                      2009
                                                #sla2009

                                                                         @candp

                                                                                     #stardog


 @ronmichael


        @fynydd                     Cambridge
                                                                                        #semtechbiz   2012



                           Steve	
  Jobs
                           Crea%vity	
  is	
  just	
  connec%ng	
  things.
Traditional enterprise content management




 Andy	
  Warhol
 They	
  say	
  that	
  /me	
  
 changes	
  things,	
  but	
  
 you	
  actually	
  have	
  to	
  
 change	
  them	
  yourself.
Semantic enterprise content management


  represents
  recognizes
 responds	
  to	
  

                      the	
  meaning	
  of	
  content
                      the	
  goals	
  of	
  users
Build it yourself




    Julius	
  Caesar
    Crea/ng	
  is	
  the
    essence	
  of	
  life.
Stand on the shoulders of giants


 Henry	
  Ford
 I	
  invented
 nothing	
  new.
 I	
  simply	
  assembled	
  
 the	
  discoveries	
  of	
  
 other	
  people.	
  Had	
  I	
  
worked	
  fiBy	
  or	
  ten	
  or	
  
even	
  five	
  years	
  
before,	
  I	
  would	
  have	
  
failed.	
  So	
  it	
  is	
  with	
  
every	
  new	
  thing.
Keep your head in the cloud




   Henry	
  David	
  Thoreau	
  
   If	
  you	
  have	
  built	
  castle
                                       s	
  in	
  the	
  air,	
  
   your	
  work	
  need	
  not	
  be
   that	
  is	
  where	
  they	
  sho   	
  lost;
                                       uld	
  be.
Be agile

     arles	
  Darwin
   Ch                                            the	
  species
                               trongest	
  of	
   ntelligent.
   I t	
  is	
  not	
  the	
  s r	
  the	
  most	
  i
   that         	
  survives	
  no the	
  most	
  adaptable
    It	
  is	
  the	
  o ne	
  that	
  is	
  
     to	
  change.
Tame your content




               Dr.	
  Seuss
               So	
  the	
  writer	
  who	
  breeds
               more	
  words	
  than	
  he	
  needs,
               is	
  making	
  a	
  chore
               for	
  the	
  reader	
  who	
  reads.
Architecture




         dotNetRDF
Foundation
             Microsoft

             SharePoint


                   ?
                    Cambridge
Ontology

  • Define	
  your	
  goal:	
  increase	
  content	
  findability
  • Build	
  simply	
  and	
  as	
  you	
  need	
  it
  • Provide	
  simple	
  management	
  tools
  • Sell	
  stakeholders	
  on	
  its	
  value
  • Hide	
  it	
  from	
  users
Browse

  • Research	
  and	
  curate	
  top	
  level	
  menus
  • Generate	
  dynamic	
  sub	
  menus
  • Generate	
  related	
  content	
  links
  • Adopt	
  friendly	
  URLs
  • Design	
  beau/ful	
  pages
Search

  • Start	
  with	
  autocomplete
  • Use	
  a	
  “snap-­‐to-­‐grid”	
  approach
  • Make	
  it	
  contextual	
  and	
  personalized
  • Provide	
  federated	
  and	
  adap/ve	
  results
  • Design	
  beau/ful	
  search	
  results
Search
                                             User	
  input




                                               Context


         Content	
     SPARQL                                OperaAons     SQL
         metadata                 Ontology       LINQ                                Content
                                                               data


                        Public                  Secret                   AnalyAcal
                       datasets                 sauce                      data




                                       Results	
  &	
  suggesAons
Administration

  • Give	
  authors	
  manual	
  &	
  automa/c	
  tagging
  • Show	
  content-­‐level	
  analy/cs	
  
  • Build	
  a	
  great	
  editor
  • Design	
  beau/ful	
  adminsitra/ve	
  tools
Keep moving

              Lexus
              Anything	
  not
              is	
  moving	
  bac	
  moving	
  forw
                                                   ard	
  
                                    kward.
Start building


 William	
  Wordsworth
 To	
  begin,	
  begin.
Libraries and Code
  dotNetRDF
  h-p://dotnetrdf.org

  Squickl	
  SQL	
  data	
  access	
  library
  h-ps://github.com/ronmichael/squickl.net

  AWS	
  Snapshot	
  Scheduler
  h-ps://github.com/ronmichael/aws-­‐snapshot-­‐scheduler

  Stardog	
  Bites	
  MSSQL	
  CLR	
  extensions
  h-ps://github.com/ronmichael/stardog-­‐bites-­‐mssql

  CFrame	
  Content	
  Management	
  Framework
  h-ps://github.com/ronmichael/cframe

  dotNetRDF	
  Stardog	
  Helper
  h-ps://github.com/ronmichael/dotnetrdf-­‐stardog-­‐helper
References
  IntegraAng	
  SemanAc	
  Systems
  John	
  F.	
  Sowa:	
  h-p://go.fynydd.com/vxzum

  An	
  Ontology-­‐Based	
  Knowledge	
  Management	
  Pla]orm
  Aldea	
  et	
  al:	
  h-p://go.fynydd.com/opble

  SemanAc	
  Enterprise	
  Content	
  Management
  Mark	
  Fisher,	
  Amit	
  Sheth:	
  h-p://go.fynydd.com/qfllv

  The	
  SemanAc	
  Web	
  and	
  Entertainment	
  Weekly
  Donna	
  Slawsky:	
  h-p://go.fynydd.com/dygpj

  Improving	
  Content	
  Management	
  with	
  SemanAc	
  Technologies
  Fernando	
  Carolo	
  and	
  Leonardo	
  Burlamaqui:	
  h-p://go.fynydd.com/bpvor

  Content	
  Management	
  Bible
  Bob	
  Boiko:	
  h-p://go.fynydd.com/xhjbi
fynydd.com




Don’t	
  forget	
  your	
  towel.

More Related Content

Similar to Building a semantic enterprise content management system from scratch v1

Data science tools of the trade
Data science tools of the tradeData science tools of the trade
Data science tools of the tradeFangda Wang
 
Finding harmony in web development
Finding harmony in web developmentFinding harmony in web development
Finding harmony in web developmentChristian Heilmann
 
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found..."Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...Dataconomy Media
 
Hacking Marketing By Scott Brinker
Hacking Marketing By Scott BrinkerHacking Marketing By Scott Brinker
Hacking Marketing By Scott BrinkerMarTech Conference
 
Design Prototyping: Bringing Wireframes to Life
Design Prototyping: Bringing Wireframes to LifeDesign Prototyping: Bringing Wireframes to Life
Design Prototyping: Bringing Wireframes to Lifegoodfriday
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...Daniel Bryant
 
Coevolving Organisational and Technical Boundaries
Coevolving Organisational and Technical BoundariesCoevolving Organisational and Technical Boundaries
Coevolving Organisational and Technical BoundariesNick Tune
 
Accelerate to Cloud
Accelerate to CloudAccelerate to Cloud
Accelerate to CloudRightScale
 
Accelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with CascadingAccelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with CascadingCascading
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016StampedeCon
 
PDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkPDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkErik Riedel
 
Hadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both WorldsHadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both WorldsInside Analysis
 
RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?
RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?
RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?RightScale
 

Similar to Building a semantic enterprise content management system from scratch v1 (20)

resume
resumeresume
resume
 
Data science tools of the trade
Data science tools of the tradeData science tools of the trade
Data science tools of the trade
 
Finding harmony in web development
Finding harmony in web developmentFinding harmony in web development
Finding harmony in web development
 
Appnovation Intro Deck
Appnovation Intro DeckAppnovation Intro Deck
Appnovation Intro Deck
 
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found..."Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
 
How to deliver effective data science projects
How to deliver effective data science projectsHow to deliver effective data science projects
How to deliver effective data science projects
 
resume
resumeresume
resume
 
Hacking Marketing By Scott Brinker
Hacking Marketing By Scott BrinkerHacking Marketing By Scott Brinker
Hacking Marketing By Scott Brinker
 
Design Prototyping: Bringing Wireframes to Life
Design Prototyping: Bringing Wireframes to LifeDesign Prototyping: Bringing Wireframes to Life
Design Prototyping: Bringing Wireframes to Life
 
Clojure at ardoq
Clojure at ardoqClojure at ardoq
Clojure at ardoq
 
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
DevoxxUK 2016: "DevOps: Microservices, containers, platforms, tooling... Oh y...
 
Coevolving Organisational and Technical Boundaries
Coevolving Organisational and Technical BoundariesCoevolving Organisational and Technical Boundaries
Coevolving Organisational and Technical Boundaries
 
Accelerate to Cloud
Accelerate to CloudAccelerate to Cloud
Accelerate to Cloud
 
What’s Up, EDoc?!
What’s Up,EDoc?!What’s Up,EDoc?!
What’s Up, EDoc?!
 
Accelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with CascadingAccelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with Cascading
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016
 
PDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkPDL Distinguished Alumni Talk
PDL Distinguished Alumni Talk
 
Semtech2006
Semtech2006Semtech2006
Semtech2006
 
Hadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both WorldsHadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both Worlds
 
RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?
RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?
RightScale Webinar: The DevOps Debate - Is the Enterprise Ready?
 

Recently uploaded

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxAna-Maria Mihalceanu
 

Recently uploaded (20)

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance Toolbox
 

Building a semantic enterprise content management system from scratch v1

  • 1. Building a Semantic Enterprise Content Management System from Scratch How  we  built  a  prac/cal  ontology-­‐driven  corporate  intranet  portal in  the  cloud  in  three  months  using  off-­‐the-­‐shelf  technology SemTechBiz  San  Francisco,  June  6th  2012 Ron  Michael  Ze-lemoyer  and  Cliff  Jurkiewicz @ronmichael  and  @cessna_pilot
  • 2. Mobile & Desktop Apps Web Apps & Services fynydd          :in-­‐id  -­‐  noun Semantic Knowledge Management 1.    a  word  of  Welsh  origin   meaning  mountain. User Interface Design 2.    a  company  of  big  thinkers,   innovative  problem  solvers,   and  doers. Systems Architecture fynydd.com Reporting & Analytics
  • 3. How we got here @thomson “TranslaAonal   reuters #kolexperts @jwindz medicine  meets  the   semanAc  web” #semtech 2009 #sla2009 @candp #stardog @ronmichael @fynydd Cambridge #semtechbiz 2012 Steve  Jobs Crea%vity  is  just  connec%ng  things.
  • 4. Traditional enterprise content management Andy  Warhol They  say  that  /me   changes  things,  but   you  actually  have  to   change  them  yourself.
  • 5. Semantic enterprise content management represents recognizes responds  to   the  meaning  of  content the  goals  of  users
  • 6. Build it yourself Julius  Caesar Crea/ng  is  the essence  of  life.
  • 7. Stand on the shoulders of giants Henry  Ford I  invented nothing  new. I  simply  assembled   the  discoveries  of   other  people.  Had  I   worked  fiBy  or  ten  or   even  five  years   before,  I  would  have   failed.  So  it  is  with   every  new  thing.
  • 8. Keep your head in the cloud Henry  David  Thoreau   If  you  have  built  castle s  in  the  air,   your  work  need  not  be that  is  where  they  sho  lost; uld  be.
  • 9. Be agile arles  Darwin Ch the  species trongest  of   ntelligent. I t  is  not  the  s r  the  most  i that  survives  no the  most  adaptable It  is  the  o ne  that  is   to  change.
  • 10. Tame your content Dr.  Seuss So  the  writer  who  breeds more  words  than  he  needs, is  making  a  chore for  the  reader  who  reads.
  • 11. Architecture dotNetRDF
  • 12. Foundation Microsoft SharePoint ? Cambridge
  • 13. Ontology • Define  your  goal:  increase  content  findability • Build  simply  and  as  you  need  it • Provide  simple  management  tools • Sell  stakeholders  on  its  value • Hide  it  from  users
  • 14. Browse • Research  and  curate  top  level  menus • Generate  dynamic  sub  menus • Generate  related  content  links • Adopt  friendly  URLs • Design  beau/ful  pages
  • 15. Search • Start  with  autocomplete • Use  a  “snap-­‐to-­‐grid”  approach • Make  it  contextual  and  personalized • Provide  federated  and  adap/ve  results • Design  beau/ful  search  results
  • 16. Search User  input Context Content   SPARQL OperaAons SQL metadata Ontology LINQ Content data Public Secret AnalyAcal datasets sauce data Results  &  suggesAons
  • 17. Administration • Give  authors  manual  &  automa/c  tagging • Show  content-­‐level  analy/cs   • Build  a  great  editor • Design  beau/ful  adminsitra/ve  tools
  • 18. Keep moving Lexus Anything  not is  moving  bac  moving  forw ard   kward.
  • 19. Start building William  Wordsworth To  begin,  begin.
  • 20. Libraries and Code dotNetRDF h-p://dotnetrdf.org Squickl  SQL  data  access  library h-ps://github.com/ronmichael/squickl.net AWS  Snapshot  Scheduler h-ps://github.com/ronmichael/aws-­‐snapshot-­‐scheduler Stardog  Bites  MSSQL  CLR  extensions h-ps://github.com/ronmichael/stardog-­‐bites-­‐mssql CFrame  Content  Management  Framework h-ps://github.com/ronmichael/cframe dotNetRDF  Stardog  Helper h-ps://github.com/ronmichael/dotnetrdf-­‐stardog-­‐helper
  • 21. References IntegraAng  SemanAc  Systems John  F.  Sowa:  h-p://go.fynydd.com/vxzum An  Ontology-­‐Based  Knowledge  Management  Pla]orm Aldea  et  al:  h-p://go.fynydd.com/opble SemanAc  Enterprise  Content  Management Mark  Fisher,  Amit  Sheth:  h-p://go.fynydd.com/qfllv The  SemanAc  Web  and  Entertainment  Weekly Donna  Slawsky:  h-p://go.fynydd.com/dygpj Improving  Content  Management  with  SemanAc  Technologies Fernando  Carolo  and  Leonardo  Burlamaqui:  h-p://go.fynydd.com/bpvor Content  Management  Bible Bob  Boiko:  h-p://go.fynydd.com/xhjbi

Editor's Notes

  1. About three years ago Jesse Dudley was working at Thomson Reuters on a product called KOLexperts that identifies experts in the pharma and biotech industries by analyzing content in places like PubMed. She attended the Special Libraries Association (SLA) Conference in June of 2009 in DC and, because of her work on KOLexperts, she attended a presentation titled “ Translational medicine meets the semantic web” by Olivier Bodenreider from the National Library of Medicine. This was her introduction into semtech after which she started spreading semtech stuff to me and I spread it along to Fynydd. It had obvious value for a lot of enterprise knowledge management tools we work on. So as we worked with customers interested in improving their knowledge sharing tools and intranets we started experimenting and recommending it. We started working with Clark and Parsia and began building prototype content management systems that ran on Stardog, their new RDF database. This eventually resulted in a semantic content management prototype and framework we called Cambridge, which has been well received in various incarnations by a couple clients. And then almost exactly three years from SLA 2009 we are speaking at SemtechBiz 2012.
  2. Traditional ECM is most often the intranet portal. It’s primitive, slow to change, hard to deploy. It’s broken. It’s time to change.
  3. SECMS tries to solve some of these problems by understanding the meaning of content and the goals of users. SECMS is the intersection of meaning and goals. We store information in more logical and standard formats (RDF) and use more modern and standard tools (SPARQL) to query them.
  4. Some design principles. First is build it yourself. Often debated - no perfect answer. Why did we? -Semtech marketplace for this kind of thing is in its infancy, esp. UI and UX -Innovative and cutting edge solution -Tools shape thinking- differentiate yourself
  5. Next: don’t build all of it yourself. Its the age of the mashup. Get advice and assistance from the best in the field. Build using the best software components and tools, open source, commercial, etc.
  6. The cliched cloud slide. Why does the cloud matter? Provisioning real servers is slow and costly, bureaucratic. Even if final deployment is onsite, cloud is great for prototyping. Scale quickly. Cheaper and more efficient servers. While prototyping you can never be sure what resources you’ll need.
  7. Another cliched slide: agile development. But why does it matter? Talk to clients - end users, not management - understand problems. Build iteratively. Build a system that doesn't require lots of documentation Build iteratively. Respond to change in business, marketplace, technology, capabilities. I
  8. Last design principle: sometimes you need to upgrade your content. Our policy & procedure story. Started thinking how to build tool to deal with existing content. But content was written and organized for an old medium - paper - then pushed to PDF. Redundant, disorganized, mixed together. Once we switched gears, rewrote & improve content, solution was easier to build and better for users.
  9. Now for implementation. AWS: Incredibly flexible and innovative .NET and C#: great framework, language, well accepted in enterprise MSSQL: good for non-RDF needs, well accepted in enterprise, SQL Express is free Stardog: great RDF database, fast and easy to use dotNetRDF: open source, talk to Stardog with ease
  10. .NET is our platform but what about a foundation? Build or buy? Lots of debate and procrastination. All choices required similar development times Build your own: faster to prototype, most flexible, better ability to innovate Avoid politics of deciding between systems already in place [lotus quickr, teamsite, sharepoint] Generic .NET solution moves easily into whatever framework customer has/wants
  11. One of our biggest problem was overcomplicating the ontology, e.g. answer questions Define goal : findability. build as you need it Don’t make it complicated, build as you need it. Treat ontology like content not code. build nice tools, prepare for it to change often. Biggest thing of all - don’t talk to users too much ontology (or tech in general). it’s only a means to an end. But selling its value to stakeholders can work.
  12. Initially planned for dynamic menus based on role, but too complicated & unnecessary. Curated top menus based on user research, card sorting, etc worked best. Dynamic sub menus and related content links work. Friendly urls are often forgotten - good for experts, for sharing Beautiful page - UX - layouts - whitespace & margins- improve browsability and user satisfaction.
  13. Don’t delay autocomplete, it improves search dramatically. Take your inputs and “snap them to a grid” to find an answer. Context is important, personalization is important Federation: include all types of results. Adaptive: build in your own analytics early on and use them for self diagnosis and improvement Beautiful results are easier to read.
  14. Tagging: simple approach of picking “subject” (hasSubject) and “audience” (hasAudience) entities from a hierarchical view of select pieces of ontology. Expand to let them choose other relationships ( eg. hasDestination mars) Simple auto tagging recommendations by matching text; add more complex with tools like Open Calais? Inline analytics were very valuable tool for authors and mgmt. Of course, editor has to be great, as should entire admin -- too often ignored.
  15. Must constantly improve - plan and budget for it early on. Start with a basic tool that looks great and has some semantics, prove it, grow it. People are used to constant improvement - internet, cars, etc. Focus on search, navigation, UX and performance.