SlideShare a Scribd company logo
1 of 14
Download to read offline
Potential Future Directions for
ePADD
Peter Chan, ePADD Project Manager
Digital Archivist, Stanford Libraries
Workshop 2 "After the Digital Revolution"
London, January 25-26, 2018
Other Platforms used in Personal Digital Archives
Facebook
Twitter
WhatsApp, Slack
Calendar
YouTube
Photos
Wordpress
Apply unique ePADD features to other File Types
Browsing of extracted entities
Merging of identities
Lexicon search
Query generator
Connection to authority files
Redacted version for public access
Restriction management
Target Different User Groups
Donors, archivists, and researchers of email archives in collecting repositories
Individuals who want to organize their own emails
Journalists who want to analyze email from their sources
Cultural Institutions which want to keep the knowledge in the emails of outgoing staff
Improve Existing ePADD Functions
Better Screening of Sensitive Information
Improve Findability of Archives / Required Information
Better Search Capability
More Automatic Classification / Grouping of Contents
Better Label / Annotation Functions
Improve Technical Infrastructure and UX
Improve Interface with other Systems
Better Screening of Sensitive Information
Good basic and advanced search functions
Regular expressions according to institution policy
Ability to store structured keywords
Entities related to sensitive message from DBpedia
Derive keywords from Wordnet
Ontology created by systematic interview with users
Classifier trained to recognize email with particular sensitivities
Improve Findability / Discovery of Archives
Provide many thousands of metadata of one archives
Provide cross collection search of metadata from archives in one institution
Provide a platform for all institutions to host their email archives
Provide cross institution search and browsing of common entities across all archives
Facilitate inclusion of metadata generated by ePADD in institutions’ catalog systems
Facilitate inclusion of metadata generated by ePADD in Wiki
Enhance the discovery module to facilitate crawling by Google
Better Search Capability
Simple search
Advanced search
Lexicon search
Query generator
Advanced query generator
Fuzzy search
Semantic search
More Automatic Classification / Grouping of Contents
Correspondent with different email addresses
Traditional entity extraction (person, organization, location)
Fine-grained entity extraction (based on entity in DBpedia)
Word frequency
Content Classification - specific (booking confirmations, receipts, etc.)
Content Classification - general (sports, economics, etc.)
Topic modelling
Better Label / Annotation Functions
Message based annotations and labels with no semantics
Message based labels with simple semantics
Message based labels with more advanced semantics
Role based annotations and labels with / without semantics
Text (alphanumeric) based labels with semantics
Correspondent based labels with semantics
Entity based labels with semantics
Improve Technical Infrastructure and UX
Single user platform
Handle 650,000 messages
Multi-users
Web-based application
Handle millions of messages
Better user experience
Remote Reading Room
Improve Interface with other Systems
Export headers in csv file for network analysis
Export whole / part of archives in mbox for preservation or use in other email clients
Export confirmed correspondents in csv file for finding aids
Connect to image recognition system to generate metadata
Export confirmed entities in RDF for other linked data systems
Connect to Wayback machine for dead url
Provide API (application program in interface) for machine consumption
Technical Team
Sudheendra Hangal - Co-founder at Amuse Labs, Magic Lamp Software. Faculty
member at Ashoka University. Worked at Sun. Stanford PhD CS.
Chinmay Narayanan - PhD Indian Institute of Technology, Research focuses on the
interactions of programming languages, logic and formal semantics. Worked at Simen.
Chaiyasit (Sit) Manovit - Founder of Nimeyo, Ixora Technology. 1st hire at PwrLite,
acquired by Xilinx. Worked at NCR, Intel, Sun, and Xilinx. Stanford PhD EE.
Peter Chan - Digital Archivist, ePADD Project Manager for 6 years; Co-founder of
MyIPhoto.com; VP, Operations Planning at Bank of America, Asia.
Josh Schneider - Assistant University Archivist, ePADD Community Manager
Thanks!!
Visit library.stanford.edu/projects/epadd
Follow @e_padd
Watch youtu.be/vu1Oi8TiGiU
Receive epadd_list@stanford.lists.edu
Download / Contribute github.com/epadd
Participate epadd.nimeyo.com
Reach epadd_project@stanford.edu

More Related Content

What's hot

Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introductionnimmyjans4
 
Hendrik flash talk metadata creation 2010 05-19
Hendrik flash talk metadata creation 2010 05-19Hendrik flash talk metadata creation 2010 05-19
Hendrik flash talk metadata creation 2010 05-19Trinity College Dublin
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldAmit Sheth
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.Lanujessy
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMSai Kumar Ale
 
Accessibility Issues
Accessibility IssuesAccessibility Issues
Accessibility Issuesliddy
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Webliddy
 
Bioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIBioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIDr. Rupak Chakravarty
 
Information retrieval
Information retrievalInformation retrieval
Information retrievalhplap
 
Ontologies for music from a digital library practitioner's perspective
Ontologies for music from a digital library practitioner's perspectiveOntologies for music from a digital library practitioner's perspective
Ontologies for music from a digital library practitioner's perspectiveJenn Riley
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Finalguestcaef1d
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrievalNanthini Dominique
 
Information Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case StudyInformation Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case StudyBhojaraju Gunjal
 
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...Artificial Intelligence Institute at UofSC
 
Converging research towards AccessForAll
Converging research towards AccessForAllConverging research towards AccessForAll
Converging research towards AccessForAllliddy
 

What's hot (20)

Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
 
Web search vs ir
Web search vs irWeb search vs ir
Web search vs ir
 
Hendrik flash talk metadata creation 2010 05-19
Hendrik flash talk metadata creation 2010 05-19Hendrik flash talk metadata creation 2010 05-19
Hendrik flash talk metadata creation 2010 05-19
 
2009 IDS Search
2009 IDS Search2009 IDS Search
2009 IDS Search
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 
Metadata 101public
Metadata 101publicMetadata 101public
Metadata 101public
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEM
 
Accessibility Issues
Accessibility IssuesAccessibility Issues
Accessibility Issues
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Bioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - IIBioinformatioc: Information Retrieval - II
Bioinformatioc: Information Retrieval - II
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
 
Ontologies for music from a digital library practitioner's perspective
Ontologies for music from a digital library practitioner's perspectiveOntologies for music from a digital library practitioner's perspective
Ontologies for music from a digital library practitioner's perspective
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Final
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
 
Information Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case StudyInformation Storage and Retrieval : A Case Study
Information Storage and Retrieval : A Case Study
 
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
 
M045067275
M045067275M045067275
M045067275
 
Converging research towards AccessForAll
Converging research towards AccessForAllConverging research towards AccessForAll
Converging research towards AccessForAll
 

Similar to Potential Future Directions for ePADD

KnowIT, semantic informatics knowledge base
KnowIT, semantic informatics knowledge baseKnowIT, semantic informatics knowledge base
KnowIT, semantic informatics knowledge baseLaurent Alquier
 
Slawek Korea
Slawek KoreaSlawek Korea
Slawek KoreaSlawek
 
TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010Eli Robillard
 
Metadata: Digital Humanties
Metadata: Digital HumantiesMetadata: Digital Humanties
Metadata: Digital HumantiesMatthew Miguez
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with searchJean Graef
 
Adhere Solutions, All Access Connector Suite for Google Search Appliance
Adhere Solutions, All Access Connector Suite for Google Search ApplianceAdhere Solutions, All Access Connector Suite for Google Search Appliance
Adhere Solutions, All Access Connector Suite for Google Search ApplianceAdhereSolutions
 
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Baden Hughes
 
SharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementSharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementIvan Sanders
 
Making IA Real: Planning an Information Architecture Strategy
Making IA Real: Planning an Information Architecture StrategyMaking IA Real: Planning an Information Architecture Strategy
Making IA Real: Planning an Information Architecture StrategyChiara Fox Ogan
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010Andreas Blumauer
 
CETIS09 OER Technical Roundtable
CETIS09 OER Technical Roundtable  CETIS09 OER Technical Roundtable
CETIS09 OER Technical Roundtable R. John Robertson
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010bgerman
 
Metadata and Dissemination
Metadata and DisseminationMetadata and Dissemination
Metadata and DisseminationKatja Šnuderl
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Researchadameq
 

Similar to Potential Future Directions for ePADD (20)

KnowIT, semantic informatics knowledge base
KnowIT, semantic informatics knowledge baseKnowIT, semantic informatics knowledge base
KnowIT, semantic informatics knowledge base
 
Slawek Korea
Slawek KoreaSlawek Korea
Slawek Korea
 
User-Driven Taxonomies
User-Driven TaxonomiesUser-Driven Taxonomies
User-Driven Taxonomies
 
TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010
 
Metadata: Digital Humanties
Metadata: Digital HumantiesMetadata: Digital Humanties
Metadata: Digital Humanties
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
Fedora
FedoraFedora
Fedora
 
Adhere Solutions, All Access Connector Suite for Google Search Appliance
Adhere Solutions, All Access Connector Suite for Google Search ApplianceAdhere Solutions, All Access Connector Suite for Google Search Appliance
Adhere Solutions, All Access Connector Suite for Google Search Appliance
 
Dspace Webinar
Dspace WebinarDspace Webinar
Dspace Webinar
 
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
 
Microsoft Enterprise Seach using SharePoint
Microsoft Enterprise Seach using SharePointMicrosoft Enterprise Seach using SharePoint
Microsoft Enterprise Seach using SharePoint
 
SharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementSharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content Management
 
Making IA Real: Planning an Information Architecture Strategy
Making IA Real: Planning an Information Architecture StrategyMaking IA Real: Planning an Information Architecture Strategy
Making IA Real: Planning an Information Architecture Strategy
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
 
CETIS09 OER Technical Roundtable
CETIS09 OER Technical Roundtable  CETIS09 OER Technical Roundtable
CETIS09 OER Technical Roundtable
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010
 
Metadata and Dissemination
Metadata and DisseminationMetadata and Dissemination
Metadata and Dissemination
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
301 fernicola word2007-ssp2008
301 fernicola word2007-ssp2008301 fernicola word2007-ssp2008
301 fernicola word2007-ssp2008
 

More from peterchanws

How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...peterchanws
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidatapeterchanws
 
Digital game preservation conference 12 25-2018
Digital game preservation conference   12 25-2018Digital game preservation conference   12 25-2018
Digital game preservation conference 12 25-2018peterchanws
 
Cutting Edge Technology used in ePADD
Cutting Edge Technologyused in ePADDCutting Edge Technologyused in ePADD
Cutting Edge Technology used in ePADDpeterchanws
 
Imaging 5.25 Floppy Disks
Imaging 5.25 Floppy DisksImaging 5.25 Floppy Disks
Imaging 5.25 Floppy Diskspeterchanws
 
Why We Want to Publish Controlled Vocabulary in SKOS?
Why We Want to Publish Controlled Vocabulary in SKOS?Why We Want to Publish Controlled Vocabulary in SKOS?
Why We Want to Publish Controlled Vocabulary in SKOS? peterchanws
 
SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012
SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012
SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012peterchanws
 
Accessioning Born-Digital Materials
Accessioning Born-Digital MaterialsAccessioning Born-Digital Materials
Accessioning Born-Digital Materialspeterchanws
 
Born digital collection work flow2
Born digital collection work flow2Born digital collection work flow2
Born digital collection work flow2peterchanws
 
Workshop 2 revised
Workshop 2 revisedWorkshop 2 revised
Workshop 2 revisedpeterchanws
 
Workshop 1 revised
Workshop 1 revisedWorkshop 1 revised
Workshop 1 revisedpeterchanws
 

More from peterchanws (14)

How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidata
 
Digital game preservation conference 12 25-2018
Digital game preservation conference   12 25-2018Digital game preservation conference   12 25-2018
Digital game preservation conference 12 25-2018
 
Cutting Edge Technology used in ePADD
Cutting Edge Technologyused in ePADDCutting Edge Technologyused in ePADD
Cutting Edge Technology used in ePADD
 
Imaging 5.25 Floppy Disks
Imaging 5.25 Floppy DisksImaging 5.25 Floppy Disks
Imaging 5.25 Floppy Disks
 
ePADD
ePADDePADD
ePADD
 
Why We Want to Publish Controlled Vocabulary in SKOS?
Why We Want to Publish Controlled Vocabulary in SKOS?Why We Want to Publish Controlled Vocabulary in SKOS?
Why We Want to Publish Controlled Vocabulary in SKOS?
 
SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012
SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012
SCA Accessioning Born-Digital Materials Workshop, Nov. 8, 2012
 
Accessioning Born-Digital Materials
Accessioning Born-Digital MaterialsAccessioning Born-Digital Materials
Accessioning Born-Digital Materials
 
MUSE
MUSEMUSE
MUSE
 
Born digital collection work flow2
Born digital collection work flow2Born digital collection work flow2
Born digital collection work flow2
 
Workshop 3
Workshop 3Workshop 3
Workshop 3
 
Workshop 2 revised
Workshop 2 revisedWorkshop 2 revised
Workshop 2 revised
 
Workshop 1 revised
Workshop 1 revisedWorkshop 1 revised
Workshop 1 revised
 

Recently uploaded

HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayMakMakNepo
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........LeaCamillePacle
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 

Recently uploaded (20)

HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up Friday
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 

Potential Future Directions for ePADD

  • 1. Potential Future Directions for ePADD Peter Chan, ePADD Project Manager Digital Archivist, Stanford Libraries Workshop 2 "After the Digital Revolution" London, January 25-26, 2018
  • 2. Other Platforms used in Personal Digital Archives Facebook Twitter WhatsApp, Slack Calendar YouTube Photos Wordpress
  • 3. Apply unique ePADD features to other File Types Browsing of extracted entities Merging of identities Lexicon search Query generator Connection to authority files Redacted version for public access Restriction management
  • 4. Target Different User Groups Donors, archivists, and researchers of email archives in collecting repositories Individuals who want to organize their own emails Journalists who want to analyze email from their sources Cultural Institutions which want to keep the knowledge in the emails of outgoing staff
  • 5. Improve Existing ePADD Functions Better Screening of Sensitive Information Improve Findability of Archives / Required Information Better Search Capability More Automatic Classification / Grouping of Contents Better Label / Annotation Functions Improve Technical Infrastructure and UX Improve Interface with other Systems
  • 6. Better Screening of Sensitive Information Good basic and advanced search functions Regular expressions according to institution policy Ability to store structured keywords Entities related to sensitive message from DBpedia Derive keywords from Wordnet Ontology created by systematic interview with users Classifier trained to recognize email with particular sensitivities
  • 7. Improve Findability / Discovery of Archives Provide many thousands of metadata of one archives Provide cross collection search of metadata from archives in one institution Provide a platform for all institutions to host their email archives Provide cross institution search and browsing of common entities across all archives Facilitate inclusion of metadata generated by ePADD in institutions’ catalog systems Facilitate inclusion of metadata generated by ePADD in Wiki Enhance the discovery module to facilitate crawling by Google
  • 8. Better Search Capability Simple search Advanced search Lexicon search Query generator Advanced query generator Fuzzy search Semantic search
  • 9. More Automatic Classification / Grouping of Contents Correspondent with different email addresses Traditional entity extraction (person, organization, location) Fine-grained entity extraction (based on entity in DBpedia) Word frequency Content Classification - specific (booking confirmations, receipts, etc.) Content Classification - general (sports, economics, etc.) Topic modelling
  • 10. Better Label / Annotation Functions Message based annotations and labels with no semantics Message based labels with simple semantics Message based labels with more advanced semantics Role based annotations and labels with / without semantics Text (alphanumeric) based labels with semantics Correspondent based labels with semantics Entity based labels with semantics
  • 11. Improve Technical Infrastructure and UX Single user platform Handle 650,000 messages Multi-users Web-based application Handle millions of messages Better user experience Remote Reading Room
  • 12. Improve Interface with other Systems Export headers in csv file for network analysis Export whole / part of archives in mbox for preservation or use in other email clients Export confirmed correspondents in csv file for finding aids Connect to image recognition system to generate metadata Export confirmed entities in RDF for other linked data systems Connect to Wayback machine for dead url Provide API (application program in interface) for machine consumption
  • 13. Technical Team Sudheendra Hangal - Co-founder at Amuse Labs, Magic Lamp Software. Faculty member at Ashoka University. Worked at Sun. Stanford PhD CS. Chinmay Narayanan - PhD Indian Institute of Technology, Research focuses on the interactions of programming languages, logic and formal semantics. Worked at Simen. Chaiyasit (Sit) Manovit - Founder of Nimeyo, Ixora Technology. 1st hire at PwrLite, acquired by Xilinx. Worked at NCR, Intel, Sun, and Xilinx. Stanford PhD EE. Peter Chan - Digital Archivist, ePADD Project Manager for 6 years; Co-founder of MyIPhoto.com; VP, Operations Planning at Bank of America, Asia. Josh Schneider - Assistant University Archivist, ePADD Community Manager
  • 14. Thanks!! Visit library.stanford.edu/projects/epadd Follow @e_padd Watch youtu.be/vu1Oi8TiGiU Receive epadd_list@stanford.lists.edu Download / Contribute github.com/epadd Participate epadd.nimeyo.com Reach epadd_project@stanford.edu