On Thursday, November 10, Joe Hilger and Sara Duane spoke at Text Analytics Forum about identifying secure and confidential information using auto-tagging. Information security continues to grow in importance in today's society. We hear stories all of the time about hackers accessing private information from companies and government agencies. Every organization struggles with employees who store confidential information on insecure network drives or cloud drives. Joe and Sara did a project with a federal research organization that used auto-tagging and text analytics to identify confidential information that needed to be moved to a secure location. During the presentation, we shared the approach we took to identify this information and how we made sure that the tagging and text analytics were accurate. Attendees learned best practices for designing a taxonomy for auto-tagging and tuning auto-tagging as well as ways to identify confidential information across the enterprise.
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024.
Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights.
Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer.
Learn about:
- The essence and purpose of taxonomies and ontologies in information and knowledge management;
- Advantages of semantic layers leveraging organizational taxonomies; and
- Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies
Five fast ways to improve search and findability across enterprise networksKristian Norling
Ask employees what their main pain points are when it comes to using enterprise networks and chances are “search” will appear high on the list. Yet a recent survey conducted by Findwise shows that while 78% of respondents believe finding the right information is critical to business goals and success, only 24% have a search strategy in place. Only 9% claim it’s “fairly easy” to find content, compared to 64% who admit it’s “hard” or “very hard”.
While the problem requires resource and in-depth review to tackle effectively, there are simple ways to start the journey while getting a longer-term strategy in place.
And actually this presentation contains 7 ways and some bonus content too!
God søk er essentielt for et godt intranett. Likevel investeres det hverken i nødvendig teknologi eller kompetanseutvikling på søk. Resultatet er skremmende: dobbeltarbeid, dårlige beslutninger, forsinkelser og overskridelser, kaste bort ansattes tid på leting etter informasjon, treg respons på marked, konkurrenter osv. Med forholdsvis enkle grep kan du gjøre noe med dette i dag.
- Hjelp - intranettet flyter over av innhold
- Sammenhengen mellom søk, informasjon, arkitektur og hyperkoblinger
- Viktigheten av kontekst
- Hva har tillit å gjøre med søk
- Hva med mobilen og søk
- Eksempler på dårlig och god søk
Ask employees what their main pain points are when it comes to using the intranet and chances are “search” will appear in the top 5 of the list. Meanwhile the global survey on Enterprise Search conclude that while 78% of respondents believe finding the right information is critical to business goals and success, only 24% have a search strategy in place. Only 9% claim it’s “fairly easy” to find content, compared to 64% who admit it’s “hard” or “very hard”.
While improvements in search requires both resources and thorough reviews, there are some things you can do to start the journey while getting a longer-term strategy in place. Kristian Norling from IntraTeam will a crash-course in intranet search with 7 actionable to-dos to take home.
Kristian ger tips om de viktigaste sakerna du måste arbeta med för att lyckas med ditt intranätsök. Och nästan ingenting har med själva söktekniken att göra!
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
With the explosive popularity of ChatGPT, organizations are throwing massive budgets and executive attention at the implementation of AI technologies. Making these solutions work for the enterprise can deliver competitive advantage and open up new solutions and business opportunities that were never before possible. However, without the right Information Architecture (IA) foundations, these projects are bound to fail. In this presentation, Marino and Galdamez provided practical, actionable steps around IA that organizations can take in preparation for future AI solutions.
In this session, attendees:
- Reviewed key elements of IA and discovered how their successful design and implementation can lay the foundations for AI;
- Learned basic terminology surrounding AI, as well as different techniques and applications of AI in enterprise environments;
- Gained a deeper understanding of the feedback loops between IA and AI and the corresponding implications on user experience; and
- Received practical advice on IA design to facilitate its implementation and the success of AI efforts.
Taxonomy: Hero of Advanced Content - SXSW 2019Laura Creekmore
I gave this presentation at SXSW 2019, talking about how content structure can enhance your work on advanced content channels like AI, voice skills, chatbots, and ecommerce.
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024.
Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights.
Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer.
Learn about:
- The essence and purpose of taxonomies and ontologies in information and knowledge management;
- Advantages of semantic layers leveraging organizational taxonomies; and
- Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies
Five fast ways to improve search and findability across enterprise networksKristian Norling
Ask employees what their main pain points are when it comes to using enterprise networks and chances are “search” will appear high on the list. Yet a recent survey conducted by Findwise shows that while 78% of respondents believe finding the right information is critical to business goals and success, only 24% have a search strategy in place. Only 9% claim it’s “fairly easy” to find content, compared to 64% who admit it’s “hard” or “very hard”.
While the problem requires resource and in-depth review to tackle effectively, there are simple ways to start the journey while getting a longer-term strategy in place.
And actually this presentation contains 7 ways and some bonus content too!
God søk er essentielt for et godt intranett. Likevel investeres det hverken i nødvendig teknologi eller kompetanseutvikling på søk. Resultatet er skremmende: dobbeltarbeid, dårlige beslutninger, forsinkelser og overskridelser, kaste bort ansattes tid på leting etter informasjon, treg respons på marked, konkurrenter osv. Med forholdsvis enkle grep kan du gjøre noe med dette i dag.
- Hjelp - intranettet flyter over av innhold
- Sammenhengen mellom søk, informasjon, arkitektur og hyperkoblinger
- Viktigheten av kontekst
- Hva har tillit å gjøre med søk
- Hva med mobilen og søk
- Eksempler på dårlig och god søk
Ask employees what their main pain points are when it comes to using the intranet and chances are “search” will appear in the top 5 of the list. Meanwhile the global survey on Enterprise Search conclude that while 78% of respondents believe finding the right information is critical to business goals and success, only 24% have a search strategy in place. Only 9% claim it’s “fairly easy” to find content, compared to 64% who admit it’s “hard” or “very hard”.
While improvements in search requires both resources and thorough reviews, there are some things you can do to start the journey while getting a longer-term strategy in place. Kristian Norling from IntraTeam will a crash-course in intranet search with 7 actionable to-dos to take home.
Kristian ger tips om de viktigaste sakerna du måste arbeta med för att lyckas med ditt intranätsök. Och nästan ingenting har med själva söktekniken att göra!
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
With the explosive popularity of ChatGPT, organizations are throwing massive budgets and executive attention at the implementation of AI technologies. Making these solutions work for the enterprise can deliver competitive advantage and open up new solutions and business opportunities that were never before possible. However, without the right Information Architecture (IA) foundations, these projects are bound to fail. In this presentation, Marino and Galdamez provided practical, actionable steps around IA that organizations can take in preparation for future AI solutions.
In this session, attendees:
- Reviewed key elements of IA and discovered how their successful design and implementation can lay the foundations for AI;
- Learned basic terminology surrounding AI, as well as different techniques and applications of AI in enterprise environments;
- Gained a deeper understanding of the feedback loops between IA and AI and the corresponding implications on user experience; and
- Received practical advice on IA design to facilitate its implementation and the success of AI efforts.
Taxonomy: Hero of Advanced Content - SXSW 2019Laura Creekmore
I gave this presentation at SXSW 2019, talking about how content structure can enhance your work on advanced content channels like AI, voice skills, chatbots, and ecommerce.
TEXT MINING-TAPPING HIDDEN KERNELS OF WISDOMITC Infotech
This paper discusses how automatic document classification, information retrieval, word frequency calculation, sentiment analysis, topic modelling and trend analysis can be utilized for root cause analysis, devising competitive strategies, enhancing customer experience and so on.
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented "An Overview of Taxonomies and AI" on January 30th, 2024, in the inaugural webinar of the Artificial Intelligence webinar series: The promise and the perils,” hosted by the Knowledge & Information Management Group of CILIP, the library and information association of the UK. In her presentation, Heather explained, with examples, how both generative AI and other AI technologies support taxonomy development and use and how taxonomies can support AI applications.
Explore the presentation to learn:
Why both top-down and bottom-up methods are needed in taxonomy creation
What AI methods are used for auto-tagging and auto-classification with taxonomies
How AI methods can extract candidate terms for taxonomy creation
How generative AI can be used for certain bottom-up taxonomy development tasks
How AI can be used to analyze a taxonomy against a corpus of documents
How generative AI can be used in queries to analyze a taxonomy
What AI applications taxonomies can support
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphEnterprise Knowledge
Tatiana Baquero Cakici, Senior KM Consultant, and Jennifer Doughty, Senior Solution Consultant from Enterprise Knowledge’s Data and Information Management (DIME) Division presented at the Taxonomy Boot Camp (KMWorld 2022) on November 17, 2022. KMWorld is the world’s leading knowledge management event that takes place every year in Washington, DC.
Their presentation “Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph” focused on how ontologies have gained momentum as a strong foundation for resolving business challenges through semantic search solutions, recommendation engines, and AI strategies. Cakici and Doughty explained that taxonomists are now faced with the challenge of gaining knowledge and experience in designing and documenting complex solutions that involve the integration of taxonomies, ontologies, and knowledge graphs. They also emphasized that taxonomists are well poised to learn how to design user-centric ontologies, analyze and map data from various systems, and understand the technological architecture of knowledge graph solutions. After describing the key roles and responsibilities needed for a team to successfully implement Knowledge Graph projects, Cakici and Doughty shared practical ontology design considerations and best practices based on their own experience. Lastly, Cakici and Doughty reviewed the most common use cases for knowledge graphs and presented real world applications through a case study that illustrated ontology design and the value of knowledge graphs.
When to use the different text analytics tools - Meaning CloudMeaningCloud
Classification, topic extraction, clustering... When to use the different Text Analytics tools?
How to leverage Text Analytics technology for your business
MeaningCloud webinar, February 8th, 2017
More information and recording of the webinar https://www.meaningcloud.com/blog/recorded-webinar-use-different-text-analytics-tools
www.meaningcloud.com
Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA.
In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability.
In this session, participants gained insights about the following:
Most common types of AI categories and use cases;
Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives;
Taxonomy and ontology design considerations and best practices;
Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and
Tools, roles, and skills to design and implement AI-powered search solutions.
The Digital Workplace Powered by Intelligent SearchDaniel Faggella
This presentation covers the landscape of AI-enabled enterprise search.
The presentation was given at Sinequa's INFORM2019 events in both NYC and Paris.
Learn more about AI-enabled enterprise search on Emerj: https://emerj.com/?s=enterprise+search
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...Concept Searching, Inc
Taxonomies are often thought of as hard to use and needing specialized applications or IT skills. Not so.
Explore how taxonomies, auto-classification, and multi-term metadata generation unburden the IT team, eliminate end user tagging, and empower business users.
Understand the Return on Investment from an effective infrastructure solution for search, security, compliance, eDiscovery, records management, knowledge management, collaboration, and migration activities.
• Watch multi-term metadata being automatically generated.
• Learn how easy it is to use taxonomy tools and interactive features, such as auto-clue suggestion, instant feedback, and assigning weights to terms.
• Discover the value of dynamic screen updating to immediately see the impact of taxonomy changes.
• View how document movement feedback enables you to see the cause and effect of changes without re-indexing.
Understand must-have functionality, to help you evaluate classification and taxonomy software.
Starting with the importance of multi-term metadata, learn about the pros and cons of differing technologies, which questions to ask vendors, and what suits your organization.
Go beyond the basics, to find out what it takes to manage a taxonomy and integrate it with the SharePoint Term Store.
Take away an understanding of:
• Metadata generation – why it is so important.
• Auto-classification – why you can’t live without it.
• Taxonomy approaches that are manageable – by the staff you already have.
Structured authoring for business-critical contentJason Aiken
For decades, XML has armed technical documentation professionals with a component-based approach to content that overcomes the many challenges caused by standalone, static documents created in silos. The problem, however, is that there is so much other business-critical content out there that could benefit from a structured approach to authoring for content automation.
Learn why it is critical for technical documentation experts to translate their best practices into solutions that non-technical content creators can apply to business-critical content. Business-critical content is content you sell, content that helps you sell, or content that helps you run your business.
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...Concept Searching, Inc
Auto-classification removes a burden from IT teams and end users. But what and where is the content being classified? Then what happens?
Auto-classification not only organizes your content but also provides an environment where information governance and compliance policies, and processes, can be implemented enterprise-wide. With automatic multi-term metadata generation and powerful taxonomy tools, the positive
impact on your business is quickly realized.
As well as the visible impact of search improvement, the elimination of end user tagging reduces both productivity drain and tagging errors, to safeguard information that should be protected, such as confidential information or records.
Find out how to clean up, optimize, and organize your enterprise content, providing a framework
for effective records management.
* Metadata generation – why it is so important
* Auto-classification – why you can’t live without it
* Taxonomy approaches that are manageable – by the staff you already have
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...AgileNetwork
Agile Mumbai 2022
Combining Human and Artificial Intelligence for Business Agility
Rohit Handa
Director, Digital Products & Platforms, HCL Technologies Ltd
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “Enterprise Knowledge Graphs: The Importance of Semantics” on May 9, 2024, at the annual Data Summit in Boston.
In her presentation, Hedden describes the components of an enterprise knowledge graph and provides further insight into the semantic layer – or knowledge model – component, which includes an ontology and controlled vocabularies, such as taxonomies, for controlled metadata. While data experts tend to focus on the graph database components (RDF triple store or a label property graph), Hedden emphasizes they should not overlook the importance of the semantic layer.
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida.
In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization.
In this session, participants gained answers to the following questions:
- What is a Green Information Management (IM) Strategy, and why should you have one?
- How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication?
- How can an organization use insights into their data to influence employee behavior for IM?
- How can you reap additional benefits from content reduction that go beyond Green IM?
More Related Content
Similar to Identifying Security Risks Using Auto-Tagging and Text Analytics
TEXT MINING-TAPPING HIDDEN KERNELS OF WISDOMITC Infotech
This paper discusses how automatic document classification, information retrieval, word frequency calculation, sentiment analysis, topic modelling and trend analysis can be utilized for root cause analysis, devising competitive strategies, enhancing customer experience and so on.
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented "An Overview of Taxonomies and AI" on January 30th, 2024, in the inaugural webinar of the Artificial Intelligence webinar series: The promise and the perils,” hosted by the Knowledge & Information Management Group of CILIP, the library and information association of the UK. In her presentation, Heather explained, with examples, how both generative AI and other AI technologies support taxonomy development and use and how taxonomies can support AI applications.
Explore the presentation to learn:
Why both top-down and bottom-up methods are needed in taxonomy creation
What AI methods are used for auto-tagging and auto-classification with taxonomies
How AI methods can extract candidate terms for taxonomy creation
How generative AI can be used for certain bottom-up taxonomy development tasks
How AI can be used to analyze a taxonomy against a corpus of documents
How generative AI can be used in queries to analyze a taxonomy
What AI applications taxonomies can support
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphEnterprise Knowledge
Tatiana Baquero Cakici, Senior KM Consultant, and Jennifer Doughty, Senior Solution Consultant from Enterprise Knowledge’s Data and Information Management (DIME) Division presented at the Taxonomy Boot Camp (KMWorld 2022) on November 17, 2022. KMWorld is the world’s leading knowledge management event that takes place every year in Washington, DC.
Their presentation “Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph” focused on how ontologies have gained momentum as a strong foundation for resolving business challenges through semantic search solutions, recommendation engines, and AI strategies. Cakici and Doughty explained that taxonomists are now faced with the challenge of gaining knowledge and experience in designing and documenting complex solutions that involve the integration of taxonomies, ontologies, and knowledge graphs. They also emphasized that taxonomists are well poised to learn how to design user-centric ontologies, analyze and map data from various systems, and understand the technological architecture of knowledge graph solutions. After describing the key roles and responsibilities needed for a team to successfully implement Knowledge Graph projects, Cakici and Doughty shared practical ontology design considerations and best practices based on their own experience. Lastly, Cakici and Doughty reviewed the most common use cases for knowledge graphs and presented real world applications through a case study that illustrated ontology design and the value of knowledge graphs.
When to use the different text analytics tools - Meaning CloudMeaningCloud
Classification, topic extraction, clustering... When to use the different Text Analytics tools?
How to leverage Text Analytics technology for your business
MeaningCloud webinar, February 8th, 2017
More information and recording of the webinar https://www.meaningcloud.com/blog/recorded-webinar-use-different-text-analytics-tools
www.meaningcloud.com
Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA.
In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability.
In this session, participants gained insights about the following:
Most common types of AI categories and use cases;
Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives;
Taxonomy and ontology design considerations and best practices;
Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and
Tools, roles, and skills to design and implement AI-powered search solutions.
The Digital Workplace Powered by Intelligent SearchDaniel Faggella
This presentation covers the landscape of AI-enabled enterprise search.
The presentation was given at Sinequa's INFORM2019 events in both NYC and Paris.
Learn more about AI-enabled enterprise search on Emerj: https://emerj.com/?s=enterprise+search
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...Concept Searching, Inc
Taxonomies are often thought of as hard to use and needing specialized applications or IT skills. Not so.
Explore how taxonomies, auto-classification, and multi-term metadata generation unburden the IT team, eliminate end user tagging, and empower business users.
Understand the Return on Investment from an effective infrastructure solution for search, security, compliance, eDiscovery, records management, knowledge management, collaboration, and migration activities.
• Watch multi-term metadata being automatically generated.
• Learn how easy it is to use taxonomy tools and interactive features, such as auto-clue suggestion, instant feedback, and assigning weights to terms.
• Discover the value of dynamic screen updating to immediately see the impact of taxonomy changes.
• View how document movement feedback enables you to see the cause and effect of changes without re-indexing.
Understand must-have functionality, to help you evaluate classification and taxonomy software.
Starting with the importance of multi-term metadata, learn about the pros and cons of differing technologies, which questions to ask vendors, and what suits your organization.
Go beyond the basics, to find out what it takes to manage a taxonomy and integrate it with the SharePoint Term Store.
Take away an understanding of:
• Metadata generation – why it is so important.
• Auto-classification – why you can’t live without it.
• Taxonomy approaches that are manageable – by the staff you already have.
Structured authoring for business-critical contentJason Aiken
For decades, XML has armed technical documentation professionals with a component-based approach to content that overcomes the many challenges caused by standalone, static documents created in silos. The problem, however, is that there is so much other business-critical content out there that could benefit from a structured approach to authoring for content automation.
Learn why it is critical for technical documentation experts to translate their best practices into solutions that non-technical content creators can apply to business-critical content. Business-critical content is content you sell, content that helps you sell, or content that helps you run your business.
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...Concept Searching, Inc
Auto-classification removes a burden from IT teams and end users. But what and where is the content being classified? Then what happens?
Auto-classification not only organizes your content but also provides an environment where information governance and compliance policies, and processes, can be implemented enterprise-wide. With automatic multi-term metadata generation and powerful taxonomy tools, the positive
impact on your business is quickly realized.
As well as the visible impact of search improvement, the elimination of end user tagging reduces both productivity drain and tagging errors, to safeguard information that should be protected, such as confidential information or records.
Find out how to clean up, optimize, and organize your enterprise content, providing a framework
for effective records management.
* Metadata generation – why it is so important
* Auto-classification – why you can’t live without it
* Taxonomy approaches that are manageable – by the staff you already have
Agile Mumbai 2022 - Rohit Handa | Combining Human and Artificial Intelligence...AgileNetwork
Agile Mumbai 2022
Combining Human and Artificial Intelligence for Business Agility
Rohit Handa
Director, Digital Products & Platforms, HCL Technologies Ltd
Similar to Identifying Security Risks Using Auto-Tagging and Text Analytics (20)
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “Enterprise Knowledge Graphs: The Importance of Semantics” on May 9, 2024, at the annual Data Summit in Boston.
In her presentation, Hedden describes the components of an enterprise knowledge graph and provides further insight into the semantic layer – or knowledge model – component, which includes an ontology and controlled vocabularies, such as taxonomies, for controlled metadata. While data experts tend to focus on the graph database components (RDF triple store or a label property graph), Hedden emphasizes they should not overlook the importance of the semantic layer.
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida.
In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization.
In this session, participants gained answers to the following questions:
- What is a Green Information Management (IM) Strategy, and why should you have one?
- How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication?
- How can an organization use insights into their data to influence employee behavior for IM?
- How can you reap additional benefits from content reduction that go beyond Green IM?
Nonprofit KM Journey to Success: Lessons and Learnings at Feeding AmericaEnterprise Knowledge
Sara Duane, Senior Consultant within EK’s Strategic Consulting practice, and EK client Tom Summerfelt, former Chief Research Officer at Feeding America, presented on November 7, 2023 at KMWorld. The talk, “Nonprofit KM Journey to Success: Lessons & Learnings at Feeding America” focused on best practices for designing and implementing KM strategies that directly align with nonprofit organizational goals.
Duane and Summerfelt used their first-hand experience developing a multi-year comprehensive KM Strategy for Feeding America to outline real-world considerations and examples of:
Unique KM challenges faced by organizations in the nonprofit space
Considerations for strategic priorities and KM roadmaps for nonprofits
How to describe the business impact of KM for nonprofits
EK presented with Kate Vilches, Knowledge Management Lead at Ulteig, on November 6, 2022 at the Taxonomy Boot Camp Conference, co-located with KMWorld, in Washington, D.C. The talk, “Taxonomy Roller Coasters: Techniques to Keep Stakeholders on the Ride,” focused on proven stakeholder management techniques during enterprise taxonomy development and launch activities.
Gray and Vilches used their firsthand experience to relate advice, share practical tools, and provide real-life examples to ensure successful stakeholder involvement, reinforcing three key themes for attendees:
How to select partners and build coalitions to ensure long term success;
Overview of the steps, stages, challenges, and thrills of defining and implementing an enterprise taxonomy; and
The importance and finesse of effective change management efforts to ensure that stakeholders begin and remain excited and involved throughout the project.
DGIQ - Case Studies_ Applications of Data Governance in the Enterprise (Final...Enterprise Knowledge
Thomas Mitrevski, Senior Data Management and Governance Consultant and
Lulit Tesfaye, Partner and Vice President of Knowledge and Data Services
presented “Case Studies: Applications of Data Governance in the Enterprise” on December 6th, 2023 at DGIQ in Washington D.C.
In this presentation, Thomas and Lulit detailed their experiences developing strategies for multiple enterprise-scale data initiatives and provided an understanding of common data governance and maturity needs. Thomas and Lulit based their talk on real-world examples and case studies and provided the audience with examples of achieving buy-in to invest in governance tools and processes, as well as the expected return on investment (ROI).
Check out the presentation below to learn:
How Leading Organizations are Benchmarking Their Data Governance Maturity
Why End-User Training was Imperative in Seeing Scaled Governance Program Adoption
Which Tools and Frameworks were Critical in Getting Started with Data Governance
How Organizations Achieved Success with Data Governance in Under 12 Weeks
What Successful Data Governance Implementation Roadmaps Really Look Like
Sara Nash and Urmi Majumder, Principal Consultants at Enterprise Knowledge, presented on April 19, 2023 at KM World in Washington D.C. on the topic of Scaling Knowledge Graph Architectures with AI.
In this presentation, Sara and Urmi defined a Knowledge Graph architecture and reviewed how AI can support the creation and growth of Knowledge Graphs. Drawing from their experience in designing enterprise Knowledge Graphs based on knowledge embedded in unstructured content, Sara and Urmi defined approaches for entity and relationship extraction depending on Enterprise AI maturity and highlighted other key considerations to incorporate AI capabilities into the development of a Knowledge Graph.
View presentation below in order to learn about how:
Assess entity and relationship extraction readiness according to EK’s Extraction Maturity Spectrum and Relationship Extraction Maturity Spectrum.
Utilize knowledge extraction from content to gather important insights into organizational data.
Extract knowledge with three approaches:
RedEx Rule, Auto-Classification Rule, Custom ML Model
Examine key factors such as how to leverage SMEs, iterate AI processes, define use cases, and invest in establishing robust AI models.
This presentation was delivered by EK CEO Zach Wahl at the 2023 Midwest KM Symposium in Kent State, Ohio. The presentation defines Knowledge Management and its value. It also covers key industry trends and outcomes.
Building for the Knowledge Management Archetypes at Your CompanyEnterprise Knowledge
Building for the KM Archetypes at Your Company
Taylor Paschal, Knowledge and Information Management Consultant at Enterprise Knowledge, and Jessica Malloy, Senior Knowledge Manager at Harvard Business Publishing presented on April 19, 2023 at the APQC Conference in Houston, Texas on the topic of Building for the KM Archetypes at Your Company. In this presentation, Jessica and Taylor define common types of personalities that are often present when building a KM program. Jessica and Taylor prompted attendees to think through the root causes of various behaviors and the approaches for taking these into account when driving KM forward in round table discussions supported by this worksheet (link). Attendees left with the ability to:
Describe the importance of focusing on the unique culture of an organization when building and iterating on a KM program
Recognize organizational archetypes and know how to adapt their KM program to them
Conduct a cultural assessment of their own organization to ensure their KM program is meeting them where they are
Knowledge Graphs are Worthless, Knowledge Graph Use Cases are PricelessEnterprise Knowledge
At Knowledge Graph Forum 2022, Lulit Tesfaye and Sara Nash, Senior Consultant discuss the importance of establishing valuable and actionable use cases for knowledge graph efforts. The discussion draws on lessons learned from several knowledge graph development efforts to define how to diagnose a bad use case and outlined their impact on initiatives - including strained relationships with stakeholders, time spent reworking priorities, and team turnover. They also share guidance on how to navigate these scenarios and provide a checklist to assess a strong use case.
For KM practitioners, Agile frameworks have long been important for optimizing stakeholder value and satisfaction in KM initiatives. Over 20 years ago, a group of software developers revolutionized their field by introducing the Agile Manifesto to guide their industry in adopting Agile values, frameworks, and practices. However, until now, KM practitioners have lacked a formal framework demonstrating how to apply Agility to KM. In short, it is time to codify these Agile principles in a manner suited for the KM profession. Leveraging the original Agile Manifesto for inspiration, Andrew Politi and Megan Salerno introduced “The Agile KM Manifesto” at KM World 2022. The presentation is designed to initiate a conversation amongst KM practitioners across the industry about this initial version of the Agile KM Manifesto (the 'AKM'), and solicit feedback on future iterations.
Next, the presenters walked through three EK case studies demonstrating how the application of its principles could have saved significant time in those initiatives.
First, we described how a global non-profit approached EK to address duplicate and outdated content, and the lack of content creation standards.
Applicable AKM principle: "Content should only be available to users if it is new, essential, reliable, dynamic, and reusable. If these criteria are not met, the content must be cleaned-up or archived accordingly.”"
Next was a discussion of how national nuclear research laboratory struggled to share and discover knowledge from retiring employees and compartmentalized silos.
Applicable AKM principle: “Tacit knowledge and expertise should be proactively and formally captured and stored in the same manner as explicit knowledge.”
Finally, the presenters described how one of the largest multinational athletic apparel companies struggled to help geographically separated teams collectively and collaboratively reuse knowledge and create content across the globe, even functionally similar focus roles.
Applicable AKM principle: “All KM efforts must leverage a common language. Develop, socialize, and employ a common KM language so stakeholders don't speak past each other and can maintain consensus throughout your KM effort.”
Ultimately, this presentation served to introduce The AKM to the broader community, demonstrate its value, and solicit input from across the industry.
Road Maps & Roadblocks to Federal Electronic Records ManagementEnterprise Knowledge
Angela Pitts, Sr. Consultant at Enterprise Knowledge, and Dave Simmons, Sr. Records Officer at General Services Administration (GSA), presented a case study in federal electronic records management that detailed the success of the GSA's Enterprise Document Management Solution (EDMS). They detailed the strategies used to identify elements of organizational change management required to successfully transition standard functions of records management (RM)—capture, maintenance, disposal, transfer, assignment of metadata, and reporting—from manual, paper-based practices to more efficient and less costly electronic systems.
Records Management is a necessary component of successful Knowledge Management as it systematically manages valuable content created and owned by the business. With technological advancements, most agencies have seen the volume of document records increase exponentially because they are now frequently born and managed as digital content through the records lifecycle. Acknowledging the challenge of managing more content with fewer people, Angela and Dave explained how the design of GSA's lean and agile systems and workflows enabled the agency to reduce the resources and attention needed to manage content collections while maintaining legal compliance and quality standards.
Building an Innovative Learning Ecosystem at Scale with Graph TechnologiesEnterprise Knowledge
Todd Fahlberg of Enterprise Knowledge, and Amber Simpson, a Senior Manager at Walmart Academy, presented on November 9, 2022 at the KMWorld Conference in Washington, DC on the topic of Building an Innovative Learning Ecosystem at Scale with Graph Technologies. In this presentation, Todd and Amber share how they’re making it easier for Walmart’s learning organization to manage content used by 2.4 million global associates with a custom Digital Library. The presentation provides insight into the challenges they faced and the lessons they learned along the way, in addition to their approach to design and implement the Digital Library. Todd and Amber also detail how and why they used graph technologies to make certain their solution can continue to scale to meet the needs of Walmart’s massive workforce and evolving business needs.
Zach Wahl and Sara Mae O'Brien-Scott spoke at the 2022 Taxonomy Boot Camp in Washington, D.C. on taxonomy's critical role in delivering what every end user now expects—a seamless and personalized experience. Personalization is harnessed by the most successful organizations to anchor their content experience by allowing users to connect with content based on key characteristics. O’Brien-Scott and Wahl provided an understanding of how taxonomy powers personalization by detailing real-world use cases and best practices for taxonomy design for personalization. They discussed the personalization maturity scale, including how taxonomy lays the groundwork for enabling cutting-edge solutions such as recommendation engines, automated content assembly, and omnichannel delivery. They also shared expected outcomes of personalization such as increased conversion rates, a decrease in employee turnover, and stronger user engagement.
JPL’s Institutional Knowledge Graph II: A Foundation for Constructing Enterpr...Enterprise Knowledge
Previously at KMWorld 2021, EK joined JPL to share the vision, approach, and delivery of the Institutional Knowledge Graph (IKG), a centrally maintained, ever-evolving knowledge graph identifying and describing JPL’s enterprise-wide concepts, such as people, organizations, projects, and facilities, and the relationships between them. Since August 2020, the IKG has offered a single source of enterprise information that other JPL applications can leverage to reduce redundancy and out-of-date or inaccurate data. In production for 2 years and now with several releases under its belt, the IKG is beginning to fulfill its promise as a foundational layer in the semantic pyramid for additional taxonomies and knowledge graphs to build upon.
At KM World 2022, Bess Schrader, Senior Solutions Consultant at EK, and Ann Bernath, Software Systems Engineer at JPL, shared a follow-up to the IKG journey including a description of the Enterprise Semantic Platform, a look at new taxonomies and knowledge graphs at JPL (enterprise-wide, others specific to engineering, technical, or science domains) and how they are beginning to leverage the IKG’s foundation of JPL concepts to enrich their dataset into a broader context. This presentation discussed different techniques to federate or synchronize multiple knowledge graphs and how these diverse integrations benefit not only the new datasets, but also the IKG as it continues to pursue its overarching dream--providing answers to questions such as, “Who did what when?”, “Who should you call?”, and “Where is the Robotics Lab?”
Learning 360: Crafting a Comprehensive View of Learning by Using a GraphEnterprise Knowledge
Chris Marino, a Principal Solution Consultant at Enterprise Knowledge (EK), was a featured speaker at this year's Data Architecture Online event organized by Dataversity. Marino presented his webinar "Learning 360: Crafting a Comprehensive View of Learning Content Using a Graph" on July 20, 2022. In his presentation, Marino took participants through the entire Graph development process, including planning, designing, and developing the new tool, highlighting benefits to the organization and lessons learned throughout the process.
Making KM Clickable: The Rapidly Changing State of Knowledge ManagementEnterprise Knowledge
Initially delivered for the Bangalore K-Community Zoom Meetup: “The Digital Edge: Tech Roadmaps and Impacts on KM on June 15th, this deck covers the key takeaways from the leading Knowledge Management book, 'Making Knowledge Management Clickable,' by Zach Wahl and Joe Hilger of Enterprise Knowledge. The presentation covers definitions and value of KM, offers best practices on KM systems, details key types of KM technologies, and discusses some of the common types of KM solutions such as KM Portals and Knowledge Graphs.
How to Quickly Prototype a Scalable Graph Architecture: A Framework for Rapid...Enterprise Knowledge
Sara Nash and Thomas Mitrevski discuss the toolkit to scope and execute knowledge graph prototypes successfully in a matter of weeks. The framework discussed includes the development of a foundational semantic model (e.g. taxonomies/ontologies) and resources and skill sets needed for successful initiatives so that knowledge graph products can scale, as well as the data architecture and tooling required (e.g., orchestration and storage) for enterprise-scale implementation. This presentation was originally delivered at KGC 2022 in Boston, MA.
Translating AI from Concept to Reality: Five Keys to Implementing AI for Know...Enterprise Knowledge
Lulit Tesfaye explains how foundational knowledge management and knowledge engineering approaches can play a key role in ensuring enterprise Artificial Intelligence (AI) initiatives start right, quickly demonstrate business value, and “stick” within the organization. The presentation includes real world case studies and examples of how organizations are approaching their data and AI transformations through knowledge maturity models to translate organizational information and data into actionable and clickable solutions. Originally delivered at data.world Summit, Spring 2022.
This is the three-hour "Taxonomy 101" Presentation delivered at KMWorld 2021 (Virtual, KMWorld Connect). The presentation details taxonomy and ontology definitions, business value, and design methodologies. It also covers the concept of Knowledge Graphs in detail. Special attention is given to the differences between taxonomy and ontologies (both from a use and design perspective).
OmnichannelX 2021: How to Make Content a Maintainable Business Asset Through ...Enterprise Knowledge
This presentation was delivered by Yanko Ivanov, Principal Solution Consultant, on June 9th at the international OmnichannelX 2021 web conference. The content management discipline is constantly evolving, presenting content authors and strategists with new challenges in content maintenance efforts and delivering tailored user experiences through multi-channel publishing. In his presentation, Ivanov explained how to approach these challenges and explored the value of combining componentized content with a rich taxonomy and ontology.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Identifying Security Risks Using Auto-Tagging and Text Analytics
1. Identifying Security Risks Using
Auto-Tagging & Text Analytics
Text Analytics Forum 2022
Joe Hilger and Sara Duane
2. ENTERPRISE KNOWLEDGE
Outline
EK at a Glance The Problem Our Approach Our Methodology
and Best
Practices
What You Will Learn
⬢ How to identify confidential information across an
enterprise
⬢ Best practices for leveraging and tuning auto-
tagging
⬢ How to design a taxonomy for auto-tagging
3. ⬢ 33 Years of Consulting Experience
⬢ Expert in Knowledge Management and
Knowledge Graph Technologies
⬢ Coauthor of Making KM Clickable (2022)
JOE
CTO AND COFOUNDER, ENTERPRISE KNOWLEDGE
HILGER
SARA
SENIOR TECHNICAL ANALYST, ENTERPRISE KNOWLEDGE
DUANE
⬢ Serves as project manager for technical
implementation and strategy projects
⬢ Conducted complex auto-tagging projects
for clients in both the commercial and federal
space
ENTERPRISE KNOWLEDGE
4. 10AREAS OF EXPERTISE
KM STRATEGY & DESIGN TAXONOMY & ONTOLOGY DESIGN
TECHNOLOGY SOLUTIONS AGILE, DESIGN THINKING, & FACILITATION
CONTENT & BRAND STRATEGY KNOWLEDGE GRAPHS, DATA MODELING, & AI
ENTERPRISE SEARCH INTEGRATED CHANGE MANAGEMENT
ENTERPRISE LEARNING CONTENT MANAGEMENT
80
+
EXPERT
CONSULTANTS
HEADQUARTERED IN WASHINGTON, DC,
USA
ESTABLISHED 2013 – OUR FOUNDERS AND PRINCIPALS HAVE BEEN PROVIDING
KNOWLEDGE MANAGEMENT CONSULTING TO GLOBAL CLIENTS FOR OVER 20 YEARS.
KMWORLD’S
100 COMPANIES THAT MATTER IN KM (2015, 2016, 2017, 2018,
2019, 2020, 2021, 2022)
TOP 50 TRAILBLAZERS IN AI (2020, 2021, 2022)
CIO REVIEW’S
20 MOST PROMISING KM SOLUTION PROVIDERS (2016)
INC MAGAZINE
#2,343 OF THE 5000 FASTEST GROWING COMPANIES (2021)
#2,574 OF THE 5000 FASTEST GROWING COMPANIES (2020)
#2,411 OF THE 5000 FASTEST GROWING COMPANIES (2019)
#1,289 OF THE 5000 FASTEST GROWING COMPANIES (2018)
INC MAGAZINE
BEST WORKPLACES (2018, 2019, 2021, 2022)
WASHINGTONIAN MAGAZINE’S
TOP 50 GREAT PLACES TO WORK (2017)
WASHINGTON BUSINESS JOURNAL’S
BEST PLACES TO WORK (2017, 2018, 2019, 2020)
ARLINGTON ECONOMIC DEVELOPMENT’S
FAST FOUR AWARD – FASTEST GROWING COMPANY (2016)
VIRGINIA CHAMBER OF COMMERCE’S
FANTASTIC 50 AWARD – FASTEST GROWING COMPANY
(2019, 2020)
AWARD-WINNING
CONSULTANCY
PRESENCE IN BRUSSELS, BELGIUM
EK At A Glance
STABLE CLIENT BASE
ENTERPRISE KNOWLEDGE
6. Problem Statement
At this federal research organization, researchers, proposal
authors, project managers, etc. all leverage project content, data,
and documentation on their shared drives.
They need to have a way to:
▪ Identify content that is controlled, CUI, or otherwise
sensitive
So that they can…
Move the relevant documents to a secure location
Prevent data loss and compliance issues
Ensure all documents have a classification
7. How Common Tools Solve the Problem
A lot of tools or solutions would solve this by
looking for PII information through pattern
recognition, including:
⬢ Using regex to identify the patterns behind PII
information, such as a phone number.
⬢ Identifying specific sensitivity labels within
the content itself, such as “top secret.”
These products and solutions don’t look for terms or categories of information
that reflect sensitive content. What if a piece of information within a document
is sensitive, but doesn’t contain the term “top secret” within it nor any identifiable
PII through pattern recognition?
8. Our Solution
Teaching
Technology
Identify the terms, words, and categories of information that
suggest secure information.
Develop a subject-oriented topic taxonomy of secure terms.
Conduct auto-tagging on documents with this subject-oriented
taxonomy to identify the secure content.
Leverage these tags and labels to begin the migration process.
1
2
3
4
9. What is a Taxonomy?
A taxonomy is a controlled vocabulary
used to describe or characterize explicit
concepts of information for the purpose
of capturing, managing, and
presenting.
Taxonomies are often driven by:
● Type of Content
● Medium
● Organization
● Purpose
● Topic (most relevant for our
approach)
11. Building Our Understanding
Conduct focus
groups with staff who
are creators, holders,
or consumers of
content to ensure a
complete
understanding of the
content they work
with and what
constitutes secure
information for them.
Analyze
documentation,
content, and data
that suggests secure
information as well as
documentation
without secure
information to
identify key topics.
Conduct a semantic
analysis of content
that identifies
significant terms
through a machine
learning algorithm
and can validate and
enhance the
designed taxonomy.
Focus Groups Document Review Corpus Analysis
Focus Groups
with Core
Team & SMEs
33+
Documents
Evaluated
287k
For this engagement, EK conducted a
thorough discovery phase:
12. Building the Taxonomy
Study Area Geography Method of Measure
Environment Application Content Type
EK used the field of environmental research to model what could be identified as secure information within a
specific domain.
The terms that made up these taxonomies were identified through focus groups with environmental research
SMEs, as well as four corpus analyses on subsets of relevant content.
The corpus analysis identified and added 37% of the taxonomy terms (i.e., terms and synonyms), thus
enriching the final POC taxonomy.
13. Solution Architecture
Project Solution Architecture
EK leveraged two main tools for this
POC:
o PoolParty: Hosted the taxonomy
and ontology, and via API, auto-
tagged the provided documents.
o GraphDB: Stored the documents
and their applied tags from the
taxonomy and ontology.
To successfully complete this
approach, EK created data pipelines
between the document storage
account, PoolParty, and GraphDB
using UnifiedViews, an ETL tool.
These pipelines facilitated the
necessary data transformation and
integration to power GraphSearch.
14. Visualizing Tags
⬢ EK leveraged PoolParty’s
GraphSearch server to allow
the organization to visualize
the results of the auto-tagging
process.
⬢ Users could filter and search
for documents based on the
identified tags.
⬢ During this phase, we could
visualize and analyze the
accuracy of the tags. View of PoolParty’s GraphSearch
17. Design Best Practices
Remember Your End User: A Machine
Design requirements for a machine are different than for a taxonomy leveraged by a human for
navigation, search, etc.
Granularity Is Important
The taxonomy should reflect the granularity of the content and get into the details of what is
presented in the content.
Synonyms at the Correct Level Are Your Friends
With relevant and accurate synonyms used correctly, auto-tagging can better parse
through the text and recognize what the content is about.
Ensure Taxonomy Terms are Reflective of the Content
The topics of your content items should help form the basis of your taxonomy.
19. Auto-tagging: An advanced application of taxonomy in which terms are automatically
applied to content as tags through text recognition, inheritance, or other automated means.
Basic level:
Searching the text for taxonomy
terms to apply, relying solely on the
term appearing in the content itself.
More complex level:
Using context and machine learning
to tag additional terms that may not
be in the content itself.
1
2
3
4
5
Metadata Inheritance
WHAT IS AUTO-TAGGING?
What Type of Auto-tagging Works for
Your Needs?
Migration Logic
NLP Extractor
ML Classification
Custom NER Models
20. AUTO-TAGGING WITH POOLPARTY
EXTRACTION
Auto-tagging is text extraction with
natural language processing (NLP) and
light machine learning (corpus scoring)
to score extracted concepts by a mix of
frequency, location in the document,
etc.
It’s important to understand both the
taxonomy and the content it will be
used to tag.
Auto-tagging will only tag well the
fields of the taxonomy that are
topical and well matched to the text
of the content items.
Core Components Necessary for Auto-
tagging:
● Synonym-rich taxonomy that is
aligned with the target content
● Taxonomy management tool
● “Learning” corpus capabilities
● Content management system with
target content
● Middle layer that can send content to
be tagged and then store the
suggested tags
Concept Extraction
21. Lemmatization and Stemming
Lemmatization reduces words to their common
base forms:
● am, are, is => be
● car, cars, car’s, cars’ => car
Stemming looks at the root of a word:
● accounts, accounting, accountant -> account
Concept extraction does not require that the exact term from the taxonomy be present in
the text. Techniques like stemming and lemmatization can help increase matches.
Important Note!
Stemming and lemmatization can be risky as they may
obscure real differences in meaning.
22. AUTO-TAGGING WITH POOLPARTY
EXTRACTION
Scoring methods:
● Frequency - the more often a term appears in a document, the higher it scores
● Location boosting - terms found in some locations in a document (for example,
the title), will have their score “boosted,” or weighted higher
● Term Frequency - Inverse Document Frequency (TF-IDF) scoring method
penalizes overly frequent terms and boosts rare terms. The frequency of a term
in a document is balanced against the frequency of that term across a
representative corpus of documents. For example, the most frequently used word
in many English documents is “the” - using TF-IDF scoring, this term will have a
low score
Scoring/Ranking Extraction
24. FINE-TUNING ITERATIVELY
● Blacklist
● Exact match
● Disambiguation
● Ontology
● Shadow concepts
● Corpus adjustment
● TF-IDF scoring
● F-score
Auto-tag
● Blacklist
● Exact match
● Synonyms
● Adjust taxonomy
● Prioritize content
segments (e.g., Title)
● Corpus scoring
Initial Fine-
tuning
Long-term
Fine-tuning
Initial Fine-
tuning
Long-term
Fine-tuning
Evaluate
Accuracy
Iterative Fine-tuning
You will need to conduct
multiple rounds, tweaking
the taxonomy and rules to
best fit the content you are
working with, and
evaluating the accuracy for
each round.
25. HOW TO ASSESS ACCURACY
GOLD STANDARD
ANECDOTAL
ACCURACY
F-SCORES AND IAA
(INTER ANNOTATOR
AGREEMENT)
How to Assess Accuracy
26. Q&A
Thank you for listening.
Questions?
JOE HILGER,
COO and Co-Founder of Enterprise
Knowledge
JHILGER@ENTERPRISE-KNOWLEDGE.COM
WWW.LINKEDIN.COM/IN/JOSEPH-HILGER/
SARA DUANE,
Senior Technical Analyst
SDUANE@ENTERPRISE-KNOWLEDGE.COM
WWW.LINKEDIN.COM/IN/SARA-DUANE/