Sara Nash and Urmi Majumder, Principal Consultants at Enterprise Knowledge, presented on April 19, 2023 at KM World in Washington D.C. on the topic of Scaling Knowledge Graph Architectures with AI.
In this presentation, Sara and Urmi defined a Knowledge Graph architecture and reviewed how AI can support the creation and growth of Knowledge Graphs. Drawing from their experience in designing enterprise Knowledge Graphs based on knowledge embedded in unstructured content, Sara and Urmi defined approaches for entity and relationship extraction depending on Enterprise AI maturity and highlighted other key considerations to incorporate AI capabilities into the development of a Knowledge Graph.
View presentation below in order to learn about how:
Assess entity and relationship extraction readiness according to EK’s Extraction Maturity Spectrum and Relationship Extraction Maturity Spectrum.
Utilize knowledge extraction from content to gather important insights into organizational data.
Extract knowledge with three approaches:
RedEx Rule, Auto-Classification Rule, Custom ML Model
Examine key factors such as how to leverage SMEs, iterate AI processes, define use cases, and invest in establishing robust AI models.
GraphQL is a query language for APIs and a runtime for fulfilling those queries. It gives clients the power to ask for exactly what they need, which makes it a great fit for modern web and mobile apps. In this talk, we explain why GraphQL was created, introduce you to the syntax and behavior, and then show how to use it to build powerful APIs for your data. We will also introduce you to AWS AppSync, a GraphQL-powered serverless backend for apps, which you can use to host GraphQL APIs and also add real-time and offline capabilities to your web and mobile apps. You can follow along if you have an AWS account – no GraphQL experience required!
Level: Beginner
Speaker: Rohan Deshpande - Sr. Software Dev Engineer, AWS Mobile Applications
Funnel Analysis with Apache Spark and DruidDatabricks
Every day, millions of advertising campaigns are happening around the world.
As campaign owners, measuring the ongoing campaign effectiveness (e.g “how many distinct users saw my online ad VS how many distinct users saw my online ad, clicked it and purchased my product?”) is super important.
However, this task (often referred to as “funnel analysis”) is not an easy task, especially if the chronological order of events matters.
One way to mitigate this challenge is combining Apache Druid and Apache DataSketches, to provide fast analytics on large volumes of data.
However, while that combination can answer some of these questions, it still can’t answer the question “how many distinct users viewed the brand’s homepage FIRST and THEN viewed product X page?”
In this talk, we will discuss how we combine Spark, Druid and DataSketches to answer such questions at scale.
GraphQL as an alternative approach to REST (as presented at Java2Days/CodeMon...luisw19
Originally designed by Facebook to allow its mobile clients to define exactly what data should be send back by an API and therefore avoid unnecessary roundtrips and data usage, GraphQL is a JSON based query language for Web APIs. Since it was open sourced by Facebook in 2015, it has undergone very rapid adoption and many companies have already switch to the GraphQL way of building APIs – see http://GraphQL.org/users.
However, with some many hundreds of thousands of REST APIs publicly available today (and many thousands others available internally), what are the implications of moving to GraphQL? Is it really worth the effort of replacing REST APIs specially if they’re successful and performing well in production? What are the pros/cons of using GraphQL? What tools / languages can be used for GraphQL? What about API Gateways? What about API design?
With a combination of rich content and hands-on demonstrations, attend this session for a point of view on how address these and many other questions, and most importantly get a better understanding and when/where/why/if GraphQL applies for your organisation or specific use case.
The Best Startup Investor Pitch Deck & How to Present to Angels & Venture Cap...J. Skyler Fernandes
Take the online video course on Udemy:
https://www.udemy.com/course/the-best-startup-investor-pitch-deck/?referralCode=A5ED0FBD65120A93A16E
3.5+hrs of video content, walking step by step each part of the pitch, with personal VC stories, examples, and advice.
The "Best" Startup Investor Pitch Deck is an aggregation of some of the best pitch decks and wisdom from some of the top angels, VCs, and entrepreneurs including my own person insight/experience. The slide deck includes a template for entrepreneurs to use to present to investors, with details on what should be addressed on each slide. There are also additional slides on how best to pitch to investors effectively, how to design and format slides, and what to do before the pitch.
Making Apache Spark Better with Delta LakeDatabricks
Delta Lake is an open-source storage layer that brings reliability to data lakes. Delta Lake offers ACID transactions, scalable metadata handling, and unifies the streaming and batch data processing. It runs on top of your existing data lake and is fully compatible with Apache Spark APIs.
In this talk, we will cover:
* What data quality problems Delta helps address
* How to convert your existing application to Delta Lake
* How the Delta Lake transaction protocol works internally
* The Delta Lake roadmap for the next few releases
* How to get involved!
GraphQL is a query language for APIs and a runtime for fulfilling those queries. It gives clients the power to ask for exactly what they need, which makes it a great fit for modern web and mobile apps. In this talk, we explain why GraphQL was created, introduce you to the syntax and behavior, and then show how to use it to build powerful APIs for your data. We will also introduce you to AWS AppSync, a GraphQL-powered serverless backend for apps, which you can use to host GraphQL APIs and also add real-time and offline capabilities to your web and mobile apps. You can follow along if you have an AWS account – no GraphQL experience required!
Level: Beginner
Speaker: Rohan Deshpande - Sr. Software Dev Engineer, AWS Mobile Applications
Funnel Analysis with Apache Spark and DruidDatabricks
Every day, millions of advertising campaigns are happening around the world.
As campaign owners, measuring the ongoing campaign effectiveness (e.g “how many distinct users saw my online ad VS how many distinct users saw my online ad, clicked it and purchased my product?”) is super important.
However, this task (often referred to as “funnel analysis”) is not an easy task, especially if the chronological order of events matters.
One way to mitigate this challenge is combining Apache Druid and Apache DataSketches, to provide fast analytics on large volumes of data.
However, while that combination can answer some of these questions, it still can’t answer the question “how many distinct users viewed the brand’s homepage FIRST and THEN viewed product X page?”
In this talk, we will discuss how we combine Spark, Druid and DataSketches to answer such questions at scale.
GraphQL as an alternative approach to REST (as presented at Java2Days/CodeMon...luisw19
Originally designed by Facebook to allow its mobile clients to define exactly what data should be send back by an API and therefore avoid unnecessary roundtrips and data usage, GraphQL is a JSON based query language for Web APIs. Since it was open sourced by Facebook in 2015, it has undergone very rapid adoption and many companies have already switch to the GraphQL way of building APIs – see http://GraphQL.org/users.
However, with some many hundreds of thousands of REST APIs publicly available today (and many thousands others available internally), what are the implications of moving to GraphQL? Is it really worth the effort of replacing REST APIs specially if they’re successful and performing well in production? What are the pros/cons of using GraphQL? What tools / languages can be used for GraphQL? What about API Gateways? What about API design?
With a combination of rich content and hands-on demonstrations, attend this session for a point of view on how address these and many other questions, and most importantly get a better understanding and when/where/why/if GraphQL applies for your organisation or specific use case.
The Best Startup Investor Pitch Deck & How to Present to Angels & Venture Cap...J. Skyler Fernandes
Take the online video course on Udemy:
https://www.udemy.com/course/the-best-startup-investor-pitch-deck/?referralCode=A5ED0FBD65120A93A16E
3.5+hrs of video content, walking step by step each part of the pitch, with personal VC stories, examples, and advice.
The "Best" Startup Investor Pitch Deck is an aggregation of some of the best pitch decks and wisdom from some of the top angels, VCs, and entrepreneurs including my own person insight/experience. The slide deck includes a template for entrepreneurs to use to present to investors, with details on what should be addressed on each slide. There are also additional slides on how best to pitch to investors effectively, how to design and format slides, and what to do before the pitch.
Making Apache Spark Better with Delta LakeDatabricks
Delta Lake is an open-source storage layer that brings reliability to data lakes. Delta Lake offers ACID transactions, scalable metadata handling, and unifies the streaming and batch data processing. It runs on top of your existing data lake and is fully compatible with Apache Spark APIs.
In this talk, we will cover:
* What data quality problems Delta helps address
* How to convert your existing application to Delta Lake
* How the Delta Lake transaction protocol works internally
* The Delta Lake roadmap for the next few releases
* How to get involved!
Flink Forward San Francisco 2022.
At Flink Forward, we get to hear creative, unique use cases, often on the bleeding edge of some of the most exciting current technologies. This talk will give you a chance to get to open up the hood on our driven and innovative Open Source community. I will cover what our community has been working on this past year, and how this work relates to our (Ververica's) exciting new Flink engineering roadmap! I will also go through some best practices and upcoming opportunities for getting involved in this community!
by
Caito Scherr
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...HostedbyConfluent
We will demonstrate how easy it is to use Confluent Cloud as the data source of your Beam pipelines. You will learn how to process the information that comes from Confluent Cloud in real time, make transformations on such information and feed it back to your Kafka topics and other parts of your architecture.
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021StreamNative
You may be familiar with the Presto plugin used to run fast interactive queries over Pulsar using ANSI SQL and can be joined with other data sources. This plugin will soon get a rename to align with the rename of the PrestoSQL project to Trino. What is the purpose of this rename and what does it mean for those using the Presto plugin? We cover the history of the community shift from PrestoDB to PrestoSQL, as well as, the future plans for the Pulsar community to donate this plugin to the Trino project. One of the connector maintainers will then demo the connector and show what is possible when using Trino and Pulsar!
Searching for AI - Leveraging Solr for classic Artificial Intelligence tasksAlexandre Rafalovitch
Apache Solr was always built on strong Information Retrieval/Natural Language Processing foundation. And, in recent versions, even more Artificial Intelligence features, techniques and integrations were added to the Solr.
This presentation covers some classic (and hidden gems) AI elements that Solr supported for long time as well as the most recent features that are not even fully documented yet.
The presentation was made with references to Solr 7.4.
Alastria Digital Identity: the Spanish Blockchain solution for SSI - Carlos P...SSIMeetup
Alastria is an association to foster the implementation of a Spanish national blockchain whose nodes are run by Alastria members. Alastria gathers over 250 cross-industry members and was initiated by the some of the biggest corporations in Spain. Alastria ID proposes an implementation of the Self Sovereign Identity paradigm over a public-permissioned Blockchain and will be presented by Carlos Pastor, Alastria’s Digital Identity Commission Leader, in this webinar from SSIMeetup.org. Alastria ID vision is to become the cornerstone of a legally binding ID for members and final users, giving users complete control over their personal data. Alastria ID not only strives to be “GDPR compliant”, but also to become the best and easiest way to fulfill GDPR user rights, providing a full-fledged Identity management solution from identity creation to attestation and claim management, including consent as well as issuer revocation and user deletion rights.
These slides present how DBT, Coral, and Iceberg can provide a novel data management experience for defining SQL workflows. In this UX, users define their workflows as a cascade of SQL queries, which then get auto-materialized and incrementally maintained. Applications of this user experience include Declarative DAG workflows, streaming/batch convergence, and materialized views.
Apache Spark™ is a fast and general engine for large-scale data processing. Spark is written in Scala and runs on top of JVM, but Python is one of the officially supported languages. But how does it actually work? How can Python communicate with Java / Scala? In this talk, we’ll dive into the PySpark internals and try to understand how to write and test high-performance PySpark applications.
Operationalizing Machine Learning at Scale at StarbucksDatabricks
As ML-driven innovations are propelled by the Self-Service capabilities in the Enterprise Data and Analytics Platform, teams face a significant entry barrier and productivity issues in moving from POCs to Operating ML-powered apps at scale in production.
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Flink Forward
Flink Forward San Francisco 2022.
Being in the payments space, Stripe requires strict correctness and freshness guarantees. We rely on Flink as the natural solution for delivering on this in support of our Change Data Capture (CDC) infrastructure. We heavily rely on CDC as a tool for capturing data change streams from our databases without critically impacting database reliability, scalability, and maintainability. Data derived from these streams is used broadly across the business and powers many of our critical financial reporting systems totalling over $640 Billion in payment volume annually. We use many components of Flink’s flexible DataStream API to perform aggregations and abstract away the complexities of stream processing from our downstreams. In this talk, we’ll walk through our experience from the very beginning to what we have in production today. We’ll share stories around the technical details and trade-offs we encountered along the way.
by
Jeff Chao
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
Recently, a set of modern table formats such as Delta Lake, Hudi, Iceberg spring out. Along with Hive Metastore these table formats are trying to solve problems that stand in traditional data lake for a long time with their declared features like ACID, schema evolution, upsert, time travel, incremental consumption etc.
This presentation contains differences between Elasticsearch and relational Databases. Along with that it also has some Glossary Of Elasticsearch and its basic operation.
Dynamic Partition Pruning in Apache SparkDatabricks
In data analytics frameworks such as Spark it is important to detect and avoid scanning data that is irrelevant to the executed query, an optimization which is known as partition pruning. Dynamic partition pruning occurs when the optimizer is unable to identify at parse time the partitions it has to eliminate. In particular, we consider a star schema which consists of one or multiple fact tables referencing any number of dimension tables. In such join operations, we can prune the partitions the join reads from a fact table by identifying those partitions that result from filtering the dimension tables. In this talk we present a mechanism for performing dynamic partition pruning at runtime by reusing the dimension table broadcast results in hash joins and we show significant improvements for most TPCDS queries.
In this knolx session, we will come to know about Delta Lake and its features. Delta Lake is one of the greatest innovations by Databricks that makes existing data lakes more scalable and reliable. Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. Delta Lake runs on top of our existing data lake and is fully compatible with Apache Spark APIs.
Exploring Java Heap Dumps (Oracle Code One 2018)Ryan Cuprak
Memory leaks are not always simple or easy to find. Heap dumps from production systems are often gigantic (4+ gigs) with millions of objects in memory. Simple spot checking with traditional tools is woefully inadequate in these situations, especially with real data. Leaks can be entire object graphs with enormous amounts of noise. This session will show you how to build custom tools using the Apache NetBeans Profiler/Heapwalker APIs. Using these APIs, you can read and analyze Java heaps programmatically to ask really hard questions. This gives you the power to analyze complex object graphs with tens of thousands of objects in seconds.
Oracle Machine Learning Overview and From Oracle Data Professional to Oracle ...Charlie Berger
DBAs spend too time with routine tasks leaving little time for innovation. Autonomous Databases free data professionals to extract more value from data. Oracle Machine Learning, in Autonomous Database, “moves the algorithms; not the data” for 100% in-database processing. Data professionals perform many supporting tasks for “data scientists”, typically 80% of the work. Come learn an evolutionary path for Oracle data professionals to leverage domain knowledge and data skills and add machine learning. See how to build and deploy predictive models inside the Database. Using examples, demos and sharing experiences, Charlie will show you how to discover new insights, make predictions and become an “Oracle Data Scientist” in just 6 weeks!
Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA.
In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability.
In this session, participants gained insights about the following:
Most common types of AI categories and use cases;
Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives;
Taxonomy and ontology design considerations and best practices;
Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and
Tools, roles, and skills to design and implement AI-powered search solutions.
Flink Forward San Francisco 2022.
At Flink Forward, we get to hear creative, unique use cases, often on the bleeding edge of some of the most exciting current technologies. This talk will give you a chance to get to open up the hood on our driven and innovative Open Source community. I will cover what our community has been working on this past year, and how this work relates to our (Ververica's) exciting new Flink engineering roadmap! I will also go through some best practices and upcoming opportunities for getting involved in this community!
by
Caito Scherr
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...HostedbyConfluent
We will demonstrate how easy it is to use Confluent Cloud as the data source of your Beam pipelines. You will learn how to process the information that comes from Confluent Cloud in real time, make transformations on such information and feed it back to your Kafka topics and other parts of your architecture.
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021StreamNative
You may be familiar with the Presto plugin used to run fast interactive queries over Pulsar using ANSI SQL and can be joined with other data sources. This plugin will soon get a rename to align with the rename of the PrestoSQL project to Trino. What is the purpose of this rename and what does it mean for those using the Presto plugin? We cover the history of the community shift from PrestoDB to PrestoSQL, as well as, the future plans for the Pulsar community to donate this plugin to the Trino project. One of the connector maintainers will then demo the connector and show what is possible when using Trino and Pulsar!
Searching for AI - Leveraging Solr for classic Artificial Intelligence tasksAlexandre Rafalovitch
Apache Solr was always built on strong Information Retrieval/Natural Language Processing foundation. And, in recent versions, even more Artificial Intelligence features, techniques and integrations were added to the Solr.
This presentation covers some classic (and hidden gems) AI elements that Solr supported for long time as well as the most recent features that are not even fully documented yet.
The presentation was made with references to Solr 7.4.
Alastria Digital Identity: the Spanish Blockchain solution for SSI - Carlos P...SSIMeetup
Alastria is an association to foster the implementation of a Spanish national blockchain whose nodes are run by Alastria members. Alastria gathers over 250 cross-industry members and was initiated by the some of the biggest corporations in Spain. Alastria ID proposes an implementation of the Self Sovereign Identity paradigm over a public-permissioned Blockchain and will be presented by Carlos Pastor, Alastria’s Digital Identity Commission Leader, in this webinar from SSIMeetup.org. Alastria ID vision is to become the cornerstone of a legally binding ID for members and final users, giving users complete control over their personal data. Alastria ID not only strives to be “GDPR compliant”, but also to become the best and easiest way to fulfill GDPR user rights, providing a full-fledged Identity management solution from identity creation to attestation and claim management, including consent as well as issuer revocation and user deletion rights.
These slides present how DBT, Coral, and Iceberg can provide a novel data management experience for defining SQL workflows. In this UX, users define their workflows as a cascade of SQL queries, which then get auto-materialized and incrementally maintained. Applications of this user experience include Declarative DAG workflows, streaming/batch convergence, and materialized views.
Apache Spark™ is a fast and general engine for large-scale data processing. Spark is written in Scala and runs on top of JVM, but Python is one of the officially supported languages. But how does it actually work? How can Python communicate with Java / Scala? In this talk, we’ll dive into the PySpark internals and try to understand how to write and test high-performance PySpark applications.
Operationalizing Machine Learning at Scale at StarbucksDatabricks
As ML-driven innovations are propelled by the Self-Service capabilities in the Enterprise Data and Analytics Platform, teams face a significant entry barrier and productivity issues in moving from POCs to Operating ML-powered apps at scale in production.
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Flink Forward
Flink Forward San Francisco 2022.
Being in the payments space, Stripe requires strict correctness and freshness guarantees. We rely on Flink as the natural solution for delivering on this in support of our Change Data Capture (CDC) infrastructure. We heavily rely on CDC as a tool for capturing data change streams from our databases without critically impacting database reliability, scalability, and maintainability. Data derived from these streams is used broadly across the business and powers many of our critical financial reporting systems totalling over $640 Billion in payment volume annually. We use many components of Flink’s flexible DataStream API to perform aggregations and abstract away the complexities of stream processing from our downstreams. In this talk, we’ll walk through our experience from the very beginning to what we have in production today. We’ll share stories around the technical details and trade-offs we encountered along the way.
by
Jeff Chao
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
Recently, a set of modern table formats such as Delta Lake, Hudi, Iceberg spring out. Along with Hive Metastore these table formats are trying to solve problems that stand in traditional data lake for a long time with their declared features like ACID, schema evolution, upsert, time travel, incremental consumption etc.
This presentation contains differences between Elasticsearch and relational Databases. Along with that it also has some Glossary Of Elasticsearch and its basic operation.
Dynamic Partition Pruning in Apache SparkDatabricks
In data analytics frameworks such as Spark it is important to detect and avoid scanning data that is irrelevant to the executed query, an optimization which is known as partition pruning. Dynamic partition pruning occurs when the optimizer is unable to identify at parse time the partitions it has to eliminate. In particular, we consider a star schema which consists of one or multiple fact tables referencing any number of dimension tables. In such join operations, we can prune the partitions the join reads from a fact table by identifying those partitions that result from filtering the dimension tables. In this talk we present a mechanism for performing dynamic partition pruning at runtime by reusing the dimension table broadcast results in hash joins and we show significant improvements for most TPCDS queries.
In this knolx session, we will come to know about Delta Lake and its features. Delta Lake is one of the greatest innovations by Databricks that makes existing data lakes more scalable and reliable. Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. Delta Lake runs on top of our existing data lake and is fully compatible with Apache Spark APIs.
Exploring Java Heap Dumps (Oracle Code One 2018)Ryan Cuprak
Memory leaks are not always simple or easy to find. Heap dumps from production systems are often gigantic (4+ gigs) with millions of objects in memory. Simple spot checking with traditional tools is woefully inadequate in these situations, especially with real data. Leaks can be entire object graphs with enormous amounts of noise. This session will show you how to build custom tools using the Apache NetBeans Profiler/Heapwalker APIs. Using these APIs, you can read and analyze Java heaps programmatically to ask really hard questions. This gives you the power to analyze complex object graphs with tens of thousands of objects in seconds.
Oracle Machine Learning Overview and From Oracle Data Professional to Oracle ...Charlie Berger
DBAs spend too time with routine tasks leaving little time for innovation. Autonomous Databases free data professionals to extract more value from data. Oracle Machine Learning, in Autonomous Database, “moves the algorithms; not the data” for 100% in-database processing. Data professionals perform many supporting tasks for “data scientists”, typically 80% of the work. Come learn an evolutionary path for Oracle data professionals to leverage domain knowledge and data skills and add machine learning. See how to build and deploy predictive models inside the Database. Using examples, demos and sharing experiences, Charlie will show you how to discover new insights, make predictions and become an “Oracle Data Scientist” in just 6 weeks!
Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA.
In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability.
In this session, participants gained insights about the following:
Most common types of AI categories and use cases;
Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives;
Taxonomy and ontology design considerations and best practices;
Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and
Tools, roles, and skills to design and implement AI-powered search solutions.
Introduction to Machine Learning - WeCloudDataWeCloudData
In this talk, WeCloudData introduces the lifecycle of machine learning and its tools/ecosystems. For more detail about WeCloudData's machine learning course please visit: https://weclouddata.com/data-science/
Exploring Data Modeling Techniques in Modern Data Warehousespriyanka rajput
This article delves deep into data modeling techniques in modern data warehouses, shedding light on their significance and various approaches. If you are aspiring to be a data analyst or data scientist, understanding data modeling is essential, making a Data Analytics Course in Bangalore, Lucknow, Bangalore, Pune, Delhi, Mumbai, Gandhinagar, and other cities across India an attractive proposition.
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine LearningSandesh Rao
Autonomous Database is one of the hottest Oracle products where we have attempted to use Machine Learning for several aspects of the service. This presentation takes a view on our current state of Diagnostic methodology in the Autonomous Database Cloud services and how do we process this data to find anomalies in them to troubleshoot them at a scale of several petabytes a year and conduct AIOps. Some of the use cases we will cover are a Log Anomaly timeline which we reduce significant amounts of logs using semi-supervised machine learning techniques to reduce logs and match them in near real time. We will cover techniques to analyze database issues using Machine learning techniques like Kmeans , TFIDF, Random Forests, and z-scores to predict if a spike in the CPU is a normal or abnormal spike. We will also talk about RNN’s with LSTM/GRU as some of the applications of how to predict faults before they happen. Some of the other use cases are to use convolution filters to determine maintenance windows within the database workloads, determine best times to do database backups, security anomaly timelines and many others. This is a production service and this can be used if you have a customer SR/defect today. The service is much more extensive inside the Oracle Autonomous Database Cloud. This presentation will accompany several examples with how to apply these techniques, machine learning knowledge is preferred but not a prerequisite
Climbing the Ontology Mountain to Achieve a Successful Knowledge GraphEnterprise Knowledge
Tatiana Baquero Cakici, Senior KM Consultant, and Jennifer Doughty, Senior Solution Consultant from Enterprise Knowledge’s Data and Information Management (DIME) Division presented at the Taxonomy Boot Camp (KMWorld 2022) on November 17, 2022. KMWorld is the world’s leading knowledge management event that takes place every year in Washington, DC.
Their presentation “Climbing the Ontology Mountain to Achieve a Successful Knowledge Graph” focused on how ontologies have gained momentum as a strong foundation for resolving business challenges through semantic search solutions, recommendation engines, and AI strategies. Cakici and Doughty explained that taxonomists are now faced with the challenge of gaining knowledge and experience in designing and documenting complex solutions that involve the integration of taxonomies, ontologies, and knowledge graphs. They also emphasized that taxonomists are well poised to learn how to design user-centric ontologies, analyze and map data from various systems, and understand the technological architecture of knowledge graph solutions. After describing the key roles and responsibilities needed for a team to successfully implement Knowledge Graph projects, Cakici and Doughty shared practical ontology design considerations and best practices based on their own experience. Lastly, Cakici and Doughty reviewed the most common use cases for knowledge graphs and presented real world applications through a case study that illustrated ontology design and the value of knowledge graphs.
The PoolParty Semantic Classifier is a component of the Semantic Suite, which makes use of machine learning in combination with Knowledge Graphs.
We discuss the potential of the fusion of machine learning, neuronal networks, and knowledge graphs based on use cases and this concrete technology offering.
We introduce the term 'Semantic AI' that refers to the combined usage of various AI methods.
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...AgileNetwork
Agile Network India - Gurugram
Title: Agile Adaptation: Driving Progress in Generative AI Projects by Sujata Bhutani
Date: 20th April 2024
Hosted by: The NorthCap University
This workshop presentation from Enterprise Knowledge team members Joe Hilger, Founder and COO, and Sara Nash, Technical Analyst, was delivered on June 8, 2020 as part of the Data Summit 2020 virtual conference. The 3-hour workshop provided an interdisciplinary group of participants with a definition of what a knowledge graph is, how it is implemented, and how it can be used to increase the value of your organization’s datas. This slide deck gives an overview of the KM concepts that are necessary for the implementation of knowledge graphs as a foundation for Enterprise Artificial Intelligence (AI). Hilger and Nash also outlined four use cases for knowledge graphs, including recommendation engines and natural language query on structured data.
Data Analytics Course In Bangalore-AugustDataMites
Data analytics courses are educational programs designed to teach individuals the skills and techniques needed to work with data, analyze it, and extract meaningful insights.
For More Details: https://datamites.com/data-analytics-certification-course-training-bangalore/
Data analytics courses are educational programs designed to teach individuals the skills and techniques needed to work with data, analyze it, and extract meaningful insights.
For More Details: https://datamites.com/data-analytics-certification-course-training-delhi/
Data analytics courses are educational programs designed to teach individuals the skills and techniques needed to work with data, analyze it, and extract meaningful insights.
For More Details: https://datamites.com/data-analytics-certification-course-training-mumbai/
Data Analytics Course Curriculum_ What to Expect and How to Prepare in 2023.pdfNeha Singh
In 2023, aspiring data analysts can expect comprehensive data analytics course curriculums covering essential topics like statistical analysis, data visualization, machine learning, and big data processing. To prepare for the course, brushing up on basic mathematics, programming, and data handling skills would be beneficial.
How to classify documents automatically using NLPSkyl.ai
About the webinar
Documents come in different shapes and sizes - From technical documents, customer support chat, emails, reviews to news articles - all of them contain information that is valuable to the business.
Managing these large volume data documents in a traditional manual way has been a complex and time-consuming task that requires enormous human efforts.
In this webinar, we will discuss how Machine learning can be used to identify and automatically label news articles into categories like business, politics, music, etc. This can be applied in another context like categorizing emails, reviews, and processing text documents, etc.
What you will learn
- How businesses are leveraging document classification to their advantage
- Best practice to automate machine learning models in hours not months
- Demo: Classify news articles into the right category using convolution neural network
Purpose of this presentation is to highlight how end to end machine learning looks like in real world enterprise. This is to provide insight to aspiring data scientist who have been through courses or education in ML that mostly focus on ML algorithms and not end to end pipeline.
Architecture and components mentioned in Slide 11 will be discussed in detailed in series of post on LinkedIn over the course of next few month
To get updates on this follow me on LinkedIn or search/follow hashtag #end2endDS. Post will be active in August 2019 and will be posted till September 2019
Similar to Scaling Knowledge Graph Architectures with AI (20)
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “Enterprise Knowledge Graphs: The Importance of Semantics” on May 9, 2024, at the annual Data Summit in Boston.
In her presentation, Hedden describes the components of an enterprise knowledge graph and provides further insight into the semantic layer – or knowledge model – component, which includes an ontology and controlled vocabularies, such as taxonomies, for controlled metadata. While data experts tend to focus on the graph database components (RDF triple store or a label property graph), Hedden emphasizes they should not overlook the importance of the semantic layer.
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida.
In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization.
In this session, participants gained answers to the following questions:
- What is a Green Information Management (IM) Strategy, and why should you have one?
- How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication?
- How can an organization use insights into their data to influence employee behavior for IM?
- How can you reap additional benefits from content reduction that go beyond Green IM?
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024.
Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights.
Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer.
Learn about:
- The essence and purpose of taxonomies and ontologies in information and knowledge management;
- Advantages of semantic layers leveraging organizational taxonomies; and
- Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
With the explosive popularity of ChatGPT, organizations are throwing massive budgets and executive attention at the implementation of AI technologies. Making these solutions work for the enterprise can deliver competitive advantage and open up new solutions and business opportunities that were never before possible. However, without the right Information Architecture (IA) foundations, these projects are bound to fail. In this presentation, Marino and Galdamez provided practical, actionable steps around IA that organizations can take in preparation for future AI solutions.
In this session, attendees:
- Reviewed key elements of IA and discovered how their successful design and implementation can lay the foundations for AI;
- Learned basic terminology surrounding AI, as well as different techniques and applications of AI in enterprise environments;
- Gained a deeper understanding of the feedback loops between IA and AI and the corresponding implications on user experience; and
- Received practical advice on IA design to facilitate its implementation and the success of AI efforts.
Heather Hedden, Senior Consultant at Enterprise Knowledge, presented "An Overview of Taxonomies and AI" on January 30th, 2024, in the inaugural webinar of the Artificial Intelligence webinar series: The promise and the perils,” hosted by the Knowledge & Information Management Group of CILIP, the library and information association of the UK. In her presentation, Heather explained, with examples, how both generative AI and other AI technologies support taxonomy development and use and how taxonomies can support AI applications.
Explore the presentation to learn:
Why both top-down and bottom-up methods are needed in taxonomy creation
What AI methods are used for auto-tagging and auto-classification with taxonomies
How AI methods can extract candidate terms for taxonomy creation
How generative AI can be used for certain bottom-up taxonomy development tasks
How AI can be used to analyze a taxonomy against a corpus of documents
How generative AI can be used in queries to analyze a taxonomy
What AI applications taxonomies can support
Nonprofit KM Journey to Success: Lessons and Learnings at Feeding AmericaEnterprise Knowledge
Sara Duane, Senior Consultant within EK’s Strategic Consulting practice, and EK client Tom Summerfelt, former Chief Research Officer at Feeding America, presented on November 7, 2023 at KMWorld. The talk, “Nonprofit KM Journey to Success: Lessons & Learnings at Feeding America” focused on best practices for designing and implementing KM strategies that directly align with nonprofit organizational goals.
Duane and Summerfelt used their first-hand experience developing a multi-year comprehensive KM Strategy for Feeding America to outline real-world considerations and examples of:
Unique KM challenges faced by organizations in the nonprofit space
Considerations for strategic priorities and KM roadmaps for nonprofits
How to describe the business impact of KM for nonprofits
EK presented with Kate Vilches, Knowledge Management Lead at Ulteig, on November 6, 2022 at the Taxonomy Boot Camp Conference, co-located with KMWorld, in Washington, D.C. The talk, “Taxonomy Roller Coasters: Techniques to Keep Stakeholders on the Ride,” focused on proven stakeholder management techniques during enterprise taxonomy development and launch activities.
Gray and Vilches used their firsthand experience to relate advice, share practical tools, and provide real-life examples to ensure successful stakeholder involvement, reinforcing three key themes for attendees:
How to select partners and build coalitions to ensure long term success;
Overview of the steps, stages, challenges, and thrills of defining and implementing an enterprise taxonomy; and
The importance and finesse of effective change management efforts to ensure that stakeholders begin and remain excited and involved throughout the project.
DGIQ - Case Studies_ Applications of Data Governance in the Enterprise (Final...Enterprise Knowledge
Thomas Mitrevski, Senior Data Management and Governance Consultant and
Lulit Tesfaye, Partner and Vice President of Knowledge and Data Services
presented “Case Studies: Applications of Data Governance in the Enterprise” on December 6th, 2023 at DGIQ in Washington D.C.
In this presentation, Thomas and Lulit detailed their experiences developing strategies for multiple enterprise-scale data initiatives and provided an understanding of common data governance and maturity needs. Thomas and Lulit based their talk on real-world examples and case studies and provided the audience with examples of achieving buy-in to invest in governance tools and processes, as well as the expected return on investment (ROI).
Check out the presentation below to learn:
How Leading Organizations are Benchmarking Their Data Governance Maturity
Why End-User Training was Imperative in Seeing Scaled Governance Program Adoption
Which Tools and Frameworks were Critical in Getting Started with Data Governance
How Organizations Achieved Success with Data Governance in Under 12 Weeks
What Successful Data Governance Implementation Roadmaps Really Look Like
This presentation was delivered by EK CEO Zach Wahl at the 2023 Midwest KM Symposium in Kent State, Ohio. The presentation defines Knowledge Management and its value. It also covers key industry trends and outcomes.
Building for the Knowledge Management Archetypes at Your CompanyEnterprise Knowledge
Building for the KM Archetypes at Your Company
Taylor Paschal, Knowledge and Information Management Consultant at Enterprise Knowledge, and Jessica Malloy, Senior Knowledge Manager at Harvard Business Publishing presented on April 19, 2023 at the APQC Conference in Houston, Texas on the topic of Building for the KM Archetypes at Your Company. In this presentation, Jessica and Taylor define common types of personalities that are often present when building a KM program. Jessica and Taylor prompted attendees to think through the root causes of various behaviors and the approaches for taking these into account when driving KM forward in round table discussions supported by this worksheet (link). Attendees left with the ability to:
Describe the importance of focusing on the unique culture of an organization when building and iterating on a KM program
Recognize organizational archetypes and know how to adapt their KM program to them
Conduct a cultural assessment of their own organization to ensure their KM program is meeting them where they are
Knowledge Graphs are Worthless, Knowledge Graph Use Cases are PricelessEnterprise Knowledge
At Knowledge Graph Forum 2022, Lulit Tesfaye and Sara Nash, Senior Consultant discuss the importance of establishing valuable and actionable use cases for knowledge graph efforts. The discussion draws on lessons learned from several knowledge graph development efforts to define how to diagnose a bad use case and outlined their impact on initiatives - including strained relationships with stakeholders, time spent reworking priorities, and team turnover. They also share guidance on how to navigate these scenarios and provide a checklist to assess a strong use case.
For KM practitioners, Agile frameworks have long been important for optimizing stakeholder value and satisfaction in KM initiatives. Over 20 years ago, a group of software developers revolutionized their field by introducing the Agile Manifesto to guide their industry in adopting Agile values, frameworks, and practices. However, until now, KM practitioners have lacked a formal framework demonstrating how to apply Agility to KM. In short, it is time to codify these Agile principles in a manner suited for the KM profession. Leveraging the original Agile Manifesto for inspiration, Andrew Politi and Megan Salerno introduced “The Agile KM Manifesto” at KM World 2022. The presentation is designed to initiate a conversation amongst KM practitioners across the industry about this initial version of the Agile KM Manifesto (the 'AKM'), and solicit feedback on future iterations.
Next, the presenters walked through three EK case studies demonstrating how the application of its principles could have saved significant time in those initiatives.
First, we described how a global non-profit approached EK to address duplicate and outdated content, and the lack of content creation standards.
Applicable AKM principle: "Content should only be available to users if it is new, essential, reliable, dynamic, and reusable. If these criteria are not met, the content must be cleaned-up or archived accordingly.”"
Next was a discussion of how national nuclear research laboratory struggled to share and discover knowledge from retiring employees and compartmentalized silos.
Applicable AKM principle: “Tacit knowledge and expertise should be proactively and formally captured and stored in the same manner as explicit knowledge.”
Finally, the presenters described how one of the largest multinational athletic apparel companies struggled to help geographically separated teams collectively and collaboratively reuse knowledge and create content across the globe, even functionally similar focus roles.
Applicable AKM principle: “All KM efforts must leverage a common language. Develop, socialize, and employ a common KM language so stakeholders don't speak past each other and can maintain consensus throughout your KM effort.”
Ultimately, this presentation served to introduce The AKM to the broader community, demonstrate its value, and solicit input from across the industry.
Road Maps & Roadblocks to Federal Electronic Records ManagementEnterprise Knowledge
Angela Pitts, Sr. Consultant at Enterprise Knowledge, and Dave Simmons, Sr. Records Officer at General Services Administration (GSA), presented a case study in federal electronic records management that detailed the success of the GSA's Enterprise Document Management Solution (EDMS). They detailed the strategies used to identify elements of organizational change management required to successfully transition standard functions of records management (RM)—capture, maintenance, disposal, transfer, assignment of metadata, and reporting—from manual, paper-based practices to more efficient and less costly electronic systems.
Records Management is a necessary component of successful Knowledge Management as it systematically manages valuable content created and owned by the business. With technological advancements, most agencies have seen the volume of document records increase exponentially because they are now frequently born and managed as digital content through the records lifecycle. Acknowledging the challenge of managing more content with fewer people, Angela and Dave explained how the design of GSA's lean and agile systems and workflows enabled the agency to reduce the resources and attention needed to manage content collections while maintaining legal compliance and quality standards.
Building an Innovative Learning Ecosystem at Scale with Graph TechnologiesEnterprise Knowledge
Todd Fahlberg of Enterprise Knowledge, and Amber Simpson, a Senior Manager at Walmart Academy, presented on November 9, 2022 at the KMWorld Conference in Washington, DC on the topic of Building an Innovative Learning Ecosystem at Scale with Graph Technologies. In this presentation, Todd and Amber share how they’re making it easier for Walmart’s learning organization to manage content used by 2.4 million global associates with a custom Digital Library. The presentation provides insight into the challenges they faced and the lessons they learned along the way, in addition to their approach to design and implement the Digital Library. Todd and Amber also detail how and why they used graph technologies to make certain their solution can continue to scale to meet the needs of Walmart’s massive workforce and evolving business needs.
Identifying Security Risks Using Auto-Tagging and Text AnalyticsEnterprise Knowledge
On Thursday, November 10, Joe Hilger and Sara Duane spoke at Text Analytics Forum about identifying secure and confidential information using auto-tagging. Information security continues to grow in importance in today's society. We hear stories all of the time about hackers accessing private information from companies and government agencies. Every organization struggles with employees who store confidential information on insecure network drives or cloud drives. Joe and Sara did a project with a federal research organization that used auto-tagging and text analytics to identify confidential information that needed to be moved to a secure location. During the presentation, we shared the approach we took to identify this information and how we made sure that the tagging and text analytics were accurate. Attendees learned best practices for designing a taxonomy for auto-tagging and tuning auto-tagging as well as ways to identify confidential information across the enterprise.
Zach Wahl and Sara Mae O'Brien-Scott spoke at the 2022 Taxonomy Boot Camp in Washington, D.C. on taxonomy's critical role in delivering what every end user now expects—a seamless and personalized experience. Personalization is harnessed by the most successful organizations to anchor their content experience by allowing users to connect with content based on key characteristics. O’Brien-Scott and Wahl provided an understanding of how taxonomy powers personalization by detailing real-world use cases and best practices for taxonomy design for personalization. They discussed the personalization maturity scale, including how taxonomy lays the groundwork for enabling cutting-edge solutions such as recommendation engines, automated content assembly, and omnichannel delivery. They also shared expected outcomes of personalization such as increased conversion rates, a decrease in employee turnover, and stronger user engagement.
JPL’s Institutional Knowledge Graph II: A Foundation for Constructing Enterpr...Enterprise Knowledge
Previously at KMWorld 2021, EK joined JPL to share the vision, approach, and delivery of the Institutional Knowledge Graph (IKG), a centrally maintained, ever-evolving knowledge graph identifying and describing JPL’s enterprise-wide concepts, such as people, organizations, projects, and facilities, and the relationships between them. Since August 2020, the IKG has offered a single source of enterprise information that other JPL applications can leverage to reduce redundancy and out-of-date or inaccurate data. In production for 2 years and now with several releases under its belt, the IKG is beginning to fulfill its promise as a foundational layer in the semantic pyramid for additional taxonomies and knowledge graphs to build upon.
At KM World 2022, Bess Schrader, Senior Solutions Consultant at EK, and Ann Bernath, Software Systems Engineer at JPL, shared a follow-up to the IKG journey including a description of the Enterprise Semantic Platform, a look at new taxonomies and knowledge graphs at JPL (enterprise-wide, others specific to engineering, technical, or science domains) and how they are beginning to leverage the IKG’s foundation of JPL concepts to enrich their dataset into a broader context. This presentation discussed different techniques to federate or synchronize multiple knowledge graphs and how these diverse integrations benefit not only the new datasets, but also the IKG as it continues to pursue its overarching dream--providing answers to questions such as, “Who did what when?”, “Who should you call?”, and “Where is the Robotics Lab?”
Learning 360: Crafting a Comprehensive View of Learning by Using a GraphEnterprise Knowledge
Chris Marino, a Principal Solution Consultant at Enterprise Knowledge (EK), was a featured speaker at this year's Data Architecture Online event organized by Dataversity. Marino presented his webinar "Learning 360: Crafting a Comprehensive View of Learning Content Using a Graph" on July 20, 2022. In his presentation, Marino took participants through the entire Graph development process, including planning, designing, and developing the new tool, highlighting benefits to the organization and lessons learned throughout the process.
Making KM Clickable: The Rapidly Changing State of Knowledge ManagementEnterprise Knowledge
Initially delivered for the Bangalore K-Community Zoom Meetup: “The Digital Edge: Tech Roadmaps and Impacts on KM on June 15th, this deck covers the key takeaways from the leading Knowledge Management book, 'Making Knowledge Management Clickable,' by Zach Wahl and Joe Hilger of Enterprise Knowledge. The presentation covers definitions and value of KM, offers best practices on KM systems, details key types of KM technologies, and discusses some of the common types of KM solutions such as KM Portals and Knowledge Graphs.
How to Quickly Prototype a Scalable Graph Architecture: A Framework for Rapid...Enterprise Knowledge
Sara Nash and Thomas Mitrevski discuss the toolkit to scope and execute knowledge graph prototypes successfully in a matter of weeks. The framework discussed includes the development of a foundational semantic model (e.g. taxonomies/ontologies) and resources and skill sets needed for successful initiatives so that knowledge graph products can scale, as well as the data architecture and tooling required (e.g., orchestration and storage) for enterprise-scale implementation. This presentation was originally delivered at KGC 2022 in Boston, MA.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
3. ⬢ Serves as implementation lead for knowledge graphs - ranging
from early design and prototyping to enterprise solutions
⬢ Expert in Knowledge Graph and Semantic Technologies
⬢ Established standards in design and delivery of semantic
recommender solutions
SARA
PRINCIPAL CONSULTANT, ENTERPRISE KNOWLEDGE
NASH
URMI
PRINCIPAL CONSULTANT, ENTERPRISE KNOWLEDGE
MAJUMDER
⬢ Expert in system architecture, design, and implementation
of semantic enterprise solutions
⬢ Leads the development of technical solutions in support of
a wide variety of both federal and commercial clients
ENTERPRISE KNOWLEDGE
4. ENTERPRISE KNOWLEDGE
A knowledge graph is a specialized
graph of the things we want to
describe and how they are related.
● Standardize data entities and
enrich data with context.
● Can be expanded by leveraging
various approaches, including
AI-driven entity and
relationship extraction.
Most of the data in our world is
unstructured.
Unstructured data:
1) Has no metadata
2) Can’t be captured neatly in
structured formats like XML, JSON,
or relational databases; and,
3) Lacks standardization, which
prevents establishing uniform
processes for analysis.
The Challenge The Knowledge Graph
5. Creating Knowledge from Unstructured Content
There is a vast amount of information embedded in documents, reports, records, process flow
diagrams, and more. There is an opportunity to extract this knowledge and make
meaningful connections to accelerate knowledge discovery across teams.
Extract Knowledge from Content Organize Knowledge in Logical Structure Get Insights from Data
ABC
Material
XYZ
Product
Process 1
Process Step
E1
Experiment
F2
Material
isInput
isOutput
E0
Experiment
isInput isInput
creates
What Materials were used
to make XYZ?
● ABC was used in Process 1
What are the experiments
in which ABC was used?
● E0
● E1
Which experiment of ABC
was used to make F2?
● E1
6. ENTERPRISE KNOWLEDGE
Natural Language
Processing (NLP)
model leveraged to
build a Knowledge
Graph (KG) for
providing coherent
and relevant learning
content
recommendations.
ML model was used to
facilitate KG
generating dynamic
automated
regulatory reporting,
and expediting
research and
publication processes.
Learning Enablement Safety Standards Regulatory Reporting and More
Discovery Analysis Research
Machine Learning
(ML) model was
trained to allow KG to
facilitate thorough
analyses of possible
risks, and help
planners plan the
best safety measures
for mitigation.
● Product Marketing
● E-Commerce
● Content Cleanup
● Data Discovery in
Research
Top Graph Use Cases: https://enterprise-knowledge.com/top-graph-use-cases-and-enterprise-applications-with-real-world-examples/
Success Stories
7. Source Data and
Content
Taxonomy/
Schema Storage
Entity and
Relationship
Extraction
Enterprise
Content and Data
Dedicated
Taxonomy/Ontology
Management System
Auto-tagging and/or
Extraction of Key
Knowledge
Enriched Content
Storage
Persistent Graph
Storage
Data Orchestration
Front End
Visualization /
UI
API
AI
Search
Chatbots/
Q&A
Data Visualization
and Reporting
Recommender
Systems
Solutions Architecture for Scalable Knowledge Graphs
8. Source Data and
Content
Taxonomy/
Schema Storage
Enterprise
Content and Data
Dedicated
Taxonomy/Ontology
Management System
Enriched Content
Storage
Persistent Graph
Storage
Data Orchestration
Front End
Visualization /
UI
API
Search
Chatbots/
Q&A
Data Visualization
and Reporting
Recommender
Systems
AI accelerates extracting entities and relationships at
scale from unstructured enterprise data. This is
increasingly possible due to advances in the natural
language processing space.
Entity and
Relationship
Extraction
Auto-tagging and/or
Extraction of Key
Knowledge
AI
Solutions Architecture for Scalable Knowledge Graphs
9. Transformational
Institutional
Unaware of
how AI is being
adopted across
organizations.
Hopeful about
the promise of
AI and its
impact on
business.
Some AI/ML
models are in
use for specific
use cases.
AI solutions are
supporting
shared use
cases across the
organization.
AI is part of
business DNA,
transforming
infrastructure
and processes
to improve
efficiency while
optimizing
costs.
Operational
Experimental
Not Ready
(Pre-AI)
ENTERPRISE KNOWLEDGE
AI Maturity Spectrum for the Enterprise
10. …with
Automated
Monitoring
and Retraining
Entity Extraction Maturity Spectrum
Regular
Expression
Based
(RegEx)
Auto
Classification
Custom ML
Model …for
Entity
Extraction
…with Active
Learning
Definition
Taxonomy driven
categorization of
content
Definition
Traditional supervised
learning approach for
text classification
Considerations
Highly dependent on
training data
Definition
Model is re-trained
periodically based on
human feedback
Considerations
Increased text
classification accuracy
Definition
Model is automatically
re-trained, tested, and
deployed
Considerations
Recommended for
large orgs with
established DataOps
processes
Definition
Use patterns of
characters and
operators to match
text
Considerations
Requires explicit
definition of rules,
and may lead to false
positives
Considerations
Limited to the
terms defined in the
taxonomy
Transformational
Institutional
Operational
Experimental
Not Ready
(Pre-AI)
11. RegEx Rule: Knowledge Graph
University is a Service because
[*]University is a Service.
This content on EK’s site is rich with knowledge
that can be extracted through different
approaches.
Auto-Classification Rule:
● EKGU is a synonym for Enterprise
Knowledge Graph University and this
article is about EKGU.
● Information Analyst is a Role, so
Information Analyst may take EKGU
Custom ML Model
● SPARQL and SHACL are
frameworks
● Taxonomy, ontology, and Knowledge
Graphs are semantic models
● Graph database is a tool
Entity Extraction in Action
12. …with
Automated
Monitoring
and Retraining
Source
Schema
Based
Rule Based
Custom ML
Model …for
Relationship
Extraction
…with Active
Learning
Definition
Custom rule set,
borrowing relationships
from standard formats
Considerations
Relies on maintaining
rules and may lead to
false positives
Definition
Traditional supervised
learning for classifying
text between two
entities
Considerations
Highly dependent on
training data
Relationship Extraction Maturity Spectrum
Definition
Model is automatically
re-trained, tested, and
deployed
Considerations
Recommended for
large orgs with
established DataOps
processes
Definition
Model is re-trained
periodically based on
human feedback
Considerations
Increased text
classification accuracy
Definition
Exploit the schema
(JSON, XML, etc.) of the
source system
Considerations
Requires explicit
mapping in data source
between entities to
assign relationships
Transformational
Institutional
Operational
Experimental
Not Ready
(Pre-AI)
13. AI Maturity Spectrum for the Enterprise Revisited
Transformational
Institutional
Operational
Experimental
Not Ready
(Pre-AI)
ENTERPRISE KNOWLEDGE
Use pattern
matching for
deterministic
entity and
relationship
extraction
Design a
starter
taxonomy &
use it for
taxonomy
driven graph
instantiation
Use pre-trained
ML models for
probabilistic
entity and
relationship
extraction
Fine-tune
pretrained ML
models based
on SME
feedback to
update graph
Monitor
information
extraction
quality to
automatically
retrain ML
model
14. Be iterative Start small, then iteratively refine and expand AI
integration.
Involve your SMEs SMEs can help validate AI performance and provide
feedback, ensuring accurate and relevant
improvements.
Define consumer-
facing use cases
Use cases should address pain points or challenges
faced by your target audience.
Invest in quality Invest in establishing robust AI models and
structured content and data that align with your use
cases.
In order to best incorporate AI capabilities into your Knowledge Graph
pipeline, there are several key factors to consider:
Key Considerations
15. ENTERPRISE KNOWLEDGE
Any Questions?
Thank you for listening.
We are happy to take any
questions at this time.
Sara Nash
snash@enterprise-knowledge.com
www.linkedin.com/in/sara-g-nash/
Urmi Majumder
umajumder@enterprise-knowledge.com
www.linkedin.com/in/urmim/
How prepared is your organization
for AI? Take EK’s AI Maturity
Assessment:
https://s.enterprise-knowledge.com/ekaiassessment