Embodied Methods for Quick and Accurate Insights About Documents Using LLMs

•Download as PPTX, PDF•

0 likes•59 views

Neo4j

Ben Goosman, Senior Software Engineer, Kineviz

Technology

Embodied methods for quick and
accurate insights about documents using
LLMs
Ben Goosman

About Me
Me: Ben Goosman
Picture credit Weidong Yang
Picture credit Piper Werle

Problem
Documents take a long time to read
Easy to get lost in the details
LLMs hallucinate
Lack of trust in AI

Solution
Don’t rely completely on the AI
Involve the human in as many steps as possible
Knowledge Map, not Knowledge Graph
Why Map?

Methods
- Generating
- Infrastructure for bulk document analysis
- Knowledge map with the POLE model
- Make the LLM explain itself
- Provide definitions to the LLM
- Use examples to get desired output
- Allow human to change query
- It’s ok not to label everything
- At first, observations are nodes, not edges
- Use shortcut
- Exploring
- Neo4j Full Text search
- Apply Force Layout in 2d and 3d
- Find central nodes
- Use path finding
- Zoom in and read
- Expand using Cypher
- Question answering with the graph
- Find relevant documents
- Find relevant knowledge map

Infrastructure for bulk document analysis

Knowledge map with the POLE model
Find relationships involving entities of types
{labels} in the text provided.
A relationship has a Source, Target,
Explanation as to why these are in relation, and
a Short relationship.
One of the Source or Target can be of a type
not in the list {labels}, but not both.
The definitions are {str(definitions)}. If there are
no relationships, don't say anything.
Some examples of your output are below.

Make the LLM explain itself
See: Chain of Thought reasoning
Explanation: Lily lived in the village nestled
in the mountains.
Short: LIVED_IN

Provide definitions to the LLM
definitions = {
"Person": "An individual human being. This can include but is not limited to information about their name, age, gender,
occupation, nationality, and relationships.",
"Organization": " A structured body of people with a particular purpose, especially a business, society, association, etc. This can
include elements such as its name, founders, founding date, purpose, key people, and locations.",
"Location": "A specific place or position. This includes geopolitical places like countries, cities, and towns, or smaller, specific
places like buildings or landmarks. Information can cover elements such as its name, geographical coordinates, population, and
relevant features.",
"Event": "An occurrence of interest happening at a particular place and time. It can be historical, current, or future. It usually
involves people or organizations, and takes place at a specific location. Information can include elements such as its name, date,
location, participants, purpose, and outcomes.",
}

$Use examples to get desired output Your task is to generate {example_count} few-shot examples to train an LLM to identify the relationships between entities of types {labels} in a text in order to create a Knowledge Graph. The few-shot examples should have the following structure, but adapted for the entities and relationships in question. The definitions of the types are {str(definitions)}. Follow the example format below where each relationship has a Source, Target, Explanation, and Short. Source: Bruno Pusterla | Person Target: Italian Agricultural Confederation | Organization Explanation: Bruno Pusterla is a top official of the Italian Agricultural Confederation. Short: WORKS_FOR$

Allow human to change query
Find relationships matching the
given query, in the text provided.
Follow the example format. Each
relationship must have a Source,
Target, Explanation, and Short. If
there are no matches, don't say
anything.

It’s ok not to label everything
Focus on a few labels at a time, and label
everything else “Entity”

At first, observations are nodes, not edges

Find Central Nodes
Use a centrality measure
like PageRank

More from Neo4j

UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida

Neo4j

CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...

Neo4j

From Knowledge Graphs via Lego Bricks to scientific conversations.pptx

Neo4j

Novo Nordisk: When Knowledge Graphs meet LLMs

Neo4j

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Neo4j

QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

Neo4j

EY_Graph Database Powered Sustainability

Neo4j

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Neo4j

These slides were presented by Hakan Lofqvist, Senior Field Engineer at Neo4j, at Data Innovation Summit April 2024 in Stockholm. The slidedeck helps explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships and knowledge graphs in data to uncover contextual insights and solve their most pressing challenges. Key takeaways: - Discover the potential of Graph Databases and how they can transform your data strategy - Explore the importance of Knowledge Graphs for GenAI in structuring information and uncover valuable insights. - Understand the best way to rapidly build accurate, contextual, and explainable GenAI applications - Witness firsthand the transformative impact these structures have on organizing information and enhancing the capabilities of GenAI systems.

Build your next Gen AI Breakthrough - April 2024

Neo4j

In this presentation, delivered by ABK Andreas Kollegger at QCon London 2024, the focus was on Connecting the Dots for Information Discovery. The classic RAG application extends an LLM with private information, able to fetch answers to questions that are contained in a single chunk of text. What if the answer requires connecting the dots across multiple chunks that may not be directly similar to the question? That is information discovery with GraphRAG. You'll learn how to: - reconstruct chunks into the original context - meaningfully connect disparate chunks - expand unstructured text data with structured data - combine all this into a RAG workflow

Connecting the Dots for Information Discovery.pdf

Neo4j

Pedro García Fernández, Consultor del Área de Estrategia TIC e Inteligencia Artificial, ISDEFE Alberto Uceda Aguilar, Responsable de Contrato, ISDEFE “Tokenización”, como activo, de las emisiones contaminantes en el transporte aéreo mediante el procesamiento de datos cercano al tiempo real. Cómo los grafos nos ayudan al linaje del proceso de verificación, a la gobernanza del dato y como base de conocimiento de rutas aéreas.

ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...

Neo4j

David Jimenez Ausin, Strategy Senior Manager BBDD NoSQL & SQL, BBVA Gloria Del Carmen Garfias Ortiz, Arquitecto de Software, BBVA El modelo de grafos ha ayudado a nuestra arquitectura Backend a optimizar los tiempos de arranque gracias a la segregación de componentes aplicativos y a obtener de forma rápida análisis de impacto en caso de evolución de algún componente ya sea aplicativo o de la propia Arquitectura.

BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos

Neo4j

Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...

Neo4j

Novedades de Productos y Roadmap Neo4j Luis Salvador, Ingeniero de Preventas, Neo4j Echa un vistazo a las últimas innovaciones de Neo4j que permiten la inteligencia basada en relaciones a escala. Obtenga más información sobre las últimas integraciones en la nube y mejoras de productos que hacen de Neo4j una opción esencial para los desarrolladores que crean aplicaciones con datos interconectados e IA generativa.

GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j

Neo4j

Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf

Neo4j

In today’s rapidly evolving financial landscape, the role of technology is pivotal in driving innovation and efficiency. Graph Technology has emerged as a powerful tool, offering unique capabilities to uncover hidden insights, streamline processes, and comply with for CDD-, Fraud- and AML-requirements. This webinar delves into the transformative potential of Graph Technology, focusing on its impact on financial institutions. Specifically, we will get an overview into Rabobank’s innovative implementation of Graph Technology. By examining real-world use cases and success stories, attendees will gain valuable insights into how Rabobank leverages graph technology to optimize operations, enhance decision-making processes, and stay ahead in a competitive market. Join us as we explore the intersection of finance and technology, and discover the game-changing potential of Graph Technology in shaping the future of financial services.

Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf

Neo4j

Webinar - IA generativa e grafi Neo4j: RAG time!

Neo4j

IA Generativa y Grafos de Neo4j: RAG time

Neo4j

Neo4j: Data Engineering for RAG (retrieval augmented generation)

Neo4j

More from Neo4j (20)

UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida

CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...

From Knowledge Graphs via Lego Bricks to scientific conversations.pptx

Novo Nordisk: When Knowledge Graphs meet LLMs

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

EY_Graph Database Powered Sustainability

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Build your next Gen AI Breakthrough - April 2024

Connecting the Dots for Information Discovery.pdf

ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...

BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos

Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...

GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j

Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf

Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf

Webinar - IA generativa e grafi Neo4j: RAG time!

IA Generativa y Grafos de Neo4j: RAG time

Neo4j: Data Engineering for RAG (retrieval augmented generation)

Recently uploaded

Introduction to use of FHIR Documents in ABDM

Kumar Satyam

Event-Driven Architecture Masterclass: Challenges in Stream Processing

ScyllaDB

Discover the top CodeIgniter development companies that can elevate your project to new heights. Our blog explores the best firms known for their expertise in CodeIgniter framework development. From robust web applications to scalable solutions, these companies deliver excellence. Whether you're a startup or an enterprise, find the perfect match for your development needs on Top CSS Gallery's blog.

Top 10 CodeIgniter Development Companies

TopCSSGallery

Simplifying Mobile A11y Presentation.pptx

MarkSteadman7

At Skynet Technologies, our team of accessibility experts performs automated, semi-automated, and manual audits of websites and web applications as per WCAG 2.2 level AA, ADA, and section 508. Based on evaluations of the accessibility compliance level of the website’s UI, design, source code, navigation, interactive elements, and overall usability, we will provide a digital accessibility evaluation report with in-depth details of potential accessibility barriers and remediation recommendations. Get a manual website WCAG audit (2.0, 2.1, 2.2 level AA) for a small website: 10 pages: $2,500 within 7 business days 30 pages: $7,500 within 14 business days 50 pages: $12,500 within 28 business days For medium websites: 100 pages: $25,000 within 70 business days For larger websites or audits of all pages, please reach out hello@skynettechnologies.com.

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...

Skynet Technologies

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx

FIDO Alliance

Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...

ScyllaDB

TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....

rightmanforbloodline

In today's digital world, trust is key to customer relationships, but keeping it is a huge challenge. Customers are well-informed and empowered, quick to change brands if their trust is broken, even if it costs them more. This puts a lot of pressure on organizations to handle trust and safety issues with great care and transparency. The challenge, however, is real. Fragmented solutions have left privacy, legal, and security teams in a perpetual cycle of catch-up, struggling to update privacy notices, manage customer data rights, and answer lengthy security questionnaires—all while trying to prove ROI to the business. It's a thankless job, filled with repetition, tedious tasks, and constant interdepartmental coordination. Combine this with fast regulatory changes and the quick evolution of AI, and it becomes overwhelming. Join this webinar to learn more about TrustArc's new innovative solution Trust Center, the only unified, no-code online hub for trust and safety information built for privacy, security, compliance, and legal teams. Trust Center streamlines your path to compliance, shortens the pre-sales cycle, and reduces both legal and regulatory risks, saving time, effort, and cost. This webinar will review: - Why companies are building unified Trust Centers for a robust privacy program. - How unified Trust Centers streamline sales cycles, ensure regulatory compliance, and reduce operational bottlenecks. - How compliance, legal, security, GRC, and privacy teams benefit from a unified Trust Center in terms of needs, pains, and outcomes. - How TrustArc Trust Center saves time and work while reducing legal, reputational, and compliance risk by effectively managing policies, notices, terms, and disclosures, and providing real-time updates on subprocessors.

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...

TrustArc

Generative AI refers to a class of machine learning algorithms that are designed to generate new data samples that are similar to those in the training data. Unlike traditional AI models that are trained to recognize patterns and make predictions, generative AI models have the ability to create entirely new data based on the patterns they have learned. This is achieved through techniques such as generative adversarial networks (GANs), variational autoencoders (VAEs), and transformer architectures, among others.

Generative AI Use Cases and Applications.pdf

alexjohnson7307

The presentation was made in “Web3 Fusion: Embracing AI and Beyond” is more than a conference; it's a journey into the heart of digital transformation. The conference a provided a platform where the future of technology meets practical application. This three-day hybrid event, set in the heart of innovation, served as a gateway to the latest trends and transformative discussions in AI, Blockchain, IoT, AR/VR, and their collective impact on the information space.

AI in Action: Real World Use Cases by Anitaraj

AnitaRaj43

Because observability is such a broad topic – and often something we learn on the job – it can feel like there’s too much to learn at once. But you don’t have to tackle everything and can start with the basics and build from there! Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack. No matter what tooling is in place, there are still observability fundamentals that developers should know. That’s why I’ve put together a primer on the different telemetry types, when to use them, how to understand the data journey, and what to look for in time series graphs.

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

Paige Cruz

Design Guidelines for Passkeys 2024.pptx

FIDO Alliance

Webinar Recording: https://www.panagenda.com/webinars/alles-neu-macht-der-mai-wir-durchleuchten-den-verbesserten-notes-eigenschaftendialog/ Haben Sie sich schon einmal über den zu kleinen Eigenschaftendialog in Notes geärgert? Mussten Sie einen Agenten oder eine Aktion erstellen, um schnell mal ein Feld zu ändern? Haben Sie jedes mal endlos nach dem zu vergleichenden Feld gesucht, nachdem Sie ein neues Dokument ausgewählt haben? Wollten Sie das verdammte Ding einfach nur größer machen? Zum Glück gibt es dafür eine Lösung – und sie ist wahrscheinlich bereits installiert! Mit dem kostenlosen panagenda Document Properties (Pro) erhalten Sie den Eigenschaftendialog, den Sie schon immer haben wollten. Größer, anpassbar, und im Volltext durchsuchbar. Sehen Sie mehrere Dokumente gleichzeitig oder vergleichen Sie mit einem Diff-Viewer. Ändern Sie beliebige Felder und haben Sie endlich eine einfache Möglichkeit, Profildokumente für alle Benutzer zu verwalten. Entdecken Sie mit HCL Ambassador Marc Thomas, wie Document Properties Ihre Arbeit vereinfachen und Sie bei der täglichen Verwendung von Domino-Anwendungen unterstützen kann – im Client oder im Designer. Sie werden es nicht bereuen! Für Sie in diesem Webinar - Was Document Properties ist, welche Editionen es gibt und wo es in Notes und Domino Designer zu finden ist - Wie Sie nach einem beliebigen Feld suchen und es bearbeiten, Dokumente vergleichen oder alle Daten per CSV exportieren können - Suchen, Bearbeiten und auch Löschen von Profildokumenten - Welche Konfigurationseinstellungen verfügbar sind, um Funktionen anzupassen - Wie Ihre Endbenutzer davon profitieren - Sehen Sie alles in einer Live-Demo

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...

panagenda

State of the Smart Building Startup Landscape 2024!

Memoori

Navigating the Large Language Model choices_Ravi Daparthi

RaviKumarDaparthi

Microsoft CSP Briefing Pre-Engagement - Questionnaire

Exakis Nelite

Explore the latest trends and insights on JavaScript usage with Pixlogix's informative blog. Discover key statistics and facts about JavaScript's role in web development, its popularity among developers, and its impact on modern websites. Stay updated with the evolving landscape of JavaScript frameworks and libraries, and learn how they're shaping the future of web development. Gain valuable insights to enhance your JavaScript skills and stay ahead in the digital realm.

JavaScript Usage Statistics 2024 - The Ultimate Guide

Pixlogix Infotech

Ivanti’s Patch Tuesday breakdown goes beyond patching your applications and brings you the intelligence and guidance needed to prioritize where to focus your attention first. Catch early analysis on our Ivanti blog, then join industry expert Chris Goettl for the Patch Tuesday Webinar Event. There we’ll do a deep dive into each of the bulletins and give guidance on the risks associated with the newly-identified vulnerabilities.

2024 May Patch Tuesday

Ivanti

Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf

AnubhavMangla3

Recently uploaded (20)

Introduction to use of FHIR Documents in ABDM

Event-Driven Architecture Masterclass: Challenges in Stream Processing

Top 10 CodeIgniter Development Companies

Simplifying Mobile A11y Presentation.pptx

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx

Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...

TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...

Generative AI Use Cases and Applications.pdf

AI in Action: Real World Use Cases by Anitaraj

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

Design Guidelines for Passkeys 2024.pptx

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...

State of the Smart Building Startup Landscape 2024!

Navigating the Large Language Model choices_Ravi Daparthi

Microsoft CSP Briefing Pre-Engagement - Questionnaire

JavaScript Usage Statistics 2024 - The Ultimate Guide

2024 May Patch Tuesday

Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf

Embodied Methods for Quick and Accurate Insights About Documents Using LLMs

1. Embodied methods for quick and accurate insights about documents using LLMs Ben Goosman

2. About Me Me: Ben Goosman Picture credit Weidong Yang Picture credit Piper Werle

3. Problem Documents take a long time to read Easy to get lost in the details LLMs hallucinate Lack of trust in AI

4. Solution Don’t rely completely on the AI Involve the human in as many steps as possible Knowledge Map, not Knowledge Graph Why Map?

5. Methods - Generating - Infrastructure for bulk document analysis - Knowledge map with the POLE model - Make the LLM explain itself - Provide definitions to the LLM - Use examples to get desired output - Allow human to change query - It’s ok not to label everything - At first, observations are nodes, not edges - Use shortcut - Exploring - Neo4j Full Text search - Apply Force Layout in 2d and 3d - Find central nodes - Use path finding - Zoom in and read - Expand using Cypher - Question answering with the graph - Find relevant documents - Find relevant knowledge map

6. Infrastructure for bulk document analysis

7. Knowledge map with the POLE model Find relationships involving entities of types {labels} in the text provided. A relationship has a Source, Target, Explanation as to why these are in relation, and a Short relationship. One of the Source or Target can be of a type not in the list {labels}, but not both. The definitions are {str(definitions)}. If there are no relationships, don't say anything. Some examples of your output are below.

8. Make the LLM explain itself See: Chain of Thought reasoning Explanation: Lily lived in the village nestled in the mountains. Short: LIVED_IN

9. Provide definitions to the LLM definitions = { "Person": "An individual human being. This can include but is not limited to information about their name, age, gender, occupation, nationality, and relationships.", "Organization": " A structured body of people with a particular purpose, especially a business, society, association, etc. This can include elements such as its name, founders, founding date, purpose, key people, and locations.", "Location": "A specific place or position. This includes geopolitical places like countries, cities, and towns, or smaller, specific places like buildings or landmarks. Information can cover elements such as its name, geographical coordinates, population, and relevant features.", "Event": "An occurrence of interest happening at a particular place and time. It can be historical, current, or future. It usually involves people or organizations, and takes place at a specific location. Information can include elements such as its name, date, location, participants, purpose, and outcomes.", }

10. Use examples to get desired output Your task is to generate {example_count} few-shot examples to train an LLM to identify the relationships between entities of types {labels} in a text in order to create a Knowledge Graph. The few-shot examples should have the following structure, but adapted for the entities and relationships in question. The definitions of the types are {str(definitions)}. Follow the example format below where each relationship has a Source, Target, Explanation, and Short. Source: Bruno Pusterla | Person Target: Italian Agricultural Confederation | Organization Explanation: Bruno Pusterla is a top official of the Italian Agricultural Confederation. Short: WORKS_FOR

11. Allow human to change query Find relationships matching the given query, in the text provided. Follow the example format. Each relationship must have a Source, Target, Explanation, and Short. If there are no matches, don't say anything.

12. It’s ok not to label everything Focus on a few labels at a time, and label everything else “Entity”

13. At first, observations are nodes, not edges

14. Simplify the graph

15. Neo4j Full Text search

16. Apply Force Layout in 2d and 3d

17. Find Central Nodes Use a centrality measure like PageRank

18. Zoom in and read

19. Expand using Cypher

20. Question answering with the graph

21. Question answering with the graph

22. Thank you and Questions

Editor's Notes

Not trying to be WikiData Compromise between Knowledge Graph and Mind Map
More accurate results with GPT-4
“Chain-of-thought (CoT) prompting is a technique that allows large language models (LLMs) to solve a problem as a series of intermediate steps[27] before giving a final answer. Chain-of-thought prompting improves reasoning ability by inducing the model to answer a multi-step problem with steps of reasoning that mimic a train of thought.[28][17][29] It allows large language models to overcome difficulties with some reasoning tasks that require logical thinking and multiple steps to solve, such as arithmetic or commonsense reasoning questions.[30][31][32]” https://en.wikipedia.org/wiki/Prompt_engineering
Observation nodes create separation in the graph layout Observation nodes can be linked to the Source and Chunk Observation nodes can be skipped later
Neo4j Full Text search is a good place to start

Embodied Methods for Quick and Accurate Insights About Documents Using LLMs

Recommended

Recommended

More Related Content

More from Neo4j

More from Neo4j (20)

Recently uploaded

Recently uploaded (20)

Embodied Methods for Quick and Accurate Insights About Documents Using LLMs

Editor's Notes