1 | © Copyright 2024 Zilliz
1
1 | © Copyright 10/22/23 Zilliz
1 | © Copyright 2024 Zilliz
Stefan Webb
Developer Advocate, Zilliz
stefan.webb@zilliz.com
https://www.linkedin.com/in/stefan-webb
What Makes Deep Research?
A Dive into AI Agents
2 | © Copyright 2024 Zilliz
2
01 New Kid on the Block: Deep Research
CONTENTS
02 Demo of Zillizʼs DeepSearcher
What makes it “tickˮ?
03
Where to from here?
03
3 | © Copyright Zilliz
3
01
New Kid on the Block:
Deep Research
4 | © Copyright Zilliz
4
5 | © Copyright Zilliz
5
What was new about Deep Research?
● Wasnʼt first in this category
○ Googleʼs Deep Research released in Dec, 2024
● Focus on end-to-end training of chain-of-thought
reasoning with RL?
● Something else?
6 | © Copyright Zilliz
6
Research Agents
Iteration
“…learned to plan and
execute a multi-step
trajectory…”
“…backtracking and reacting
to real-time information…”
“…pivoting as needed in
reaction to information it
encounters…”
Search
“…trained using end-to-end
reinforcement learning on hard
browsing and reasoning tasks
across a range of domains…”
“…optimized for web browsing
and data analysis…”
Reasoning
“…fine-tuned on the upcoming
OpenAI o3 reasoning model…”
““…leverages reasoning to search,
interpret, and analyze massive
amounts of text…””
7 | © Copyright Zilliz
7
02
Code Walkthrough
Zillizʼs DeepSearcher
8 | © Copyright Zilliz
8
About Milvus
Milvus is an open-source vector database for
GenAI projects. pip install on your laptop, plug into
popular AI dev tools, and push to production with
a single line of code.
33K
GitHub Stars
66M
Docker Pulls
400
Contributors
2.7K
Forks
Easy Setup
Pip-install to start coding in a notebook within seconds
Integration
Plug into OpenAI, Langchain, LlmaIndex, and many more
Reusable Code
Write once, and deploy with one line of code into the production
environment
Feature-rich
Dense & sparse embeddings, filtering, reranking and beyond
9 | © Copyright 2024 Zilliz
9
Milvus Users
10 | © Copyright 2024 Zilliz
10
Deployment Options
Milvus Lite
● Locally hosted
● Suitable for prototyping
and demos
Milvus Standalone
● Single remote/local server
● “Medium” scale
● Simplified setup,
maintenance, etc.
compared to cluster
Milvus Cluster
● Distributed system
● Many different types of
nodes
● Scales to 100s of billions
of vectors
11 | © Copyright 2024 Zilliz
11
12 | © Copyright 2024 Zilliz
12
Retrieval-Augmented Generation
13 | © Copyright 2024 Zilliz
13
13 | © Copyright 10/22/23 Zilliz
13 | © Copyright 2024 Zilliz
github.com/milvus-io/milvus zilliz.com/learn/generative-ai
14 | © Copyright Zilliz
14
Code Walkthrough
15 | © Copyright Zilliz
15
03 What makes it “tickˮ?
16 | © Copyright Zilliz
16
Agents — Tool Usage
● Ability to call functions
● Equivalently, write and run code (in sandbox)
● Search web
● Use of structured output / constrained
sampling
17 | © Copyright Zilliz
17
Agents — Cognitive Models
18 | © Copyright Zilliz
18
Reasoning — Conditional Computation
19 | © Copyright Zilliz
19
Reasoning — GRPO Training
20 | © Copyright Zilliz
20
Reasoning — GRPO Training
21 | © Copyright Zilliz
21
04 Where to from here?
22 | © Copyright Zilliz
22
Challenges
● Cost
● Hallucinations (factual errors)
● Reasoning errors
● Open-source reasoning trace datasets
23 | © Copyright Zilliz
23
Where to from here?
● Cost
○ Special hardware SambaNova, Cerebras)
○ Continuous CoT
● Barriers to entry
○ Milvus
■ “I Built a Deep Research with Open Source—and So Can You!ˮ
■ “Introducing DeepSearcher: A Local Open Source Deep Researchˮ
○ HuggingFace
■ “Open-R1: a fully open reproduction of DeepSeek-R1ˮ
■ “Open-source DeepResearch – Freeing our search agentsˮ
24 | © Copyright 2024 Zilliz
24
https://milvus.io/discord
https://github.com/milvus-io/milvus
https://x.com/milvusio
https://www.linkedin.com/company/the-milvus-project
LET’S STAY CONNECTED!
Stefan Webb
Developer Advocate, Zilliz
25 | © Copyright 2024 Zilliz
25
Unstructured
Data Podcast
Latest Episodes
• Inside the AI Revolution
• Prompt, Score, Repeat: Principled
RAG and Agent Design
🎙🎙🎙
26 | © Copyright 2024 Zilliz
26
Workshop
with Milvus
and OpenAI
Join us for a hands-on session with
OpenAI to learn about Agents!
🗓 March 20, 2025
⏰ 530  830 PM
📍Palo Alto
27 | © Copyright 2024 Zilliz
27
Book a free 11 session to get help with your production deployment
meetings.hubspot.com/chloe-williams1/milvus-office-hours
28 | © Copyright Zilliz
28
T H A N K Y O U

What Makes "Deep Research"? A Dive into AI Agents

  • 1.
    1 | ©Copyright 2024 Zilliz 1 1 | © Copyright 10/22/23 Zilliz 1 | © Copyright 2024 Zilliz Stefan Webb Developer Advocate, Zilliz stefan.webb@zilliz.com https://www.linkedin.com/in/stefan-webb What Makes Deep Research? A Dive into AI Agents
  • 2.
    2 | ©Copyright 2024 Zilliz 2 01 New Kid on the Block: Deep Research CONTENTS 02 Demo of Zillizʼs DeepSearcher What makes it “tickˮ? 03 Where to from here? 03
  • 3.
    3 | ©Copyright Zilliz 3 01 New Kid on the Block: Deep Research
  • 4.
    4 | ©Copyright Zilliz 4
  • 5.
    5 | ©Copyright Zilliz 5 What was new about Deep Research? ● Wasnʼt first in this category ○ Googleʼs Deep Research released in Dec, 2024 ● Focus on end-to-end training of chain-of-thought reasoning with RL? ● Something else?
  • 6.
    6 | ©Copyright Zilliz 6 Research Agents Iteration “…learned to plan and execute a multi-step trajectory…” “…backtracking and reacting to real-time information…” “…pivoting as needed in reaction to information it encounters…” Search “…trained using end-to-end reinforcement learning on hard browsing and reasoning tasks across a range of domains…” “…optimized for web browsing and data analysis…” Reasoning “…fine-tuned on the upcoming OpenAI o3 reasoning model…” ““…leverages reasoning to search, interpret, and analyze massive amounts of text…””
  • 7.
    7 | ©Copyright Zilliz 7 02 Code Walkthrough Zillizʼs DeepSearcher
  • 8.
    8 | ©Copyright Zilliz 8 About Milvus Milvus is an open-source vector database for GenAI projects. pip install on your laptop, plug into popular AI dev tools, and push to production with a single line of code. 33K GitHub Stars 66M Docker Pulls 400 Contributors 2.7K Forks Easy Setup Pip-install to start coding in a notebook within seconds Integration Plug into OpenAI, Langchain, LlmaIndex, and many more Reusable Code Write once, and deploy with one line of code into the production environment Feature-rich Dense & sparse embeddings, filtering, reranking and beyond
  • 9.
    9 | ©Copyright 2024 Zilliz 9 Milvus Users
  • 10.
    10 | ©Copyright 2024 Zilliz 10 Deployment Options Milvus Lite ● Locally hosted ● Suitable for prototyping and demos Milvus Standalone ● Single remote/local server ● “Medium” scale ● Simplified setup, maintenance, etc. compared to cluster Milvus Cluster ● Distributed system ● Many different types of nodes ● Scales to 100s of billions of vectors
  • 11.
    11 | ©Copyright 2024 Zilliz 11
  • 12.
    12 | ©Copyright 2024 Zilliz 12 Retrieval-Augmented Generation
  • 13.
    13 | ©Copyright 2024 Zilliz 13 13 | © Copyright 10/22/23 Zilliz 13 | © Copyright 2024 Zilliz github.com/milvus-io/milvus zilliz.com/learn/generative-ai
  • 14.
    14 | ©Copyright Zilliz 14 Code Walkthrough
  • 15.
    15 | ©Copyright Zilliz 15 03 What makes it “tickˮ?
  • 16.
    16 | ©Copyright Zilliz 16 Agents — Tool Usage ● Ability to call functions ● Equivalently, write and run code (in sandbox) ● Search web ● Use of structured output / constrained sampling
  • 17.
    17 | ©Copyright Zilliz 17 Agents — Cognitive Models
  • 18.
    18 | ©Copyright Zilliz 18 Reasoning — Conditional Computation
  • 19.
    19 | ©Copyright Zilliz 19 Reasoning — GRPO Training
  • 20.
    20 | ©Copyright Zilliz 20 Reasoning — GRPO Training
  • 21.
    21 | ©Copyright Zilliz 21 04 Where to from here?
  • 22.
    22 | ©Copyright Zilliz 22 Challenges ● Cost ● Hallucinations (factual errors) ● Reasoning errors ● Open-source reasoning trace datasets
  • 23.
    23 | ©Copyright Zilliz 23 Where to from here? ● Cost ○ Special hardware SambaNova, Cerebras) ○ Continuous CoT ● Barriers to entry ○ Milvus ■ “I Built a Deep Research with Open Source—and So Can You!ˮ ■ “Introducing DeepSearcher: A Local Open Source Deep Researchˮ ○ HuggingFace ■ “Open-R1: a fully open reproduction of DeepSeek-R1ˮ ■ “Open-source DeepResearch – Freeing our search agentsˮ
  • 24.
    24 | ©Copyright 2024 Zilliz 24 https://milvus.io/discord https://github.com/milvus-io/milvus https://x.com/milvusio https://www.linkedin.com/company/the-milvus-project LET’S STAY CONNECTED! Stefan Webb Developer Advocate, Zilliz
  • 25.
    25 | ©Copyright 2024 Zilliz 25 Unstructured Data Podcast Latest Episodes • Inside the AI Revolution • Prompt, Score, Repeat: Principled RAG and Agent Design 🎙🎙🎙
  • 26.
    26 | ©Copyright 2024 Zilliz 26 Workshop with Milvus and OpenAI Join us for a hands-on session with OpenAI to learn about Agents! 🗓 March 20, 2025 ⏰ 530  830 PM 📍Palo Alto
  • 27.
    27 | ©Copyright 2024 Zilliz 27 Book a free 11 session to get help with your production deployment meetings.hubspot.com/chloe-williams1/milvus-office-hours
  • 28.
    28 | ©Copyright Zilliz 28 T H A N K Y O U