2.10 Tool Use and Agents

This video discusses the emerging capabilities of language models (LMs) to use tools and act as agents, allowing them to perform complex tasks like placing food orders or conducting research. It highlights the importance of user verification to prevent errors and the potential for LMs to leverage external tools for accurate reasoning. The future of AI may see LMs evolving into responsible agents that can execute sequences of actions to assist users effectively.

Uploaded by

dnabc04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views3 pages

2.10 Tool Use and Agents

Uploaded by

dnabc04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Welcome to the final video of this week!

In this video, I’d like to share how language models

(LMs) are beginning to use tools and also discuss a cutting-edge topic: Agents, which involve
letting LMs decide for themselves what actions to take next. Let’s explore.

Example: Food Order Chatbot

Consider a food-order chatbot. If you say, “Sam Burger,” the chatbot might respond with,
“Okay, it’s on the way.” However, to actually place the order and send it to you, the LM
needs to take action behind the scenes. Here’s what happens:

 The LM outputs an internal response like:

css
Sao chép mã
Order burger for user 9876 to be sent to this address.

It also generates the user-facing message:

csharp
Sao chép mã
Okay, it’s on its way.

An LM fine-tuned to output structured text like this can trigger a software application to
process the order. In this case, it would communicate with the restaurant’s ordering system to
deliver a burger to the specified user at their address.

Only the final line (“Okay, it’s on its way”) is displayed to the user. This is an example of tool
use by an LM, where the text output triggers actions like placing a restaurant order.

Improving Reliability

Placing an incorrect order can be a costly mistake. To avoid this, a better user interface could
involve a verification dialog:

 Before finalizing the order, the chatbot might display a prompt asking the user to
confirm:
“Is this order correct? Yes/No.”

This step allows the user to validate the action before the LM triggers it. For any safety-
critical or mission-critical actions, it’s essential to let the user confirm before the LM executes
potentially costly or erroneous tasks.

Using Tools for Reasoning

LMs can also leverage tools for reasoning. For instance, if you prompt an LM with:

“How much would I have after 8 years if I deposit $100 in a bank account that pays 5%
interest?”

The LM might generate an answer like:

“You will have $147.40.”

While this response sounds plausible, the number is incorrect. LMs, even when instruction-
tuned, are not great at precise math. Instead of relying solely on the LM, a tool like a
calculator can be used to compute the correct result.

This process mirrors how you or I would use a calculator to solve a similar problem. LMs can
call external tools for accurate reasoning, ensuring reliable and precise answers.

We can also give the LM a calculator to help it get the right answer. Instead of having the LM
output the answer directly, the LM could generate output like this:

"After compounding, you would have calculator: 100 × 1.05^8."

This output could be interpreted as a command to call an external calculator program to

compute the correct answer, which is $147.74. The calculated result can then be plugged back
into the text, providing the user with the correct figure.

By giving LMs the ability to call tools in their outputs, we can significantly extend their
reasoning or action-taking capabilities. Tool use is already an important part of many LM
applications. However, designers of these applications should carefully ensure that tools are
not triggered in ways that cause harm or irreversible damage.

Moving Beyond Tools: AI Agents

Going beyond tools, AI researchers are exploring agents, which extend LMs' capabilities
from triggering single actions to carrying out complex sequences of actions. This is an
exciting but experimental area at the cutting edge of AI research. While agents are not yet
mature enough for most critical applications, they hold tremendous potential.

For example, imagine you ask an agent, “Help me research Better Burger’s top competitors.”
The agent could use an LM as a reasoning engine to determine the steps needed to complete
the task:

1. Search for a list of top competitors.

2. Visit the websites of each competitor.
3. Write a summary based on the homepage content of each competitor.

To accomplish this, the agent might:

 Trigger a web search tool with the query “Better Burger’s competitors.”
 Visit the websites of the identified competitors and download their homepage content.
 Use an LM to summarize the text found on these websites.

While there have been impressive demonstrations of agents performing such tasks, the
technology is not yet ready for widespread use. However, as researchers improve its
capabilities, agents may become powerful tools for helping users carry out tasks in a safe and
responsible manner.

Conclusion

The future of AI could see LMs evolving into reasoning engines that not only decide on
sequences of actions but also execute them responsibly to assist users with their tasks.
Thank you, and congratulations on reaching the end of week two! With just one more week to
go in this course, we’ll next explore how generative AI is affecting companies. This includes
identifying generative AI use cases for businesses and examining its broader societal impact,
particularly on jobs. I look forward to seeing you next week!

Demystifying AI Agents Curated by ProductHood School 1735180517
No ratings yet
Demystifying AI Agents Curated by ProductHood School 1735180517
17 pages
AI 3 Agent
No ratings yet
AI 3 Agent
46 pages
Genai
No ratings yet
Genai
26 pages
What Are AI Agents? - IBM
No ratings yet
What Are AI Agents? - IBM
17 pages
LLM AI Agents: Task Planning & Tools
No ratings yet
LLM AI Agents: Task Planning & Tools
36 pages
InfoQ EMag 117 Act One From Chatbots To AI Agents 1742887495103
No ratings yet
InfoQ EMag 117 Act One From Chatbots To AI Agents 1742887495103
39 pages
Ai Agent Overview
100% (2)
Ai Agent Overview
33 pages
How AI Agents Can Help Supercharge Language Models - A Handbook For Developers
No ratings yet
How AI Agents Can Help Supercharge Language Models - A Handbook For Developers
127 pages
E3. AI Agents
No ratings yet
E3. AI Agents
49 pages
Fine Tuning Techniques For Large Language Models LLMs
100% (4)
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Humanity's Last Prompt Engineering Guide
100% (2)
Humanity's Last Prompt Engineering Guide
36 pages
1 - 2 - Intro - AI Agents Vs AI Assistance
No ratings yet
1 - 2 - Intro - AI Agents Vs AI Assistance
6 pages
Ai Workshop
No ratings yet
Ai Workshop
29 pages
Agentic AI
No ratings yet
Agentic AI
6 pages
Get The Best From AI
No ratings yet
Get The Best From AI
34 pages
Introduction to LLM Agents
No ratings yet
Introduction to LLM Agents
2 pages
AI Agents - Introduction (Part-1) - Discover AI Agents, Their Design, and - by Vipra Singh - Medium
No ratings yet
AI Agents - Introduction (Part-1) - Discover AI Agents, Their Design, and - by Vipra Singh - Medium
22 pages
Types and Roles of AI Agents
No ratings yet
Types and Roles of AI Agents
13 pages
AI Product Essentials Crash Course
No ratings yet
AI Product Essentials Crash Course
270 pages
Guide 4 Prompt Engineering
No ratings yet
Guide 4 Prompt Engineering
1 page
Building AI Agents With LLMS, RAG, and Knowledge Graphs
100% (6)
Building AI Agents With LLMS, RAG, and Knowledge Graphs
560 pages
AI Handbook
No ratings yet
AI Handbook
15 pages
Ai Use Cases To Launch Today
No ratings yet
Ai Use Cases To Launch Today
26 pages
New Agentic AI
100% (1)
New Agentic AI
16 pages
2 AI Agents and Workflow Automation A Complete Guide DiegoDavila
No ratings yet
2 AI Agents and Workflow Automation A Complete Guide DiegoDavila
60 pages
Ai 1
No ratings yet
Ai 1
6 pages
Introduction To Generative AI and Prompt Engineering
No ratings yet
Introduction To Generative AI and Prompt Engineering
3 pages
GALLM Unit 5 Note
No ratings yet
GALLM Unit 5 Note
7 pages
LLM Agents and Real-World Applications
No ratings yet
LLM Agents and Real-World Applications
2 pages
Ai Agents
No ratings yet
Ai Agents
39 pages
Introduction and History of Artificial Intelligence
No ratings yet
Introduction and History of Artificial Intelligence
90 pages
Mastering AI Agents Guide
No ratings yet
Mastering AI Agents Guide
17 pages
Mastering AI Agents
100% (12)
Mastering AI Agents
93 pages
Sanet - ST - Building Applications With AI Agents
100% (2)
Sanet - ST - Building Applications With AI Agents
72 pages
Generative AI
No ratings yet
Generative AI
6 pages
Agentic Ai Beginner Guide
No ratings yet
Agentic Ai Beginner Guide
5 pages
AIvolution 2025 คุณอานนท์ มณฑาทิพย์กุล (Sales Solution Specialist - Data - AI
No ratings yet
AIvolution 2025 คุณอานนท์ มณฑาทิพย์กุล (Sales Solution Specialist - Data - AI
40 pages
Multi-Agentic RAG With Hugging Face Code Agents - by Gabriele Sgroi, PHD - Dec, 2024 - Towards Data Science
No ratings yet
Multi-Agentic RAG With Hugging Face Code Agents - by Gabriele Sgroi, PHD - Dec, 2024 - Towards Data Science
42 pages
AI ASSignemnet 1
No ratings yet
AI ASSignemnet 1
5 pages
How LLM Agents Learn and Reason
No ratings yet
How LLM Agents Learn and Reason
2 pages
GenAI Concepts: A Comprehensive Guide
No ratings yet
GenAI Concepts: A Comprehensive Guide
14 pages
Generative AI's Economic Impact
No ratings yet
Generative AI's Economic Impact
19 pages
Getting Started With AI
No ratings yet
Getting Started With AI
50 pages
Augmenting Human Potential:: The Role of Llms in Shaping The Future of Hci
No ratings yet
Augmenting Human Potential:: The Role of Llms in Shaping The Future of Hci
4 pages
Breaking Creative Boundaries Generative Ai and Its Applications v1
No ratings yet
Breaking Creative Boundaries Generative Ai and Its Applications v1
10 pages
Don't Build Chatbots - Build Agents With Jobs - The New Stack
No ratings yet
Don't Build Chatbots - Build Agents With Jobs - The New Stack
11 pages
AI21
No ratings yet
AI21
5 pages
AI Agents Prompt Engineer Level 1
No ratings yet
AI Agents Prompt Engineer Level 1
23 pages
CH 5 Modern Artificial Intelligence
No ratings yet
CH 5 Modern Artificial Intelligence
5 pages
Principles of Building AI Agents - Deck Version-1
100% (1)
Principles of Building AI Agents - Deck Version-1
12 pages
AI Agents
No ratings yet
AI Agents
13 pages
1 1 Intro Overview
No ratings yet
1 1 Intro Overview
11 pages
Prompt Cook Book
100% (1)
Prompt Cook Book
24 pages
IAI Sp2025 Session 16 - Improving LLMs (Continued)
No ratings yet
IAI Sp2025 Session 16 - Improving LLMs (Continued)
28 pages
AI Security Reference Architectures
No ratings yet
AI Security Reference Architectures
31 pages
Recruitment Strategy for Top Talent
No ratings yet
Recruitment Strategy for Top Talent
15 pages
VIPISKA2024
No ratings yet
VIPISKA2024
12 pages
File 1750847717445 4962089
No ratings yet
File 1750847717445 4962089
23 pages
Sexual Reproduction in Flowering Plant - DPP 03 (Extra DPP) - Lakshya NEET 2026
No ratings yet
Sexual Reproduction in Flowering Plant - DPP 03 (Extra DPP) - Lakshya NEET 2026
3 pages
Engineering Mathematics and Physics Overview
No ratings yet
Engineering Mathematics and Physics Overview
23 pages
Home Economics Then and Now
No ratings yet
Home Economics Then and Now
1 page
Trinidad & Tobago Visa Application Guide
No ratings yet
Trinidad & Tobago Visa Application Guide
1 page
BOOK 2-LGU Guidebook in LCCAP Formulation (Reference) PDF
100% (2)
BOOK 2-LGU Guidebook in LCCAP Formulation (Reference) PDF
140 pages
PolicyStatus 274383492
No ratings yet
PolicyStatus 274383492
1 page
Monitoring 2024
No ratings yet
Monitoring 2024
7 pages
Data Analysis Algorithms For Revenue Assurance
No ratings yet
Data Analysis Algorithms For Revenue Assurance
16 pages
EVENTS 1 (Basic Facts About Event)
No ratings yet
EVENTS 1 (Basic Facts About Event)
23 pages
Hts-Scs-Hsei-Ea-023 Night Working r2 (2017.04.18)
No ratings yet
Hts-Scs-Hsei-Ea-023 Night Working r2 (2017.04.18)
9 pages
Ev M Manual August 2023
No ratings yet
Ev M Manual August 2023
182 pages
Student Activity Guide
No ratings yet
Student Activity Guide
22 pages
Catalog CT Shortform 121115
No ratings yet
Catalog CT Shortform 121115
20 pages
Chin Et Al 2022 PsycRev
No ratings yet
Chin Et Al 2022 PsycRev
23 pages
Reduction of The Histamine Content and Immunoreactivity of Parvalbumin in by Maillarrd Reaction
No ratings yet
Reduction of The Histamine Content and Immunoreactivity of Parvalbumin in by Maillarrd Reaction
34 pages
Implementation of IEC TR 62010 - Guidelines For Maintenance Management of Analyser Systems
No ratings yet
Implementation of IEC TR 62010 - Guidelines For Maintenance Management of Analyser Systems
25 pages
Cybersecurity Report
No ratings yet
Cybersecurity Report
17 pages
Propeller Shafts Drive Shaft
No ratings yet
Propeller Shafts Drive Shaft
10 pages
ICSE Std. 9 Physics Syllabus
100% (1)
ICSE Std. 9 Physics Syllabus
175 pages
Hash House A Go Go Plano Menu PDF
No ratings yet
Hash House A Go Go Plano Menu PDF
2 pages
Arabic Study Guide
No ratings yet
Arabic Study Guide
7 pages
09 DECEMBER 2024 Submission of Comprehensive Deployment of Peresoonel LIGTAS PASKUHAN 2024
No ratings yet
09 DECEMBER 2024 Submission of Comprehensive Deployment of Peresoonel LIGTAS PASKUHAN 2024
7 pages
BRS SAP HANA NAS Data Protection Using Data Domain and NetWorker 11
No ratings yet
BRS SAP HANA NAS Data Protection Using Data Domain and NetWorker 11
30 pages
Scrum Roles and Responsibilities Guide
No ratings yet
Scrum Roles and Responsibilities Guide
24 pages
Sap Utara
No ratings yet
Sap Utara
11 pages
Fundamentals of Machine Learning
No ratings yet
Fundamentals of Machine Learning
97 pages
Maruti Final India
No ratings yet
Maruti Final India
84 pages