Module 04: AI Agents with Tools

Video Walkthrough
What You'll Learn
Prerequisites
Understanding AI Agents with Tools
How Tool Calling Works
Tool Chaining
Run the Application
Using the Application
Key Concepts
Available Tools
When to Use Tool-Based Agents
Tools vs RAG
Next Steps

Video Walkthrough

Watch this live session that explains how to get started with this module:

What You'll Learn

So far, you've learned how to have conversations with AI, structure prompts effectively, and ground responses in your documents. But there's still a fundamental limitation: language models can only generate text. They can't check the weather, perform calculations, query databases, or interact with external systems.

Tools change this. By giving the model access to functions it can call, you transform it from a text generator into an agent that can take actions. The model decides when it needs a tool, which tool to use, and what parameters to pass. Your code executes the function and returns the result. The model incorporates that result into its response.

Prerequisites

Completed Module 01 - Introduction (Azure OpenAI resources deployed)
Completed previous modules recommended (this module references RAG concepts from Module 03 in the Tools vs RAG comparison)
.env file in root directory with Azure credentials (created by azd up in Module 01)

Note: If you haven't completed Module 01, follow the deployment instructions there first.

Understanding AI Agents with Tools

📝 Note: The term "agents" in this module refers to AI assistants enhanced with tool-calling capabilities. This is different from the Agentic AI patterns (autonomous agents with planning, memory, and multi-step reasoning) that we'll cover in Module 05: MCP.

Without tools, a language model can only generate text from its training data. Ask it for the current weather, and it has to guess. Give it tools, and it can call a weather API, perform calculations, or query a database — then weave those real results into its response.

Without tools the model can only guess — with tools it can call APIs, run calculations, and return real-time data.

An AI agent with tools follows a Reasoning and Acting (ReAct) pattern. The model doesn't just respond — it thinks about what it needs, acts by calling a tool, observes the result, and then decides whether to act again or deliver the final answer:

Reason — The agent analyzes the user's question and determines what information it needs
Act — The agent selects the right tool, generates the correct parameters, and calls it
Observe — The agent receives the tool's output and evaluates the result
Repeat or Respond — If more data is needed, the agent loops back; otherwise, it composes a natural language answer

The ReAct cycle — the agent reasons about what to do, acts by calling a tool, observes the result, and loops until it can deliver the final answer.

This happens automatically. You define the tools and their descriptions. The model handles the decision-making about when and how to use them.

How Tool Calling Works

Tool Definitions

WeatherTool.java | TemperatureTool.java

You define functions with clear descriptions and parameter specifications. The model sees these descriptions in its system prompt and understands what each tool does.

@Component
public class WeatherTool {
    
    @Tool("Get the current weather for a location")
    public String getCurrentWeather(@P("Location name") String location) {
        // Your weather lookup logic
        return "Weather in " + location + ": 22°C, cloudy";
    }
}

@AiService
public interface Assistant {
    String chat(@MemoryId String sessionId, @UserMessage String message);
}

// Assistant is automatically wired by Spring Boot with:
// - ChatModel bean
// - All @Tool methods from @Component classes
// - ChatMemoryProvider for session management

The diagram below breaks down every annotation and shows how each piece helps the AI understand when to call the tool and what arguments to pass:

Anatomy of a tool definition — @Tool tells the AI when to use it, @P describes each parameter, and @AiService wires everything together at startup.

🤖 Try with GitHub Copilot Chat: Open WeatherTool.java and ask:

"How would I integrate a real weather API like OpenWeatherMap instead of mock data?"

"What makes a good tool description that helps the AI use it correctly?"

"How do I handle API errors and rate limits in tool implementations?"

Decision Making

When a user asks "What's the weather in Seattle?", the model doesn't randomly pick a tool. It compares the user's intent against every tool description it has access to, scores each one for relevance, and selects the best match. It then generates a structured function call with the right parameters — in this case, setting location to "Seattle".

If no tool matches the user's request, the model falls back to answering from its own knowledge. If multiple tools match, it picks the most specific one.

The model evaluates every available tool against the user's intent and selects the best match — this is why writing clear, specific tool descriptions matters.

Execution

AgentService.java

Spring Boot auto-wires the declarative @AiService interface with all registered tools, and LangChain4j executes tool calls automatically. Behind the scenes, a complete tool call flows through six stages — from the user's natural language question all the way back to a natural language answer:

The end-to-end flow — the user asks a question, the model selects a tool, LangChain4j executes it, and the model weaves the result into a natural response.

If you ran the ToolIntegrationDemo in Module 00, you already saw this pattern in action — the Calculator tools were called the same way. The sequence diagram below shows exactly what happened under the hood during that demo:

The tool-calling loop from the Quick Start demo — AiServices sends your message and tool schemas to the LLM, the LLM replies with a function call like add(42, 58), LangChain4j executes the Calculator method locally, and feeds the result back for the final answer.

🤖 Try with GitHub Copilot Chat: Open AgentService.java and ask:

"How does the ReAct pattern work and why is it effective for AI agents?"

"How does the agent decide which tool to use and in what order?"

"What happens if a tool execution fails - how should I handle errors robustly?"

Response Generation

The model receives the weather data and formats it into a natural language response for the user.

Architecture: Spring Boot Auto-Wiring

This module uses LangChain4j's Spring Boot integration with declarative @AiService interfaces. At startup Spring Boot discovers every @Component that contains @Tool methods, your ChatModel bean, and the ChatMemoryProvider — then wires them all into a single Assistant interface with zero boilerplate.

The @AiService interface ties together the ChatModel, tool components, and memory provider — Spring Boot handles all the wiring automatically.

Here's the full request lifecycle as a sequence diagram — from the HTTP request through the controller, service, and auto-wired proxy, all the way to the tool execution and back:

The complete Spring Boot request lifecycle — HTTP request flows through the controller and service to the auto-wired Assistant proxy, which orchestrates the LLM and tool calls automatically.

Key benefits of this approach:

Spring Boot auto-wiring — ChatModel and tools automatically injected
@MemoryId pattern — Automatic session-based memory management
Single instance — Assistant created once and reused for better performance
Type-safe execution — Java methods called directly with type conversion
Multi-turn orchestration — Handles tool chaining automatically
Zero boilerplate — No manual AiServices.builder() calls or memory HashMap

Alternative approaches (manual AiServices.builder()) require more code and miss Spring Boot integration benefits.

Tool Chaining

Tool Chaining — The real power of tool-based agents shows when a single question requires multiple tools. Ask "What's the weather in Seattle in Fahrenheit?" and the agent automatically chains two tools: first it calls getCurrentWeather to get the temperature in Celsius, then it passes that value to celsiusToFahrenheit for conversion — all in a single conversation turn.

Tool chaining in action — the agent calls getCurrentWeather first, then pipes the Celsius result into celsiusToFahrenheit, and delivers a combined answer.

Graceful Failures — Ask for weather in a city that's not in the mock data. The tool returns an error message, and the AI explains it can't help rather than crashing. Tools fail safely. The diagram below contrasts the two approaches — with proper error handling, the agent catches the exception and responds helpfully, while without it the entire application crashes:

When a tool fails, the agent catches the error and responds with a helpful explanation instead of crashing.

This happens in a single conversation turn. The agent orchestrates multiple tool calls autonomously.

Run the Application

Verify deployment:

Ensure the .env file exists in the root directory with Azure credentials (created during Module 01). Run this from the module directory (04-tools/):

Bash:

cat ../.env  # Should show AZURE_OPENAI_ENDPOINT, API_KEY, DEPLOYMENT

PowerShell:

Get-Content ..\.env  # Should show AZURE_OPENAI_ENDPOINT, API_KEY, DEPLOYMENT

Start the application:

Note: If you already started all applications using ./start-all.sh from the root directory (as described in Module 01), this module is already running on port 8084. You can skip the start commands below and go directly to http://localhost:8084.

Option 1: Using Spring Boot Dashboard (Recommended for VS Code users)

The dev container includes the Spring Boot Dashboard extension, which provides a visual interface to manage all Spring Boot applications. You can find it in the Activity Bar on the left side of VS Code (look for the Spring Boot icon).

From the Spring Boot Dashboard, you can:

See all available Spring Boot applications in the workspace
Start/stop applications with a single click
View application logs in real-time
Monitor application status

Simply click the play button next to "tools" to start this module, or start all modules at once.

Here's what the Spring Boot Dashboard looks like in VS Code:

The Spring Boot Dashboard in VS Code — start, stop, and monitor all modules from one place

Option 2: Using shell scripts

Start all web applications (modules 01-04):

Bash:

cd ..  # From root directory
./start-all.sh

PowerShell:

cd ..  # From root directory
.\start-all.ps1

Or start just this module:

Bash:

cd 04-tools
./start.sh

PowerShell:

cd 04-tools
.\start.ps1

Both scripts automatically load environment variables from the root .env file and will build the JARs if they don't exist.

Note: If you prefer to build all modules manually before starting:

Bash:
cd ..  # Go to root directory
mvn clean package -DskipTests
PowerShell:
cd ..  # Go to root directory
mvn clean package -DskipTests

Open http://localhost:8084 in your browser.

To stop:

Bash:

./stop.sh  # This module only
# Or
cd .. && ./stop-all.sh  # All modules

PowerShell:

.\stop.ps1  # This module only
# Or
cd ..; .\stop-all.ps1  # All modules

Using the Application

The application provides a web interface where you can interact with an AI agent that has access to weather and temperature conversion tools. Here's what the interface looks like — it includes quick-start examples and a chat panel for sending requests:

The AI Agent Tools interface - quick examples and chat interface for interacting with tools

Try Simple Tool Usage

Start with a straightforward request: "Convert 100 degrees Fahrenheit to Celsius". The agent recognizes it needs the temperature conversion tool, calls it with the right parameters, and returns the result. Notice how natural this feels - you didn't specify which tool to use or how to call it.

Test Tool Chaining

Now try something more complex: "What's the weather in Seattle and convert it to Fahrenheit?" Watch the agent work through this in steps. It first gets the weather (which returns Celsius), recognizes it needs to convert to Fahrenheit, calls the conversion tool, and combines both results into one response.

See Conversation Flow

The chat interface maintains conversation history, allowing you to have multi-turn interactions. You can see all previous queries and responses, making it easy to track the conversation and understand how the agent builds context over multiple exchanges.

Multi-turn conversation showing simple conversions, weather lookups, and tool chaining

Experiment with Different Requests

Try various combinations:

Weather lookups: "What's the weather in Tokyo?"
Temperature conversions: "What is 25°C in Kelvin?"
Combined queries: "Check the weather in Paris and tell me if it's above 20°C"

Notice how the agent interprets natural language and maps it to appropriate tool calls.

Key Concepts

ReAct Pattern (Reasoning and Acting)

The agent alternates between reasoning (deciding what to do) and acting (using tools). This pattern enables autonomous problem-solving rather than just responding to instructions.

Tool Descriptions Matter

The quality of your tool descriptions directly affects how well the agent uses them. Clear, specific descriptions help the model understand when and how to call each tool.

Session Management

The @MemoryId annotation enables automatic session-based memory management. Each session ID gets its own ChatMemory instance managed by the ChatMemoryProvider bean, so multiple users can interact with the agent simultaneously without their conversations mixing together. The following diagram shows how multiple users are routed to isolated memory stores based on their session IDs:

Each session ID maps to an isolated conversation history — users never see each other's messages.

Error Handling

Tools can fail — APIs timeout, parameters might be invalid, external services go down. Production agents need error handling so the model can explain problems or try alternatives rather than crashing the entire application. When a tool throws an exception, LangChain4j catches it and feeds the error message back to the model, which can then explain the problem in natural language.

Available Tools

The diagram below shows the broad ecosystem of tools you can build. This module demonstrates weather and temperature tools, but the same @Tool pattern works for any Java method — from database queries to payment processing.

Any Java method annotated with @Tool becomes available to the AI — the pattern extends to databases, APIs, email, file operations, and more.

When to Use Tool-Based Agents

Not every request needs tools. The decision comes down to whether the AI needs to interact with external systems or can answer from its own knowledge. The following guide summarizes when tools add value and when they're unnecessary:

A quick decision guide — tools are for real-time data, calculations, and actions; general knowledge and creative tasks don't need them.

Tools vs RAG

Modules 03 and 04 both extend what the AI can do, but in fundamentally different ways. RAG gives the model access to knowledge by retrieving documents. Tools give the model the ability to take actions by calling functions. The diagram below compares these two approaches side by side — from how each workflow operates to the trade-offs between them:

RAG retrieves information from static documents — Tools execute actions and fetch dynamic, real-time data. Many production systems combine both.

In practice, many production systems combine both approaches: RAG for grounding answers in your documentation, and Tools for fetching live data or performing operations.

Next Steps

Next Module: 05-mcp - Model Context Protocol (MCP)

Navigation: ← Previous: Module 03 - RAG | Back to Main | Next: Module 05 - MCP →

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Module 04: AI Agents with Tools

Table of Contents

Video Walkthrough

What You'll Learn

Prerequisites

Understanding AI Agents with Tools

How Tool Calling Works

Tool Definitions

Decision Making

Execution

Response Generation

Architecture: Spring Boot Auto-Wiring

Tool Chaining

Run the Application

Using the Application

Try Simple Tool Usage

Test Tool Chaining

See Conversation Flow

Experiment with Different Requests

Key Concepts

ReAct Pattern (Reasoning and Acting)

Tool Descriptions Matter

Session Management

Error Handling

Available Tools

When to Use Tool-Based Agents

Tools vs RAG

Next Steps

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Module 04: AI Agents with Tools

Table of Contents

Video Walkthrough

What You'll Learn

Prerequisites

Understanding AI Agents with Tools

How Tool Calling Works

Tool Definitions

Decision Making

Execution

Response Generation

Architecture: Spring Boot Auto-Wiring

Tool Chaining

Run the Application

Using the Application

Try Simple Tool Usage

Test Tool Chaining

See Conversation Flow

Experiment with Different Requests

Key Concepts

ReAct Pattern (Reasoning and Acting)

Tool Descriptions Matter

Session Management

Error Handling

Available Tools

When to Use Tool-Based Agents

Tools vs RAG

Next Steps