Function Calling

Enable models to call functions and use tools during generation.

Overview

Function calling (also called tool use) allows models to invoke external functions to perform actions or retrieve information. multi-llm-ts handles tool orchestration automatically through plugins.

Supported providers: OpenAI, Anthropic, Google, Ollama, Groq, Mistral AI, and more.

Basic Usage

Add plugins to your model and they're invoked automatically when needed:

typescript

import { igniteModel, loadModels, Message } from 'multi-llm-ts'
import { WeatherPlugin, SearchPlugin } from './plugins'

const models = await loadModels('openai', config)
const model = igniteModel('openai', models.chat[0], config)

// Add plugins
model.addPlugin(new WeatherPlugin())
model.addPlugin(new SearchPlugin())

// Model will call tools as needed
const messages = [
  new Message('user', 'What is the weather in Paris?')
]

const response = await model.complete(messages)
console.log(response.content)
// Output: "The weather in Paris is currently 18°C and partly cloudy."

With Completion

Tools execute transparently with complete():

typescript

model.addPlugin(new WeatherPlugin())

const response = await model.complete([
  new Message('user', 'What is the weather in Paris?')
])

// Behind the scenes:
// 1. Model decides to call get_weather tool
// 2. Plugin executes with parameters {location: "Paris"}
// 3. Result returned to model
// 4. Model generates natural language response

With Streaming

Tool execution is visible during streaming:

typescript

model.addPlugin(new WeatherPlugin())

const stream = model.generate([
  new Message('user', 'What is the weather in Paris?')
])

for await (const chunk of stream) {
  if (chunk.type === 'content') {
    // Model's text response
    console.log('Text:', chunk.text)
  } else if (chunk.type === 'tool') {
    // Tool execution status
    console.log(`Tool: ${chunk.name} [${chunk.state}]`)
    console.log(`Status: ${chunk.status}`)
  }
}

// Output:
// Tool: get_weather [preparing]
// Status: Preparing weather lookup...
// Tool: get_weather [running]
// Status: Fetching weather for Paris...
// Tool: get_weather [completed]
// Status: Weather: 18°C, partly cloudy
// Text: The weather in Paris is currently 18°C and partly cloudy.

Tool states: During streaming, tools emit state updates (preparing, running, completed, canceled, error). See Streaming for details.

Multi-Turn Conversations with Tool Calls

Important: When building multi-turn conversations, track tool calls in message history to maintain proper context.

Tracking Tool Calls in Streaming

Collect tool calls during streaming and add them to the assistant's message:

typescript

import { LlmChunkTool } from 'multi-llm-ts'

const conversation = [
  new Message('user', 'What is the weather in Paris and London?')
]

// Collect response and tool calls
const assistantMessage = new Message('assistant', '')
const toolCalls: LlmChunkTool[] = []

const stream = model.generate(conversation)

for await (const chunk of stream) {
  if (chunk.type === 'content') {
    assistantMessage.appendText(chunk)
  } else if (chunk.type === 'tool' && chunk.state === 'completed') {
    // Track completed tool calls
    toolCalls.push(chunk)
  }
}

// Store tool call information in message
if (toolCalls.length > 0) {
  assistantMessage.toolCalls = toolCalls.map(tc => ({
    id: tc.id,
    function: tc.name,
    args: tc.call?.params,
    result: tc.call?.result
  }))
}

// Add to conversation
conversation.push(assistantMessage)

// Continue conversation with context
conversation.push(new Message('user', 'Which one is warmer?'))
const response = await model.complete(conversation)
// Model has access to previous tool calls and can reference them

Why Track Tool Calls?

Tracking tool calls in conversation provides:

Context for follow-up questions: Model can reference previous tool results
Conversation continuity: Maintains full history of what information was fetched
Debugging and logging: Complete audit trail of tool usage
Multi-turn interactions: Enables complex workflows spanning multiple turns

Message Structure with Tool Calls

typescript

interface Message {
  role: 'user' | 'assistant' | 'system'
  content: string
  toolCalls: LlmToolCall[]  // Tool calls made by assistant
  attachments: Attachment[]
}

interface LlmToolCall {
  id: string           // Unique call identifier
  function: string     // Tool name
  args: any            // Tool parameters
  result?: any         // Tool result (optional)
}

Complete Example

Full multi-turn conversation with tool tracking:

typescript

import { igniteModel, loadModels, Message, LlmChunkTool } from 'multi-llm-ts'
import { WeatherPlugin, CalculatorPlugin } from './plugins'

// Setup
const models = await loadModels('openai', config)
const model = igniteModel('openai', models.chat[0], config)
model.addPlugin(new WeatherPlugin())
model.addPlugin(new CalculatorPlugin())

// Start conversation
const conversation = [
  new Message('system', 'You are a helpful assistant'),
  new Message('user', 'What is the weather in Paris?')
]

// Turn 1: Weather query
const assistantMsg1 = new Message('assistant', '')
const toolCalls1: LlmChunkTool[] = []

for await (const chunk of model.generate(conversation)) {
  if (chunk.type === 'content') {
    assistantMsg1.appendText(chunk)
  } else if (chunk.type === 'tool' && chunk.state === 'completed') {
    toolCalls1.push(chunk)
  }
}

if (toolCalls1.length > 0) {
  assistantMsg1.toolCalls = toolCalls1.map(tc => ({
    id: tc.id,
    function: tc.name,
    args: tc.call?.params,
    result: tc.call?.result
  }))
}

conversation.push(assistantMsg1)
console.log(assistantMsg1.content)
// Output: "The weather in Paris is currently 18°C and partly cloudy."

// Turn 2: Follow-up with calculation
conversation.push(new Message('user', 'What is that in Fahrenheit?'))

const response2 = await model.complete(conversation)
console.log(response2.content)
// Model uses previous weather result and calculates: "That's 64.4°F"

Best Practices

Track tool calls: Always maintain tool call history in multi-turn conversations
Provide clear tool descriptions: Help models understand when to use each tool
Return structured data: Use objects with clear field names
Handle errors gracefully: Let models know what went wrong
Validate inputs: Check parameters in plugin implementations
Monitor tool usage: Track which tools are called and how often

Next Steps

Create custom Plugins for your use case
Implement Tool Validation for security
Learn about Abort Operations for cancellation
Review Streaming for real-time updates

Function Calling ​

Overview ​

Basic Usage ​

With Completion ​

With Streaming ​

Multi-Turn Conversations with Tool Calls ​

Tracking Tool Calls in Streaming ​

Why Track Tool Calls? ​

Message Structure with Tool Calls ​

Complete Example ​

Best Practices ​

Next Steps ​

Function Calling

Overview

Basic Usage

With Completion

With Streaming

Multi-Turn Conversations with Tool Calls

Tracking Tool Calls in Streaming

Why Track Tool Calls?

Message Structure with Tool Calls

Complete Example

Best Practices

Next Steps