.Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET #9933

GregorBiswanger · 2024-12-10T14:18:13Z

Describe the bug
Semantic Kernel Functions in .NET using Microsoft.SemanticKernel.Connectors.AzureAIInference do not work with Azure-hosted open-source models like "Mistral Nemo," even though these models support Function Calls. The same setup works fine with GPT models, and in Python, it works with Azure AI without any issues. This seems to be a .NET-specific issue with Semantic Kernel.

To Reproduce
Steps to reproduce the behavior:

Deploy an open-source model (e.g., "Mistral Nemo") in Azure AI.
Integrate the connector Microsoft.SemanticKernel.Connectors.AzureAIInference.
Use PromptExecutionSettings with FunctionChoiceBehavior.Auto().
Implement and execute a Semantic Kernel Function.

Expected behavior
The Semantic Kernel Function should execute successfully using Function Calls with the Azure-hosted open-source model, similar to how it works with GPT models or with Python implementations.

Screenshots
Not applicable

Platform

OS: Windows
IDE: Visual Studio 2022
Language: C#
Source:
- Semantic Kernel version: 1.32.0
- Microsoft.SemanticKernel.Connectors.AzureAIInference version: 1.32.0-beta

Additional context

The same functions work locally with models like LM Studio (Beta) or Ollama.
In Python, Function Calls with Azure-hosted models work as expected.
This issue seems specific to the Semantic Kernel in .NET when using Azure AI.

Please provide guidance or a fix to enable Semantic Kernel Functions to work with Azure-hosted open-source models.

The text was updated successfully, but these errors were encountered:

sphenry · 2024-12-10T16:35:01Z

@markwallace-microsoft can you take a look?

markwallace-microsoft · 2024-12-12T12:56:46Z

@GregorBiswanger I created a sample for this and tested with Mistral Nemo and didn't see any issues. Take a look here: #9954

GregorBiswanger · 2024-12-12T13:06:34Z

@markwallace-microsoft It is important that you tried Mistral Nemo via Azure AI and not with Ollama or locally... so really via Azure?!

GregorBiswanger · 2024-12-13T01:19:07Z

Hi @markwallace-microsoft,

After further investigation, I’ve identified that the issue specifically occurs when streaming is enabled for the Azure-hosted Open Source Models (e.g., Mistral Nemo) using Semantic Kernel in .NET. Here’s what I tested:

What I tested:

Validation of function parameters:
- I ensured all functions were configured with their required parameters correctly. There were no missing or misconfigured parameters.
Simplified system prompts:
- I reduced the complexity of the system prompt, removing Markdown and special encodings to rule out parsing issues. This worked fine but didn’t resolve the issue when streaming was enabled.
Disabling Streaming:
- When I disabled streaming, the function calls worked perfectly. This pinpointed the issue to be specific to the streaming functionality.

Code that does not work with Streaming:

#pragma warning disable SKEXP0070

using Microsoft.Extensions.DependencyInjection;
using Microsoft.SemanticKernel;
using Microsoft.SemanticKernel.ChatCompletion;
using Microsoft.SemanticKernel.Connectors.AzureAIInference;
using OllamaApiFacade.Extensions;
using SemanticFlow.DemoWebApi;

var azureKeyVaultHelper = new AzureKeyVaultHelper("https://XXX.vault.azure.net");
var endpoint = await azureKeyVaultHelper.GetSecretAsync("AZURE-MINISTRAL-NEMO-ENDPOINT");
var apiKey = await azureKeyVaultHelper.GetSecretAsync("AZURE-MINISTRAL-NEMO-KEY");

// Kernel setup
var builder = Kernel.CreateBuilder()
    .AddAzureAIInferenceChatCompletion("Mistral-Nemo-oclsi", apiKey, new Uri(endpoint));

// Use Burp Suite proxy for analysis backend communication
// builder.Services.AddProxyForDebug();

var kernel = builder.Build();

// Import plugin functions
kernel.ImportPluginFromFunctions("HelperFunctions",
[
    kernel.CreateFunctionFromMethod(() => new List<string> { "Squirrel Steals Show", "Dog Wins Lottery" },
        "GetLatestNewsTitles", "Retrieves latest news titles."),
    kernel.CreateFunctionFromMethod(() => DateTime.UtcNow.ToString("R"), 
        "GetCurrentUtcDateTime", "Retrieves the current date time in UTC."),
    kernel.CreateFunctionFromMethod((string cityName, string currentDateTime) =>
    {
        if (string.IsNullOrEmpty(cityName) || string.IsNullOrEmpty(currentDateTime))
        {
            throw new ArgumentException("cityName and currentDateTime are required.");
        }
        return cityName switch
        {
            "Boston" => "61 and rainy",
            "London" => "55 and cloudy",
            "Miami" => "80 and sunny",
            "Paris" => "60 and rainy",
            "Tokyo" => "50 and sunny",
            "Sydney" => "75 and sunny",
            "Tel Aviv" => "80 and sunny",
            _ => "31 and snowing",
        };
    }, "GetWeatherForCity", "Gets the current weather for the specified city, using the city name and current UTC date/time.")
]);

// Settings with Streaming Enabled
var settings = new AzureAIInferencePromptExecutionSettings { FunctionChoiceBehavior = FunctionChoiceBehavior.Auto() };

var chatHistory = new ChatHistory();
chatHistory.AddUserMessage("What is the weather in Tokyo based on the current date and time?");

// Streaming with IChatCompletionService
var chatCompletionService = kernel.Services.GetRequiredService<IChatCompletionService>();

try
{
    // Stream the response
    await foreach (var message in chatCompletionService.GetStreamingChatMessageContentsAsync(chatHistory, settings, kernel))
    {
        if (message.Role.HasValue)
        {
            Console.Write($"{message.Role.Value}: ");
        }
        if (!string.IsNullOrEmpty(message.Content))
        {
            Console.Write(message.Content);
        }
    }
    Console.WriteLine("\nStreaming completed.");
}
catch (Exception ex)
{
    // Log any exceptions
    Console.WriteLine($"Error: {ex.Message}");
}

Cheers,
Gregor

GregorBiswanger · 2024-12-13T12:50:28Z

Hi @markwallace-microsoft,

I’ve discovered something new that might narrow down the issue further. It appears the problem isn’t just related to streaming but to Chat Completions in general.

Using Burp Suite, I analyzed the request JSON being sent. I noticed that the JSON contains two separate tools entries, which may be causing unexpected behavior. From what I observed:

Azure AI seems to focus on the last tools entry in the JSON, potentially ignoring the others.
In contrast, local solutions like LM-Studio (Beta) handled the multiple tools entries without any issues.

This leads me to believe there’s a bug in how Chat Completions handle these duplicate entries in the request JSON.

Here’s the exact JSON being sent (identical even with streaming enabled):

POST /chat/completions?api-version=2024-05-01-preview HTTP/1.1
Host: mistral-nemo-oclsi.swedencentral.models.ai.azure.com
Accept: application/json
...
Content-Type: application/json
Content-Length: 936
Connection: keep-alive

{
    "messages": [
        {
            "content": [
                {
                    "text": "What is the weather in Tokyo based on the current date and time?",
                    "type": "text"
                }
            ],
            "role": "user"
        }
    ],
    "model": "Mistral-Nemo-oclsi",
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "HelperFunctions-GetLatestNewsTitles",
                "description": "Retrieves latest news titles.",
                "parameters": {
                    "type": "object",
                    "required": [],
                    "properties": {}
                }
            }
        },
        {
            "type": "function",
            "function": {
                "name": "HelperFunctions-GetCurrentUtcDateTime",
                "description": "Retrieves the current date time in UTC.",
                "parameters": {
                    "type": "object",
                    "required": [],
                    "properties": {}
                }
            }
        },
        {
            "type": "function",
            "function": {
                "name": "HelperFunctions-GetWeatherForCity",
                "description": "Gets the current weather for the specified city, using the city name and current UTC date/time.",
                "parameters": {
                    "type": "object",
                    "required": ["cityName", "currentDateTime"],
                    "properties": {
                        "cityName": {
                            "type": "string"
                        },
                        "currentDateTime": {
                            "type": "string"
                        }
                    }
                }
            }
        }
    ],
    "tool_choice": "auto",
    "stop": [],
    "tools": []
}

Key Observation:
The JSON contains two tools arrays:

One under "tools" and another under "tools":[] at the end.

While this structure doesn’t seem to break local solutions like LM-Studio, Azure AI appears to mishandle it by focusing on the last entry.

I now believe the bug lies in how Chat Completions process these entries in general. Let me know if additional details or logs are needed!

markwallace-microsoft · 2024-12-13T15:31:04Z

Thanks for the detailed investigation @GregorBiswanger. I suspect this is an issue in the Azure SDK, will continue to investigate.

GregorBiswanger · 2024-12-15T21:08:57Z

@markwallace-microsoft In combination with IChatCompletionService and PromptExecutionSettings instead of AzureAIInferencePromptExecutionSettings, it works without duplicating tool entries in the request. The 'Function Call' then works perfectly.

GregorBiswanger added the bug Something isn't working label Dec 10, 2024

markwallace-microsoft added .NET Issue or Pull requests regarding .NET code python Pull requests for the Python Semantic Kernel triage labels Dec 10, 2024

github-actions bot changed the title ~~Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET~~ .Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET Dec 10, 2024

github-actions bot changed the title ~~.Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET~~ Python: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET Dec 10, 2024

GregorBiswanger changed the title ~~Python: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET~~ Dotnet: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET Dec 10, 2024

GregorBiswanger changed the title ~~Dotnet: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET~~ .Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET Dec 10, 2024

sphenry assigned markwallace-microsoft Dec 10, 2024

sphenry removed the triage label Dec 10, 2024

sphenry added this to Semantic Kernel Dec 10, 2024

markwallace-microsoft moved this to Sprint: In Progress in Semantic Kernel Dec 10, 2024

moonbox3 removed the python Pull requests for the Python Semantic Kernel label Dec 11, 2024

markwallace-microsoft linked a pull request Dec 12, 2024 that will close this issue

Python: .Net: Add function calling with AzureAIInference sample #9954

Open

4 tasks

markwallace-microsoft moved this from Sprint: In Progress to Sprint: In Review in Semantic Kernel Dec 12, 2024

markwallace-microsoft moved this from Sprint: In Review to Sprint: In Progress in Semantic Kernel Dec 13, 2024

markwallace-microsoft moved this from Sprint: In Progress to Sprint: In Review in Semantic Kernel Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET #9933

.Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET #9933

GregorBiswanger commented Dec 10, 2024

sphenry commented Dec 10, 2024

markwallace-microsoft commented Dec 12, 2024

GregorBiswanger commented Dec 12, 2024

GregorBiswanger commented Dec 13, 2024

GregorBiswanger commented Dec 13, 2024

markwallace-microsoft commented Dec 13, 2024

GregorBiswanger commented Dec 15, 2024

.Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET #9933

.Net: Bug: Azure-hosted Open-Source Models (e.g., Mistral Nemo) fail with Kernel Functions in .NET #9933

Comments

GregorBiswanger commented Dec 10, 2024

sphenry commented Dec 10, 2024

markwallace-microsoft commented Dec 12, 2024

GregorBiswanger commented Dec 12, 2024

GregorBiswanger commented Dec 13, 2024

What I tested:

Code that does not work with Streaming:

GregorBiswanger commented Dec 13, 2024

markwallace-microsoft commented Dec 13, 2024

GregorBiswanger commented Dec 15, 2024