Skip to content

.Net: How does semantic kernel support load balancing by using multiple Azure OpenAI endpoints? #5774

Discussion options

You must be logged in to vote

This article explains how to use Azure API Management to load balance requests to multiple instances of the Azure OpenAI Service.

You will need to customise the OpenAICient that the Semantic Kernel uses to call the Azure OpenAI Service. This sample shows how to do this.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by sophialagerkranspandey
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested .NET Issue or Pull requests regarding .NET code
2 participants
Converted from issue

This discussion was converted from issue #5763 on April 04, 2024 13:48.