.Net: How does semantic kernel support load balancing by using multiple Azure OpenAI endpoints? #5774

vishnugms · 2024-04-03T18:16:34Z

vishnugms
Apr 3, 2024

How does semantic kernel support load balancing by using multiple Azure OpenAI endpoints? I would like to route my chat completion requests to different registered endpoints depending on the load.

Pl. assist.

Answered by markwallace-microsoft

Apr 4, 2024

This article explains how to use Azure API Management to load balance requests to multiple instances of the Azure OpenAI Service.

You will need to customise the OpenAICient that the Semantic Kernel uses to call the Azure OpenAI Service. This sample shows how to do this.

View full answer

markwallace-microsoft · 2024-04-04T08:28:11Z

markwallace-microsoft
Apr 4, 2024
Maintainer

This article explains how to use Azure API Management to load balance requests to multiple instances of the Azure OpenAI Service.

You will need to customise the OpenAICient that the Semantic Kernel uses to call the Azure OpenAI Service. This sample shows how to do this.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.Net: How does semantic kernel support load balancing by using multiple Azure OpenAI endpoints? #5774

{{title}}

Replies: 1 comment

{{title}}

Select a reply

.Net: How does semantic kernel support load balancing by using multiple Azure OpenAI endpoints? #5774

vishnugms Apr 3, 2024

Replies: 1 comment

markwallace-microsoft Apr 4, 2024 Maintainer

vishnugms
Apr 3, 2024

markwallace-microsoft
Apr 4, 2024
Maintainer