Troubleshooting performance latency issues
Latency in BMC HelixGPT's performance might occur due to the following reasons:
- Changes in parameter values in the default skill configuration.
- A proxy or gateway is configured between BMC HelixGPT and the LLM provider.
- Changes in the LLM model or client configuration parameters.
Issue symptoms
End users experience latency in responses from BMC HelixGPT. Sometimes,BMC HelixGPT takes more than a minute to generate a response.
Issue scope
This issue can occur in the following scenarios:
- Change in the default skill configuration parameters.
- A proxy or gateway is configured between BMC HelixGPT and the LLM provider.
- Lack of optimization of client and LLM model configuration parameters for specific requirements.
Resolution
- For agentic skills, we recommend that the parameter value for numberOfDocumentsToReturn is set to 5 or less.
Increasing the default value can sometimes affect performance. For more information, see Updating the configuration parameters of a skill. - If a proxy or gateway is configured between BMC HelixGPT and the LLM provider, make sure to allocate sufficient memory and CPU to the proxy server, optimize its configuration, and enable resource usage monitoring.
Optimize the client and LLM model configuration parameters for your specific requirements. To optimize the model configurations, based on your environment requirements, perform any or all of the following steps:
Latency caused by LLM provider rate limitingIn some environments, latency might occur due to rate-limiting configurations on the LLM provider side.
For example, if the configured requests per minute or tokens per minute limits are exceeded, the provider might return HTTP 429 (Too Many Requests) errors, causing delayed responses in BMC HelixGPT.Review the provider configuration and adjust the rate-limiting parameters as required. For more information, see the knowledge article BMC HelixGPT - Using HelixGPT Assistant Is Returning Slow Results.