Use appropriate wait time for retry based on the error message. #14

ekzhu · 2023-05-11T07:56:56Z

[flaml.autogen.oai.completion: 05-11 00:50:35] {217} INFO - retrying in 10 seconds...
Traceback (most recent call last):
  File "[...\.venv\Lib\site-packages\flaml\autogen\oai\completion.py]", line 193, in _get_response
    response = openai_completion.create(request_timeout=request_timeout, **config)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "[...\.venv\Lib\site-packages\openai\api_resources\completion.py]", line 25, in create
    return super().create(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "[...\.venv\Lib\site-packages\openai\api_resources\abstract\engine_api_resource.py]", line 153, in create
    response, _, api_key = requestor.request(
                           ^^^^^^^^^^^^^^^^^^
  File "[...\.venv\Lib\site-packages\openai\api_requestor.py]", line 226, in request
    resp, got_stream = self._interpret_response(result, stream)
                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "[...\.venv\Lib\site-packages\openai\api_requestor.py]", line 620, in _interpret_response
    self._interpret_response_line(
  File "[...\.venv\Lib\site-packages\openai\api_requestor.py]", line 683, in _interpret_response_line
    raise self.handle_error_response(
openai.error.RateLimitError: Requests to the Completions_Create Operation under Azure OpenAI API version 2022-12-01 have exceeded call rate limit of your current OpenAI S0 pricing tier. Please retry after 59 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.
[flaml.autogen.oai.completion: 05-11 00:50:45] {217} INFO - retrying in 10 seconds...

The error message says "Please retry after 59 seconds", but FLAML keeps retrying in 10-second intervals.

The text was updated successfully, but these errors were encountered:

sonichi · 2023-05-11T17:52:55Z

Thanks. It'll be nice to adjust the retry time according to the error msg.
One workaround is to set flaml.oai.retry_time = 60 in your code for now if that's the most common retry time required.

Pavel-hb · 2023-09-27T13:04:48Z

I also get this error:

[autogen.oai.completion: 09-27 12:50:43] {236} INFO - retrying in 10 seconds...
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/autogen/oai/completion.py", line 206, in _get_response
response = openai_completion.create(**config)
File "/usr/local/lib/python3.10/dist-packages/openai/api_resources/chat_completion.py", line 25, in create
return super().create(*args, **kwargs)
File "/usr/local/lib/python3.10/dist-packages/openai/api_resources/abstract/engine_api_resource.py", line 155, in create
response, _, api_key = requestor.request(
File "/usr/local/lib/python3.10/dist-packages/openai/api_requestor.py", line 299, in request
resp, got_stream = self._interpret_response(result, stream)
File "/usr/local/lib/python3.10/dist-packages/openai/api_requestor.py", line 710, in _interpret_response
self._interpret_response_line(
File "/usr/local/lib/python3.10/dist-packages/openai/api_requestor.py", line 775, in _interpret_response_line
raise self.handle_error_response(
openai.error.RateLimitError: Rate limit reached for default-gpt-3.5-turbo in organization org-6oXqS68sE8UL8MSONa3w2IpY on requests per min. Limit: 3 / min. Please try again in 20s. Contact us through our help center at help.openai.com if you continue to have issues. Please add a payment method to your account to increase your rate limit. Visit https://platform.openai.com/account/billing to add a payment method.
INFO:autogen.oai.completion:retrying in 10 seconds...
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/autogen/oai/completion.py", line 206, in _get_response
response = openai_completion.create(**config)
File "/usr/local/lib/python3.10/dist-packages/openai/api_resources/chat_completion.py", line 25, in create
return super().create(*args, **kwargs)
File "/usr/local/lib/python3.10/dist-packages/openai/api_resources/abstract/engine_api_resource.py", line 155, in create
response, _, api_key = requestor.request(
File "/usr/local/lib/python3.10/dist-packages/openai/api_requestor.py", line 299, in request
resp, got_stream = self._interpret_response(result, stream)
File "/usr/local/lib/python3.10/dist-packages/openai/api_requestor.py", line 710, in _interpret_response
self._interpret_response_line(
File "/usr/local/lib/python3.10/dist-packages/openai/api_requestor.py", line 775, in _interpret_response_line
raise self.handle_error_response(
openai.error.RateLimitError: Rate limit reached for default-gpt-3.5-turbo in organization org-6oXqS68sE8UL8MSONa3w2IpY on requests per min. Limit: 3 / min. Please try again in 20s. Contact us through our help center at help.openai.com if you continue to have issues. Please add a payment method to your account to increase your rate limit. Visit https://platform.openai.com/account/billing to add a payment method.

RobertWeaver · 2023-09-30T07:35:21Z

Is there a configuration to set the max numbers of requests that can be made per minute? This would help to avoid hitting rate limits for Requests Per Minute. 😅

sonichi · 2023-09-30T13:14:54Z

@Pavel-hb @RobertWeaver I added some answers in #53 . Could you take a look and let me know if it answers your question?

raghavgrover13 · 2023-11-22T04:18:51Z

I also observed the same behavior that retry occurs with 10 seconds, it definitely should retry based on the time in the error message.
Also is there a way to dynamically reduce max tokens? I have seen in some instances with GPT 3.5 Turbo 16k that the output in one chat goes beyond the token limit- one should calculate the token count in system and user prompt and then set max token based on the model - max token limit- the token used in user prompt and system prompt?

sonichi · 2023-12-03T18:22:15Z

The retry time behavior has been changed in v0.2. https://microsoft.github.io/autogen/docs/Installation#python
Dynamically reducing max tokens is an idea @kevin666aa may want to take a note.

yiranwu0 · 2023-12-04T01:32:49Z

The retry time behavior has been changed in v0.2. https://microsoft.github.io/autogen/docs/Installation#python Dynamically reducing max tokens is an idea @kevin666aa may want to take a note.

The "calculate the token count in system and user prompt" are already the features of CompressibeAgent.

I think "dynamically reducing max tokens" is not necessary to have. The similar thing is achieved by the "TERMINATE" mode of CompressibeAgent: when the token count is smaller than max token limit, the completion will be allowed. OpenAI will automatically return when max token limit of the model is reached. After that, the next completion will be terminated due to token count limit. So, no retry due to token limit will be happen.

@raghavgrover13 Can you checkout https://github.com/microsoft/autogen/blob/main/notebook/agentchat_compression.ipynb to see if it helps with your problem?

sonichi added enhancement New feature or request good first issue Good for newcomers labels May 11, 2023

Kyoshiin mentioned this issue May 17, 2023

updated retry_time microsoft/FLAML#1045

Open

3 tasks

sonichi transferred this issue from microsoft/FLAML Sep 23, 2023

sonichi mentioned this issue Sep 30, 2023

make retry_time configurable, add doc #53

Merged

3 tasks

sonichi added llm issues related to LLM and removed good first issue Good for newcomers labels Oct 22, 2023

ekzhu closed this as not planned Won't fix, can't repro, duplicate, stale Mar 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use appropriate wait time for retry based on the error message. #14

Use appropriate wait time for retry based on the error message. #14

ekzhu commented May 11, 2023

sonichi commented May 11, 2023

Pavel-hb commented Sep 27, 2023

RobertWeaver commented Sep 30, 2023

sonichi commented Sep 30, 2023

raghavgrover13 commented Nov 22, 2023 •

edited

sonichi commented Dec 3, 2023

yiranwu0 commented Dec 4, 2023 •

edited

Use appropriate wait time for retry based on the error message. #14

Use appropriate wait time for retry based on the error message. #14

Comments

ekzhu commented May 11, 2023

sonichi commented May 11, 2023

Pavel-hb commented Sep 27, 2023

RobertWeaver commented Sep 30, 2023

sonichi commented Sep 30, 2023

raghavgrover13 commented Nov 22, 2023 • edited

sonichi commented Dec 3, 2023

yiranwu0 commented Dec 4, 2023 • edited

raghavgrover13 commented Nov 22, 2023 •

edited

yiranwu0 commented Dec 4, 2023 •

edited