Throttling of batch API requests to avoid excessive rate limit errors -> faster batch processing #369

pchalasani · 2024-01-17T20:37:53Z

Add a variant of this openai cookbook example to pro-actively throttle API requests. The current Langroid chat and achat methods do have the usual retry-with-exponential backoff etc here but this approach is "blind" in the sense that there is no pro-active throttling attempt. As a result when we do large batch jobs (e.g. using run_batch_tasks there might be just too much time wasted in hitting rate limits and retrying. A pro-active throttling approach like in the script below should work better.

https://github.com/openai/openai-cookbook/blob/main/examples/api_request_parallel_processor.py

The text was updated successfully, but these errors were encountered:

pchalasani added the enhancement New feature or request label Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Throttling of batch API requests to avoid excessive rate limit errors -> faster batch processing #369

Throttling of batch API requests to avoid excessive rate limit errors -> faster batch processing #369

pchalasani commented Jan 17, 2024

Throttling of batch API requests to avoid excessive rate limit errors -> faster batch processing #369

Throttling of batch API requests to avoid excessive rate limit errors -> faster batch processing #369

Comments

pchalasani commented Jan 17, 2024