Add exception catching for when Redis server is down #1153

corynezin · 2019-10-25T17:42:15Z

After my Redis server went down briefly, it seems all of my workers were killed as a result of this error:

10:33:55 Worker rq:worker:7870f7200ab64bdfae843ce5f4dc5ae0: found an unhandled exception, quitting...
Traceback (most recent call last):
  File "/usr/local/lib/python3.6/site-packages/rq/worker.py", line 470, in work
    result = self.dequeue_job_and_maintain_ttl(timeout)
  File "/usr/local/lib/python3.6/site-packages/rq/worker.py", line 521, in dequeue_job_and_maintain_ttl
    job_class=self.job_class)
  File "/usr/local/lib/python3.6/site-packages/rq/queue.py", line 469, in dequeue_any
    result = cls.lpop(queue_keys, timeout, connection=connection)
  File "/usr/local/lib/python3.6/site-packages/rq/queue.py", line 441, in lpop
    result = connection.blpop(queue_keys, timeout)
  File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 1618, in blpop
    return self.execute_command('BLPOP', *keys)
  File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 839, in execute_command
    return self.parse_response(conn, command_name, **options)
  File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 853, in parse_response
    response = connection.read_response()
  File "/usr/local/lib/python3.6/site-packages/redis/sentinel.py", line 55, in read_response
    return super(SentinelManagedConnection, self).read_response()
  File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 699, in read_response
    response = self._parser.read_response()
  File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 309, in read_response
    response = self._buffer.readline()
  File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 241, in readline
    self._read_from_socket()
  File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 186, in _read_from_socket
    raise ConnectionError(SERVER_CLOSED_CONNECTION_ERROR)

I would like them to stay alive, even if they are not working.

Could we add a try/except block around lines like:

rq/rq/queue.py

Line 441 in 75644ba

result = connection.blpop(queue_keys, timeout)

The text was updated successfully, but these errors were encountered:

selwin · 2019-12-08T03:58:36Z

Using a process manager like systemd would help you with this.

I think changing the behaviorto have workers sleep for a few seconds before retrying again would also be good. Please open a PR for this.

corynezin · 2019-12-09T02:53:23Z

I did consider systemd, but it was too much of a pain to set up on Docker, where my application is running. I will look into the PR

mdawar · 2020-06-23T09:06:27Z

I did consider systemd, but it was too much of a pain to set up on Docker, where my application is running. I will look into the PR

If you're using Docker, why not rely on Docker's restart policy? for example --restart on-failure or --restart unless-stopped, this way the worker containers will be restarted by Docker on failure.

solves issue rq#1153, rq#998 rq workers not auto connecting to redis server incase if they are down/restarted.

corynezin mentioned this issue May 25, 2020

Add exception catch for redis connection failure #1261

Open

Asrst mentioned this issue Dec 3, 2020

Add exception to catch redis connection failure to retry after wait time #1387

Merged

Asrst added a commit to Asrst/rq that referenced this issue Dec 3, 2020

Merge branch 'wait-for-connection'

2c40de2

solves issue rq#1153, rq#998 rq workers not auto connecting to redis server incase if they are down/restarted.

selwin closed this as completed in #1387 Jan 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add exception catching for when Redis server is down #1153

Add exception catching for when Redis server is down #1153

corynezin commented Oct 25, 2019

selwin commented Dec 8, 2019

corynezin commented Dec 9, 2019

mdawar commented Jun 23, 2020

Add exception catching for when Redis server is down #1153

Add exception catching for when Redis server is down #1153

Comments

corynezin commented Oct 25, 2019

selwin commented Dec 8, 2019

corynezin commented Dec 9, 2019

mdawar commented Jun 23, 2020