Score:1

How to handle in-progress request when applying rolling update?

jp flag

Given a set of web servers subject to rolling update, such as through kubernetes rolling updates, if a request is issued to one such termination-pending web server milliseconds before a SIGTERM signal is issued to said web server,

  1. Should the server signal the client that it is being SIGTERM'd and tell the client to "try again" using a different (or same) network address (with a potential delay)?
  2. Else, could the server redirect automatically the request to another pod/instance of the webserver that has been rolled up already?
  3. In the specific case of kubernetes, could the request be sent back to the service and let it know to send it back once at least one of the pods has been rolled out?
Score:2
jp flag

When a pod is terminating it has some time (by default 30 seconds) to complete the request when it receives SIGTERM and before it gets SIGKILL. You can configure longer timeouts. There is also preStop hook is called before SIGTERM is sent to a pod. See Kubernetes best practices: terminating with grace blog post for details.

Alternatively, you can configure load balancers to retry failed requests but this work only for idempotent requests.

Philippe Hebert avatar
jp flag
Thanks for the answer, @AlexD! I was aware that incoming network activity is stopped up to `terminationGracePeriodSeconds` seconds before the issuing of `SIGTERM`, which is by default `30s`. This indeed covers most use cases as it's really rare to have requests longer than `30s`. This being said, in the case where the request is a long-winded request and it cannot resolve in said amount of time, what would you suggest the strategy be? You can assume that the request is bounded by a transaction / is indempotent when returned early.
jp flag
@PhilippeHebert it really depends. What is the cost of failing the request? What options do we have to retry this request? How long can we delay the request and what is the cost of the delaying of the successful completion of the request? Too many variables.
Philippe Hebert avatar
jp flag
Those are very relevant questions; I suggest you add them as part of your answer, alongside "Is the request idempotent?". For the sake of completeness, if the handling of these requests can be categorized in a few approaches, I would also suggest describing these approaches as well.
mangohost

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.