Add readiness check to HTTP server #1855

kylemclaren · 2024-08-06T13:42:59Z

This differs from the /health-check endpoint in that it only returns 200 if the health status is READY — i.e. ready to make a prediction.

In my particular use-case, this will be useful when running Cog on Fly.io GPUs (see here, here and here) with cold start requests; in my testing, I would regularly hit 409 - Conflict ("Already running a prediction") when sending an HTTP request to Cog and starting the VM. This occurs even with the idempotent endpoint. I am not sure if I this is intended behaviour — but it's not working for my use-case.

A readiness endpoint would allow me to set a health check to only forward the request to the Machine when it's ready to make a prediction:

  [[http_service.checks]]
    grace_period = "60s"
    interval = "30s"
    method = "GET"
    timeout = "5s"
    path = "/ready"

Happy to elaborate further!

add readiness check

d486018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add readiness check to HTTP server #1855

Add readiness check to HTTP server #1855

kylemclaren commented Aug 6, 2024

Add readiness check to HTTP server #1855

Are you sure you want to change the base?

Add readiness check to HTTP server #1855

Conversation

kylemclaren commented Aug 6, 2024