Restart snapshotter gracefully even if there are running containers #151

luodw · 2020-09-22T15:39:40Z

recently, I have tried containerd lazy load feature. stargz-snapshotter use fusefs to fetch file data on demand, and in fuse userspace handler, if file is cached locally, it will return directly, if not cached locally, it will fetch from docker registry.

my question is, if stargz-snapshotter restart abnormally or when upgrade, the fuse connections will break, so container process read will failed. Is there some good practice?

ktock · 2020-09-23T02:29:05Z

@luodw Thanks for the question! Though we have graceful shutdown on SIGINT (#26), recovery on abnormal shutdown / support for service restart are in progress (#134). Very welcome for contribution.

luodw · 2020-09-23T06:49:47Z

@luodw Thanks for the question! Though we have graceful shutdown on SIGINT (#26), recovery on abnormal shutdown / support for service restart are in progress (#134). Very welcome for contribution.

Thanks for your reply, I got it.

ktock · 2020-09-24T01:26:19Z

@luodw Can you check if the master version (contains the patch #134) fixes this issue?

luodw · 2020-09-24T08:45:28Z

@luodw Can you check if the master version (contains the patch #134) fixes this issue?

I hava tried the latest master branch (containes the patch #134 ), but when I 'kill -9 ', and restart right now, the container still has err

The follow steps reproduce the issue

ctr-remote images rpull docker.io/stargz/golang:1.12.9-esgz
ctr-remote run --rm -t --snapshotter=stargz docker.io/stargz/golang:1.12.9-esgz test /bin/bash
kill -9 and restart right now
run some commands in container

ktock · 2020-09-25T02:40:23Z

Currently, you need to re-run containers too. And I agree with that the snapshotter needs to be able to gracefully restart even if there are running containers.

luodw · 2020-09-25T02:52:07Z

Currently, you need to re-run containers too. And I agree with that the snapshotter needs to be able to gracefully restart even if there are running containers.

Ok，I also think the ideal usage is when snapshotter restarts, the running containers can still run normally.

amrmahdi · 2021-01-14T20:28:44Z

@ktock can you describe what is required to do an update/restart to the snapshotter in a running cluster for instance? How do you do that today?

ktock · 2021-01-15T00:35:55Z

Currently, we need to kill all containers running on that node before restarting this snapshotter and re-deploy these containers after the snapshotter restarts.

One of the idea to solve this issue is spawning the FUSE server as a separated process instead of goroutine as done today.

luodw closed this as completed Sep 23, 2020

luodw reopened this Sep 23, 2020

ktock changed the title ~~If stargz-snapshotter daemon process restart?~~ Restart snapshotter gracefully even if there are running containers Sep 25, 2020

ktock added the enhancement New feature or request label Sep 25, 2020

ktock added the help wanted Extra attention is needed label Oct 7, 2020

ktock mentioned this issue May 10, 2021

restore error blocked stargz restart #314

Open

ilyee mentioned this issue May 11, 2021

Graceful restart using separate FUSE manager daemon #318

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restart snapshotter gracefully even if there are running containers #151

Restart snapshotter gracefully even if there are running containers #151

luodw commented Sep 22, 2020

ktock commented Sep 23, 2020

luodw commented Sep 23, 2020

ktock commented Sep 24, 2020

luodw commented Sep 24, 2020 •

edited

Loading

ktock commented Sep 25, 2020

luodw commented Sep 25, 2020

amrmahdi commented Jan 14, 2021

ktock commented Jan 15, 2021

Restart snapshotter gracefully even if there are running containers #151

Restart snapshotter gracefully even if there are running containers #151

Comments

luodw commented Sep 22, 2020

ktock commented Sep 23, 2020

luodw commented Sep 23, 2020

ktock commented Sep 24, 2020

luodw commented Sep 24, 2020 • edited Loading

ktock commented Sep 25, 2020

luodw commented Sep 25, 2020

amrmahdi commented Jan 14, 2021

ktock commented Jan 15, 2021

luodw commented Sep 24, 2020 •

edited

Loading