So, for context, my experience is limited to trying to get a MariaDB Galera clus...

ancieque · on Sept 12, 2022

If you are interested in making this work with any of the constraints, I am sure that there is a way to work around all these issues.

About [0]/[1]: I guess you are right in this not working out of the box, but this could possibly be worked around with a custom entrypoint that behaves differently on which slot the task is running in.

> (Since when you restart a stack to add the new service, the old service will also get restarted. If I'm remembering what I found correctly.)

Are you sure the Docker Image digest did not change? Have you tried pinning an actual Docker Image digest?

> Then there's updating services/replicas. You cannot have Swarm kill a service/replica until after the replacement is actually up and running. Which means you'll need to create a new volume every time you need to upgrade, otherwise you'll end up with two instances of your app using the same data.

Is this true even with "oder: stop-first"?

> To complicate things, as far as I can tell, Swarm doesn't yet support CSI plugins. So you're pretty much stuck with local or nfs storage. If you're using local storage when deploying new replicas/services, you better hope the first replica/service starts up on same node it was on before...

True, but there are still some volume plugins that work around and local storage should work if you use labels to pin the replicas to nodes.

jerrac · on Sept 19, 2022

Finally have time to look into your suggestions. Hopefully you check your comments every once in a while...

> Are you sure the Docker Image digest did not change? Have you tried pinning an actual Docker Image digest?

Mostly sure. Many of my tests only changed the docker-compose file, not the actual image. So even though GitLab was rebuilding the image, the image digest would not have changed. I'll try to find time to pin the digest just to double check, though.

> Is this true even with "oder: stop-first"?

Er, did you mean "order"? I only see `--update-order` as a flag on the `docker service update` command. I do not see it in the docker-compose specification. So far all my tests have been through Portainer's stack deployment feature. So all changes are in my docker-compose file.

Maybe it would just work if I stuck it in the deploy.update section? I'll try.

> True, but there are still some volume plugins that work around and local storage should work if you use labels to pin the replicas to nodes.

I have tried pinning specific services to specific nodes to make local storage work. And I've use labels to force only one replica per node when using replicas.

What volume plugins are you thinking of? I haven't found any that seem to be maintained outside of local storage and nfs. And maybe some that would work if I were in some cloud host...

Anyway, thanks for giving me a couple things to try. :)

ancieque · on Sept 22, 2022

About the order. I mean the order in deploy

``` deploy: mode: replicated replicas: 3 update_config: order: stop-first parallelism: 1 rollback_config: order: stop-first parallelism: 1 restart_policy: condition: on-failure ```

For the volume plugins:

We are using Hetzner, and this one works great: https://github.com/costela/docker-volume-hetzner . Also, there exists one for glusterfs (https://github.com/chrisbecke/glusterfs-volume).

jerrac · on Sept 26, 2022

Thanks!

I also found the docs: https://docs.docker.com/compose/compose-file/deploy/#update_... Not sure how I missed that when I was looking at it before... :\

I'll look into those plugins.

ancieque · on Sept 28, 2022

I learned about a lot of things by watching videos by Bret Fisher. He has a lot of good resources on running Docker Swarm