When you build your multi-stage Dockerfile with
docker build -t myimage .
it produces the final image tagged myimage
, and also intermediate images. To be completely clear we are talking here not about containers, but about images. It looks like this:
See these <none>
images? These are what I'm talking about.
Now this "issue" has been discussed to some extent here and here.
Here are some relevant parts:
If these intermediate images would be purged/pruned automatically, the build cache would be gone with each build, therefore forcing you to rebuild the entire image each time.
So okay, it does not make sense to prune then automatically.
Some people do this:
For now, I'm using
docker image prune -f
after mydocker build -t app .
command to cleanup those intermediate images.
But unfortunately this is not something I can do. As one discussion participant commented:
It removes "all dangling images", so in shared environments (like Jenkins slave) it's more akin to shooting oneself in the foot. :)
And this is a scenario I found myself in.
So nothing to be "fixed" on Docker side. But how can I remove those extra images, from a single particular build only?
Update
After reading very nice answer from d4nyll below, which is a big step forward, I'd like to add some more constraints to the question ;) First, let me sum up the answer:
- One can use ARG to pass a build id from CI/CD to
Dockerfile
builder - Then one can use LABEL syntax to add build id metadata to the stage images being built
- Then one can use the
--filter
option of docker image prune command to remove only the images with the current build id
This is a big step forward, but I'm still struggling into how to fit it into my usage scenario without adding unnecessary complexity.
In my case a requirement is that application developers who author the Dockerfiles and check them into the source control system are responsible for making sure that their Dockerfiles build the image to their satisfaction. They are not required to craft all their Dockerfiles in a specific way, "so our CI/CD process does not break". They simply have to provide a Dockerfile
that produce correct docker image.
Thus, I'm not really in a position to request them to add stuff in the Dockerfile
for every single application, just for the sake of CI/CD pipeline. This is something that CI/CD pipeline is expected to handle all by itself.
The only way I can see making this work is to write a Dockerfile
parser, that will detect multi-staged build and inject a label per stage and then build that modified Dockerfile
. This is a complexity that I'm very hesitant to add to the CI/CD pipeline.
Do I have a better (read simpler) options?
question from:https://stackoverflow.com/questions/50126741/how-to-remove-intermediate-images-from-a-build-after-the-build