-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't rebuild container image from Dockerfile #3
Comments
I can confirm this recipe is not working, though I'm not sure we are having the same error. The error I encountered is The reason why it worked in the past but does not now is probably that NVIDIA keeps updating the base image @khinsen Except for the above reason, another possibility is out-of-memory. By default, this recipe uses all cores (w/ hyperthreading) to build AMGX, so it requires a huge memory space. But if you didn't experience laggy responses when the failure happened, then out-of-memory is probably not the cause. |
@piyueh No laggy responses indeed. I have 8 GB of memory, but I don't know how much of that can be attributed to Docker. Your explanation sounds quite possible, and it's the main issue I see with using containers for reproducibility: most container images are themselves not reproducible. In this case, the base images keeps changing, but on top of that, the Dockerfile does Another potential issue is that the NVIDIA Cuda images comes with the statement that "The NVIDIA Container Toolkit for Docker is required to run CUDA images." I definitely don't have that toolkit. But I am not trying to run the image, I am just trying to build it. Does this also require the NVIDIA toolkit? |
@khinsen You are completely right, commands such as As for the issue reported here, I think @piyueh gave the right answer.
which was the consequence of bumping the CUDA version in the base image ( Of course, there are ways to update the Dockerfile to make the build successful, but that does not resolve the reproducibility issue with images. Also, you do not need a NVIDIA GPU device or the CUDA Toolkit installed on the host to build the image. |
Given the importance of both CUDA and reproducibility in scientific computing, I wonder if NVIDIA could be convinced to provide reproducible Docker images, at least from time to time, much like Ubuntu's LTS releases. Technically, one option would be to use Debian's Debuerreotype as the lowest-layer image on which to build. Alternatively, at least archive their images in a more permanent way, e.g. on Zenodo. |
I made an attempt to rebuild locally the container image for
Dockerfile-0.5.1-GPU-OpenMPI-xenial-devel
, using the command line fromdocker/README.md
:The build is aborted with the following error message:
I am using Docker Desktop 3.3.3(64133) under macOS, on a computer that has no NVIDIA GPU nor any NVIDIA software installed. If NVIDIA GPUs and/or drivers are a requirement for building the image, it would be nice to indicate this in the README.
The text was updated successfully, but these errors were encountered: