Guillaume Eynard-Bontemps, CNES (Centre National d’Etudes Spatiales - French Space Agency)
2020-11-16
Thanks to Florient Chouteau and Dennis Wilson
for their work on this subject.
I took most of the content from theirs:
Data Science is about reproducibility
So… containers ?
What’s a container?
Docker is a solution that standardizes packaging and execution of software in isolated environments (containers) that share resources and can communicate between themselves
Build, Share, and Run Any App, Anywhere
Docker is some fancy tech over linux kernel capabilities (containers)
Docker:
Python:
FROM python:3.7
ENV MYVAR="HELLO"
RUN pip install torch
COPY my-conf.txt /app/my-conf.txt
ADD my-file.txt /app/my-file.txt
EXPOSE 9000
WORKDIR "/WORKDIR"
USER MYUSER
ENTRYPOINT ["/BIN/BASH"]
CMD ["ECHO” , "${MYVAR}"]
“Docker for HPC”
“Rootless Docker for Redhat”
What’s Docker typical workflow?