Skip to content
This repository was archived by the owner on Oct 24, 2023. It is now read-only.

fix: retry more aggressively when install the gpu deb package. (#3065)#3068

Merged
andyliuliming merged 1 commit intoAzure:aks-release-v0.47.0-1from
andyliuliming:andliu/pickaggressiveretry
Apr 14, 2020
Merged

fix: retry more aggressively when install the gpu deb package. (#3065)#3068
andyliuliming merged 1 commit intoAzure:aks-release-v0.47.0-1from
andyliuliming:andliu/pickaggressiveretry

Conversation

@andyliuliming
Copy link
Copy Markdown
Member

Reason for Change:

Issue Fixed:

Requirements:

Notes:

@xuto2
Copy link
Copy Markdown
Contributor

xuto2 commented Apr 14, 2020

/lgtm

@andyliuliming andyliuliming merged commit 925bec3 into Azure:aks-release-v0.47.0-1 Apr 14, 2020
wait_for_apt_locks
retrycmd_if_failure 30 5 3600 apt-get -o Dpkg::Options::="--force-confold" install -y nvidia-container-runtime="${NVIDIA_CONTAINER_RUNTIME_VERSION}+${NVIDIA_DOCKER_SUFFIX}" || exit $ERR_GPU_DRIVERS_INSTALL_TIMEOUT
{{/* if the unattened upgrade is turned on, and it may takes 10 min to finish the installation, and we use the 1 second just to try to get the lock more aggressively */}}
retrycmd_if_failure 600 1 3600 apt-get -o Dpkg::Options::="--force-confold" install -y nvidia-container-runtime="${NVIDIA_CONTAINER_RUNTIME_VERSION}+${NVIDIA_DOCKER_SUFFIX}" || exit $ERR_GPU_DRIVERS_INSTALL_TIMEOUT
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add: lsof /var/lib/dpkg/lock-frontend and lsof /var/lib/dpkg/lock when dpkg failed so that we log who is holding the lock.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants