Tech Support Advisory: Yum updates fail from slurm package conflicts
Posted on February 2, 2021When performing a yum update or dnf update on your system, the update may fail with messages about conflicts between Slurm packages. This is caused by the addition of new Slurm packages in upstream repos that collide with custom packages installed by ACT.
The errors may look like some of the following:
Transaction check error:
file /usr/share/man/man3/Slurm.3pm.gz from install of slurm-perlapi-20.11.2-2.el7.x86_64 conflicts with file from package slurm-18.08.9-1.el7.x86_64
file /usr/share/man/man3/Slurm::Bitstr.3pm.gz from install of slurm-perlapi-20.11.2-2.el7.x86_64 conflicts with file from package slurm-18.08.9-1.el7.x86_64
…
or
Problem: cannot install both slurm-20.11.2-2.el8.x86_64 and slurm-20.02.3-1.el8.x86_64
package slurm-libpmi-20.02.3-1.el8.x86_64 requires libslurmfull.so()(64bit), but none of the providers can be installed
package slurm-libpmi-20.02.3-1.el8.x86_64 requires slurm(x86-64) = 20.02.3-1.el8, but none of the providers can be installed
cannot install the best update candidate for package slurm-20.02.3-1.el8.x86_64
problem with installed package slurm-libpmi-20.02.3-1.el8.x86_64
…
To work around this problem, we are recommending our customers who want to continue to have ACT’s Slurm integration exclude Slurm from updates from repos by excluding it in the repo configs. This can be done by running the following command as root:
echo "exclude=slurm*" >> /etc/yum.conf
Once excluded, your yum/dnf updates will work again.
Background/Cause
In response to a bug report[1], On January 23rd 2021, a new package, slurm-20.11.2-2, was added to the EPEL 7[2] and 8[3] repositories. The Slurm packages provided by ACT are built on your system from source downloaded from SchedMD, the developers of Slurm. The primary package name, slurm, is the same for both, therefore the package manager attempts to install the prevailing version. The structure of the packages are actually different, however, and the package manager fails to keep things safe.
Keeping ACT version of Slurm
We haven’t fully evaluated the new package provided by EPEL, so we can’t recommend that anybody switch to them at this time unless the user is confident in Slurm administration and is fully willing to handle management on their own. There are a few concerns we have with using the EPEL provided version and will take a closer look at whether or not using that version is worthwhile.
Some of the configuration choices we’ve made with Slurm are targeted for the majority of our customer base. For instance, the slurm configuration files reside in /opt/slurm (/act/slurm on EL 7 installations) which is NFS mounted across the cluster. With this, only a single file modification is instantly visible to all nodes, making management simpler and less error prone.
Another concern is how the repo provided Slurm package manages updates. Minor updates within major Slurm versions are relatively safe to perform. However, major version updates require more scrutiny. In the past, these updates have required backing up and upgrading the backend database (MariaDB), or added or deprecated settings in the config. Every Slurm update should begin with a review of their documentation, so we recommend against a blind update from repo unless appropriate safeguards are added to the RPMs. See also the “Upgrade” section of Slurm’s Quick Start Administration Guide[4] for more detail.
Contact your support team at Advanced Clustering Technologies if you would like to discuss Slurm packages and managing updates.
[1] – https://bugzilla.redhat.com/show_bug.cgi?id=1912491
[2] – https://lists.fedoraproject.org/archives/list/[email protected]/message/2NOE6TYZKUCRYEY4Z754IGHAORRZG6SC/
[3] – https://lists.fedoraproject.org/archives/list/[email protected]/message/KPO7IONLWK627CTAG2JWBEIOE7LCK7BQ/
[4] – https://slurm.schedmd.com/quickstart_admin.html
Categories
- ACTnowHPC (8)
- AMD (5)
- Big Data (1)
- Case Studies (6)
- Cloud HPC Computing (16)
- Cluster Management (2)
- Clusters (12)
- ClusterVisor (5)
- Company News (46)
- Customer Service (3)
- eQUEUE (4)
- GPU Computing (11)
- Grant Writing (25)
- HPC Clusters (46)
- HPC Compute Blocks (3)
- HPC in the news (66)
- HPC Resources (59)
- Infiniband (3)
- Intel Xeon (18)
- Knights Landing (2)
- NVIDIA GPUs (2)
- NVIDIA Tesla GPUs (1)
- Omni-Path (1)
- Servers (5)
- Storage (5)
- Tech Tips (5)
- Trade Shows (39)
- Uncategorized (53)
- Workstations (3)
Recent Posts
archives
- October 2024
- September 2024
- July 2024
- June 2024
- May 2024
- February 2024
- January 2024
- December 2023
- October 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- October 2019
- September 2019
- August 2019
- July 2019
- June 2019
- May 2019
- April 2019
- March 2019
- February 2019
- January 2019
- November 2018
- October 2018
- September 2018
- August 2018
- July 2018
- June 2018
- May 2018
- April 2018
- March 2018
- February 2018
- January 2018
- December 2017
- November 2017
- October 2017
- September 2017
- August 2017
- July 2017
- June 2017
- May 2017
- April 2017
- March 2017
- February 2017
- January 2017
- December 2016
- November 2016
- October 2016
- September 2016
- August 2016
- July 2016
- June 2016
- May 2016
- April 2016
- March 2016
- February 2016
- January 2016
- December 2015
- November 2015
- October 2015
- September 2015
- August 2015
- July 2015
- June 2015
- May 2015
- April 2015
- March 2015
- February 2015
- December 2014
- November 2014
- October 2014
- June 2014
- May 2014
- April 2014
- March 2014
- February 2014
- January 2014
Request a Consultation from our team of HPC and AI Experts
Would you like to speak to one of our HPC or AI experts? We are here to help you. Submit your details, and we'll be in touch shortly.