Re-imaging a compute node back to a working state
If you accidentally misconfigure software on a cluster compute node you can always revert it back to a working image. In order to prepare a node for imaging you first set it to boot into the cloner3 image the next time it powers on:
$ act_netboot -n <node name> -set=cloner3
Next you simply reboot the machine or you can remotely reboot it by:
$ act_powerctl -n <node name> reboot
Remember, every time you change software or configurations on your compute nodes you should update their backup image! You can update an image by logging into the compute node you want to update the image from and run the command:
$ /act/cloner/bin/cloner –server=<head hostname> –image=<image name> —update
Categories
- Getting Support (5)
- Hardware (35)
- Areca Raid Arrays (3)
- InfiniBand (10)
- LSI Raid Arrays (9)
- NVIDIA Graphics Cards (1)
- Racks (1)
- Troubleshooting (8)
- Software (11)
- ACT Utilities (5)
- HPC apps & benchmarks (1)
- Linux (3)
- Schedulers (3)
- SGE / Grid Engine (1)
- TORQUE (1)
- Tech Tips (17)
Request a Consultation from our team of HPC and AI Experts
Would you like to speak to one of our HPC or AI experts? We are here to help you. Submit your details, and we'll be in touch shortly.