本文介绍了运行多个辅助守护程序SLURM的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想在一台机器上运行多个工作守护程序。根据 damienfrancois 的回答slurm群集的最小计算机数> slurm群集的最小计算机数是多少可以做到。问题是当前我只能在一台计算机上执行1个工作守护程序。例如,

I want to run multiple worker daemons on single machine. As per damienfrancois's answer on what is the minimum number of computers for a slurm cluster it can be done. Problem is currently I am able to execute only 1 worker daemon on one machine. for example

当我运行

sudo slurmd -N linux1 -cDvv
sudo slurmd -N linux2 -cDvv

当我运行linux2时,linux1掉线了。是否可以在一台计算机上运行多个辅助守护程序?
这是我的文件

linux1 goes down when I run linux2. Is it possible to run multiple worker daemons on one machine?Here is my slurm.conf file

推荐答案

由于您的意图似乎只是测试Slurm的行为,因此建议您使用前端模式可以在同一台计算机上创建虚拟计算节点。

as your intention seems to be just testing the behavior of Slurm, I would recommend you to use the front-end mode, where you can create dummy computation nodes in the same machine.

在其,您有更多详细信息,但基本上您必须配置安装才能使用此模式:

In their FAQ, you have more details, but basically you must configure your installation to work with this mode:

./configure --enable-front-end

并在中配置节点slurm.conf

NodeName=test[1-100] NodeHostName=localhost

在该指南中,他们还解释了如何通过更改端口在同一节点中启动多个真实守护进程,但出于测试目的

In that guide, they also explain how to launch more than one real daemons in the same node by changing the ports, but for my testing purposes it was not necessary.

祝你好运!

这篇关于运行多个辅助守护程序SLURM的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

06-29 16:19