Following this guide:
https://jabriffa.wordpress.com/2015/02/11/installing-torquepbs-job-scheduler-on-ubuntu-14-04-lts/
I installed TORQUE on Ubuntu 16-04-lts (he claims the process works the same on 16.04)
A short summary of his install instructions so this can be self sufficient:
apt-get install torque-server torque-client torque-mom torque-pam
/etc/init.d/torque-mom stop
/etc/init.d/torque-scheduler stop
/etc/init.d/torque-server stop
pbs_server -t create
killall pbs_server
echo SERVER.DOMAIN > /etc/torque/server_name
echo SERVER.DOMAIN > /var/spool/torque/server_priv/acl_svr/acl_hosts
echo [email protected] > /var/spool/torque/server_priv/acl_svr/operators
echo [email protected] > /var/spool/torque/server_priv/acl_svr/managers
echo "SERVER.DOMAIN np=4" > /var/spool/torque/server_priv/nodes
echo SERVER.DOMAIN > /var/spool/torque/mom_priv/config
/etc/init.d/torque-server start
/etc/init.d/torque-scheduler start
/etc/init.d/torque-mom start
# set scheduling properties
qmgr -c 'set server scheduling = true'
qmgr -c 'set server keep_completed = 300'
qmgr -c 'set server mom_job_sync = true
After following his directions up to:
qmgr -c 'set server scheduling = true'
I get the error message
qmgr obj=master.node svr=master.node: Unauthorized Request
I grep the logs as he mentions and find this unhelpful snippet: grep Unauthorized /var/spool/torque/server_logs/*
08/25/2018 15:48:43;0080;PBS_Server;Req;req_reject;Reject reply code=15007(Unauthorized Request ), aux=0, type=Manager, from [email protected]
This is my hostname:
master
This is my hosts file:
127.0.1.1 master master
127.0.0.1 localhost
10.136.7.155 master.node
10.136.7.155 master
10.136.65.29 slave1
10.136.73.247 slave2
10.136.44.128 slave3
This is how I configured the various config files:
echo master.node > /etc/torque/server_name
echo master.node > /var/spool/torque/server_priv/acl_svr/acl_hosts
echo [email protected] > /var/spool/torque/server_priv/acl_svr/operators
echo [email protected] > /var/spool/torque/server_priv/acl_svr/managers
echo "master.node np=4" > /var/spool/torque/server_priv/nodes
echo master.node > /var/spool/torque/mom_priv/config
Each time I fiddle with it I restart the various daemons with:
/etc/init.d/torque-server restart
/etc/init.d/torque-scheduler restart
/etc/init.d/torque-mom restart
I am currently running as root.
I am at a total loss as to what TORQUE wants here. Why am I unauthorized?
Also qmgr thinks there are no nodes despite having a /var/spool/torque/server_priv/nodes file. Why?
Qmgr: list node
No Active Nodes, nothing done.