SGE error can't open output file

1k views Asked by At

I'm trying to run PsN with run_on_sge but keep getting this error when I submit multiple runs simultaneously. I'm also getting "error: can't chdir to directory".

I suspect the PsN scripts may not be creating the run directories before submitting the job but am not certain. I'm also a little unsure of how to test this.

The nodes have access to the NFS mounted directories in question and the permissions look fine. Any pointers appreciated!!

1

There are 1 answers

0
Michael Wilson On

Note to self: don't mess up the UID's when creating user accounts on the compute nodes.

UID's did not match between (3) compute nodes and the head node... which resulted in 'intermitted' failures with the error mentioned. Whenever a job was submitted to a UID mismatched node the job failed with what now seems to be a pretty obvious error message.