[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] condor 8.6.5



Ok, here's a little more information.  The job fails if the submit script and program are in (and run from) home directory or general nfs-shared scratch.  Is there a part of the documentation that describes how the filesystem+permissions should be set up?  I'm using the basic condor configuration that comes via yum.


More information on nfs:


6 jobs; 0 completed, 0 removed, 0 idle, 0 running, 6 held, 0 suspended
[nmoore@pilgrim ~]$ condor_q -hold


-- Schedd: pilgrim : <199.17.158.20:9618?... @ 08/04/17 17:31:34
 ID      OWNER          HELD_SINCE  HOLD_REASON
   2.0   nmoore          8/4  10:47 Error from slot1@pilgrim: Failed to open '/home/nmoore/condor_sub/job.out' as standard output: Permission denied (errno 13)
   3.0   nmoore          8/4  10:47 Error from slot2@pilgrim: Failed to open '/home/nmoore/condor_sub/job.out' as standard output: Permission denied (errno 13)
   4.0   nmoore          8/4  10:47 Error from slot1@pilgrim: Failed to open '/home/nmoore/condor_sub/job.out' as standard output: Permission denied (errno 13)
   5.0   nmoore          8/4  10:47 Error from slot2@pilgrim: Failed to open '/home/nmoore/condor_sub/job-2.out' as standard output: Permission denied (errno 13)
   6.0   nmoore          8/4  11:50 Error from slot1@pilgrim: Failed to open '/home/nmoore/condor_sub/job-3.out' as standard output: Permission denied (errno 13)
   7.0   nmoore          8/4  15:02 Error from slot2@pilgrim: Failed to open '/data-shared/condor-sub/job.out' as standard output: Permission denied (errno 13)

6 jobs; 0 completed, 0 removed, 0 idle, 0 running, 6 held, 0 suspended
[nmoore@pilgrim ~]$ df -h
Filesystem                    Size  Used Avail Use% Mounted on
/dev/sda2                      56G  7.3G   49G  14% /
devtmpfs                      1.9G     0  1.9G   0% /dev
tmpfs                         1.9G   84K  1.9G   1% /dev/shm
tmpfs                         1.9G   57M  1.9G   3% /run
tmpfs                         1.9G     0  1.9G   0% /sys/fs/cgroup
/dev/sda5                      43G  386M   43G   1% /data-local
/dev/sda1                     3.8G  210M  3.6G   6% /boot
199.17.158.25:/data-shared-2  298G  7.9G  291G   3% /data-shared-2
199.17.158.2:/data-shared     587G   46G  511G   9% /data-shared
tmpfs                         383M   16K  383M   1% /run/user/42
tmpfs                         383M     0  383M   0% /run/user/1001
199.17.158.2:/home/nmoore     587G  9.6G  548G   2% /home/nmoore


The condor config files are, 

on the worker node:

[nmoore@pilgrim ~]$ cat /etc/condor/condor_config.local
CONDOR_HOST = greylag.physics.winona.edu
DAEMON_LIST = MASTER, STARTD, SCHEDD
ALLOW_WRITE = $(ALLOW_WRITE), $(CONDOR_HOST)

and on the scheduler node

[nmoore@greylag ~]$ cat /etc/condor/condor_config.local
ALLOW_WRITE = $(ALLOW_WRITE), 199.17.158.6, 199.17.158.20, 199.17.158.21, 199.17.158.22, 199.17.158.23, 199.17.158.24, 199.17.158.25, sj5947pw72.workstations.winona.edu





From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Moore, Nathan T <nmoore@xxxxxxxxxx>
Sent: Friday, August 4, 2017 2:47 PM
To: HTCondor-Users Mail List
Subject: Re: [HTCondor-users] condor 8.6.5
 
This sender failed our fraud detection checks and may not be who they appear to be. Learn about spoofing
Feedback

Home is an NFS filesystem share via autofs.  


I did the install via yum, so I'm not sure were FILESYSTEM_DOMAIN is set.



From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Ben Cotton <ben.cotton@xxxxxxxxxxxxxxxxxx>
Sent: Friday, August 4, 2017 1:17:17 PM
To: HTCondor-Users Mail List
Subject: Re: [HTCondor-users] condor 8.6.5
 
Hi Nathan,

> File permissions error?
>
Looks like it. Do your machines all have the same FILESYSTEM_DOMAIN
setting? Is /home an NFS share?


--
Ben Cotton
Technical Marketing Manager

Cycle Computing
Better Answers. Faster.

http://www.cyclecomputing.com
twitter: @cyclecomputing
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/