[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Problem with sshd.sh, ssh-keygen and Parallel Universe



Hi James,
I don't know what is happened but now ssh-keygen problem is solved! I have only restart condor on node after i have seen that node remain claimed also when job is finished. Now i have error from mpi script modified by me to run intel mpirun. I go to debug it ....
Thanks
Pierpaolo
Il giorno 16/set/2009, alle ore 17.34, Burnash, James ha scritto:

The thing is ... what user is condor running as ... and does that user have permissions to navigate down to the tmp directory.

For example, what are the permissions on /home/condor/local/execute/ dir_3412?

Also, sometimes errors like this pop up because filesystems are full - probably something you've already checked, but if not it can cause weird problems ...

James Burnash
Unix SA
x2248

-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users- bounces@xxxxxxxxxxx] On Behalf Of Pierpaolo Minelli
Sent: Wednesday, September 16, 2009 11:13 AM
To: Condor-Users Mail List
Subject: Re: [Condor-users] Problem with sshd.sh, ssh-keygen and Parallel Universe

Hi James,

thanks but also changing permission to the dir 'tmp' in this way:

drwxrwxrwx 2 nobody nobody   4096 Sep 16 17:07 tmp

it does not work ....

Pierpaolo

Il giorno 16/set/2009, alle ore 16.40, Burnash, James ha scritto:

Hi Pierpaolo,

Is condor_master daemon running as user condor, or as user nobody?
If it is as nobody, these would seem ok, but if it's as user condor,
those permission will prevent processes owned by that user from
writing into that directory structure.

James Burnash
Unix SA

-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-
bounces@xxxxxxxxxxx] On Behalf Of Pierpaolo Minelli
Sent: Wednesday, September 16, 2009 10:11 AM
To: Condor-Users Mail List
Subject: Re: [Condor-users] Problem with sshd.sh, ssh-keygen and
Parallel Universe

Hi James,
the whole directory path given to the ssh-keygen program exist and
these are permissions in the directory:

-rwxr-xr-x 1 nobody nobody 499183 Sep 16 15:54 a.out
-rwx------ 1 nobody nobody     50 Sep 16 15:57 chirp.config
-rwxr-xr-x 1 nobody nobody   1745 Sep 16 15:54 condor_exec.exe
-rw-r--r-- 1 nobody nobody     12 Sep 16 15:54 mpd.hosts
-rw-r--r-- 1 nobody nobody    697 Sep 16 15:57 mpi.err.0.0
-rw-r--r-- 1 nobody nobody     36 Sep 16 15:57 mpi.out.0.0
drwxr-xr-x 2 nobody nobody   4096 Sep 16 15:57 tmp

Directory tmp seems to have right permissions.

Pierpaolo

Il giorno 16/set/2009, alle ore 14.31, Burnash, James ha scritto:

Hi Piarpaolo,

I can get an error return code of 1 if the whole directory path
given to the ssh-keygen program doesn't exist, or is not writable
(permissions):

e.g.

ssh-keygen -q -f /tmp/junk/hostkey -t rsa -N ''

returns 1,

but

ssh-keygen -q -f /tmp/hostkey -t rsa -N ''

returns 0

James Burnash

-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-
bounces@xxxxxxxxxxx] On Behalf Of Pierpaolo Minelli
Sent: Wednesday, September 16, 2009 7:46 AM
To: condor-users@xxxxxxxxxxx
Subject: [Condor-users] Problem with sshd.sh, ssh-keygen and
Parallel Universe

Hi,

i have a problem using mpi wrapper script on my cluster (CentOS 5.3).
Sorry for my english.
This is my cmd file:

universe = parallel
executable = mp2script
arguments = a.out
machine_count = 2
log = mpi.log.$(Process)
output = mpi.out.$(NODE).$(Process)
error = mpi.err.$(NODE).$(Process)
notification = error
should_transfer_files = yes
when_to_transfer_output = on_exit
transfer_input_files = a.out, mpd.hosts
queue

When I submit this cmd, I obtain this error file (after inserting
'set
-x' in mp2script):

++ trap sshd_cleanup 15
+++ condor_config_val CONDOR_SSHD
++ SSHD=/usr/sbin/sshd
+++ condor_config_val CONDOR_SSH_KEYGEN
++ KEYGEN=/usr/bin/ssh-keygen
+++ condor_config_val libexec
++ CONDOR_CHIRP=/home/condor/releases/imip8.ba.imip.cnr.it/libexec
++ CONDOR_CHIRP=/home/condor/releases/imip8.ba.imip.cnr.it/libexec/
condor_chirp
++ PORT=4444
++ _CONDOR_REMOTE_SPOOL_DIR=/home/condor/local/spool/
cluster62.proc0.subproc0
++ _CONDOR_PROCNO=0
++ _CONDOR_NPROCS=2
++ mkdir /home/condor/local/execute/dir_3412/tmp
++ hostkey=/home/condor/local/execute/dir_3412/tmp/hostkey
++ /bin/rm -f /home/condor/local/execute/dir_3412/tmp/hostkey /home/
condor/local/execute/dir_3412/tmp/hostkey.pub
++ /usr/bin/ssh-keygen -q -f /home/condor/local/execute/dir_3412/ tmp/
hostkey -t rsa -N ''
++ '[' 1 -ne 0 ']'
++ echo ssh keygenerator /usr/bin/ssh-keygen returned error 0 exiting
++ exit -1

ssh-keygen seems not to works properly but i don't know why....

Someone can help me to solve this problem?

Best regards
Piarpaolo
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/


DISCLAIMER:
This e-mail, and any attachments thereto, is intended only for use
by the addressee(s) named herein and may contain legally privileged
and/or confidential information. If you are not the intended
recipient of this e-mail, you are hereby notified that any
dissemination, distribution or copying of this e-mail, and any
attachments thereto, is strictly prohibited. If you have received
this in error, please immediately notify me and permanently delete
the original and any copy of any e-mail and any printout thereof. E-
mail transmission cannot be guaranteed to be secure or error-free.
The sender therefore does not accept liability for any errors or
omissions in the contents of this message which arise as a result of
e-mail transmission.
NOTICE REGARDING PRIVACY AND CONFIDENTIALITY Knight Capital Group
may, at its discretion, monitor and review the content of all e-mail
communications. http://www.knight.com
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/


DISCLAIMER:
This e-mail, and any attachments thereto, is intended only for use
by the addressee(s) named herein and may contain legally privileged
and/or confidential information. If you are not the intended
recipient of this e-mail, you are hereby notified that any
dissemination, distribution or copying of this e-mail, and any
attachments thereto, is strictly prohibited. If you have received
this in error, please immediately notify me and permanently delete
the original and any copy of any e-mail and any printout thereof. E-
mail transmission cannot be guaranteed to be secure or error-free.
The sender therefore does not accept liability for any errors or
omissions in the contents of this message which arise as a result of
e-mail transmission.
NOTICE REGARDING PRIVACY AND CONFIDENTIALITY Knight Capital Group
may, at its discretion, monitor and review the content of all e-mail
communications. http://www.knight.com
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/