[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor-C Detected Down Grid Resource [Sec=Unclassified]




Can you query the remote schedd from one of the laptops?

condor_q -pool erm-43880.yyy.zzz -name erm-43880.yyy.zzz

--Dan

Troy Robertson wrote:

I’m having trouble converting over to Condor-C, the manual instructions look simple enough but there doesn’t seem to be any tutorial or more in-depth how-to on the subject on the website anywhere.

I am trying to do this as we have laptop users who want to submit jobs to the pool and be able to take their laptop home at night and come back in the morning and receive results.

We have a linux pool of execute machines with a linux central manager. Submit machines are all Windows.

I would like to submit to the central manager of the condor pool (erm-43880.yyy.zzz).

I have collector daemon running on submit and schedd running on remote central manager.

I have installed Condor as Personal pool on submit machines.

I have modified submit config with:

CONDOR_GAHP=$(SBIN)/condor_c-gahp

C_GAHP_LOG=/tmp/CGAHPLog.$(USERNAME)

C_GAHP_WORKER_THREAD_LOG=/tmp/CGAHPWorkerLog.$(USERNAME)

And added to central manager and execute machines:

SEC_DEFAULT_NEGOTIATION = OPTIONAL

SEC_DEFAULT_AUTHENTICATION_METHODS = CLAIMTOBE

Submit file:

universe = grid

Executable = hello

output = hello_output.txt

error = hello_error.txt

log = hello_log.txt

notification = never

grid_resource = condor erm-43880.yyy.zzz erm-43880.yyy.zzz

+remote_universe = vanilla

+remote_requirements = True

+remote_ShouldTransferFiles = "YES"

+remote_whentotransferoutput = "ON_EXIT"

Queue

Job log contains:

000 (046.000.000) 06/03 13:30:46 Job submitted from host: <147.66.11.17:14672>

...

020 (046.000.000) 06/03 13:31:09 Detected Down Globus Resource

RM-Contact: erm-43880

...

026 (046.000.000) 06/03 13:31:09 Detected Down Grid Resource

GridResource: condor erm-43880 erm-43880

I can see the grid manager process and gahp and gahp_worker processes start up but the jobs just sit there idle.

Remote central manager logs contain no indication that a job is being submitted.

Can anyone please help?

___________________________________________________________________________

Australian Antarctic Division - Commonwealth of Australia
IMPORTANT: This transmission is intended for the addressee only. If you are not the intended recipient, you are notified that use or dissemination of this communication is strictly prohibited by Commonwealth law. If you have received this transmission in error, please notify the sender immediately by e-mail or by telephoning +61 3 6232 3209 and
DELETE the message.
Visit our web site at http://www.antarctica.gov.au/
___________________________________________________________________________

------------------------------------------------------------------------

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/