[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor-C Detected Down GridResource




If the schedd is not advertised to the collector that you specify as the last argument in grid_resource, then Condor-C will not be able to contact the remote schedd. Have you specified the wrong collector? Or is there some problem preventing the schedd from advertising itself to that collector?

You can see all of the schedd ClassAds known to a collector with this command:

condor_status -pool <collector address> -schedd

--Dan

Troy Robertson wrote:

Hi Dan,

I get:
"Error: Collector has no record of schedd/submitter"

Assuming this is due to the manager not receiving any submitted jobs?


-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Dan Bradley
Sent: Wednesday, 4 June 2008 12:52 AM
To: Condor-Users Mail List
Subject: Re: [Condor-users] Condor-C Detected Down GridResource [Sec=Unclassified]



Can you query the remote schedd from one of the laptops?

condor_q -pool erm-43880.yyy.zzz -name erm-43880.yyy.zzz

--Dan

Troy Robertson wrote:

I'm having trouble converting over to Condor-C, the manual
instructions look simple enough but there doesn't seem to be any tutorial or more in-depth how-to on the subject on the
website anywhere.
I am trying to do this as we have laptop users who want to
submit jobs
to the pool and be able to take their laptop home at night and come back in the morning and receive results.

We have a linux pool of execute machines with a linux
central manager.
Submit machines are all Windows.

I would like to submit to the central manager of the condor pool
(erm-43880.yyy.zzz).

I have collector daemon running on submit and schedd
running on remote
central manager.

I have installed Condor as Personal pool on submit machines.

I have modified submit config with:

CONDOR_GAHP=$(SBIN)/condor_c-gahp

C_GAHP_LOG=/tmp/CGAHPLog.$(USERNAME)

C_GAHP_WORKER_THREAD_LOG=/tmp/CGAHPWorkerLog.$(USERNAME)

And added to central manager and execute machines:

SEC_DEFAULT_NEGOTIATION = OPTIONAL

SEC_DEFAULT_AUTHENTICATION_METHODS = CLAIMTOBE

Submit file:

universe = grid

Executable = hello

output = hello_output.txt

error = hello_error.txt

log = hello_log.txt

notification = never

grid_resource = condor erm-43880.yyy.zzz erm-43880.yyy.zzz

+remote_universe = vanilla

+remote_requirements = True

+remote_ShouldTransferFiles = "YES"

+remote_whentotransferoutput = "ON_EXIT"

Queue

Job log contains:

000 (046.000.000) 06/03 13:30:46 Job submitted from host:
<147.66.11.17:14672>

...

020 (046.000.000) 06/03 13:31:09 Detected Down Globus Resource

RM-Contact: erm-43880

...

026 (046.000.000) 06/03 13:31:09 Detected Down Grid Resource

GridResource: condor erm-43880 erm-43880

I can see the grid manager process and gahp and gahp_worker
processes
start up but the jobs just sit there idle.

Remote central manager logs contain no indication that a
job is being
submitted.

Can anyone please help?


______________________________________________________________________
_____

Australian Antarctic Division - Commonwealth of Australia
IMPORTANT: This transmission is intended for the addressee only. If
you are not the
intended recipient, you are notified that use or
dissemination of this
communication is
strictly prohibited by Commonwealth law. If you have received this transmission in error, please notify the sender immediately by e-mail or by
telephoning +61 3
6232 3209 and
DELETE the message.
Visit our web site at http://www.antarctica.gov.au/

______________________________________________________________
_____________
-------------------------------------------------------------
----------
-

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to
condor-users-request@xxxxxxxxxxx with
a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/

___________________________________________________________________________

   Australian Antarctic Division - Commonwealth of Australia
IMPORTANT: This transmission is intended for the addressee only. If you are not the
intended recipient, you are notified that use or dissemination of this communication is
strictly prohibited by Commonwealth law. If you have received this transmission in error,
please notify the sender immediately by e-mail or by telephoning +61 3 6232 3209 and
DELETE the message.
       Visit our web site at http://www.antarctica.gov.au/
___________________________________________________________________________
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/