[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor-C Detected Down GridResource [Sec=Unclassified] [Sec=Unclassified]



Hi Dan,

I get:
"Error: Collector has no record of schedd/submitter"

Assuming this is due to the manager not receiving any submitted jobs?


> -----Original Message-----
> From: condor-users-bounces@xxxxxxxxxxx 
> [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Dan Bradley
> Sent: Wednesday, 4 June 2008 12:52 AM
> To: Condor-Users Mail List
> Subject: Re: [Condor-users] Condor-C Detected Down 
> GridResource [Sec=Unclassified]
> 
> 
> 
> Can you query the remote schedd from one of the laptops?
> 
> condor_q -pool erm-43880.yyy.zzz -name erm-43880.yyy.zzz
> 
> --Dan
> 
> Troy Robertson wrote:
> 
> > I'm having trouble converting over to Condor-C, the manual
> > instructions look simple enough but there doesn't seem to be any 
> > tutorial or more in-depth how-to on the subject on the 
> website anywhere.
> >
> > I am trying to do this as we have laptop users who want to 
> submit jobs
> > to the pool and be able to take their laptop home at night and come 
> > back in the morning and receive results.
> >
> > We have a linux pool of execute machines with a linux 
> central manager.
> > Submit machines are all Windows.
> >
> > I would like to submit to the central manager of the condor pool
> > (erm-43880.yyy.zzz).
> >
> > I have collector daemon running on submit and schedd 
> running on remote
> > central manager.
> >
> > I have installed Condor as Personal pool on submit machines.
> >
> > I have modified submit config with:
> >
> >CONDOR_GAHP=$(SBIN)/condor_c-gahp
> >
> >C_GAHP_LOG=/tmp/CGAHPLog.$(USERNAME)
> >
> >C_GAHP_WORKER_THREAD_LOG=/tmp/CGAHPWorkerLog.$(USERNAME)
> >
> > And added to central manager and execute machines:
> >
> >SEC_DEFAULT_NEGOTIATION = OPTIONAL
> >
> >SEC_DEFAULT_AUTHENTICATION_METHODS = CLAIMTOBE
> >
> > Submit file:
> >
> > universe = grid
> >
> > Executable = hello
> >
> > output = hello_output.txt
> >
> > error = hello_error.txt
> >
> > log = hello_log.txt
> >
> > notification = never
> >
> > grid_resource = condor erm-43880.yyy.zzz erm-43880.yyy.zzz
> >
> > +remote_universe = vanilla
> >
> > +remote_requirements = True
> >
> > +remote_ShouldTransferFiles = "YES"
> >
> > +remote_whentotransferoutput = "ON_EXIT"
> >
> > Queue
> >
> > Job log contains:
> >
> > 000 (046.000.000) 06/03 13:30:46 Job submitted from host:
> > <147.66.11.17:14672>
> >
> > ...
> >
> > 020 (046.000.000) 06/03 13:31:09 Detected Down Globus Resource
> >
> > RM-Contact: erm-43880
> >
> > ...
> >
> > 026 (046.000.000) 06/03 13:31:09 Detected Down Grid Resource
> >
> > GridResource: condor erm-43880 erm-43880
> >
> > I can see the grid manager process and gahp and gahp_worker 
> processes
> > start up but the jobs just sit there idle.
> >
> > Remote central manager logs contain no indication that a 
> job is being
> > submitted.
> >
> > Can anyone please help?
> >
> > 
> ______________________________________________________________________
> > _____
> >
> > Australian Antarctic Division - Commonwealth of Australia
> > IMPORTANT: This transmission is intended for the addressee only. If
> > you are not the
> > intended recipient, you are notified that use or 
> dissemination of this 
> > communication is
> > strictly prohibited by Commonwealth law. If you have received this 
> > transmission in error,
> > please notify the sender immediately by e-mail or by 
> telephoning +61 3 
> > 6232 3209 and
> > DELETE the message.
> > Visit our web site at http://www.antarctica.gov.au/
> > 
> ______________________________________________________________
> _____________
> >
> >-------------------------------------------------------------
> ----------
> >-
> >
> >_______________________________________________
> >Condor-users mailing list
> >To unsubscribe, send a message to 
> condor-users-request@xxxxxxxxxxx with 
> >a
> >subject: Unsubscribe
> >You can also unsubscribe by visiting
> >https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >
> >The archives can be found at:
> >https://lists.cs.wisc.edu/archive/condor-users/
> >  
> >
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to 
> condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting 
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> The archives can be found at: 
> https://lists.cs.wisc.edu/archive/condor-users/
> 
___________________________________________________________________________

    Australian Antarctic Division - Commonwealth of Australia
IMPORTANT: This transmission is intended for the addressee only. If you are not the
intended recipient, you are notified that use or dissemination of this communication is
strictly prohibited by Commonwealth law. If you have received this transmission in error,
please notify the sender immediately by e-mail or by telephoning +61 3 6232 3209 and
DELETE the message.
        Visit our web site at http://www.antarctica.gov.au/
___________________________________________________________________________