[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Globus - CondorG setup



Hi, I have set up a condor pool, a linux central manager which can execute and submit and there are a combination of Linux and Windows machines in the pool which can execute and submit jobs. 

I am now trying to use the Globus grid manager. I have been through the tutorial at https://bi.offis.de/wisent/tiki-index.php?page=Condor-GT4-Admin. I have installed globus on the central manager itself and am attempting to submit from there also. The certificates have been created for myself and the HIGH/LOW PORT macros have been set. 

I am having trouble though, it seems as though the globus server, I think GRAM is never actually being started, hence the job never leaves the idle state; this is part of the gridmanager log: 

GAHP[4439] <- 'GT4_GRAM_PING 4 https://10.1.207.26/wsrf/services/ManagedJobFactoryService'
02/12 10:36:35 [4432] GAHP[4439] -> 'S'
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> AxisFault
02/12 10:36:35 [4432] GAHP[4439] (stderr) ->  faultCode: {http://schemas.xmlsoap.org/soap/envelope/}Server.userException
02/12 10:36:35 [4432] GAHP[4439] (stderr) ->  faultSubcode: 
02/12 10:36:35 [4432] GAHP[4439] (stderr) ->  faultString: java.net.ConnectException: Connection refused
02/12 10:36:35 [4432] GAHP[4439] (stderr) ->  faultActor: 
02/12 10:36:35 [4432] GAHP[4439] (stderr) ->  faultNode: 
02/12 10:36:35 [4432] GAHP[4439] (stderr) ->  faultDetail: 
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> {http://xml.apache.org/axis/}stackTrace:java.net.ConnectException: Connection refused
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.net.PlainSocketImpl.socketConnect(Native Method)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:310)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:176)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:163)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:384)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.net.Socket.connect(Socket.java:542)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
02/12 10:36:35 [4432] GAHP[4439] (stderr) -> at java.lang.reflect.Method.invoke(Method.java:616).

My condor submit file looks like :

######
universe = grid
grid_resource = gt4 https://10.1.207.26/wsrf/services/ManagedJobFactoryService Condor
executable = helloworld.bat
requirements = OpSys == "MSWin32_NT51" && Arch == "X86"

output = hellowin.out
error = hellowin.error
log = hellowin.log

should_transfer_files = YES
when_to_transfer_output = ON_EXIT

Queue 
######

I would appreciate any help from anyone with regards what is going on.

Thanks,

Kunal



Got a cool Hotmail story? Tell us now