[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor 7.0.2 in combination with Windows XP andVista



Hello,

Here is some more information:

CollectorLog:

6/19 13:31:38 ******************************************************
6/19 13:31:38 ** condor_collector.exe (CONDOR_COLLECTOR) STARTING UP
6/19 13:31:38 ** C:\condor\bin\condor_collector.exe
6/19 13:31:38 ** $CondorVersion: 7.0.2 Jun  9 2008 BuildID: 89891 $
6/19 13:31:38 ** $CondorPlatform: INTEL-WINNT50 $
6/19 13:31:38 ** PID = 1560
6/19 13:31:38 ** Log last touched 6/19 10:36:08
6/19 13:31:38 ******************************************************
6/19 13:31:38 Using config source: C:\condor\condor_config
6/19 13:31:38 Using local config sources: 
6/19 13:31:38    C:\condor/condor_config.local
6/19 13:31:38 DaemonCore: Command Socket at <192.168.158.167:9618>
6/19 13:31:41 In ViewServer::Init()
6/19 13:31:41 In CollectorDaemon::Init()
6/19 13:31:41 In ViewServer::Config()
6/19 13:31:41 In CollectorDaemon::Config()
6/19 13:31:41 enable: Creating stats hash table
6/19 13:31:43 (Sending 0 ads in response to query)
6/19 13:31:43 Got QUERY_STARTD_PVT_ADS
6/19 13:31:43 (Sending 0 ads in response to query)
6/19 13:31:43 NegotiatorAd  : Inserting ** "< demo4 >"
6/19 13:31:43 stats: Inserting new hashent for 'Negotiator':'demo4':'192.168.158.167'
6/19 13:31:46 MasterAd     : Inserting ** "< demo4 >"
6/19 13:31:46 stats: Inserting new hashent for 'Master':'demo4':'192.168.158.167'
6/19 13:31:48 ScheddAd     : Inserting ** "< demo4 , 192.168.158.167 >"
6/19 13:31:48 stats: Inserting new hashent for 'Schedd':'demo4':'192.168.158.167'
6/19 13:31:48 DaemonCore: Can't receive command request from 192.168.158.167 (perhaps a timeout?)
6/19 13:31:50 SubmittorAd  : Inserting ** "< inpho@demo4demo4 , 192.168.158.167 >"
6/19 13:31:50 stats: Inserting new hashent for 'Submittor':'inpho@demo4':'192.168.158.167'
6/19 13:31:59 WARNING:  No master ad for < slot1@demo4 >
6/19 13:31:59 StartdAd     : Inserting ** "< slot1@demo4 , 192.168.158.167 >"
...
6/19 13:32:02 stats: Inserting new hashent for 'StartdPvt':'slot4@demo4':'192.168.158.167'
6/19 13:34:17 DC_AUTHENTICATE: attempt to open invalid session demo4:6968:1213903454:28, failing.
6/19 13:34:17 DC_AUTHENTICATE: attempt to open invalid session demo4:6968:1213903454:28, failing.
6/19 13:34:20 DC_AUTHENTICATE: attempt to open invalid session demo4:6968:1213903461:30, failing.
6/19 13:34:21 WARNING:  No master ad for < slot2@demo1 >


MasterLog:

6/19 13:31:27 ******************************************************
6/19 13:31:27 ** Condor (CONDOR_MASTER) STARTING UP
6/19 13:31:27 ** C:\condor\bin\condor_master.exe
6/19 13:31:27 ** $CondorVersion: 7.0.2 Jun  9 2008 BuildID: 89891 $
6/19 13:31:27 ** $CondorPlatform: INTEL-WINNT50 $
6/19 13:31:27 ** PID = 1780
6/19 13:31:27 ** Log last touched 6/19 10:36:08
6/19 13:31:27 ******************************************************
6/19 13:31:27 Using config source: C:\condor\condor_config
6/19 13:31:27 Using local config sources: 
6/19 13:31:27    C:\condor/condor_config.local
6/19 13:31:27 DaemonCore: Command Socket at <192.168.158.167:1029>
6/19 13:31:37 Authorized application C:\condor/bin/condor_schedd.exe is now enabled in the firewall.
6/19 13:31:37 Authorized application C:\condor/bin/condor_startd.exe is now enabled in the firewall.
6/19 13:31:37 Authorized application C:\condor/bin\condor_dagman.exe is now enabled in the firewall.
6/19 13:31:37 Started DaemonCore process "C:\condor/bin/condor_collector.exe", pid and pgroup = 1560
6/19 13:31:40 Started DaemonCore process "C:\condor/bin/condor_negotiator.exe", pid and pgroup = 1920
6/19 13:31:40 Started DaemonCore process "C:\condor/bin/condor_schedd.exe", pid and pgroup = 1932
6/19 13:31:41 Started DaemonCore process "C:\condor/bin/condor_startd.exe", pid and pgroup = 1896
6/19 14:31:41 Preen pid is 3832
6/19 14:31:42 Child 3832 died, but not a daemon -- Ignored



SchedLog:

6/19 13:31:42 (pid:1932) ******************************************************
6/19 13:31:42 (pid:1932) ** condor_schedd.exe (CONDOR_SCHEDD) STARTING UP
6/19 13:31:42 (pid:1932) ** C:\condor\bin\condor_schedd.exe
6/19 13:31:42 (pid:1932) ** $CondorVersion: 7.0.2 Jun  9 2008 BuildID: 89891 $
6/19 13:31:42 (pid:1932) ** $CondorPlatform: INTEL-WINNT50 $
6/19 13:31:42 (pid:1932) ** PID = 1932
6/19 13:31:42 (pid:1932) ** Log last touched 6/19 10:36:08
6/19 13:31:42 (pid:1932) ******************************************************
6/19 13:31:42 (pid:1932) Using config source: C:\condor\condor_config
6/19 13:31:42 (pid:1932) Using local config sources: 
6/19 13:31:42 (pid:1932)    C:\condor/condor_config.local
6/19 13:31:42 (pid:1932) DaemonCore: Command Socket at <192.168.158.167:1042>
6/19 13:31:43 (pid:1932) History file rotation is enabled.
6/19 13:31:43 (pid:1932)   Maximum history file size is: 20971520 bytes
6/19 13:31:43 (pid:1932)   Number of rotated history files is: 2
6/19 13:31:43 (pid:1932) my_popen: CreateProcess failed
6/19 13:31:43 (pid:1932) Failed to execute C:\condor/bin/condor_shadow.std.exe, ignoring
6/19 13:31:44 (pid:1932) About to rotate ClassAd log C:\condor/spool/job_queue.log


StartLog:

6/19 13:31:42 ******************************************************
6/19 13:31:42 ** condor_startd.exe (CONDOR_STARTD) STARTING UP
6/19 13:31:42 ** C:\condor\bin\condor_startd.exe
6/19 13:31:42 ** $CondorVersion: 7.0.2 Jun  9 2008 BuildID: 89891 $
6/19 13:31:42 ** $CondorPlatform: INTEL-WINNT50 $
6/19 13:31:42 ** PID = 1896
6/19 13:31:42 ** Log last touched 6/19 10:36:08
6/19 13:31:42 ******************************************************
6/19 13:31:42 Using config source: C:\condor\condor_config
6/19 13:31:42 Using local config sources: 
6/19 13:31:42    C:\condor/condor_config.local
6/19 13:31:42 DaemonCore: Command Socket at <192.168.158.167:1043>
6/19 13:31:42 MachAttributes::publish: failed to get Windows version information
6/19 13:31:44 my_popen: CreateProcess failed
6/19 13:31:44 Failed to execute C:\condor/bin/condor_starter.std.exe, ignoring
6/19 13:31:44 slot1: New machine resource allocated
6/19 13:31:44 slot2: New machine resource allocated
6/19 13:31:44 slot3: New machine resource allocated
6/19 13:31:44 slot4: New machine resource allocated
6/19 13:31:49 no loadavg samples this minute, maybe thread died???
6/19 13:31:49 About to run initial benchmarks.
6/19 13:31:55 Completed initial benchmarks.
6/19 13:31:55 slot2: State change: IS_OWNER is false
6/19 13:31:55 slot2: Changing state: Owner -> Unclaimed
6/19 13:31:55 slot3: State change: IS_OWNER is false
6/19 13:31:55 slot3: Changing state: Owner -> Unclaimed
6/19 13:31:55 slot4: State change: IS_OWNER is false
6/19 13:31:55 slot4: Changing state: Owner -> Unclaimed
6/19 13:31:55 slot1: State change: IS_OWNER is false
6/19 13:31:55 slot1: Changing state: Owner -> Unclaimed


Submit file:

Executable = C:\WINDOWS\system32\cmd.exe
Requirements = (Inpho_ApplicationsMaster51_Directory =!= UNDEFINED && Inpho_ApplicationsMaster51_Installed =?= True && Inpho_ApplicationsMaster51_OrthoMaster_Installed =?= True) 
Priority = 0
Universe = Vanilla
Output = offingen_with_subblocks_complete_$(Cluster)_$(Process).out
Error = offingen_with_subblocks_complete_$(Cluster)_$(Process).out
Log = offingen_with_subblocks_complete_$(Cluster)_$(Process).log
Getenv = True
Environment = ERMAPPER=dummy

leave_in_queue = True
on_exit_remove = (ExitCode != -20)
Initialdir = \\DEMO4\I$\inpho_data\Offingen\Project\condor\job0
Arguments = " /C call '\\DEMO4\I$\inpho_data\Offingen\Project\condor\job0\offingen_with_subblocks_complete_151.bat' $$(Inpho_ApplicationsMaster51_Directory)"
Queue
...

Best Regards / Mit freundlichen Grüßen

Thomas Laue
 
-- 
INPHO GmbH   *   Smaragdweg 1   *   70174 Stuttgart   *   Germany
phone: +49 711 2288 10  *  fax: +49 711 2288 111  *  web: www.inpho.de
place of business: Stuttgart    *   managing director: Johannes Saile
commercial register: Stuttgart, HRB 9586
Leader in Photogrammetry and Digital Surface Modelling
Please visit www.inpho.de 



-----Ursprüngliche Nachricht-----
Von: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] Im Auftrag von Matt Hope
Gesendet: Donnerstag, 19. Juni 2008 15:20
An: Condor-Users Mail List
Betreff: Re: [Condor-users] Condor 7.0.2 in combination with Windows XP andVista

On Thu, Jun 19, 2008 at 1:55 PM, Thomas Laue <Thomas.Laue@xxxxxxxx> wrote:
> Hello,
>
> I have encountered a problem after upgrading a small Condor pool from version 6.8.8 to 7.0.2. The pool consists of 2 computers equipped with Windows XP and a central manager machine equipped with Vista. I installed version 7.0.2 on the central manager and the two processing nodes. Everything was working smoothly with Condor 6.8.8
>
> Now, all jobs submitted to the processing nodes are cancelled after a few seconds on the XP systems. I found the following messages in the StarterLog.slot1 file (similar ones for other slots).
>
> 6/19 13:31:34 ******************************************************
> 6/19 13:31:34 ** condor_starter (CONDOR_STARTER) STARTING UP
> 6/19 13:31:34 ** C:\condor\bin\condor_starter.exe
> 6/19 13:31:34 ** $CondorVersion: 7.0.2 Jun  9 2008 BuildID: 89891 $
> 6/19 13:31:34 ** $CondorPlatform: INTEL-WINNT50 $
> 6/19 13:31:34 ** PID = 1008
> 6/19 13:31:34 ** Log last touched 6/19 12:31:13
> 6/19 13:31:34 ******************************************************
> 6/19 13:31:34 Using config source: C:\condor\condor_config
> 6/19 13:31:34 Using local config sources:
> 6/19 13:31:34    C:\condor/condor_config.local
> 6/19 13:31:34 DaemonCore: Command Socket at <192.168.158.151:2214>
> 6/19 13:31:34 Setting resource limits not implemented!
> 6/19 13:31:34 Communicating with shadow <192.168.158.167:51589>
> 6/19 13:31:34 Submitting machine is "demo4.demo.inpho.de"
> 6/19 13:31:34 setting the orig job name in starter
> 6/19 13:31:34 setting the orig job iwd in starter
> 6/19 13:31:35 File transfer completed successfully.
> 6/19 13:31:36 Job 16.0 set to execute immediately
> 6/19 13:31:36 Starting a VANILLA universe job with ID: 16.0
> 6/19 13:31:36 Tracking process family by login "condor-reuse-slot1"
> 6/19 13:31:36 IWD: C:\condor\execute\dir_1008
> 6/19 13:31:36 Output file: C:\condor\execute\dir_1008\offingen_with_subblocks_complete_16_0.out
> 6/19 13:31:36 Error file: C:\condor\execute\dir_1008\offingen_with_subblocks_complete_16_0.out
> 6/19 13:31:36 Renice expr "10" evaluated to 10
> 6/19 13:31:36 About to exec C:\condor\execute\dir_1008\condor_exec.exe /C call \\DEMO4\I$\inpho_data\Offingen\Project\condor\job0\offingen_with_subblocks_complete_16.bat C:\Program' 'Files\Inpho\ApplicationsMaster' '5.1\bin
> 6/19 13:31:36 Create_Process: CreateProcess failed, errno=193

Error 193 is this is ERROR_BAD_EXE_FORMAT.

> 6/19 13:31:36 ERROR "Create_Process(C:\condor\execute\dir_1008\condor_exec.exe,/C call \\DEMO4\I$\inpho_data\Offingen\Project\condor\job0\offingen_with_subblocks_complete_16.bat C:\Program' 'Files\Inpho\ApplicationsMaster' '5.1\bin, ...) failed" at line 495 in file ..\src\condor_starter.V6.1\os_proc.C

As to why it is happening no clue.

Could you include your submit file as well?

Matt
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: 
https://lists.cs.wisc.edu/archive/condor-users/