[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Windows:Problem with Schedd:Failed to read packet header



 

Dear kind people who help struggling Condor novices

 

 

I have installed Condor 6.6.11 on Windows XP machine. Able to submit jobs which run but then become idle and stay stuck in that state without ever finishing.

 

6/12 23:04:43 Using config file: C:\Condor\condor_config

6/12 23:04:43 Using local config files: C:\Condor/condor_config.local

6/12 23:04:43 DaemonCore: Command Socket at <123.123.123.123:1028>

6/12 23:04:43 Using name: student-19ae64e

6/12 23:04:43 No Accountant host specified in config file

6/12 23:04:43 Queue Management Super Users:

6/12 23:04:43     condor

6/12 23:04:43     SYSTEM

6/12 23:04:43     NT AUTHORITY/SYSTEM

6/12 23:04:43 my_popen: CreateProcess failed

6/12 23:04:43 Failed to execute C:\Condor/bin/condor_shadow.pvm, ignoring

6/12 23:04:43 my_popen: CreateProcess failed

6/12 23:04:43 Failed to execute C:\Condor/bin/condor_shadow.std, ignoring

6/12 23:04:43 About to truncate log C:\Condor/spool/job_queue.log

6/12 23:04:43 Version of gridmanager is

6/12 23:04:43 JobsRunning = 0

6/12 23:04:43 JobsIdle = 1

6/12 23:04:43 JobsHeld = 0

6/12 23:04:43 JobsRemoved = 0

6/12 23:04:43 SchedUniverseJobsRunning = 0

6/12 23:04:43 SchedUniverseJobsIdle = 0

6/12 23:04:43 N_Owners = 1

6/12 23:04:43 MaxJobsRunning = 200

6/12 23:04:43 Attempting to send update via UDP to collector STUDENT-19AE64E <123.123.123.123:1234>

6/12 23:04:43 Sent HEART BEAT ad to central mgr: Number of submittors=1

6/12 23:04:43 Changed attribute: RunningJobs = 0

6/12 23:04:43 Changed attribute: IdleJobs = 1

6/12 23:04:43 Changed attribute: HeldJobs = 0

6/12 23:04:43 Changed attribute: FlockedJobs = 0

6/12 23:04:43 Changed attribute: Name = "student@student-19ae64e"

6/12 23:04:43 Attempting to send update via UDP to collector STUDENT-19AE64E <123.123.123.123:1234>

6/12 23:04:43 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False

6/12 23:04:43 Sent ad to central manager for student@student-19ae64e

6/12 23:04:43 ============ Begin clean_shadow_recs =============

6/12 23:04:43 ============ End clean_shadow_recs =============

6/12 23:04:44 DaemonCore: in SendAliveToParent()

6/12 23:04:44 DaemonCore: attempting to connect to '<123.123.123.123:1025>'

6/12 23:04:44 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value of 0

6/12 23:04:53 -------- Begin starting jobs --------

6/12 23:04:53 -------- Done starting jobs --------

6/12 23:05:40 DaemonCore: Command received via TCP from host <123.123.123.123:1053>

6/12 23:05:40 DaemonCore: received command 1111 (QMGMT_CMD), calling handler (handle_q)

6/12 23:05:40 condor_read(): Socket closed when trying to read buffer

6/12 23:05:40 IO: Failed to read packet header

6/12 23:05:40 QMGR Connection closed

 

What is wrong?I have an inkling it may be a configuration problem

 

Bless you

 

Dan

 

 

 

 

 






 


From: AOUAD Lamine <Lamine.Aouad@xxxxxxx>
Reply-To: Condor-Users Mail List <condor-users@xxxxxxxxxxx>
To: "Condor-Users Mail List"<condor-users@xxxxxxxxxxx>
Subject: [Condor-users] DAGMan and VARS
Date: Wed, 31 May 2006 20:12:13 +0200
>Hi all,
>
>I start with my question :
>Why should all the jobs in a DAG file have the same VARS entries ?
>I have four basic tasks in my DAG (i.e. four submit description
>files), but condor failed to parse my dag file when I define various
>VARS entries for the different tasks.. Could anyone explain me the
>reason of this behaviour..
>
>Thank you
>
>Best Regards,
>Lamine
>_______________________________________________
>Condor-users mailing list
>To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
>subject: Unsubscribe
>You can also unsubscribe by visiting
>https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
>The archives can be found at either
>https://lists.cs.wisc.edu/archive/condor-users/
>http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR