[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[condor-users] Shadow exception! - Failed to connect to schedd!



I am new to Condor.  I recently created a Condor cluster of Unix computers.

The following messages regarding "Shadow exception!" are produced when a job
is evicted.  Is this failure causing the eviction, or is this a failure in
the process of eviction?

I don't know where to begin to diagnose this problem!

Any help / suggestions would be appreciated.

Mike Box                       Phone: (540)231-9506
Systems Administrator            Fax: (540)231-3863
Department of Statistics      E-mail: Mike.Box@xxxxxx
Virginia Tech

------------------------------------------------------------------------------

001 (005.000.000) 03/24 11:26:43 Job executing on host: <111.222.33.63:32793>
...
010 (005.000.000) 03/24 12:04:47 Job was suspended.
        Number of processes actually suspended: 1
...
011 (005.000.000) 03/24 12:09:48 Job was unsuspended.
...
010 (005.000.000) 03/24 12:12:29 Job was suspended.
        Number of processes actually suspended: 1
...
011 (005.000.000) 03/24 12:22:33 Job was unsuspended.
...
007 (005.000.000) 03/24 12:32:38 Shadow exception!
        Failed to connect to schedd!
        2872  -  Run Bytes Sent By Job
        5585175  -  Run Bytes Received By Job
...
001 (005.000.000) 03/24 12:41:46 Job executing on host: <111.222.33.73:32784>
...

Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>