[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Problems on Mac OS X



1. I use Condor on G5' s cluster with Mac OS X. After few hours, condor_master crashes on the submitter machine what leads the break of the jobs. It seems that this pb occurs with many applications on mac os X but no one in forum Mac has got an idea to solve this pb.
This pb occurs with condor6.6.5 condor6.6.7 and condor6.7.2…
So, has anyone use condor on mac os x and have you got these crashes ?
Thank you.
Here is the crash log…

Command: condor_master
Path: /Users/damien/Programmes/condor-6.7.2/sbin/condor_master
Version: ??? (???)
PID: 19786
Thread: 0

Exception: EXC_BAD_ACCESS (0x0001)
Codes: KERN_PROTECTION_FAILURE (0x0002) at 0x00000000

Thread 0 Crashed:
0 condor_master 0x0006c6e8 0x1000 + 0x6b6e8
1 condor_master 0x0005e698 0x1000 + 0x5d698
2 condor_master 0x0001ee90 0x1000 + 0x1de90
3 condor_master 0x0000f2e0 0x1000 + 0xe2e0
4 condor_master 0x0001d574 0x1000 + 0x1c574
5 condor_master 0x00001ce0 0x1000 + 0xce0
6 condor_master 0x00001b60 0x1000 + 0xb60

PPC Thread State:
srr0: 0x0006c6e8 srr1: 0x0000f030 vrsave: 0x00000000
cr: 0x44000222 xer: 0x00000004 lr: 0x0006c694 ctr: 0x90002d40
r0: 0x00000000 r1: 0xbfffd6f0 r2: 0xa0000fc8 r3: 0x0030cc60
r4: 0x00000038 r5: 0x00000020 r6: 0x000000a8 r7: 0x00000803
r8: 0x400028b0 r9: 0x00000000 r10: 0x00000000 r11: 0x0030cc60
r12: 0x00000000 r13: 0x00000000 r14: 0x00000000 r15: 0x00000000
r16: 0x00000000 r17: 0x00000000 r18: 0x00000000 r19: 0x00000000
r20: 0x00000000 r21: 0x00000000 r22: 0x00000000 r23: 0x00000000
r24: 0x00000000 r25: 0x00000000 r26: 0xbffffab4 r27: 0x00000008
r28: 0x008024c0 r29: 0x008024c0 r30: 0xbfffd6f0 r31: 0x0006c560

Binary Images Description:
0x1000 - 0x285fff condor_master /Users/damien/Programmes/condor-6.7.2/sbin/condor_master
0x8fe00000 - 0x8fe4ffff dyld /usr/lib/dyld
0x90000000 - 0x90122fff libSystem.B.dylib /usr/lib/libSystem.B.dylib
0x939d0000 - 0x939d4fff libmathCommon.A.dylib /usr/lib/system/libmathCommon.A.dylib

2. Sometimes, only the manager machine crashes but not the submitter machine… At this moment, condor stops… but when I launch again condor_master on the manager machine, jobs restart although I have a vanilla configuration…!

Thank you

Damien