[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] ProcAPI sanity failure, age = -98161996



On Mon, Mar 13, 2006 at 11:21:43AM -0500, Matthew Galati wrote:
> Hi,
> 
> I am new to condor, trying a very simple experiment on a set of windows machines (note, condor works out of the box on my linux cluster - I am struggling with Windows 2003 Server).
> 
> I installed: condor-6.7.17-winnt50-x86.msi
> 
<...>
> 
> If I look at my hello.log file, some (not all) are dieing with the error "Create_Process" failed:
> 
> ============
<...>
> 007 (013.001.000) 03/13 10:57:07 Shadow exception!
> 	Error from starter on vm2@xxxxxxxxxxxxxxxxxxx: Create_Process(C:\WINDOWS\system32\cmd.exe,condor_exec.exe /Q /C condor_exec.bat, ...) failed
> 	0  -  Run Bytes Sent By Job
> 	473  -  Run Bytes Received By Job
> ...
<...>
> 
> Here's the shadow log on the submit machine - I am not sure if that helps... 
> 

What would be more useful would be StarterLog.vm2 on ORCLUS01.na.sas.com

> 
> In the MasterLog, I also keep seeing the following: "ProcAPI sanity failure, age = xxxx". This error seems serious.

I think we fixed this bug just this morning (the tyep we were using didn't have
enough precision, hence the bogus value) - it will be in 6.7.18.

-Erik


> 
> 
> ============
> 3/13 11:12:59 Time stamp of running C:\condor/bin/condor_master.exe: 1140256862
> 3/13 11:12:59 GetTimeStamp returned: 1140256862
> 3/13 11:12:59 Time stamp of running C:\condor/bin/condor_collector.exe: 1140256858
> 3/13 11:12:59 GetTimeStamp returned: 1140256858
> 3/13 11:12:59 Time stamp of running C:\condor/bin/condor_negotiator.exe: 1140256862
> 3/13 11:12:59 GetTimeStamp returned: 1140256862
> 3/13 11:12:59 Time stamp of running C:\condor/bin/condor_schedd.exe: 1140256866
> 3/13 11:12:59 GetTimeStamp returned: 1140256866
> 3/13 11:12:59 Time stamp of running C:\condor/bin/condor_startd.exe: 1140256866
> 3/13 11:12:59 GetTimeStamp returned: 1140256866
> 3/13 11:12:59 exit Daemons::CheckForNewExecutable
> 3/13 11:13:13 ProcAPI sanity failure, age = -98161996
> 3/13 11:13:13 ProcAPI sanity failure, age = -98161996
> 3/13 11:13:13 ProcAPI sanity failure, age = -98161996
> 3/13 11:13:13 ProcAPI sanity failure, age = -98161996
> 3/13 11:13:14 ProcAPI sanity failure, age = -98161995
> 3/13 11:13:14 ProcAPI sanity failure, age = -98161995
> ============
> 
> Any ideas? 
> 
> Thanks,
> Matthew 
> 
> 
> 
> 
> 
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users