[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] scheduler universe job exited with status -1073741502



On Thu, 26 Jan 2006, Horvatth Szabolcs wrote:

> >Does the job run far enough to generate a dagman.out file?  If so, can
> >you please send that?  Also, please send the SchedLog from that machine.
>
> The job does generate the dag.lib.out file but its always empty, no
> matter what -debug level I use.

Well, -debug doesn't affect dag.lib -- it affects what goes into the
dagman.out file.  (If your dag file is foo.dag, you should get a
foo.dag.dagman.out.  If you didn't get such a file, things are failing
at a *very* early stage of starting the DAGMan job.)

Can you verify that there's no dagman.out file?  The mere fact of that
file not existing would be a clue.

> This is the relevant part of the sched.log:
>
> 1/26 16:40:07 Sent ad to central manager for szabolcs@xxxxxxxxxxxxxxxxxxx
> 1/26 16:40:07 Sent ad to 1 collectors for szabolcs@xxxxxxxxxxxxxxxxxxx
> 1/26 16:40:07 Successfully created sched universe process
> 1/26 16:40:07 Starting add_shadow_birthdate(134530.0)
> 1/26 16:40:07 Successfully created sched universe process
> 1/26 16:40:07 Starting add_shadow_birthdate(134523.0)
> 1/26 16:40:07 scheduler universe job (134530.0) pid 2532 exited with status -1073741502
> 1/26 16:40:07 Starting add_shadow_birthdate(134612.0)
> 1/26 16:40:08 Started shadow for job 134612.0 on "<192.168.0.104:1039>", (shadow pid = 8884)
> 1/26 16:40:08 DaemonCore: Command received via UDP from host <192.168.0.71:4976>
> 1/26 16:40:08 DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
> 1/26 16:40:08 scheduler universe job (134523.0) pid 14784 exited with status -1073741502
> 1/26 16:40:08 Starting add_shadow_birthdate(134628.0)

Hmm, that doesn't really help much! :-)

I will consult with some of the schedd experts and get back to you.

Kent Wenger
Condor Team