[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] DC_AUTHENTICATE: attempt to open invalid session...



Hi All,

Our admin recently added some machines in an "old" condor pool to a
"new" condor pool, but I can't seem to get jobs submit from machines in
the new condor pool to run in the old one...

Here are the symptoms...

NegotiatorLog from central manager shows a match.

SchedLog from submitting machine shows

4/8 15:52:01 condor_read(): recv() returned -1, errno = 104, assuming
failure.
4/8 15:52:01 Response problem from startd.
4/8 15:52:01 Sent RELEASE_CLAIM to startd on <192.168.0.4:32772>
4/8 15:52:01 Match record (<192.168.0.4:32772>, 801, 5) deleted
4/8 15:52:01 condor_read(): recv() returned -1, errno = 104, assuming
failure.

StartLog on machine that was trying to start the job looks like...

4/8 15:52:01 DC_AUTHENTICATE: attempt to open invalid session 
fire4:801:1112972542:6, failing.


These machines are all on a private network, so we don't need to fancy
authentication turned on, and I am fairly sure that the Admin didn't
change anything from the defaults.

Condor version is 6.4.7 (yikes -- old!) on the machines trying to run
the job and 6.7.5 on the machines submitting the job.  Is it just a
matter of upgrading condor?  Is there somewhere special I can look in
the manual?

(3.7.4 looks like a good place to start, but I don't see in there how to
fix our problem -- or rather how to have our SysAdmin try and fix the
problem).

Thanks for any pointers. Have a great weekend...

Cheers,
-Jeff

-- 
------------------------------------------------------------
Jeff Linderoth                               O: 610-758-4879
Asst. Professor                              
Industrial and Systems Engineering           jtl3@xxxxxxxxxx
Lehigh University                       www.lehigh.edu/~jtl3