[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Problem with condor_pid_ns_init

Hi Greg, Todd,

Thanks, that all makes sense. The SIGQUIT was due to a RANK-based eviction event, so was appropriate. 

I've been trying to check that cgroups and PID namespaces are doing the right thing on our cluster with OSG jobs and didn't realize that the error message was a red herring. I'll ignore it in this context.


> On Feb 9, 2017, at 5:11 PM, Greg Thain <gthain@xxxxxxxxxxx> wrote:
> Duncan:
> I can reproduce this error message by doing a fast shutdown on a startd running with PID namespaces on.  But, the job does go back to idle as it should, so really the problem is a pollution of the log message.  We'll fix this for 8.6.1, but unless there's other problems, I don't think this would be an 8.6 showstopper for LIGO.
> -greg
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/


Duncan Brown                         http://dbrown10.expressions.syr.edu
Charles Brightman Professor of Physics     Room 263-1 Physics Department
Director of the Graduate Program      Syracuse University, NY 13244, USA
Phone: 315 443 5993                                    Fax: 315 443 9103