[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] nodes without cgrouped jobs?
- Date: Thu, 7 Jun 2018 13:41:57 -0500
- From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
- Subject: Re: [HTCondor-users] nodes without cgrouped jobs?
On 6/7/2018 10:44 AM, Thomas Hartmann wrote:
> Hi all,
> I just noticed, that a few of our nodes have their jobs not confined in
> cgroups - i.e., no condor slice at all . These nodes are setup the
> same and on the same release  as the majority of the nodes where the
> jobs are properly cgrouped.
> We are going to drain and reboot these nodes, but maybe somebody has an
> idea, what might have gone wrong here?
Unlike some others on this list, I am not a cgroup expert, but what does "condor_config_val BASE_CGROUP" have to say on these two machines? The default value is "htcondor", so to poke around in /sys/fs/cgroup, I would not be going into system.slice subdirectory (systemd settings), but would do something like:
# ls /sys/fs/cgroup/cpu,cpuacct/htcondor/condor_var_lib_condor_execute_slot1_slot1_*
Hope the above helps
> [root@batch0202 ~]# ls
> ls: cannot access
> No such file or directory
> [root@batch0203 ~]# ls
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> The archives can be found at:
Todd Tannenbaum <tannenba@xxxxxxxxxxx> University of Wisconsin-Madison
Center for High Throughput Computing Department of Computer Sciences
HTCondor Technical Lead 1210 W. Dayton St. Rm #4257
Phone: (608) 263-7132 Madison, WI 53706-1685