Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] cgroups: monitoring network io via net_cls?

Date: Tue, 23 Feb 2016 11:57:17 -0600
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] cgroups: monitoring network io via net_cls?

On 2/23/2016 9:24 AM, Thomas Hartmann wrote:

Hi all,

I just noticed, that for jobs in cgroups not parameters are set for
network, i.e., no /cgroup/net_cls/htcondor/...

My idea was to see, if one could monitor the network I/O for each job
(tc?)? For example the overall send/received packages or after a job has
finished (Probably the same also for blkio could be interesting).

But afais condor uses cgroups only for cpu and mem, or?

HTCondor vanilla universe jobs just use CPU, Memory, and freezercontrollers. Agree that it could be interesting to add blkio and net_cls.

Regarding monitoring network activity of jobs, it is not clear that anet_cls cgroup is really what you want. Last I knew, net_cls will tagtraffic in the kernel so tc could do things like traffic prioritizationby cgroup. But even if you could get traffic totals per cgroup (not surehow), it seems problematic - for instance, you probably don't wanttraffic to the loopback interface to count. So likely what you reallywant is monitor traffic per network interface, and then for each job(slot) to have its own virtual network interface. By having the abilityto give each job its own network identity, you can alsoshape/monitor/control the traffic once it leaves your machine and goesonto the network. This is the approach we explored with the LarkProject, where we did work to add network awareness to HTCondor. See

  https://htcondor-wiki.cs.wisc.edu/index.cgi/wiki?p=LarkProject

It is also the approach for Docker; be aware that as of v8.5.2 ofHTCondor, Docker universe jobs have network input and output usagepublished into the job classad as attributes. See

 https://htcondor-wiki.cs.wisc.edu/index.cgi/tktview?tn=5456

Once we merge in the code we did for the Lark project into mainstreamHTCondor, vanilla universe jobs should also be able to have networkusage attributes. I cannot promise when this will happen for certain,but at least we've been thinking and working on mechanisms to handlenetwork traffic in HTCondor ....


regards
Todd

Follow-Ups:
- Re: [HTCondor-users] cgroups: monitoring network io via net_cls?
  - From: Todd Tannenbaum

References:
- [HTCondor-users] cgroups: monitoring network io via net_cls?
  - From: Thomas Hartmann

Prev by Date: Re: [HTCondor-users] GROUP_SORT_EXPR
Next by Date: Re: [HTCondor-users] cgroups: monitoring network io via net_cls?
Previous by thread: [HTCondor-users] cgroups: monitoring network io via net_cls?
Next by thread: Re: [HTCondor-users] cgroups: monitoring network io via net_cls?
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [HTCondor-users] cgroups: monitoring network io via net_cls?