[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] condor-ce: troubleshooting and jobRouter



Hello, Stefano.

 

Could you use osg-local-job-environment.conf?

 

I modified the file to add the environment.

 

## /var/lib/osg/osg-local-job-environment.conf

#!/bin/sh

VO_CMS_SW_DIR=/cvmfs/cms.cern.ch

PATH=/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin

export VO_CMS_SW_DIR

export PATH

####################################

 

and here is my jobrouter configuration

 

## /etc/condor-ce/config.d/61-job-routes.conf

##  61-job-routes.conf 

#####################################################

# Example Job Route

#

# This is an extraordinarily simple job route.

# All it does is route local condor and set a

# simple Accounting Group and default RequestMemory.

#####################################################

# No custom functions for job router entries; these are causing crashes in 8.3.5.

# Can remove the eval_set_environment attribute below starting in 8.3.8.

JOB_ROUTER_ENTRIES = [ \

        name = "condor_pool_dteam"; \

        TargetUniverse = 5; \

        Requirements = target.x509UserProxyVOName =?= "dteam"; \

        set_requirements = (Arch == "X86_64") && (TARGET.OpSys == "LINUX"); \

        MaxJobs = 100; \

        MaxIdleJobs = 100; \

] \ 

[ \

        name = "condor_pool_ops"; \

        TargetUniverse = 5; \

        Requirements = target.x509UserProxyVOName =?= "ops"; \

        set_requirements = (Arch == "X86_64") && (TARGET.OpSys == "LINUX"); \

        MaxJobs = 100; \

        MaxIdleJobs = 100; \

] \ 

[ \

        name = "condor_pool_cms"; \

        TargetUniverse = 5; \

        Requirements = target.x509UserProxyVOName =?= "cms"; \

        set_requirements = (Arch == "X86_64") && (TARGET.OpSys == "LINUX"); \

        MaxJobs = 1280; \

        MaxIdleJobs = 1280; \

] \ 

 

 

 

 

 

-----------------------ìë ëìì-----------------------
ëëìë: "Stefano Dal Pra "<stefano.dalpra@xxxxxxxxxxxx>
ëëìë: htcondor-users <htcondor-users@xxxxxxxxxxx>
ëëìê: 2018-09-18 22:12:29 GMT +0900 (ROK)
ìë: [HTCondor-users] condor-ce: troubleshooting and jobRouter

 

 

Hello,

 

 

I'm practicing with HTCondor-ce and need some help as i'm not very

 

fluent at troubleshooting / configuration.

 

 

Test pilot jobs submitted by a CMS factory are failing a validation

 

shell script when running in the execute node.

 

Apparently, the reason is that no env var is passed to the job:

 

 

Environment = ""

 

 

I verified that the shell script succeeds if I submit it from the

 

condor-ce itself by adding

 

environment =

 

"PATH=/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin" in the

 

submit file.

 

 

However, if i submit the same from an external machine, again no

 

environment is passed to the job in the exec node.

 

That seems to suggest that a few parameters are trimmed away. I think

 

that JobRouter should be where such submission

 

parameters might be altered but i'm not sure at all and some simpler

 

misconfiguration could explain this problem.

 

 

A couple of questions:

 

 

1) For jobs I submit there are logfiles such as

 

/var/log/condor-ce/GridmanagerLog.dteam039

 

containing a line such as:

 

 

09/17/18 15:08:10 (D_ALWAYS:2) [4098033] GAHP[4098037] <-

 

'CONDOR_JOB_SUBMIT [SNIP] Environment\ =\

 

"PATH=/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin"; [SNIP]

 

 

where i can see the submit file content,

 

however there is no similar file for the cms user:

 

/var/log/condor-ce/GridmanagerLog.pilcms017

 

Is there a way to compare the job parameters "before" and "after" the

 

routing?

 

 

2) Does someone have a few examples of jobrouting configuration for a

 

WLCG like HTCondor-CE ?

 

Currently i'm looking at

 

https://opensciencegrid.org/docs/compute-element/job-router-recipes/ .

 

If the examples there are mostly adequate for a non OSG CE I can go on

 

and refere to those ones.

 

 

Thanks for any help, bye

 

 

Stefano

 

 

_______________________________________________

 

HTCondor-users mailing list

 

To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a

 

subject: Unsubscribe

 

You can also unsubscribe by visiting

 

https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

 

 

The archives can be found at:

 

https://lists.cs.wisc.edu/archive/htcondor-users/