[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Setting Condor Job Owner in Windows



Hi Todd,

Your idea sounded great! However, I tried it without success (the job still
starts as SYSTEM even thought the condor.submit file say +Owner = "diane",
and I had reconfigured condor to disable Queue access checks).  The job
never gets into the queue and returns with condor.error:

ERROR: No credential stored for SYSTEM@NT AUTHORITY

	Correct this by running:
	condor_store_cred add

In hopes of figuring this out, I have included here the relevant parts of
the condor logs (in particular SchedLog showing queue access checks
disabled), and my condor.submit file. 

If you have any insights that would be great.
Thanks,
Diane
-------------------------------------------------------------------
Condor.submit file:
# Autogenerated condor submit file
# Generated: Oct 3, 2007 11:17:04 AM

+Owner = "diane"

universe = vanilla
executable = EnsembleAspect.bat

# Transfer files back & forth (no NFS mount)
should_transfer_files = Yes
when_to_transfer_output = ON_EXIT

# Only send email if there's a problem
notification = Error
rank = VirtualMachineID

arguments = \tmp\MatrixPD_95783\PerlIn_500_200_100_20_PD
transfer_input_files = PerlIn_500_200_100_20_PD,
AspectIn_500_200_100_20_PD.ses,  AspectIn_500_200_100_20_PD.src,
AspectIn_500_200_100_20_PD.rcv,  AspectIn_500_200_100_20_PD.mis
transfer_output_files = PerlOut_500_200_100_20_PD,  dd_500_200_100_20_PD.ses
output = \tmp\MatrixPD_95783\CO_500_200_100_20_PD.out
error = \tmp\MatrixPD_95783\CO_500_200_100_20_PD.err
log = \tmp\MatrixPD_95783\CO_500_200_100_20_PD.log

queue

##############################################

Here are the Condor logs:
SchedLog:
10/3 11:12:42 (pid:2152)
******************************************************
10/3 11:12:42 (pid:2152) ** condor_schedd.exe (CONDOR_SCHEDD) STARTING UP
10/3 11:12:42 (pid:2152) ** C:\condor\bin\condor_schedd.exe
10/3 11:12:42 (pid:2152) ** $CondorVersion: 6.8.5 May 17 2007 $
10/3 11:12:42 (pid:2152) ** $CondorPlatform: INTEL-WINNT50 $
10/3 11:12:42 (pid:2152) ** PID = 2152
10/3 11:12:42 (pid:2152) ** Log last touched 10/3 11:11:32
10/3 11:12:42 (pid:2152)
******************************************************
10/3 11:12:42 (pid:2152) Using config source: C:\condor\condor_config
10/3 11:12:42 (pid:2152) Using local config sources: 
10/3 11:12:42 (pid:2152)    C:\condor/condor_config.local
10/3 11:12:42 (pid:2152) DaemonCore: Command Socket at <192.168.2.105:1030>
10/3 11:12:42 (pid:2152) History file rotation is enabled.
10/3 11:12:42 (pid:2152)   Maximum history file size is: 20971520 bytes
10/3 11:12:42 (pid:2152)   Number of rotated history files is: 2
10/3 11:12:42 (pid:2152) NOTE: QUEUE_ALL_USERS_TRUSTED=TRUE - all queue
access checks disabled!
10/3 11:12:42 (pid:2152) my_popen: CreateProcess failed
10/3 11:12:42 (pid:2152) Failed to execute
C:\condor/bin/condor_shadow.pvm.exe, ignoring
10/3 11:12:42 (pid:2152) my_popen: CreateProcess failed
10/3 11:12:42 (pid:2152) Failed to execute
C:\condor/bin/condor_shadow.std.exe, ignoring

CollectorLog:
10/3 11:12:41 ******************************************************
10/3 11:12:41 ** condor_collector.exe (CONDOR_COLLECTOR) STARTING UP
10/3 11:12:41 ** C:\condor\bin\condor_collector.exe
10/3 11:12:41 ** $CondorVersion: 6.8.5 May 17 2007 $
10/3 11:12:41 ** $CondorPlatform: INTEL-WINNT50 $
10/3 11:12:41 ** PID = 2132
10/3 11:12:41 ** Log last touched 10/3 11:11:32
10/3 11:12:41 ******************************************************
10/3 11:12:41 Using config source: C:\condor\condor_config
10/3 11:12:41 Using local config sources: 
10/3 11:12:41    C:\condor/condor_config.local
10/3 11:12:41 DaemonCore: Command Socket at <192.168.2.105:9618>
10/3 11:12:41 In ViewServer::Init()
10/3 11:12:41 In CollectorDaemon::Init()
10/3 11:12:41 In ViewServer::Config()
10/3 11:12:41 In CollectorDaemon::Config()
10/3 11:12:42 enable: Creating stats hash table
10/3 11:12:42 (Sending 0 ads in response to query)
10/3 11:12:42 Got QUERY_STARTD_PVT_ADS
10/3 11:12:42 (Sending 0 ads in response to query)
10/3 11:12:42 NegotiatorAd  : Inserting ** "< winxp-dev-01 >"
10/3 11:12:42 stats: Inserting new hashent for
'Negotiator':'winxp-dev-01':'192.168.2.105'
10/3 11:12:42 WARNING:  No master ad for < winxp-dev-01 >
10/3 11:12:42 ScheddAd     : Inserting ** "< winxp-dev-01 , 192.168.2.105 >"
10/3 11:12:42 stats: Inserting new hashent for
'Schedd':'winxp-dev-01':'192.168.2.105'
10/3 11:12:46 ** Master < winxp-dev-01 > rejuvenated from recently down
10/3 11:12:46 stats: Inserting new hashent for
'Master':'winxp-dev-01':'192.168.2.105'
10/3 11:13:17 WARNING:  No master ad for < vm1@winxp-dev-01 >
10/3 11:13:17 StartdAd     : Inserting ** "< vm1@winxp-dev-01 ,
192.168.2.105 >"
10/3 11:13:17 stats: Inserting new hashent for
'Start':'vm1@winxp-dev-01':'192.168.2.105'
10/3 11:13:17 StartdPvtAd  : Inserting ** "< vm1@winxp-dev-01 ,
192.168.2.105 >"
10/3 11:13:17 stats: Inserting new hashent for
'StartdPvt':'vm1@winxp-dev-01':'192.168.2.105'
10/3 11:13:18 WARNING:  No master ad for < vm2@winxp-dev-01 >
10/3 11:13:18 StartdAd     : Inserting ** "< vm2@winxp-dev-01 ,
192.168.2.105 >"
10/3 11:13:18 stats: Inserting new hashent for
'Start':'vm2@winxp-dev-01':'192.168.2.105'
10/3 11:13:18 StartdPvtAd  : Inserting ** "< vm2@winxp-dev-01 ,
192.168.2.105 >"
10/3 11:13:18 stats: Inserting new hashent for
'StartdPvt':'vm2@winxp-dev-01':'192.168.2.105'
10/3 11:17:42 (Sending 5 ads in response to query)
10/3 11:17:42 Got QUERY_STARTD_PVT_ADS
10/3 11:17:42 (Sending 2 ads in response to query)
10/3 11:17:42 NegotiatorAd  : Inserting ** "< winxp-dev-01 >"
10/3 11:22:42 (Sending 5 ads in response to query)
10/3 11:22:42 Got QUERY_STARTD_PVT_ADS
10/3 11:22:42 (Sending 2 ads in response to query)
10/3 11:22:42 NegotiatorAd  : Inserting ** "< winxp-dev-01 >"

StartLog:
10/3 11:12:42 ******************************************************
10/3 11:12:42 ** condor_startd.exe (CONDOR_STARTD) STARTING UP
10/3 11:12:42 ** C:\condor\bin\condor_startd.exe
10/3 11:12:42 ** $CondorVersion: 6.8.5 May 17 2007 $
10/3 11:12:42 ** $CondorPlatform: INTEL-WINNT50 $
10/3 11:12:42 ** PID = 2160
10/3 11:12:42 ** Log last touched 10/3 11:11:32
10/3 11:12:42 ******************************************************
10/3 11:12:42 Using config source: C:\condor\condor_config
10/3 11:12:42 Using local config sources: 
10/3 11:12:42    C:\condor/condor_config.local
10/3 11:12:42 DaemonCore: Command Socket at <192.168.2.105:1031>
10/3 11:13:02 my_popen: CreateProcess failed
10/3 11:13:02 Failed to execute C:\condor/bin/condor_starter.pvm.exe,
ignoring
10/3 11:13:02 my_popen: CreateProcess failed
10/3 11:13:02 Failed to execute C:\condor/bin/condor_starter.std.exe,
ignoring
10/3 11:13:02 vm1: New machine resource allocated
10/3 11:13:02 vm2: New machine resource allocated
10/3 11:13:07 no loadavg samples this minute, maybe thread died???
10/3 11:13:07 About to run initial benchmarks.
10/3 11:13:13 Completed initial benchmarks.
10/3 11:13:13 vm2: State change: IS_OWNER is false
10/3 11:13:13 vm2: Changing state: Owner -> Unclaimed
10/3 11:13:13 vm1: State change: IS_OWNER is false
10/3 11:13:13 vm1: Changing state: Owner -> Unclaimed

MasterLog:
10/3 11:12:31 ******************************************************
10/3 11:12:31 ** Condor (CONDOR_MASTER) STARTING UP
10/3 11:12:31 ** C:\condor\bin\condor_master.exe
10/3 11:12:31 ** $CondorVersion: 6.8.5 May 17 2007 $
10/3 11:12:31 ** $CondorPlatform: INTEL-WINNT50 $
10/3 11:12:31 ** PID = 256
10/3 11:12:31 ** Log last touched 10/3 11:11:32
10/3 11:12:31 ******************************************************
10/3 11:12:31 Using config source: C:\condor\condor_config
10/3 11:12:31 Using local config sources: 
10/3 11:12:31    C:\condor/condor_config.local
10/3 11:12:31 DaemonCore: Command Socket at <192.168.2.105:1026>
10/3 11:12:41 Started DaemonCore process
"C:\condor/bin/condor_collector.exe", pid and pgroup = 2132
10/3 11:12:41 Started DaemonCore process
"C:\condor/bin/condor_negotiator.exe", pid and pgroup = 2144
10/3 11:12:41 Started DaemonCore process "C:\condor/bin/condor_schedd.exe",
pid and pgroup = 2152
10/3 11:12:41 Started DaemonCore process "C:\condor/bin/condor_startd.exe",
pid and pgroup = 2160

NegotiatorLog:
10/3 11:12:41 ******************************************************
10/3 11:12:41 ** condor_negotiator.exe (CONDOR_NEGOTIATOR) STARTING UP
10/3 11:12:41 ** C:\condor\bin\condor_negotiator.exe
10/3 11:12:41 ** $CondorVersion: 6.8.5 May 17 2007 $
10/3 11:12:41 ** $CondorPlatform: INTEL-WINNT50 $
10/3 11:12:41 ** PID = 2144
10/3 11:12:41 ** Log last touched 10/3 11:11:32
10/3 11:12:41 ******************************************************
10/3 11:12:41 Using config source: C:\condor\condor_config
10/3 11:12:42 Using local config sources: 
10/3 11:12:42    C:\condor/condor_config.local
10/3 11:12:42 DaemonCore: Command Socket at <192.168.2.105:1029>
10/3 11:12:42 ACCOUNTANT_HOST = None (local)
10/3 11:12:42 NEGOTIATOR_INTERVAL = 300 sec
10/3 11:12:42 NEGOTIATOR_TIMEOUT = 30 sec
10/3 11:12:42 MAX_TIME_PER_SUBMITTER = 31536000 sec
10/3 11:12:42 MAX_TIME_PER_PIESPIN = 31536000 sec
10/3 11:12:42 PREEMPTION_REQUIREMENTS = ( (CurrentTime -
EnteredCurrentState) > (1 * (60 * 60)) && RemoteUserPrio > SubmittorPrio *
1.2 ) || (MY.NiceUser == True)
10/3 11:12:42 PREEMPTION_RANK = (RemoteUserPrio * 1000000) -
TARGET.ImageSize
10/3 11:12:42 NEGOTIATOR_PRE_JOB_RANK = RemoteOwner =?= UNDEFINED
10/3 11:12:42 NEGOTIATOR_POST_JOB_RANK = None
10/3 11:12:42 Warning: attempting to compare null hostnames in same_host.
10/3 11:12:42 ---------- Started Negotiation Cycle ----------
10/3 11:12:42 Phase 1:  Obtaining ads from collector ...
10/3 11:12:42   Getting all public ads ...
10/3 11:12:42   Sorting 0 ads ...
10/3 11:12:42   Getting startd private ads ...
10/3 11:12:42 Got ads: 0 public and 0 private
10/3 11:12:42 Public ads include 0 submitter, 0 startd
10/3 11:12:42 Phase 2:  Performing accounting ...
10/3 11:12:42 Phase 3:  Sorting submitter ads by priority ...
10/3 11:12:42 Phase 4.1:  Negotiating with schedds ...
10/3 11:12:42 ---------- Finished Negotiation Cycle ----------
10/3 11:17:42 ---------- Started Negotiation Cycle ----------
10/3 11:17:42 Phase 1:  Obtaining ads from collector ...
10/3 11:17:42   Getting all public ads ...
10/3 11:17:42   Sorting 5 ads ...
10/3 11:17:42   Getting startd private ads ...
10/3 11:17:42 Got ads: 5 public and 2 private
10/3 11:17:42 Public ads include 0 submitter, 2 startd
10/3 11:17:42 Phase 2:  Performing accounting ...
10/3 11:17:42 Phase 3:  Sorting submitter ads by priority ...
10/3 11:17:42 Phase 4.1:  Negotiating with schedds ...
10/3 11:17:42 ---------- Finished Negotiation Cycle ----------


-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Todd Tannenbaum
Sent: Wednesday, October 03, 2007 6:54 AM
To: Condor-Users Mail List
Subject: Re: [Condor-users] Setting Condor Job Owner in Windows


Here is another idea that is much easier:

In your condor_config file on your submit machine, set the following to 
tell Condor to not authenticate users who submit to the queue:
    QUEUE_ALL_USERS_TRUSTED = True

Then do a condor_reconfig.

Now, in your submit file, you can set Owner as follows:

   +Owner = "diane"
   executable = ....
   universe = vanilla
   queue

Note the double quotes and the + sign in front of Owner.

Voila  - you can now submit jobs as any owner you want.  BTW, this would 
work on Windows or Unix.  BUT --- realize the incredible security 
implications of setting QUEUE_ALL_USERS_TRUSTED to True - you had better 
carefully restrict who can access the schedd on your submit machine!!

-- 
Todd Tannenbaum                       University of Wisconsin-Madison
Condor Project Research               Department of Computer Sciences
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: 
https://lists.cs.wisc.edu/archive/condor-users/