[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] reason for suspended jobs



Hello,


----- Mensagem original -----
> De: "Ben Cotton" <bcotton@xxxxxxxxxxxxxxxxx>
> Para: "HTCondor-Users Mail List" <htcondor-users@xxxxxxxxxxx>
> Enviadas: Sexta-feira, 20 de abril de 2018 18:46:19
> Assunto: Re: [HTCondor-users] reason for suspended jobs
> 
> 
> 
> 
> HI Carlos,
> 
> 
> What are the values of SUSPEND and WANT_SUSPEND on your execute
> nodes?
> 


CAN_RUN_WHOLE_MACHINE = SlotID == $(WHOLE_MACHINE_SLOT)

SINGLE_CORE_SLOTS_CLAIMED = ($(WHOLE_MACHINE_SLOT_STATE) =?= 'Claimed') < (Slot1_State =?= 'Claimed' ) + (Slot2_State =?= 'Claimed' ) + (Slot3_State =?= 'Claimed' ) + (Slot4_State =?= 'Claimed' ) + (Slot5_State =?= 'Claimed' ) + (Slot6_State =?= 'Claimed' ) + (Slot7_State =?= 'Claimed' ) + (Slot8_State =?= 'Claimed' ) + (Slot9_State =?= 'Claimed' ) + (Slot10_State =?= 'Claimed' ) + (Slot11_State =?= 'Claimed' ) + (Slot12_State =?= 'Claimed' ) + (Slot13_State =?= 'Claimed' ) + (Slot14_State =?= 'Claimed' ) + (Slot15_State =?= 'Claimed' ) + (Slot16_State =?= 'Claimed' ) + (Slot17_State =?= 'Claimed' ) + (Slot18_State =?= 'Claimed' ) + (Slot19_State =?= 'Claimed' ) + (Slot20_State =?= 'Claimed' ) + (Slot21_State =?= 'Claimed' ) + (Slot22_State =?= 'Claimed' ) + (Slot23_State =?= 'Claimed' ) + (Slot24_State =?= 'Claimed' )

SUSPEND = ($(SUSPEND)) || ( MY.CAN_RUN_WHOLE_MACHINE && ($(SINGLE_CORE_SLOTS_CLAIMED)) )
WANT_SUSPEND = ($(WANT_SUSPEND)) || ($(SUSPEND))
WHOLE_MACHINE_SLOT = ($(DETECTED_CORES)+1)



How do I interpret the options above?

thanks,


--
Carlos Adean
IT Team
linea.gov.br
skype: carlosadean