[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Status of nodes in a pool



I have no answer to the file lock problem, but the question about VMs is an easy one. Condor will create one virtual machine for each CPU detected on your machine. Hyperthreaded Pentium IV processors will alsow show up as two processors each. There is a setting in the condor_config to ignore hyperthreading, but it has never had any effect for me.

For the hard problem, you might check the system log to make sure it wasn't some general system problem, rather than a Condor-specific one. Good luck.

- dave


Fabiano Portella wrote:
Hi Condor folks!
I'm trying to set a pool of machines in a LAN. I've set up 3 machines (one manager and 2 submitter/executer). After all configuration steps done, I start manager first, then start others machines.
When I type 'condor_status' in manager, I get the following:
Name OpSys Arch State Activity LoadAv Mem ActvtyTime vm1@xxxxxxxxx <mailto:vm1@xxxxxxxxx> LINUX INTEL Unclaimed Idle 0.000 250 0+00:05:04 vm2@xxxxxxxxx <mailto:vm2@xxxxxxxxx> LINUX INTEL Unclaimed Idle 0.000 250 0+00:05:05 vivax.biowebd LINUX INTEL Unclaimed Idle 0.000 757 0+00:04:54 Total Owner Claimed Unclaimed Matched Preempting Backfill INTEL/LINUX 3 0 0 3 0 0 0 Total 3 0 0 3 0 0 0 Inspecting MasterLog file of the machine with problem (the machine that doesn't appear in status of manager), I get the following error: 4/4 18:09:13 ******************************************************
4/4 18:09:13 ** condor_master (CONDOR_MASTER) STARTING UP
4/4 18:09:13 ** /usr/local/condor/sbin/condor_master
4/4 18:09:13 ** $CondorVersion: 6.7.18 Mar 22 2006 $
4/4 18:09:13 ** $CondorPlatform: I386-LINUX_RH9 $
4/4 18:09:13 ** PID = 28069
4/4 18:09:13 ******************************************************
4/4 18:09:13 Using config file: /usr/local/condor/etc/condor_config
4/4 18:09:13 Using local config files: /usr/local/condor/local.genome/condor_config.local 4/4 18:09:13 FileLock::obtain(1) failed - errno 11 (Resource temporarily unavailable) 4/4 18:09:13 ERROR "Can't get lock on "/tmp/condor-lock.genome0.140998920499445/InstanceLock"" at line 973 in file master.C What can I do to solve that? This is a local file and it exists in filesystem. Another question is related do condor_status in manager machine. Is correct that this manager prints 2 vm (I suppose to be virtual machines)? I though that each node appear just once in condor_status command. Thanks in advance.
Regards,
Fabiano.

------------------------------------------------------------------------
Abra sua conta no Yahoo! Mail <http://us.rd.yahoo.com/mail/br/tagline/mail/*http://br.info.mail.yahoo.com/> - 1GB de espaço, alertas de e-mail no celular e anti-spam realmente eficaz.


------------------------------------------------------------------------

_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/condor-users