[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Does condor work on a eucalyptus cloud?



Steve,

I am just trying to acquire more VMs to use as condor pool resources. The
goal to figure out whether the concept of combining both open source
common platforms (condor and eucalyptus) will work or not.
Thanks for the pointers for more reading. If you get it to work, please
share how you did it. I will do the same.

Nadia P.


> Can you be more specific about what you are trying to do on a
> eucalyptus cloud?  I.e. are you trying to configure virtual machines
> in the cloud to act like a normal condor pool, or are you trying
> to use the condor VM universe to launch virtual machines on
> the cloud.
>
> I have not yet made it work on a eucalyptus cloud but I
> expect to try it shortly in the next couple of weeks.  It
> does work on the Nimbus cloud, workspace.globus.org can give
> you more details.
>
> Steve Timm
>
>
> On Wed, 16 Dec 2009, Nadia Papakonstantinou wrote:
>
>> Hi
>>
>> I have been familiar with condor for about a month. I installed release
>> 7.4.0 on a x86_64 CentOS 5.4 and am happily using it in a small cluster
>> of
>> real and virtual machines (VMware) here.
>>
>> We also have a small eucalyptus cloud and I was trying to get condor to
>> work there, without success.
>>
>> Our eucalyptus cloud is configured as "MANAGED" (not by me), and there
>> is
>> no domain name defined in the configuration.
>> I installed condor on two private instances I spawned on this private
>> cloud with owner myself. In only one of the two, condor is master. I
>> configured it to use
>>
>> UID_DOMAIN      = $(FULL_HOSTNAME)
>> FILESYSTEM_DOMAIN   = $(FULL_HOSTNAME)
>> CONDOR_IDS=myId.myGroupId
>>
>> ALLOW_READ = *
>> ALLOW_WRITE = XXX.XXX.XXX.* 192.168.X.Y1 192.168.X.Y2 127.0.0.1
>> *.localdomain
>>
>> XXX.XXX.XXX.* - is the IP address assigned to the instances by
>> eucalyptus
>> for external access
>> 192.168.X.Y* - internal addresses inside the eucalyptus cloud
>>
>>> From each of the instances I can ping the other, but only inside the
>> 192.168.*.* net
>>
>> I can get all condor deamons to run, but once I request condor_status, I
>> get a problem:
>> $condor_status
>> Error:  Could not fetch ads --- can't find collector
>>
>> $condor_q            (that works okay, but it seems to be oblivious of
>> the
>> ethernet address of the instance)
>>
>>
>> -- Submitter: localhost.localdomain : <127.0.0.1:54676> :
>> localhost.localdomain
>> ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
>>
>> 0 jobs; 0 idle, 0 running, 0 held
>>
>> The deamons are running:
>>
>> nadiap@localhost :~
>> $ps -ef | grep condor
>> nadiap   17652     1  0 20:02 ?        00:00:03 condor_master
>> nadiap   17653 17652  0 20:02 ?        00:00:00 condor_collector -f
>> nadiap
>>  17654 17652  0 20:02 ?        00:00:00 condor_schedd -f
>> nadiap   17656 17654  0 20:02 ?        00:00:02 condor_procd -A
>> /tmp/condor-lock.localhost0.17840921886182/procd_pipe.SCHEDD -S 60 -C
>> 6358
>> nadiap   17749 17652  0 20:04 ?        00:00:04 condor_startd -f
>>
>> In the logs, it says the following:
>>
>> ==> /usr/local/condor/local/log/MasterLog <==
>> 12/15 22:11:42 DaemonCore: Command Socket at <127.0.0.1:57888>
>> 12/15 22:11:42 Warning: Collector information was not found in the
>> configuration file. ClassAds will not be sent to the collector and this
>> daemon will not join a larger Condor pool.
>> 12/15 22:11:42 passwd_cache::cache_uid(): getpwnam("condor") failed:
>> user
>> not found
>> 12/15 22:11:42 passwd_cache::cache_uid(): getpwnam("condor") failed:
>> user
>> not found
>> 12/15 22:11:42 Collector port not defined, will use default: 9618
>> 12/15 22:11:43 Started DaemonCore process
>> "/usr/local/condor/sbin/condor_collector", pid and pgroup = 18237
>> 12/15 22:11:46 Started DaemonCore process
>> "/usr/local/condor/sbin/condor_schedd", pid and pgroup = 18240
>> 12/15 22:11:47 Started DaemonCore process
>> "/usr/local/condor/sbin/condor_startd", pid and pgroup = 18241
>>
>>
>> Is there anything special I need to do in the config file to get the
>> collector to work properly?
>>
>> My question to the group is: Has anyone gotten this to work on a
>> eucalyptus cloud? It seems that it ought to, but what is the right
>> incantation?
>>
>> Thanks,
>> Nadia
>>
>>
>> _______________________________________________
>> Condor-users mailing list
>> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with
>> a
>> subject: Unsubscribe
>> You can also unsubscribe by visiting
>> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>>
>> The archives can be found at:
>> https://lists.cs.wisc.edu/archive/condor-users/
>>
>
> --
> ------------------------------------------------------------------
> Steven C. Timm, Ph.D  (630) 840-8525
> timm@xxxxxxxx  http://home.fnal.gov/~timm/
> Fermilab Computing Division, Scientific Computing Facilities,
> Grid Facilities Department, FermiGrid Services Group, Assistant Group
> Leader.
>