[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor and KVM: cannot connect to qemu:///session



What happens when you try to open via virsh?  

On Mon, 2010-12-13 at 13:55 -0500, Ryan Jansen wrote:
> Tim,
> 
> That's what I suspected at first, but it looks like the vm-gahp is
> running as root. Here's the vm-gahp log with D_FULLDEBUG on:
> 
> 12/13 13:41:37 Running as root.  Enabling specialized core dump
> routines
> 12/13 13:41:37 DaemonCore: Command Socket at <10.32.72.74:9077>
> 12/13 13:41:37 Will use UDP to update collector cclweb00.cse.nd.edu
> <129.74.152.166:9618>
> 12/13 13:41:37 VMGAHP[916]: VM-GAHP initialized with run-mode 3
> 12/13 13:41:37 VMGAHP[916]: Initial UID/GUID=0/0,
> EUID/EGUID=126019/1313, Condor UID/GID=108172,40
> 12/13 13:41:37 VMGAHP[916]: Initialize Uids: caller=root, job
> user=rjansen
> 12/13 13:41:37 VMGAHP[916]: Constructed VMGahp
> 12/13 13:41:37 VMGAHP[916]: Command: COMMANDS
> 12/13 13:41:38 VMGAHP[916]: Command: SUPPORT_VMS
> 12/13 13:41:38 VMGAHP[916]: Execute commands: S xen kvm vmware
> 12/13 13:41:39 VMGAHP[916]: Command: ASYNC_MODE_ON
> 12/13 13:41:40 VMGAHP[916]: Command: CLASSAD
> 12/13 13:41:43 VMGAHP[916]: Command: CONDOR_VM_START
> 12/13 13:41:43 VMGAHP[916]: Constructed VM_Type.
> 12/13 13:41:43 ERROR "Failed to create libvirt connection: could not
> connect to qemu:///session" at line 989 in file xen_type.cpp
> 
> Based on the log output, It appears to be running as root, and it
> knows that the job user is rjansen. Does that look normal, or do you
> still think it's most likely a permissions problem? Is there any way
> to get some more useful output from libvirt, maybe explaining why it
> couldn't connect?
> 
> Thanks,
> Ryan
> 
> 
> On Mon, Dec 13, 2010 at 1:11 PM, Timothy St. Clair
> <tstclair@xxxxxxxxxx> wrote:
>         If you can verify that your libvirtd is running & qemu+kvm are
>         installed
>         properly (check via virsh command prompt), then it is likely a
>         permissions issue.  Condor's vm-gahp requires it be started
>         with
>         elevated priv's(~root) in order to communicate with the
>         libvirtd.
>         
>         Cheers,
>         Tim
>         
>         
>         On Mon, 2010-12-13 at 12:01 -0500, Ryan Jansen wrote:
>         > Hi Tim,
>         >
>         > Thanks for the email and sorry for taking so long to get
>         back to you.
>         >
>         > I'm using libvirt version 0.6.3.
>         >
>         > Ryan
>         >
>         > On Wed, Dec 8, 2010 at 11:13 AM, Timothy St. Clair
>         > <tstclair@xxxxxxxxxx> wrote:
>         >         what version of libvirt are you using?
>         >
>         >         Cheers,
>         >         Tim
>         >
>         >
>         >         On Tue, 2010-12-07 at 16:36 -0500, Ryan Jansen
>         wrote:
>         >         > Hi everyone,
>         >         >
>         >         > I'm having a problem getting Condor to start up a
>         KVM
>         >         virtual machine
>         >         > in Condor. I posted an email before, and with
>         advice from a
>         >         few
>         >         > people, I was able to sort out my KVM problems.
>         But now,
>         >         whenever I
>         >         > run a vm universe job, the condor_vm-gahp fails
>         with the
>         >         following
>         >         > error:
>         >         >
>         >         > 12/07 16:18:12 ** condor_vm-gahp (CONDOR_VM_GAHP)
>         STARTING
>         >         UP
>         >         > 12/07 16:18:12
>         >         >
>         >
>         ** /afs/nd.edu/user37/condor/software/versions/amd64-redhat5/condor-7.4.2-dynamic/sbin/condor_vm-gahp
>         >         > 12/07 16:18:12 ** SubsystemInfo: name=VM_GAHP
>         type=GAHP(9)
>         >         > class=DAEMON(1)
>         >         > 12/07 16:18:12 ** Configuration: subsystem:VM_GAHP
>         >         local:<NONE>
>         >         > class:DAEMON
>         >         > 12/07 16:18:12 ** $CondorVersion: 7.4.2 Mar 29
>         2010 BuildID:
>         >         227044 $
>         >         > 12/07 16:18:12 ** $CondorPlatform:
>         X86_64-LINUX_RHEL5 $
>         >         > 12/07 16:18:12 ** PID = 13583
>         >         > 12/07 16:18:12 ** Log last touched 12/7 16:18:10
>         >         > 12/07 16:18:12
>         >
>         ******************************************************
>         >         > 12/07 16:18:12 Using config
>         >         > source: /afs/nd.edu/user37/condor/condor_config
>         >         > 12/07 16:18:12 Using local config sources:
>         >         > 12/07
>         >         > 16:18:12
>         >
>          /afs/nd.edu/user37/condor/software/config/machines/dqcneh100.local
>         >         > 12/07 16:18:12 DaemonCore: Command Socket at
>         >         <10.32.72.74:9118>
>         >         > 12/07 16:18:12 VMGAHP[13583]: VM-GAHP initialized
>         with
>         >         run-mode 3
>         >         > 12/07 16:18:12 VMGAHP[13583]: Initial
>         UID/GUID=0/0,
>         >         > EUID/EGUID=126019/1313, Condor UID/GID=108172,40
>         >         > 12/07 16:18:12 VMGAHP[13583]: Initialize Uids:
>         caller=root,
>         >         job
>         >         > user=rjansen
>         >         > 12/07 16:18:18 ERROR "Failed to create libvirt
>         connection:
>         >         could not
>         >         > connect to qemu:///session" at line 989 in file
>         xen_type.cpp
>         >         >
>         >         > Now, I have adjusted /etc/libvirt/libvirt.conf to
>         allow the
>         >         libvirt
>         >         > group to access the libvirt rw socket, and I added
>         the users
>         >         root,
>         >         > rjansen, and condor to that group.
>         >         >
>         >         > Additionally, I can connect just fine (as root and
>         rjansen)
>         >         to
>         >         > qemu:///session, through virsh, and through the
>         libvirt C
>         >         library
>         >         > using example code from the qemu website. In fact,
>         the code
>         >         I use to
>         >         > connect to the library in the example program is
>         essentially
>         >         the same
>         >         > as the code on line 989 in xen_type.cpp, which is
>         failing.
>         >         >
>         >         > I'm not sure if I'm doing something wrong with
>         Condor or
>         >         something
>         >         > wrong with KVM/libvirt, but I'd like to get this
>         working.
>         >         >
>         >         > Does anyone have any ideas on how to fix this
>         problem?
>         >         >
>         >         > Thanks,
>         >         > Ryan
>         >
>         >         > _______________________________________________
>         >         > Condor-users mailing list
>         >         > To unsubscribe, send a message to
>         >         condor-users-request@xxxxxxxxxxx with a
>         >         > subject: Unsubscribe
>         >         > You can also unsubscribe by visiting
>         >         >
>         https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>         >         >
>         >         > The archives can be found at:
>         >         > https://lists.cs.wisc.edu/archive/condor-users/
>         >
>         >         _______________________________________________
>         >         Condor-users mailing list
>         >         To unsubscribe, send a message to
>         >         condor-users-request@xxxxxxxxxxx with a
>         >         subject: Unsubscribe
>         >         You can also unsubscribe by visiting
>         >
>         https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>         >
>         >         The archives can be found at:
>         >         https://lists.cs.wisc.edu/archive/condor-users/
>         >
>         
>         
>