[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] EP: Can't create bridge interface



Hi Team,
here is a log showing that a VM has been started successfully, but then killed because the network interface cannot be created.


Valerio

12/08/23 17:35:15 ******************************************************
12/08/23 17:35:15 ** condor_vm-gahp (CONDOR_VM_GAHP) STARTING UP
12/08/23 17:35:15 ** /usr/sbin/condor_vm-gahp
12/08/23 17:35:15 ** SubsystemInfo: name=VM_GAHP type=GAHP(8) class=CLIENT(2)
12/08/23 17:35:15 ** Configuration: subsystem:VM_GAHP local:<NONE> class:CLIENT
12/08/23 17:35:15 ** $CondorVersion: 23.2.0 2023-11-29 BuildID: 692333 PackageID: 23.2.0-1.1 $
12/08/23 17:35:15 ** $CondorPlatform: X86_64-Debian_12 $
12/08/23 17:35:15 ** PID = 2063
12/08/23 17:35:15 ** Log last touched 12/8 17:34:15
12/08/23 17:35:15 ******************************************************
12/08/23 17:35:15 Using config source: /etc/condor/condor_config
12/08/23 17:35:15 Using local config sources: 
12/08/23 17:35:15    /etc/condor/config.d/01-execute.config
12/08/23 17:35:15    /etc/condor/config.d/50-debug.config
12/08/23 17:35:15    /etc/condor/condor_config.local
12/08/23 17:35:15 config Macros = 103, Sorted = 103, StringBytes = 2672, TablesBytes = 3772
12/08/23 17:35:15 CLASSAD_CACHING is ENABLED
12/08/23 17:35:15 Daemon Log is logging: D_ALWAYS:2 D_ERROR D_STATUS
12/08/23 17:35:15 Internal pipe for signals resized to 4096 from 65536
12/08/23 17:35:15 No shared_port cookie available; will fall back to using on-disk $(DAEMON_SOCKET_DIR)
12/08/23 17:35:15 SharedPortEndpoint: waiting for connections to named socket vm_gahp_2063_630e
12/08/23 17:35:15 DaemonCore: command socket at <10.10.0.40:9618?addrs=10.10.0.40-9618&alias=master11.sel&noUDP&sock=vm_gahp_2063_630e>
12/08/23 17:35:15 DaemonCore: private command socket at <10.10.0.40:9618?addrs=10.10.0.40-9618&alias=master11.sel&noUDP&sock=vm_gahp_2063_630e>
12/08/23 17:35:15 Setting maximum accepts per cycle 8.
12/08/23 17:35:15 Setting maximum UDP messages per cycle 100.
12/08/23 17:35:15 Will use TCP to update collector htcondor.sel <10.10.0.30:9618?alias=htcondor.sel>
12/08/23 17:35:15 No shared_port cookie available; will fall back to using on-disk $(DAEMON_SOCKET_DIR)
12/08/23 17:35:15 VM-GAHP initialized with run-mode 3
12/08/23 17:35:15 Initial UID/GUID=0/0, EUID/EGUID=65534/65534, Condor UID/GID=109,116
12/08/23 17:35:15 Initialize Uids: caller=root, job user=nobody
12/08/23 17:35:15 Constructed VMGahp
12/08/23 17:35:15 Command: COMMANDS
12/08/23 17:35:16 Command: SUPPORT_VMS
12/08/23 17:35:16 Execute commands: S xen kvm
12/08/23 17:35:17 Command: ASYNC_MODE_ON
12/08/23 17:35:18 Command: CLASSAD
12/08/23 17:35:21 Command: CONDOR_VM_START
12/08/23 17:35:21 Constructed VM_Type.
12/08/23 17:35:21 In KVMType::CreateConfigFile()
12/08/23 17:35:21 Memory: 4096
12/08/23 17:35:21 Looking up number of vcpus.
12/08/23 17:35:21 Setting up 1 CPUS
12/08/23 17:35:21 MAC address was not defined.
12/08/23 17:35:21 format = /var/lib/libvirt/images/osticket-1.qcow2:vda:w
12/08/23 17:35:21 CreateKvmVMConfigFile
12/08/23 17:35:21 In VirshType::CreateVirshConfigFile
12/08/23 17:35:21 LIBVIRT_XML_SCRIPT_ARGS input_strings= Arguments = ""
AuthTokenId = "f5c0b90e97cdee3fc316ce32bd3684e5"
AuthTokenIssuer = "htcondor.sel"
AuthTokenSubject = "condor@xxxxxxxxxxxx"
AutoClusterAttrs = "ImageSize,MachineLastMatchTime,Offline,RemoteOwner,RequestCpus,RequestDisk,RequestMemory,TotalJobRuntime,ConcurrencyLimits,FlockTo,Rank,Requirements,KFlops,DiskUsage,JobVMMemory,JobVMNetworkingType,JobVMType,Machine"
AutoClusterId = 38
ClusterId = 112
Cmd = "\"OSTICKET\""
CommittedSlotTime = 0
CommittedSuspensionTime = 0
CommittedTime = 0
CondorPlatform = "$CondorPlatform: X86_64-Debian_12 $"
CondorVersion = "$CondorVersion: 23.2.0 2023-11-29 BuildID: 692333 PackageID: 23.2.0-1.1 $"
CpusProvisioned = 1
CumulativeRemoteSysCpu = 0.0
CumulativeRemoteUserCpu = 0.0
CumulativeSlotTime = 0
CumulativeSuspensionTime = 0
CurrentHosts = 1
DiskProvisioned = 4525337
DiskUsage = 4250000
DiskUsage_RAW = 4194304
EnteredCurrentStatus = 1702053313
Environment = ""
Err = "/dev/null"
ExecutableSize = 4250000
ExecutableSize_RAW = 4194304
ExitBySignal = false
ExitStatus = 0
GlobalJobId = "t450.sel#112.0#1702053266"
ImageSize = 4250000
ImageSize_RAW = 4194304
In = "/dev/null"
Iwd = "/var/lib/condor/execute/dir_2061"
JobCurrentStartDate = 1702053313
JobLeaseDuration = 2400
JobNotification = 0
JobPrio = 0
JobRunCount = 1
JobStartDate = 1702053313
JobStatus = 2
JobSubmitMethod = 0
JobUniverse = 13
JobVMCheckpoint = false
JobVMMemory = 4096
JobVMNetworking = true
JobVMNetworkingType = "bridge"
JobVMType = "kvm"
JobVMVNCConsole = false
JobVM_VCPUS = 1
KillSig = "SIGTERM"
LastJobLeaseRenewal = 1702053313
LastJobStatus = 1
LastMatchTime = 1702053313
LastSuspensionTime = 0
LeaveJobInQueue = false
MachineAttrCpus0 = 1
MachineAttrSlotWeight0 = 1
MaxHosts = 1
MemoryProvisioned = 4096
MinHosts = 1
MyAddress = "<10.10.0.47:9618?addrs=10.10.0.47-9618&alias=t450.sel&noUDP&sock=shadow_52504_9150_40>"
MyType = "Job"
NumCkpts = 0
NumCkpts_RAW = 0
NumJobCompletions = 0
NumJobMatches = 1
NumJobStarts = 0
NumRestarts = 0
NumShadowStarts = 1
NumSystemHolds = 0
OrigCmd = "\"OSTICKET\""
OrigIwd = "/home/sel/HTCSS"
OrigMaxHosts = 1
Out = "/dev/null"
Owner = "condor"
ProcId = 0
ProvisionedResources = "Cpus Memory Disk Swap"
PublicClaimId = "<10.10.0.40:9618?addrs=10.10.0.40-9618&alias=master11.sel&noUDP&sock=startd_2005_d77a>#1702053254#1#..."
QDate = 1702053266
Rank = 0.0
RemoteHost = "slot1_1@xxxxxxxxxxxx"
RemoteSlotID = 1
RemoteSysCpu = 0.0
RemoteUserCpu = 0.0
RemoteWallClockTime = 0.0
Renice = 0
RequestCpus = 1
RequestDisk = 4250000
RequestMemory = 4096
Requirements = ((Machine == "master11.sel")) && (TARGET.Arch == "X86_64") && (TARGET.HasVM =?= true) && (TARGET.VM_Type == MY.JobVMType) && (TARGET.VM_AvailNum > 0) && (TARGET.Disk >= RequestDisk) && (TARGET.TotalMemory >= MY.JobVMMemory) && (TARGET.VM_Memory >= MY.JobVMMemory) && TARGET.VM_Networking && stringListIMember(JobVMNetworkingType,TARGET.VM_Networking_Types,",") && (TARGET.HasFileTransfer)
ShadowBday = 1702053313
ShadowIpAddr = "<10.10.0.47:9618?addrs=10.10.0.47-9618&alias=t450.sel&noUDP&sock=shadow_52504_9150_40>"
ShadowVersion = "$CondorVersion: 23.2.0 2023-11-29 BuildID: 692333 PackageID: 23.2.0-1.1 $"
ShouldTransferFiles = "YES"
StartdIpAddr = "<10.10.0.40:9618?addrs=10.10.0.40-9618&alias=master11.sel&noUDP&sock=startd_2005_d77a>"
StartdPrincipal = "execute-side@matchsession/10.10.0.40"
TargetType = "Machine"
TotalSubmitProcs = 1
TotalSuspensions = 0
TransferErr = false
TransferExecutable = false
TransferIn = false
TransferInputSizeMB = 4096
TransferOut = false
TransferSocket = "<10.10.0.47:9618?addrs=10.10.0.47-9618&alias=t450.sel&noUDP&sock=shadow_52504_9150_40>"
UidDomain = "t450.sel"
User = "condor@xxxxxxxx"
UserLog = "/var/log/condor/osticketvm.log"
VMPARAM_No_Output_VM = true
VMPARAM_VM_NAME = "condor_t450.sel_112.0"
VMPARAM_vm_Disk = "/var/lib/libvirt/images/osticket-1.qcow2:vda:w"
VM_WORKING_DIR = "/var/lib/condor/execute/dir_2061"
WhenToTransferOutput = "ON_EXIT"
VMPARAM_Xen_Bootloader = ""
VMPARAM_Xen_Initrd = ""
VMPARAM_Bridge_Interface = "eno1"

12/08/23 17:35:21 Helper stderr output: awk: /usr/libexec/condor/libvirt_simple_script.awk:26: warning: regexp escape sequence `\"' is not a known regexp operator
12/08/23 17:35:21 Inside VirshType::Start
12/08/23 17:35:21 Trying XML: <domain type='kvm'><name>condor_t450.sel_112.0</name><memory>4194304</memory><vcpu>1</vcpu><cpu mode='host-passthrough'/><os><type>hvm</type></os><features><acpi/><apic/><pae/></features><on_poweroff>destroy</on_poweroff><on_reboot>restart</on_reboot><on_crash>restart</on_crash><devices><console type='pty'><source path='/dev/ptmx'/></console><interface type='bridge'><source bridge='eno1'/></interface><disk type='file'><source file='/var/lib/libvirt/images/osticket-1.qcow2'/><target dev='vda'/></disk></devices></domain>
12/08/23 17:35:22 Failed to create libvirt domain: Unable to add bridge eno1 port vnet0: Operation not supported
12/08/23 17:35:22 Inside VirshType::Shutdown
12/08/23 17:35:22 executeStart fail!
12/08/23 17:35:23 Command: RESULTS
12/08/23 17:35:24 Command: QUIT
12/08/23 17:35:24 Inside KVMType::killVMFast
12/08/23 17:35:24 Inside VirshType::killVMFast
12/08/23 17:35:24 killVMFast is called
12/08/23 17:35:26 **** condor_vm-gahp (condor_VM_GAHP) pid 2063 EXITING WITH STATUS 0