[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Problems with hibernation



Hi,

I've been trying to get hibernation working again (well, in actual fact to shutdown VMs), however after the startd says it's about to enter hibernation it seems to be cancelled due to problems sending its final message to the collector. In the startd log is this:

02/13/15 16:02:39 allHibernating: resource #1: 'S4' (0x8)
02/13/15 16:02:39 ResMgr: HIBERNATE: Machine cannot be woken by its public network adapter (eth0); hibernating anyway
02/13/15 16:02:39 ResMgr: This machine is about to enter hibernation
02/13/15 16:02:39 In ResMgr::disableResources ()
02/13/15 16:02:39 Publishing ClassAd 'mips' to slot 1 [InSlotList matches]
02/13/15 16:02:39 Publishing ClassAd 'kflops' to slot 1 [InSlotList matches]
02/13/15 16:02:39 update_with_ack: Failed to send query EOM to collector host <130.246.x.y:9618>.
02/13/15 16:02:39 All resources disabled: no.
02/13/15 16:02:39 All resources disabled: no.

Notice the "update_with_ack: Failed to send query EOM to collector host". In the collector log there is a message "Want private ads, but no socket given!":

02/13/15 16:02:40 IPVERIFY: checking vm21.a.b.c against 130.246.x.y
02/13/15 16:02:40 IPVERIFY: matched 130.246.x.y to 130.246.x.y
02/13/15 16:02:40 IPVERIFY: ip found is 1
02/13/15 16:02:40 StartdAd     : Inserting ** "< slot1@xxxxxxxxxx , 130.246.x.y >"
02/13/15 16:02:40 stats: Inserting new hashent for 'Start':'slot1@xxxxxxxxxx':'130.246.x.y'
02/13/15 16:02:40 Want private ads, but no socket given!
02/13/15 16:02:40 In OfflineCollectorPlugin::update ( 60 )

This happens in both 8.2.7 and 8.3.2. Has anyone seen this before or know what's going on? There are no other occurrences of these errors in the logs apart from when machines try to hibernate.

Many Thanks,
Andrew.