[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] rooster on linux



On Mon, Nov 21, 2011 at 02:27:24PM -0600, Dan Bradley wrote:
> 
> >11/21/11 14:07:03 UdpWakeOnLanWaker::doWake: Malformed subnet
> >'255.255.255.128k144.92.167.252'
> 
> What does the classad for slot1@xxxxxxxxxxxxxxxxxxx look like?  Does
> it contain the strange looking subnet string mentioned in the above
> error message?
> 
> --Dan

Subnet mask is ok, but in constructor UdpWakeOnLanWaker::UdpWakeOnLanWaker
SubnetMask attribute is wrong copied.

   found = ad->LookupString (
        ATTR_SUBNET_MASK,
        m_subnet,
        MAX_IP_ADDRESS_LENGTH-1 );

where MAX_IP_ADDRESS_LENGTH is equal 16. So only 15 characters are copied 
"255.255.255.128" without tailing zero '\0' character.

Regards,
Lukas

> 
> On 11/21/11 2:17 PM, Dimitri Maziuk wrote:
> >On 11/21/2011 01:57 PM, Dan Bradley wrote:
> >>Hi Dimitri,
> >>
> >>Do you see any activity in RoosterLog?  If the Unhibernate expression is
> >>true for the offline machine, I would expect rooster to attempt to wake
> >>it up.  If it does so, it will generate a log message.
> >I do actually -- now that I submitted a bunch of jobs to the pool.
> >
> >(Previously I tried setting unhibernate to true in the offline ad and/or
> >condor_config_val and saw only
> >Got 0 startd ads matching ROOSTER_UNHIBERNATE=Offline&&  Unhibernate
> >-- hence the question how to test it.)
> >
> >Now I get
> >
> >11/21/11 14:07:03 (fd:8) (pid:1958) Got 4 startd ads matching
> >ROOSTER_UNHIBERNATE=Offline&&  Unhibernate
> >11/21/11 14:07:03 (fd:8) (pid:1958) Sending wakeup call to
> >slot1@xxxxxxxxxxxxxxxxxxxx
> >11/21/11 14:07:03 (fd:8) (pid:1958) Entering Create_Pipe()
> >11/21/11 14:07:03 (fd:8) (pid:1958) Entering Create_Named_Pipe()
> >11/21/11 14:07:03 (fd:10) (pid:1958) Create_Pipe() success
> >read_handle=65536 write_handle=65537
> >11/21/11 14:07:03 (fd:10) (pid:1958) Entering Create_Pipe()
> >11/21/11 14:07:03 (fd:10) (pid:1958) Entering Create_Named_Pipe()
> >11/21/11 14:07:03 (fd:12) (pid:1958) Create_Pipe() success
> >read_handle=65538 write_handle=65539
> >11/21/11 14:07:03 (fd:12) (pid:1958) In
> >DaemonCore::Create_Process(/usr/bin/condor_power,...)
> >11/21/11 14:07:03 (fd:12) (pid:8021) Create_Process: Arg:
> >/usr/bin/condor_power -d -i
> >11/21/11 14:07:03 (fd:12) (pid:8021) Re-mapping std(in|out|err) in child.
> >11/21/11 14:07:03 (fd:12) (pid:8021) Printing fds to inherit:
> >11/21/11 14:07:03 (fd:12) (pid:8021) About to exec "/usr/bin/condor_power"
> >11/21/11 14:07:03 (fd:12) (pid:1958) Child Process: pid 8021 at
> >11/21/11 14:07:03 (fd:12) (pid:1958) SharedPortEndpoint: Inside destructor.
> >11/21/11 14:07:03 (fd:8) (pid:1958) Close_Pipe(pipe_end=65536) succeeded
> >11/21/11 14:07:03 (fd:8) (pid:1958) Close_Pipe(pipe_end=65539) succeeded
> >11/21/11 14:07:03 (fd:8) (pid:1958) Close_Pipe(pipe_end=65537) succeeded
> >11/21/11 14:07:03 (fd:8) (pid:1958) Close_Pipe(pipe_end=65538) succeeded
> >11/21/11 14:07:03 UdpWakeOnLanWaker::doWake: Malformed subnet
> >'255.255.255.128k144.92.167.252'
> >11/21/11 14:07:03 UdpWakeOnLanWaker::initialize: Failed to initialize
> >broadcast address
> >11/21/11 14:07:03 UdpWakeOnLanWaker: failed to initialize
> >condor_power: failed to send wake packet.
> >
> >
> >
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/
>