[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Erro in condor_chirp



Dan,

The extra junk isn't in this output (see log above). Assuming that the
response field show the number of bytes transfered, the length is ok
(41), if you count the double quote.

Thanks,

Gabriel

>From the ShadowLog:

04/12 13:40:15 (167068.0) (10491): About to decode condor_sysnum
04/12 13:40:15 (167068.0) (10491): Got request for syscall get_job_attr
(285)
04/12 13:40:15 (167068.0) (10491): pseudo_get_job_attr(AllRemoteHosts) =
"slot2@xxxxxxxxxxxxx,slot3@xxxxxxxxxxxxx"
04/12 13:40:15 (167068.0) (10491):      rval = 0, errno = 0
04/12 13:40:15 (167068.0) (10491): About to decode condor_sysnum
04/12 13:40:15 (167068.0) (10491): Got request for syscall get_job_attr
(285)
04/12 13:40:15 (167068.0) (10491): pseudo_get_job_attr(AllRemoteHosts) =
"slot2@xxxxxxxxxxxxx,slot3@xxxxxxxxxxxxx"
04/12 13:40:15 (167068.0) (10491):      rval = 0, errno = 0
04/12 13:40:19 (167067.0) (10241): About to decode condor_sysnum
04/12 13:40:19 (167067.0) (10241): Got request for syscall
register_job_info (-81)
04/12 13:40:19 (167067.0) (10241):      rval = 0, errno = 0


>From StarterLog.slot2 (node255)

04/12 13:40:23 IOProxy: accepting connection from 10.200.8.31
04/12 13:40:23 IOProxyHandler: request: get_job_attr JobUniverse
04/12 13:40:23 IOProxyHandler: response: 2
04/12 13:40:23 IOProxyHandler: closing connection to 10.200.8.31
04/12 13:40:23 IOProxy: accepting connection from 10.200.8.31
04/12 13:40:23 IOProxyHandler: request: get_job_attr AllRemoteHosts
04/12 13:40:23 IOProxyHandler: response: 41
04/12 13:40:23 IOProxyHandler: closing connection to 10.200.8.31


On Mon, 2010-04-12 at 11:13 -0500, Dan Bradley wrote:
> Gabriel,
> 
> I would expect to see a line in the shadow log that looks like this:
> 
> pseudo_get_job_attr(AllRemoteHosts) = 
> slot4@xxxxxxxxxxxxx,slot8@xxxxxxxxxxxxx
> 
> What I am curious to know is whether the extra junk that you noticed is 
> showing up in this output or not.
> 
> --Dan
> 
> Gabriel A. von Winckler wrote:
> > Dan,
> >
> > I've increased the debug in this daemons (and others) but I didn't saw
> > anything unusual. This debug level should print the content of the chirp
> > request?
> >
> > Thanks,
> >
> > Gabriel
> >
> > On Fri, 2010-04-09 at 10:03 -0500, Dan Bradley wrote:
> >   
> >> Adding D_SYSCALLS to SHADOW_DEBUG and STARTER_DEBUG may help.  Then look 
> >> in the shadow and starter logs to see where the extra junk first shows up.
> >>
> >> --Dan
> >>
> >> Gabriel A. von Winckler wrote:
> >>     
> >>> Hi,
> >>>
> >>> I found a very strange situation using condor_chirp in a job running in
> >>> the parallel universe.
> >>>
> >>> My MPI wrapper script execute the follow command:
> >>>
> >>> $(condor_config_val libexec)/condor_chirp get_job_attr AllRemoteHosts
> >>>
> >>> The result is the expected (the same from condor_q -l) except on an
> >>> execution with 2 nodes.
> >>>
> >>> Here is the output:
> >>>
> >>> "slot4@xxxxxxxxxxxxx,slot8@xxxxxxxxxxxxx"nfig
> >>>
> >>> Note the extra "nfig".
> >>>
> >>> Is this expected somehow?
> >>> How should I debug this?
> >>>
> >>> I'm using condor 7.4.1.
> >>>
> >>> Thanks,
> >>> Gabriel
> >>>
> >>> _______________________________________________
> >>> Condor-users mailing list
> >>> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> >>> subject: Unsubscribe
> >>> You can also unsubscribe by visiting
> >>> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >>>
> >>> The archives can be found at:
> >>> https://lists.cs.wisc.edu/archive/condor-users/
> >>>   
> >>>       
> >> _______________________________________________
> >> Condor-users mailing list
> >> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> >> subject: Unsubscribe
> >> You can also unsubscribe by visiting
> >> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >>
> >> The archives can be found at:
> >> https://lists.cs.wisc.edu/archive/condor-users/
> >>     
> >
> >
> > _______________________________________________
> > Condor-users mailing list
> > To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> > subject: Unsubscribe
> > You can also unsubscribe by visiting
> > https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >
> > The archives can be found at:
> > https://lists.cs.wisc.edu/archive/condor-users/
> >   
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/