[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Custom print formats with WHERE clause & condor_status seg fault



this bug has been fixed in the 8.2.2 release.

On 8/5/2014 4:17 PM, andrew.lahiff@xxxxxxxxxx wrote:
Yes, I'm using STATUS_DEFAULT_STARTD_PRINT_FORMAT_FILE.

Thanks,
Andrew.

________________________________________
From: John (TJ) Knoeller [johnkn@xxxxxxxxxxx]
Sent: Tuesday, August 05, 2014 10:00 PM
To: HTCondor-Users Mail List
Subject: Re: [HTCondor-users] Custom print formats with WHERE clause & condor_status seg fault

Thanks.  are you perhaps using a STATUS_DEFAULT_* parameter to set the
default condor status output?

I think I see a bug where those knobs don't work with with WHERE clause.

-tj


On 8/5/2014 5:09 AM, andrew.lahiff@xxxxxxxxxx wrote:
Here is the full output from gdb:

(gdb) r
Starting program: /scratch/alahiff/condor/crash/condor-8.2.1-x86_64_RedHat6-unstripped/bin/condor_status
[Thread debugging using libthread_db enabled]

Program received signal SIGSEGV, Segmentation fault.
List<char>::Append (this=0x68, obj=0x61c450 "PartitionableSlot")
      at /slots/04/dir_29130/userdir/src/condor_utils/list.h:310
310   /slots/04/dir_29130/userdir/src/condor_utils/list.h: No such file or directory.
       in /slots/04/dir_29130/userdir/src/condor_utils/list.h
Missing separate debuginfos, use: debuginfo-install expat-2.0.1-11.el6_2.x86_64 glibc-2.12-1.132.el6.x86_64 globus-callout-2.4-2.el6.x86_64 globus-common-14.10-2.el6.x86_64 globus-gsi-callback-4.6-2.el6.x86_64 globus-gsi-cert-utils-8.6-2.el6.x86_64 globus-gsi-credential-6.0-2.el6.x86_64 globus-gsi-openssl-error-2.1-10.el6.x86_64 globus-gsi-proxy-core-6.2-9.el6.x86_64 globus-gsi-proxy-ssl-4.1-10.el6.x86_64 globus-gsi-sysconfig-5.3-8.el6.x86_64 globus-gss-assist-9.0-1.el6.x86_64 globus-gssapi-gsi-10.10-2.el6.x86_64 globus-openssl-module-3.3-2.el6.x86_64 keyutils-libs-1.4-4.el6.x86_64 krb5-libs-1.10.3-10.el6_4.3.x86_64 libcom_err-1.41.12-14.el6.x86_64 libgcc-4.4.7-3.el6.x86_64 libselinux-2.0.94-5.3.el6.x86_64 libstdc++-4.4.7-3.el6.x86_64 libtool-ltdl-2.2.6-15.5.el6.x86_64 openssl-1.0.1e-16.el6_5.14.x86_64 pcre-7.8-6.el6.x86_64 voms-2.0.11-2.el6.x86_64 zlib-1.2.3-29.el6.x86_64
(gdb) bt full
#0  List<char>::Append (this=0x68, obj=0x61c450 "PartitionableSlot")
      at /slots/04/dir_29130/userdir/src/condor_utils/list.h:310
          item = <value optimized out>
#1  0x00007ffff75e6532 in GenericQuery::addCustomAND (this=0x8, value=<value optimized out>)
      at /slots/04/dir_29130/userdir/src/condor_utils/generic_query.cpp:168
          x = <value optimized out>
#2  0x00000000004087fc in set_status_print_mask_from_stream (streamid=<value optimized out>,
      is_filename=<value optimized out>) at /slots/04/dir_29130/userdir/src/condor_status.V6/status.cpp:585
          constraint = 0x62399d "PartitionableSlot"
          attr = <value optimized out>
          messages = ""
          file = <value optimized out>
          err = 0
          where_expr = "PartitionableSlot"
          attrs = {_vptr.StringList = 0x7ffff79a9d90, m_strings = {_vptr.List = 0x7ffff79a8a30,
              dummy = 0x61afd0, current = 0x621bf0, num_elem = 12}, m_delimiters = 0x61afb0 " ,"}
          pstream = 0x0
#3  0x000000000040d084 in setPPstyle (pps=PP_STARTD_NORMAL, i=0, argv=0x40dea7 "<default>")
      at /slots/04/dir_29130/userdir/src/condor_status.V6/setflags.cpp:106
          stat_buff = {st_dev = 2050, st_ino = 419071, st_nlink = 1, st_mode = 33188, st_uid = 0, st_gid = 0,
            __pad0 = 0, st_rdev = 0, st_size = 698, st_blksize = 4096, st_blocks = 8, st_atim = {
              tv_sec = 1407186730, tv_nsec = 0}, st_mtim = {tv_sec = 1407186726, tv_nsec = 0}, st_ctim = {
              tv_sec = 1407186726, tv_nsec = 0}, __unused = {0, 0, 0}}
          param_name = {Data = 0x61e4f0 "STATUS_DEFAULT_STARTD_PRINT_FORMAT_FILE", dummy = 0 '\000', Len = 39,
            capacity = 60, tokenBuf = 0x0, nextToken = 0x0}
          pf_file = 0x6238f0 "/etc/condor/status_startd_ral.cpf"
          setBy = 0
          setArg = 0x0
#4  0x000000000040d2b9 in setMode (mod=<value optimized out>, i=0, argv=0x40dea7 "<default>")
      at /slots/04/dir_29130/userdir/src/condor_status.V6/setflags.cpp:294
          setBy = 0
          setArg = 0x0
---Type <return> to continue, or q <return> to quit---
#5  0x000000000040af13 in main (argc=1, argv=0x7fffffffde88)
      at /slots/04/dir_29130/userdir/src/condor_status.V6/status.cpp:153
          totals = {ppo = PP_NOTSET, malformed = 0, allTotals = {tableSize = 0, numElems = 0, ht = 0x0,
              hashfcn = 0x3000000003, maxLoadFactor = 2.6312747812465374e-312, duplicateKeyBehavior = 119,
              currentBucket = 110, currentItem = 0x8,
              activeIterators = std::vector of length -17592168991829, capacity 17051357 = {0x48d889484e75c085,
                0x245c8b4820246c8b, 0x8b4c2824648b4c18, 0xc338c4834830246c, 0x841f0f2e66, 0x20728b493b8b0000,
                0x2454894c287a0349, 0xc0850000e12ce808, 0x62840f0824548b4c, 0xfb5ebdb31ffffff,
                0x75c0850c458b001f, 0x4566ed75e48545f2, 0x1f0fe5eba079ed85, 0x419574c085480040,
                0x415014b70f40528b, 0xf88302e0833c428b, 0xffe181c019d18901, 0x3c083d0f700007f,
                0xffffff6c8f0fc839, 0x428b41ae78d28566, 0x8941c08501508d38, 0x85a89499f753852,
                0x48ffffff4de9db31, 0xfec3850f00087f83, 0xffff3be9db31ffff, 0x41d78949574190ff,
                0x5441f58949554156, 0xc8ec81485355, 0x848b4808698b4100, 0x44894c0000010824, 0x100249c8b482824,
                0x11824b48b4c0000, 0x9024bc89480000, 0xa0248c89480000, 0x80248489480000, 0x11024848b0000,
                0xac24848900, 0x890000012024848b, 0x8b48000000b42484, 0x8948000001282484, 0x8949218b4d702444,
                0x245c8d4cf18941f0, 0xe1834106e8c14970, 0x2484c7dc048b493f, 0xa8, 0x782444c748,
                0xf0394c28408b4800, 0x8824848948, 0x84f6000000dc840f, 0xd7402000000b424, 0xf030000031480f6,
                0x1580f6000000c584, 0xb8850f20000003, 0x21673905f60000, 0x8b00000667850f08, 0xfc985000002ec88,
                0x508b480000009d84, 0xb48b4868408b4870, 0x528b480000008824, 0x8b48302454894808,
                0x98248489480840, 0x2f8868b480000, 0x580840fc0854800, 0x2f0968b0000, 0x2144000002f48e8b,
                0xe8894cd0148b48c2, 0xe183c18948e8d348, 0x44e8d348d089483f, 0xd02148ead348c989, 0x646850f01a8,
                0xb02484c7, 0xa824bc83000000, 0x848b484574010000, 0xc085480000008024, 0x854810788b481074,
                0xf000002d4850fff, 0x394801c38348001f, 0x31fffffeea870fdd, 0xc8c48148c0, 0x5e415d415c415d5b,
                0x801f0fc35f41, 0x487824448b480000, 0x450b60fb174c085, 0x2a7402fa8004eac0, 0x1fa8065740afa80,
                0xb024848b2e74, 0xc383489074c08500, 0xfe97870fdd394801, 0x441f0f66abebffff, 0x21665e158b0000,
                0x680850fd285, 0xc289483824448948, 0x4c8b482824448b48, 0x848b481089482824, 0x4189480000008824,
                0x6ee900000001b808, 0x801f0fffffff, 0x4838244489480000, 0x480000008824848b, 0x8d4c00216a2c358d,
                0x408b4800216a2d35, 0x448948c0248d4c30, 0x8d4a04e4c1494024, 0x8948e6014d202674,
                0x15fff78948502474, 0x406e8b49002178fe, 0x4e002169f3058d48, 0x8b49ed854820248d,
                0x3b5840f50245c, 0x8d48d231e8894c00, 0xe8894cf3f748fe4b, 0x6c894830244c8948,
                0xd48949dd89484824, 0x4a8d48f1f748d231, 0x49cb8948c8894901, 0x4923348d4e05e0c1,
                0x482464034c05e4c1, 0x4d08247c8b4d19eb, 0x49c4014d7674ff85, 0x5a731e048d49ee39,
---Type <return> to continue, or q <return> to quit---
                0x4c24048b41c68949, 0x247c8b4dde75e839, 0x9024b48b4808, 0x894c082444894c00, 0xc0850000dd84e8ff,
                0x49c0750824448b4c, 0x24748b481024448b, 0x894850247c8b4828, 0x89481824448b4906,
                0x21784215ff0846, 0xfe57e900000001b8, 0xec2949f4894dffff, 0x441f0fffffff76e9,
                0x4840244c8b480000, 0x89480021691c358d, 0x6c8b485b148d48eb, 0xc148c9048d484824,
                0x485006448b4804e0, 0x860fc2394802e0c1, 0x24448b4800000130, 0x8b4cd231ee894438,
                0x8b4800000098248c, 0x8b440000008824bc, 0x48f3f748f0894800, 0x48d231d18948f089,
                0x1c28348302474f7, 0x4805e6c148d68948, 0x4805e1c14811048d, 0x1f0f07eb000d4c8d,
                0x798348d0894c0040, 0x48f1014816740008, 0xea7210148d4cd839, 0xcfebd92948c18948,
                0x298944c089459066, 0xb42484f6c1014d, 0xf0849894c020000, 0x748b480000008d85,
                0x8948187989483824, 0x31487b60f1071, 0x840f023c03e08300, 0x24548b480000026a, 0x2168420d8d4840,
                0x4e0c148d2048d48, 0x8b48015001448348, 0x21772915ff50247c, 0xade93824548b4800,
                0x8824b48b48fffffd, 0x82444894c000000, 0x5c894c10244c8944, 0x850000566be82024,
                0x8b440824448b4cc0, 0x20245c8b4c10244c, 0xc883fffffcfc840f, 0x1f0ffffffd03e9ff,
                0xa024848b480040, 0x8b48104189480000, 0xe918418948702444, 0x17b8d48ffffff78, 0x20bf000055a5e8,
                0xc48949c689480000, 0xc08548ffff7275e8, 0x3cf840fc68949, 0x8548fe24448d4900,
                0x840f3024448948db, 0x24548d4900000101, 0x8b487024748b48fe, 0x894c000000a0248c,
                0x8948d2314568246c, 0x302454894860246c, 0x48000000b424948b, 0x244c894858247489,
                0x4102e283ee894848, 0x841f0fd589, 0x8548086e8b480000, 0x4400000093840fed, 0x184e8b4cd231068b,
                0x48c78944105e8b4c, 0xf88948f4f749f889, 0x74f748d231d18948, 0x894801c283483024...}},
            topLevelTotal = 0x30}


________________________________________
From: John (TJ) Knoeller [johnkn@xxxxxxxxxxx]
Sent: Monday, August 04, 2014 9:56 PM
To: HTCondor-Users Mail List
Subject: Re: [HTCondor-users] Custom print formats with WHERE clause & condor_status seg fault

If you can send me a stack trace I'll take a look.

On 8/4/2014 7:03 AM, andrew.lahiff@xxxxxxxxxx wrote:
Hi,

I've just been trying some of the examples of custom print formats here:

https://htcondor-wiki.cs.wisc.edu/index.cgi/wiki?p=ExperimentalCustomPrintFormats

with HTCondor 8.2.1. The condor_status example for partitionable slots (*) seems to give a segmentation fault:

# condor_status
Segmentation fault

When the WHERE clause is removed the seg fault doesn't occur. Has anyone else seen this problem?

Thanks,
Andrew.

(*)

# status summary output for partitionable slots (8.1.6 or later for some fields)
SELECT
      Machine              AS Machine   WIDTH -32 TRUNCATE
      strcat(OpSysAndVer,"_x",substr(Arch,-2))  AS Platform WIDTH -8
      split(condorversion)[1] AS Condor
      TotalCpus            AS Cpus      PRINTF %4d
      Cpus                 AS Free      PRINTF %4d
      TotalMemory/1024.0   AS " Mem(Gb)" PRINTF %8.2f
      max(childmemory)     AS MaxMem   WIDTH 6 TRUNCATE
      Memory*100.0/TotalMemory AS FreeMem%  PRINTF %8.1f
      NumDynamicSlots      AS Slots    WIDTH 5 TRUNCATE
      TotalLoadAvg*1.0/TotalCpus  AS CpuUtil   PRINTF %7.2f
      RecentJobStarts/20.0  AS Jobs/Min PRINTF %8.2f
WHERE PartitionableSlot
SUMMARY NONE


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/