[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] 10.0.5 i/ 10.0.6 install submitterCeiling remaining is 2147483647



Turns out the 9618 port on the client nodes was closed â the job is working now.

 

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Weatherby,Gerard <gweatherby@xxxxxxxx>
Date: Monday, July 3, 2023 at 2:41 PM
To: Todd Tannenbaum <tannenba@xxxxxxxxxxx>, HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] 10.0.5 i/ 10.0.6 install submitterCeiling remaining is 2147483647

Central manager
root@condorcentralmanager:~# condor_status

root@condorcentralmanager:~# condor_status -af name cpus slotweight

execute node
systemctl status condor

â condor.service - Condor Distributed High-Throughput-Computing

     Loaded: loaded (/lib/systemd/system/condor.service; enabled; vendor preset: enabled)

     Active: active (running) since Mon 2023-07-03 15:21:32 EDT; 13min ago

   Main PID: 12304 (condor_master)

     Status: "All daemons are responding"

         IP: 14.2K in, 127.4K out

      Tasks: 4 (limit: 4194303)

     Memory: 9.1M

     CGroup: /system.slice/condor.service

             ââ12304 /usr/sbin/condor_master -f

             ââ12357 condor_procd -A /var/run/condor/procd_pipe -L /var/log/condor/ProcLog -R 1000000 -S 60 -C 999

             ââ12358 condor_shared_port

             ââ12359 condor_startd

 

Jul 03 15:21:32 test-condor2.nmrbox.org systemd[1]: Started Condor Distributed High-Throughput-Computing.

Jul 03 15:21:32 test-condor2.nmrbox.org htcondor[12319]: Not changing GLOBAL_MAX_FDS (/proc/sys/fs/file-max): new value (32768) <= old value (9223372036854775807).

Jul 03 15:21:32 test-condor2.nmrbox.org htcondor[12338]: Not changing TCP_LISTEN_QUEUE (/proc/sys/net/core/somaxconn): new value (1024) <= old value (4096).

Jul 03 15:21:32 test-condor2.nmrbox.org htcondor[12344]: Not changing ROOT_MAXKEYS_BYTES (/proc/sys/kernel/keys/root_maxbytes): new value (25000000) <= old value (25000000).

Jul 03 15:21:32 test-condor2.nmrbox.org htcondor[12351]: Changing FS_CACHE_DIRTY_BYTES (/proc/sys/vm/dirty_bytes) from 100000000 to 100000000

root@test-condor2:/etc/condor/tokens.d# fc -s 27

condor_ping -type collector -table ALL

         Instruction Authentication Encryption Integrity Decision Identity

               ALLOW       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

                READ       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

               WRITE       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

          NEGOTIATOR       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

       ADMINISTRATOR       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

               OWNER       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

              CONFIG           FAIL       FAIL      FAIL     FAIL FAIL  (use -verbose for more info)

              DAEMON       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

    ADVERTISE_STARTD       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

    ADVERTISE_SCHEDD       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

    ADVERTISE_MASTER       IDTOKENS        AES       AES    ALLOW condor@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx


 

Our DNS entries do not reverse due to our University policy. (The IPs we have been allocated are required to end in uchc.edu if we add reverse entries).
I could work around on the VMs with condor installed if it is necessary.




 

From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Date: Friday, June 30, 2023 at 4:08 PM
To: Weatherby,Gerard <gweatherby@xxxxxxxx>, HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] 10.0.5 i/ 10.0.6 install submitterCeiling remaining is 2147483647

*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

Thanks, and also my condor_status questions  below--

On your central manager node, what output do the following commands display:

   $ condor_status

   $ condor_status -af name cpus slotweight

If the above commands do not display anything, then on an execute point (i.e. a host running your condor_startd), what is the output from these two commands when run AS ROOT (or with sudo):

  #  condor_ping -type collector -table ALL

  #  ps auxw | grep condor

Finally, you show below forward dns entries with dig, but curious if there are reverse (inverse) entries... what is the output from:

   $ dig +short -x 155.37.253.160

Thanks,
regards,
Todd



On 6/30/2023 2:12 PM, Weatherby,Gerard wrote:

Oops, missed that email.

htcondor/focal,now 10.6.0-1 amd64 [installed]



On same network with DNS entries:

weatherby@condorcentralmanager:~$ dig +short condorcentralmanager.nmrbox.org

155.37.253.160

gweatherby@condorcentralmanager:~$ dig +short test-condor1.nmrbox.org

155.37.253.166

gweatherby@condorcentralmanager:~$ dig +short test-condor2.nmrbox.org

155.37.253.165

cat /etc/lsb-release

DISTRIB_ID=Ubuntu

DISTRIB_RELEASE=20.04

DISTRIB_CODENAME=focal

DISTRIB_DESCRIPTION="Ubuntu-Server 20.04.3 LTS amd64 (Cubic 2022-03-16 07:21)"

uname -a

Linux test-condor2.nmrbox.org 5.4.0-42-generic #46-Ubuntu SMP Fri Jul 10 00:24:02 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux

 

From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Date: Friday, June 30, 2023 at 1:52 PM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>, Weatherby,Gerard <gweatherby@xxxxxxxx>
Subject: Re: [HTCondor-users] 10.0.5 i/ 10.0.6 install submitterCeiling remaining is 2147483647

*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***


Hi Gerald,

We can (hopefully!) help better if you send along answers to my earlier questions from Tuesday -

What distro/ version of Linux are you using?  Are all three nodes on the same network?  Do they all have hostname entries in either DNS or /etc/hosts ?  Your email subject line says "10.0.5 install", but the commands below would pull from the Feature channel and thus install HTCondor v10.5.0 currently (perhaps that is what you meant).  Does "condor_status" show available slots from your execute node?  Given the "skipped because submitterCeiling" message below, what is the output from  

    condor_status -af name cpus slotweight

?


On 6/30/2023 8:29 AM, Weatherby,Gerard wrote:

Iâve upgraded the test nodes and am still seeing the same message.




 

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Weatherby,Gerard <gweatherby@xxxxxxxx>
Date: Thursday, June 29, 2023 at 8:27 AM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] 10.0.5 install

So â.

I followed (I think) the instructions at https://htcondor.readthedocs.io/en/latest/getting-htcondor/admin-quick-start.html â and it just doesnât seem to work

due to whatever

submitterCeiling remaining is 2147483647

is trying to tell me. Any suggestions?

 

 

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Weatherby,Gerard <gweatherby@xxxxxxxx>
Date: Tuesday, June 27, 2023 at 5:07 PM
To: Todd Tannenbaum <tannenba@xxxxxxxxxxx>, HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] 10.0.5 install

None of your business! 😉

Yes, same password used for each.

 

From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Date: Tuesday, June 27, 2023 at 4:38 PM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>, Weatherby,Gerard <gweatherby@xxxxxxxx>
Subject: Re: [HTCondor-users] 10.0.5 install

*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

On 6/27/2023 3:24 PM, Todd Tannenbaum via HTCondor-users wrote:


Was the contents of the ".pw" file


Hit send to quickly!  Following up on my question above, I certainly did not mean to ask that you post the contents of your .pw file :-O !  Just meant to ask you to confirm that the contents of the .pw is identical across your submit / execute / central manager hosts.










_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
 
The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/






-- 
Todd Tannenbaum <tannenba@xxxxxxxxxxx>  University of Wisconsin-Madison
Center for High Throughput Computing    Department of Computer Sciences
Calendar: https://tinyurl.com/yd55mtgd  1210 W. Dayton St. Rm #4257
Phone: (608) 263-7132                   Madison, WI 53706-1685 





-- 
Todd Tannenbaum <tannenba@xxxxxxxxxxx>  University of Wisconsin-Madison
Center for High Throughput Computing    Department of Computer Sciences
Calendar: https://tinyurl.com/yd55mtgd  1210 W. Dayton St. Rm #4257
Phone: (608) 263-7132                   Madison, WI 53706-1685