[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Cannot use multiple arguments to run MPI application in "parallel" universe
- Date: Fri, 16 Nov 2018 16:09:21 +0000
- From: Michael Pelletier <Michael.V.Pelletier@xxxxxxxxxxxx>
- Subject: Re: [HTCondor-users] Cannot use multiple arguments to run MPI application in "parallel" universe
Since HTCondor dates back to before the days of multi-processor systems, the old static-slot configuration abstracts each core in a multi-core system as a separate machine in keeping with the original one-core==one-machine paradigm. Using partitionable slots brings the resource allocation into the modern multi-core age.
Michael V. Pelletier
Digital Transformation & Innovation
Integrated Defense Systems
From: HTCondor-users [mailto:htcondor-users-bounces@xxxxxxxxxxx] On Behalf Of hufh
Sent: Friday, November 16, 2018 10:55 AM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: [External] Re: [HTCondor-users] Cannot use multiple arguments to run MPI application in "parallel" universe
Now i can run your MPI program with correct output. Thank you so much!
I am a little bit confused by the concept "machine". In this presentation,Âhttps://meetings.internet2.edu/media/medialibrary/2015/10/19/20151008-thain-htcondor-admin-tutorial.pdf
it says: "Machine â An individual computer, managed by one startd", this means "machine" is a physical machine.
but when I run condor_q on my 24-core server(I have only this server), i got result as follows:
Â Â Â Â Â Â Â Â Â Â ÂMachines Owner Claimed Unclaimed Matched PreemptingÂ Drain
Â Â Â Â X86_64/LINUXÂ Â Â Â24Â Â Â0Â Â Â Â4Â Â Â Â 20Â Â Â Â0Â Â Â Â Â 0Â Â Â 0
Â Â Â Â Â Â Â ÂTotalÂ Â Â Â24Â Â Â0Â Â Â Â4Â Â Â Â 20Â Â Â Â0Â Â Â Â Â 0Â Â Â 0
Here "machines" is 24, it means it's not a "physical" machine, but a core or a slot.
Could you please clarify for me?Â In addition, what does node mean? My condor version is 8.6.12 for CentOS.