[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Condor to reboot a machine



Hey Rich,

I'll have a look at it. AFAIR we tried using EC2 with HVM instances a while back and the penalty was way more than 2-3%. I'll double check on that.

Thanks!


On Thu, Jan 9, 2014 at 6:20 AM, Rich Pieri <ratinox@xxxxxxx> wrote:
Tiago Macarios wrote:
> Yeah we know that. Problem is that we run intensive simulations that may
> take days/weeks to finish. The extra overhead of running a VM is really
> not desirable.

The overhead is about 2-3% in actual operation, maybe a little more,
maybe a little less depending on the nature of the jobs.

By comparison, a tiny error will render a node unbootable. That would
reduce its processing capability by 100% until an administrator fixes
the problem.

Those are the two worst case situations. Take your pick.

I know how I'd go about implementing this dual-boot strategy but, as I
wrote before, I strongly recommend using the VM universe instead. Linux
boot-time device enumeration can be inconsistent and this inconsistency
will bite you.

--
Rich Pieri <ratinox@xxxxxxx>
MIT Laboratory for Nuclear Science
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/