[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] VMware job: so many checkpoints kept on master! A bug?



Hi,

I have a VMware job running on my Condor pool.

This job has been swapped from one pool PC to the other
and meanwhile checkpoints have been made.

However, I thought that Condor's policy is to remove old checkpoints
as soon as a newer checkpoint has been created successfully; on the
other hand, if a checkpoint fails, its remains are deleted and condor
sticks to the old checkpoint. Is this right?

The problem on my condor pool is that I see a whole lot of different checkpoints
on my master (in the spool folder of the VMware job); see below. This list of
checkpoints seems to be growing as time goes on......

I wonder if this is a bug; and if so, is this a problem on the master,
the pool PCs, or both?

Thank you,
Rob.


  33495040 2010-08-13 16:57 isoysCAAv.iso
  33177600 2010-08-16 17:22 isoiEGAA8.iso
  33495040 2010-08-16 21:48 isoAHEAAI.iso
  33495040 2010-08-17 09:35 iso0XBAAm.iso
  33495040 2010-08-17 12:58 isonoAAAH.iso
  33177600 2010-08-17 16:30 isoXMAAAY.iso
  33495040 2010-08-17 20:01 iso-QAAA2.iso
  33495040 2010-08-18 10:55 isoplFAAE.iso
  33495040 2010-08-18 13:28 isoupEAAv.iso
  33495040 2010-08-18 16:15 isolSBAAV.iso
  33495040 2010-08-18 19:52 isomRAAAq.iso
  33495040 2010-08-19 09:52 isoNPHAAc.iso
  33495040 2010-08-19 13:31 isouJHAAN.iso
  33495040 2010-08-19 20:20 isoPeCAAe.iso
  33495040 2010-08-20 13:17 iso_OAAAh.iso
  33495040 2010-08-20 16:25 iso4oFAAA.iso
    983040 2010-08-20 17:04 1GB_1GBswap-000002.vmdk
    917504 2010-08-20 17:04 ttyLinux-000002.vmdk
      8967 2010-08-20 17:04 vm-WDAAw_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:06 vm-wdaaw_condor.vmem
       524 2010-08-20 17:06 vm-WDAAw_condor.vmsd
       921 2010-08-20 17:06 vm-WDAAw_condor.vmx
      8967 2010-08-20 17:06 vm67BAAU_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:07 vm67baau_condor.vmem
       524 2010-08-20 17:07 vm67BAAU_condor.vmsd
   1037045 2010-08-20 17:07 vm67baau_condor.vmss
       921 2010-08-20 17:07 vm67BAAU_condor.vmx
      8967 2010-08-20 17:07 vm6fEAAu_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:08 vm6feaau_condor.vmem
       524 2010-08-20 17:08 vm6fEAAu_condor.vmsd
   1035530 2010-08-20 17:08 vm6feaau_condor.vmss
       921 2010-08-20 17:08 vm6fEAAu_condor.vmx
      8967 2010-08-20 17:08 vm6rBAAH_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:09 vm6rbaah_condor.vmem
       525 2010-08-20 17:09 vm6rBAAH_condor.vmsd
   1035555 2010-08-20 17:09 vm6rbaah_condor.vmss
       921 2010-08-20 17:09 vm6rBAAH_condor.vmx
      8967 2010-08-20 17:09 vmdVCAAy_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:10 vmdvcaay_condor.vmem
       524 2010-08-20 17:10 vmdVCAAy_condor.vmsd
   1035530 2010-08-20 17:10 vmdvcaay_condor.vmss
       921 2010-08-20 17:10 vmdVCAAy_condor.vmx
      8967 2010-08-20 17:10 vmdZCAAV_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:11 vmdzcaav_condor.vmem
       524 2010-08-20 17:11 vmdZCAAV_condor.vmsd
   1035530 2010-08-20 17:11 vmdzcaav_condor.vmss
       921 2010-08-20 17:11 vmdZCAAV_condor.vmx
      8967 2010-08-20 17:11 vmfnBAA-_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:12 vmfnbaa-_condor.vmem
       523 2010-08-20 17:12 vmfnBAA-_condor.vmsd
   1035555 2010-08-20 17:12 vmfnbaa-_condor.vmss
       921 2010-08-20 17:12 vmfnBAA-_condor.vmx
      8967 2010-08-20 17:12 vmh6GAAX_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:13 vmh6gaax_condor.vmem
       525 2010-08-20 17:13 vmh6GAAX_condor.vmsd
   1035555 2010-08-20 17:13 vmh6gaax_condor.vmss
       921 2010-08-20 17:13 vmh6GAAX_condor.vmx
      8967 2010-08-20 17:13 vmhiBAAV_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:14 vmhibaav_condor.vmem
       524 2010-08-20 17:14 vmhiBAAV_condor.vmsd
   1035530 2010-08-20 17:14 vmhibaav_condor.vmss
       921 2010-08-20 17:14 vmhiBAAV_condor.vmx
      8967 2010-08-20 17:14 vmIdEAAX_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:15 vmideaax_condor.vmem
       525 2010-08-20 17:15 vmIdEAAX_condor.vmsd
   1035530 2010-08-20 17:15 vmideaax_condor.vmss
       921 2010-08-20 17:15 vmIdEAAX_condor.vmx
      8967 2010-08-20 17:15 vmJ3DAAp_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:16 vmj3daap_condor.vmem
       523 2010-08-20 17:16 vmJ3DAAp_condor.vmsd
   1035530 2010-08-20 17:16 vmj3daap_condor.vmss
       921 2010-08-20 17:16 vmJ3DAAp_condor.vmx
      8967 2010-08-20 17:16 vmJ5DAAb_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:17 vmj5daab_condor.vmem
       524 2010-08-20 17:17 vmJ5DAAb_condor.vmsd
   1035530 2010-08-20 17:17 vmj5daab_condor.vmss
       921 2010-08-20 17:17 vmJ5DAAb_condor.vmx
      8967 2010-08-20 17:17 vmJbBAAV_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:18 vmjbbaav_condor.vmem
       524 2010-08-20 17:18 vmJbBAAV_condor.vmsd
   1035530 2010-08-20 17:18 vmjbbaav_condor.vmss
       921 2010-08-20 17:18 vmJbBAAV_condor.vmx
      8967 2010-08-20 17:18 vms8BAA6_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:19 vms8baa6_condor.vmem
       523 2010-08-20 17:19 vms8BAA6_condor.vmsd
   1035530 2010-08-20 17:19 vms8baa6_condor.vmss
       921 2010-08-20 17:19 vms8BAA6_condor.vmx
      8967 2010-08-20 17:19 vmueGAAC_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:20 vmuegaac_condor.vmem
       524 2010-08-20 17:20 vmueGAAC_condor.vmsd
   1035530 2010-08-20 17:20 vmuegaac_condor.vmss
       921 2010-08-20 17:20 vmueGAAC_condor.vmx
    259959 2010-08-20 17:20 vmware-0.log
      2957 2010-08-20 17:20 vmware-1.log
    247083 2010-08-20 17:20 vmware-2.log
      8967 2010-08-20 17:20 vmYYDAAc_condor-Snapshot1.vmsn
 524288000 2010-08-20 17:21 vmyydaac_condor.vmem
       523 2010-08-20 17:21 vmYYDAAc_condor.vmsd
   1035530 2010-08-20 17:21 vmyydaac_condor.vmss
       921 2010-08-20 17:21 vmYYDAAc_condor.vmx
  33495040 2010-08-20 20:26 isoaaDAAx.iso
      8967 2010-08-20 20:27 vmDAAAA4_condor-Snapshot1.vmsn
 524288000 2010-08-20 20:28 vmdaaaa4_condor.vmem
       525 2010-08-20 20:28 vmDAAAA4_condor.vmsd
   1035530 2010-08-20 20:28 vmdaaaa4_condor.vmss
       921 2010-08-20 20:28 vmDAAAA4_condor.vmx
  33495040 2010-08-21 10:25 isoGtGAAo.iso
      8967 2010-08-21 10:25 vmwdGAA7_condor-Snapshot1.vmsn
 524288000 2010-08-21 10:26 vmwdgaa7_condor.vmem
       525 2010-08-21 10:26 vmwdGAA7_condor.vmsd
   1035530 2010-08-21 10:26 vmwdgaa7_condor.vmss
       921 2010-08-21 10:26 vmwdGAA7_condor.vmx
 366280704 2010-08-21 20:41 1GB_1GBswap-000001.vmdk
  33495040 2010-08-21 20:41 isoXDFAAW.iso
      8664 2010-08-21 20:41 nvram
    917504 2010-08-21 20:41 ttyLinux-000001.vmdk
   1035555 2010-08-21 20:41 vm-wdaaw_condor.vmss
      8967 2010-08-21 20:41 vmIFHAAV_condor-Snapshot1.vmsn
 524288000 2010-08-21 20:43 vmifhaav_condor.vmem
       524 2010-08-21 20:43 vmIFHAAV_condor.vmsd
   1035555 2010-08-21 20:43 vmifhaav_condor.vmss
       921 2010-08-21 20:43 vmIFHAAV_condor.vmx
    317781 2010-08-21 20:43 vmware.log