[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Some of my jobs went to held stage



Hello experts,

I see that the some of my jobs went to "held" stage with error:  The file parallel.sh is on NFS.


****************************************************************************

       0  -  Total Bytes Received By Job
...
000 (43754.000.000) 02/19 13:15:47 Job submitted from host: <>
...
001 (43754.000.000) 02/19 13:16:02 Job executing on host: <ProvNet=atlas.edu>
...
007 (43754.000.000) 02/19 13:16:02 Shadow exception!
        Error from slot2@xxxxxxxxxxxxxxxxx: Failed to execute '/xdata/bawa/batch/SmallD3PD/jobScripts/parallel_200.sh': (errno=116: 'Stale file handle')
        0  -  Run Bytes Sent By Job
        0  -  Run Bytes Received By Job
...
012 (43754.000.000) 02/19 13:16:02 Job was held.
        Error from slot2@xxxxxxxxxxxxxxxxx: Failed to execute '/xdata/bawa/batch/SmallD3PD/jobScripts/parallel_200.sh': (errno=116: 'Stale file handle')
        Code 6 Subcode 116
...



*************************

How can I get rid of this error?


--
Dr. Harinder Singh Bawa

                                          
[web][facebook][youtube][twitter]
California State University, Fresno Logo