[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] condor_qedit problem



Never mind.  I see what I was doing wrong now

On Sat, May 26, 2012 at 11:12:13AM -0500, Nathan Panike wrote:
> I keep getting tons of messages in my Userlog:
> 
> 022 (129.000.000) 05/25 17:16:20 Job disconnected, attempting to reconnect
>     Socket between submit and execute hosts closed unexpectedly
>     Trying to reconnect to slot4@xxxxxxxxxxxxxxxxxxx <128.105.109.35:43267>
> ...
> 024 (129.000.000) 05/25 17:16:20 Job reconnection failed
>     Job not found at execution machine
>     Can not reconnect to slot4@xxxxxxxxxxxxxxxxxxx, rescheduling job
> ...
> 
> So I believe there is a problem with durga.stat.wisc.edu:
> 
> $ condor_hold 129
> 
> $ condor_qedit 129 Requirements "( TARGET.Arch == \"X86_64\" ) && ( TARGET.OpSys == \"LINUX\" ) && ( TARGET.Disk >= DiskUsage ) && ( ( TARGET.Memory * 1024 ) >= ImageSize ) && ( ( RequestMemory * 1024 ) >= ImageSize ) && ( TARGET.HasFileTransfer ) && ( Machine != \"durga.stat.wisc.edu\" )"
> 
> The Userlog dutifully reports that the Requirements have changed:
> 
> 033 (129.000.000) 05/25 21:50:50 Changing job attribute Requirements from ( TARGET.Arch == "X86_64" ) && ( TARGET.OpSys == "LINUX" ) && ( TARGET.Disk >= DiskUsage ) && ( ( TARGET.Memory * 1024 ) >= ImageSize ) && ( ( RequestMemory * 1024 ) >= ImageSize ) && ( TARGET.HasFileTransfer ) to Requirements = ( TARGET.Arch == "X86_64" ) && ( TARGET.OpSys == "LINUX" ) && ( TARGET.Disk >= DiskUsage ) && ( ( TARGET.Memory * 1024 ) >= ImageSize ) && ( ( RequestMemory * 1024 ) >= ImageSize ) && ( TARGET.HasFileTransfer ) && ( Machine != "durga.stat.wisc.edu" )
> 
> But when I do: 
> 
> $ condor_q -l 129 | grep Requirements
> Requirements = ( TARGET.Arch == "X86_64" ) && ( TARGET.OpSys == "LINUX" ) && ( TARGET.Disk >= DiskUsage ) && ( ( TARGET.Memory * 1024 ) >= ImageSize ) && ( ( RequestMemory * 1024 ) >= ImageSize ) && ( TARGET.HasFileTransfer )
> 
> The requirements do not seem to have changed.
> 
> This is on the UWCS pool:
> 
> CondorVersion = "$CondorVersion: 7.6.7 Apr 24 2012 BuildID: 421363 $"
> ScheddIpAddr = "<128.105.14.28:37081>"
> 
> What am I doing wrong?
> 
> Nathan Panike
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/