[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Java stdout not completeing



Hello,

Running $CondorVersion: 6.7.6 Mar 15 2005 on $CondorPlatform: I386-LINUX_RH9
$ with NFS on 10 nodes.

I have a java executable that, given some arguments, spits out CSV to stdout
(System.out.println). When I run this job in condor, I'm only getting a few
lines from the beginning, as if its running only for a few seconds before
quitting (it runs all the way outside of condor with 'java' on the
commandline). The StartLogs (part of one shown below) seem to show that it
might be interrupted for some reason? (I may be reading that wrong).

Any help would be apprecieated.

Matt Baker
University of Arkansas


Job Ad
-------

  ####################
  #
  # UAVSimulator ClassAd
  #
  ####################

  universe       = java
  executable     = UAVSimulator.jar
  arguments      = UAVSimulator 1000 20 20 100 50
  output         = UAVSimulator_$(Process).output
log = UAVSimulator.log
error          = UAVSimulator.error
input = uav.txt
 jar_files = UAVSimulator.jar

WhenToTransferOutput = ON_EXIT
  queue 10


StartLog from one of my nodes
-----------------------------
4/16 22:12:55 Remote owner is matt@[REMOVED]
4/16 22:12:55 State change: claiming protocol successful
4/16 22:12:55 Changing state: Matched -> Claimed
4/16 22:12:58 DaemonCore: Command received via TCP from host
<130.184.163.51:10357>
4/16 22:12:58 DaemonCore: received command 444 (ACTIVATE_CLAIM), calling
handler (command_activate_claim)
4/16 22:12:58 Got activate_claim request from shadow
(<130.184.163.51:10357>)
4/16 22:12:58 Remote job ID is 25.0
4/16 22:12:58 Got universe "JAVA" (10) from request classad
4/16 22:12:58 State change: claim-activation protocol successful
4/16 22:12:58 Changing activity: Idle -> Busy
4/16 22:12:59 DaemonCore: Command received via TCP from host
<130.184.163.51:11417>
4/16 22:12:59 DaemonCore: received command 404 (DEACTIVATE_CLAIM_FORCIBLY),
calling handler (command_handler)
4/16 22:12:59 Called deactivate_claim_forcibly()
4/16 22:12:59 DaemonCore: Command received via UDP from host
<130.184.163.51:11787>
4/16 22:12:59 DaemonCore: received command 443 (RELEASE_CLAIM), calling
handler (command_release_claim)
4/16 22:12:59 State change: received RELEASE_CLAIM command
4/16 22:12:59 Changing state and activity: Claimed/Busy ->
Preempting/Vacating
4/16 22:12:59 DaemonCore: Command received via UDP from host
<130.184.163.51:11622>
4/16 22:12:59 DaemonCore: received command 443 (RELEASE_CLAIM), calling
handler (command_release_claim)
4/16 22:12:59 Got RELEASE_CLAIM while in Preempting state, ignoring.
4/16 22:12:59 Starter pid 7705 exited with status 0
4/16 22:12:59 State change: starter exited
4/16 22:12:59 State change: No preempting claim, returning to owner
4/16 22:12:59 Changing state and activity: Preempting/Vacating -> Owner/Idle
4/16 22:12:59 State change: IS_OWNER is false
4/16 22:12:59 Changing state: Owner -> Unclaimed


2 of the 10 Jobs from UAVSimulator.log..
------------------------------------------

005 (025.000.000) 04/16 22:12:59 Job terminated.
        (1) Normal termination (return value 0)
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
        308  -  Run Bytes Sent By Job
        77471  -  Run Bytes Received By Job
        308  -  Total Bytes Sent By Job
        77471  -  Total Bytes Received By Job
...
001 (025.001.000) 04/16 22:13:01 Job executing on host:
<192.168.0.253:11998>
...
005 (025.001.000) 04/16 22:13:02 Job terminated.
        (1) Normal termination (return value 0)
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
        364  -  Run Bytes Sent By Job
        77471  -  Run Bytes Received By Job
        364  -  Total Bytes Sent By Job
        77471  -  Total Bytes Received By Job
...