[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] segmentation fault in vanilla universe



You might try configuring grid1 to capture core files. Though if the bug is in matlab it might be tough to get fixed.

Best,


matt

On 12/30/2009 12:33 PM, satyanarayan rao wrote:
> Dear Hailong,
> 
> The answer is no, the script is running fine on the node
> grid1.iiitm.ac.in <http://grid1.iiitm.ac.in>. The problem is something
> else which i am not able to recognise.
> 
> Thanks and regards,
> Satyanarayan Rao
> 
> On Wed, Dec 30, 2009 at 5:19 PM, hailong.yang1115
> <hailong.yang1115@xxxxxxxxx <mailto:hailong.yang1115@xxxxxxxxx>> wrote:
> 
>     Hi Satyanarayan,
>      
>     Have you tried to run the script file run_matlab.sh on the execute
>     machine *grid1.iiitm.ac.in* <http://grid1.iiitm.ac.in/>? Did it run
>     correctly? It seems the commands you wrote in the script file met
>     with segmentation faults.
>      
>     -Hailong
>      
>     2009-12-30
>     ------------------------------------------------------------------------
>     ***********************************************
>     * Hailong Yang, PhD. Candidate
>     * Sino-German Joint Software Institute,
>     * School of Computer Science&Engineering, Beihang University
>     * Phone: (86-010)82315908
>     * Email: hailong.yang1115@xxxxxxxxx <mailto:hailong.yang1115@xxxxxxxxx>
>     * Address: G413, New Main Building in Beihang University,
>     *              No.37 XueYuan Road,HaiDian District,
>     *              Beijing,P.R.China,100191
>     ***********************************************
>     ------------------------------------------------------------------------
>     *发件人:* satyanarayan rao
>     *发送时间:* 2009-12-30  19:13:23
>     *收件人:* Condor-Users Mail List
>     *抄送:*
>     *主题:* [Condor-users] segmentation fault in vanilla universe
>     Hello All,
> 
>     I am having problem with running the matlab command using vanilla
>     universe.
>     I have matlab installed on two nodes, i am using only
>     one(*grid1.iiitm.ac.in <http://grid1.iiitm.ac.in>*) in submit by
>     putting it in requirement parameter.
>     The command which i am using is successfully running on the node,
>     but if i am submitting using vanilla universe it is giving error.
>     Process which i followed is following.
> 
>     [grid-node@grid-server ~]$ cat run_matlab.sh
>     #!/bin/bash
>     /usr/local/Matlab/bin/matlab -nojvm -nosplash -nodisplay <
>     /usr/local/Matlab/bin/test_matlab.m
>     hostname
>     [grid-node@grid-server ~]$
> 
>     [grid-node@grid-server ~]$ cat vanilla.submit
>               
>                 Executable     = /home/grid-node/run_matlab.sh
>                 Universe       = vanilla
>                 Requirements   = (machine=="grid1.iiitm.ac.in
>     <http://grid1.iiitm.ac.in>")
>                 Error   = err.$(Process)
>                 Output  = out.$(Process)
>                  Log = foo.log
>                  transfer_executable = true
>                  should_transfer_files = YES
>                  when_to_transfer_output = ON_EXIT
>                  Queue 10
> 
>     [grid-node@grid-server ~]$
> 
> 
>     [grid-node@grid-server ~]$ condor_status
> 
>     Name               OpSys      Arch   State     Activity LoadAv Mem  
>     ActvtyTime
> 
>     slot1@anlab-node8. LINUX      INTEL  Unclaimed Idle     0.000   497 
>     0+03:10:05
>     slot2@anlab-node8. LINUX      INTEL  Unclaimed Idle     0.000   497 
>     0+15:03:00
>     slot1@grid-server. LINUX      INTEL  Unclaimed Idle     0.840  1006 
>     0+03:15:07
>     slot2@grid-server. LINUX      INTEL  Unclaimed Idle     0.000  1006 
>     0+15:15:31
>     grid1.iiitm.ac.in <http://grid1.iiitm.ac.in>  LINUX      INTEL 
>     Unclaimed Idle     0.000   242  0+02:15:04
> 
>                          Total Owner Claimed Unclaimed Matched
>     Preempting Backfill
> 
>              INTEL/LINUX     5     0       0         5       0         
>     0        0
> 
>                    Total     5     0       0         5       0         
>     0        0
>     [grid-node@grid-server ~]$
> 
> 
>     [grid-node@grid-server ~]$ condor_submit vanilla.submit
>     Submitting job(s)..........
>     Logging submit event(s)..........
>     10 job(s) submitted to cluster 1042.
>     [grid-node@grid-server ~]$
> 
>     [grid-node@grid-server ~]$ cat out.1
>     grid1.iiitm.ac.in <http://grid1.iiitm.ac.in>
>     [grid-node@grid-server ~]$ cat err.1
>     /opt/condor-7.2.4/local.grid1/execute/dir_15382/condor_exec.exe:
>     line 2: 15385 Segmentation fault      /usr/local/Matlab/bin/matlab
>     -nojvm -nosplash -nodisplay </usr/local/Matlab/bin/test_matlab.m
>     [grid-node@grid-server ~]$
> 
>     Any help will be greatly appreciated.
> 
>     Thanks and Regards,
>     Satyanarayan Rao
> 
> 
>     -- 
>     Satyanarayan Rao
>     ABV-IIITM Gwalior
>     http://www.iiitm.ac.in
>     Ph: +91-9179346657
> 
> 
>     _______________________________________________
>     Condor-users mailing list
>     To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
>     <mailto:condor-users-request@xxxxxxxxxxx> with a
>     subject: Unsubscribe
>     You can also unsubscribe by visiting
>     https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
>     The archives can be found at:
>     https://lists.cs.wisc.edu/archive/condor-users/
> 
> 
> 
> 
> -- 
> Satyanarayan Rao
> ABV-IIITM Gwalior
> http://www.iiitm.ac.in
> Ph: +91-9179346657
> 
> 
> 
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/