64bit: segfault on program exit

marco atzeri marco.atzeri@gmail.com
Mon Apr 1 15:11:00 GMT 2013


I am building and testing openmpi-1.7.0rc9 on
   CYGWIN_NT-6.1 1.7.18(0.263/5/3) 2013-03-28 22:07 x86_64 Cygwin
every looks fine except when all the processes on several cores
end and should return to lunching program, something go wrong
(of course on 32bit everyhing is OK)

Attached stackdump.


$ mpirun -np 4 ./hello_c.exe
Hello, world, I am 1 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
Hello, world, I am 3 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
Hello, world, I am 2 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
Hello, world, I am 0 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
[MARCOATZERI:05260] *** Process received signal ***
[MARCOATZERI:05260] Signal: Segmentation fault (11)
[MARCOATZERI:05260] Signal code:  (23)
[MARCOATZERI:05260] Failing at address: 0x488f23380
[MARCOATZERI:05260] *** End of error message ***
Segmentation fault

debugging with gdb is not very useful as

(gdb) run -np 4 ./hello_c.exe
Starting program: /usr/bin/orterun.exe -np 4 ./hello_c.exe
[New Thread 1964.0x23a8]
[New Thread 1964.0x1554]
[New Thread 1964.0x27dc]
[New Thread 1964.0x1388]
[New Thread 1964.0x5a4]
[New Thread 1964.0x15bc]
[New Thread 1964.0x23e4]
[New Thread 1964.0x2244]
[New Thread 1964.0x163c]
[New Thread 1964.0x16f8]
[New Thread 1964.0x2718]
[New Thread 1964.0x2018]
[New Thread 1964.0x19c4]
Hello, world, I am 0 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
Hello, world, I am 2 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
Hello, world, I am 3 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)
Hello, world, I am 1 of 4, (Open MPI v1.7rc9, package: Open MPI 
marco@MARCOATZERI Distribution, ident: 1.7rc9, Mar 28, 2013, 96)

and gdb freezes.


Marco
-------------- next part --------------
Exception: STATUS_ACCESS_VIOLATION at rip=00488F23380
rax=0000000000000000 rbx=0000000000000004 rcx=000000018022C984
rdx=0000000000000000 rsi=0000000600012270 rdi=0000000488F6C060
r8 =00000001801C9DA0 r9 =0000000488F7E000 r10=0000000588F6C05F
r11=0000000488F23367 r12=0000000000000000 r13=0000000000000001
r14=0000000000000000 r15=0000000000000006
rbp=000000000022A630 rsp=000000000022A610
program=E:\cygwin64\bin\orterun.exe, pid 5260, thread main
cs=0033 ds=002B es=002B fs=0053 gs=002B ss=002B
Stack trace:
Frame        Function    Args
0000022A630  00488F23380 (004CCD00004, 0060008C820, 00600082970, 0000022CCF0)
00600012500  00488F1C48C (00000000000, 0000022AB10, 00180134344, 00000000000)
0000022A900  0010040234B (0000022AB10, 00000000000, 00000000000, 0000022AB80)
0000022AAC0  001004010F3 (0000022AB10, 00000000000, 00000000030, 30001000000FF00)
0000022AB80  001800478A7 (00000000000, 00000000000, 00000000000, 00000000000)
00000000000  0018004576B (00000000000, 00000000000, 00000000000, 00000000000)
00000000000  0018004592F (00000000000, 00000000000, 00000000000, 00000000000)
00000000000  00100407CB1 (00000000000, 00000000000, 00000000000, 00000000000)
00000000000  00100401010 (00000000000, 00000000000, 00000000000, 00000000000)
00000000000  0007710652D (00000000000, 00000000000, 00000000000, 00077189300)
00000000000  0007733C521 (00000000000, 00000000000, 00000000000, 00077189300)
End of stack trace


More information about the Cygwin-apps mailing list