parallel calc crash

Submitted by zamir on Fri, 07/20/2012 - 06:40

Hello all.

I run the attached input script using the standard liggghts-stable < in.triaxial_remix, and it runs just fine. When I run it with mpi using the command line: mpirun -np 4 liggghts-stable < in.triaxial_remix, it gives me a segmentation fault.

What is extremely puzzling is that I ran it on a 32 bit laptop with mpi just fine, and now on my workstation, it crashes.

I tested several other scripts with mpi to make sure that my mpi installation i was OK. It seems to be fine. Also, when I type "which mpirun" in a terminal, it tells me /usr/bin/mpirun, so my symbolic links are ok I think. Even so, I think something is wrong with my mpi installation, but I do not know enough to find out what it is.

I have attached a tar.gz file, if anyone can give it a whirl and let me know what you find, I would appreciate it.

Thank you,
Zamir

PS. I think you might have to rename the file tar.gz.
PPS. I have tried on liggghts 1.5.2 and 1.5.3.

ADDITION:
The message in the terminal is as follows:
[Zanzibar:03005] *** Process received signal ***
[Zanzibar:03005] Signal: Segmentation fault (11)
[Zanzibar:03005] Signal code: Address not mapped (1)
[Zanzibar:03005] Failing at address: 0x18b5b4410
[Zanzibar:03005] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0xfc60) [0x7fb47cc9ac60]
[Zanzibar:03005] [ 1] liggghts-stable(_ZN9LAMMPS_NS20PairGranHookeHistory7computeEiii+0x565) [0x5d3b21]
[Zanzibar:03005] [ 2] liggghts-stable(_ZN9LAMMPS_NS8PairGran7computeEii+0x12) [0x5d0e2c]
[Zanzibar:03005] [ 3] liggghts-stable(_ZN9LAMMPS_NS6Verlet5setupEv+0x15e) [0x63acb8]
[Zanzibar:03005] [ 4] liggghts-stable(_ZN9LAMMPS_NS3Run7commandEiPPc+0x5bd) [0x61d527]
[Zanzibar:03005] [ 5] liggghts-stable(_ZN9LAMMPS_NS5Input15execute_commandEv+0xc08) [0x580922]
[Zanzibar:03005] [ 6] liggghts-stable(_ZN9LAMMPS_NS5Input4fileEv+0x27a) [0x5812cc]
[Zanzibar:03005] [ 7] liggghts-stable(main+0x46) [0x58691a]
[Zanzibar:03005] [ 8] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xff) [0x7fb47c914eff]
[Zanzibar:03005] [ 9] liggghts-stable() [0x4743f9]
[Zanzibar:03005] *** End of error message ***

AttachmentSize
Binary Data share.tar_.gz204.32 KB
ckloss's picture

ckloss | Fri, 07/20/2012 - 11:50

Could you please re-run the case with the version 2.0.4?
Bug fixes are made for the latest and greatest version only

Cheers, Christoph