Multicontact model crashes without error

Submitted by amgroen on Thu, 07/04/2019 - 18:27

When using the "gran surface multicontact" my simulation crashes for no apparent reason and Liggghts does not output an error.
I'm using the latest Liggghts that I built myself on Windows with MSMPI and vtk

When running with a single processor, the visual C++ error popup says: "cannot dereference end list iterator"
An image of the error is attached.

I've also attached my script as a reference

The code crashes after inserting 800 particles which takes around 10 minutes to run on my computer. I can produce the crash much more quickly with multiple-processors but then I don't get the nice error popup shown in the attached image.

I'm not sure if this issue can be resolved with changing my script or if there is something wrong with the C++ code.

Is the multicontact model not compatible with inserting particles? Or defining meshes after the fix multicontact/halfspace is used?
The documentation says that it is not stored in binary restart files, can the multicontact model still be applied to particles that are inserted from a binary restart file?

Also, trying to run the example code from github with 4 processors immediately crashes with:
"Fatal error in MPI_Wait: Message truncated, error stack:
MPI_Wait(request=0x054FF290, status0x054FF274) failed
Message from rank 1 and tag 0 truncated; 209176 bytes received but buffer size is 117120"

Hopefully, the multicontact model is just out-dated and requires a simple update

Thanks,
Alex

AttachmentSize
Plain text icon code3.55 KB
Image icon Windows error13.48 KB

amgroen | Wed, 07/10/2019 - 03:22

A follow-up is that I tried a variety of different things but I'm not knowledgeable enough to try to fix the code.

This error in the public example occurs with:
My version, the Windows binary, on Ubuntu (OpenMPI) with the newest version as well as 3.6 (the tutorial was released)

The next thing that I could try is MPICH.
I'm not convinced that the multicontact model ever worked with MPI and multiple cores

I also found another thread on the forum which was not resolved: https://www.cfdem.com/forums/liggghts-cluster-singularity

The error in Ubuntu displays as:
Import and parallelization of mesh piston_m containing 2 triangle(s) successful
*** An error occurred in MPI_Wait
*** reported by process [1659305985,0]
*** on communicator MPI_COMM_WORLD
*** MPI_ERR_TRUNCATE: message truncated
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
*** and potentially your MPI job)
2 more processes have sent help message help-mpi-errors.txt / mpi_errors_are_fatal
Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages

Since I have many simulations to run I'm just going to run several simulations on single cores.
Still would be nice to have an update about this issue