Opened 2 years ago

Last modified 2 years ago

#118 new defect

Sequential run of the model does not work

Reported by: abierjon Owned by:
Priority: major Milestone:
Component: MARS GCM Version:
Keywords: Cc:

Description

It seems (at least on Ciclad and Spirit) that the model (r2828) can't run in sequential for now (no -parallel option in the compilation)

Indeed, the model stops just after dynredem0 and output this error in the log :

 dynredem0: rad,omeg,g,cpp,kappa   3397200.00000000       7.077651389408857E-005
   3.72000002861023        744.499000000000       0.256792992353439     
[spirit64-08:452017:0:452017] Caught signal 11 (Segmentation fault: invalid permissions for mapped object at address 0x996798)
==== backtrace (tid: 452017) ====
 0  /net/nfs/tools/u20/22.3/PrgEnv/intel/linux-ubuntu20.04-x86_64/ucx/1.11.2-intel-2021.4.0-galjorcjt7gfloq54gvzbfmsvmb2k4ac/lib/libucs.so.0(ucs_handle_error+0x352) [0x147c0ef743d2]
 1  /net/nfs/tools/u20/22.3/PrgEnv/intel/linux-ubuntu20.04-x86_64/ucx/1.11.2-intel-2021.4.0-galjorcjt7gfloq54gvzbfmsvmb2k4ac/lib/libucs.so.0(+0x2efe7) [0x147c0ef74fe7]
 2  /net/nfs/tools/u20/22.3/PrgEnv/intel/linux-ubuntu20.04-x86_64/ucx/1.11.2-intel-2021.4.0-galjorcjt7gfloq54gvzbfmsvmb2k4ac/lib/libucs.so.0(+0x2eadb) [0x147c0ef74adb]
 3  /lib/x86_64-linux-gnu/libpthread.so.0(+0x14420) [0x147c1374f420]
 4  ./gcm_64x48x54_phymars_seq.e() [0x43a755]
 5  ./gcm_64x48x54_phymars_seq.e() [0x411403]
 6  ./gcm_64x48x54_phymars_seq.e() [0x40ffa2]
 7  /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3) [0x147c12be6083]
 8  ./gcm_64x48x54_phymars_seq.e() [0x40feae]
=================================

There is no error if we compile the model with -parallel mpi and run it on 1 core.

Change History (1)

comment:1 Changed 2 years ago by abierjon

This is also the case for run on LMD machines :

 dynredem0: rad,omeg,g,cpp,kappa   3397200.00000000       7.077651389408857E-005
   3.72000002861023        744.499000000000       0.256792992353439     
forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image              PC                Routine            Line        Source             
libintlc.so.5      00002B2902916E01  Unknown               Unknown  Unknown
libintlc.so.5      00002B29029152B7  Unknown               Unknown  Unknown
libifcore.so.5     00002B29017D2782  Unknown               Unknown  Unknown
libifcore.so.5     00002B29017D25D6  Unknown               Unknown  Unknown
libifcore.so.5     00002B290173E7A9  Unknown               Unknown  Unknown
libifcore.so.5     00002B290174FC48  Unknown               Unknown  Unknown
libpthread.so.0    00002B2902B77630  Unknown               Unknown  Unknown
gcm_64x48x54_phym  00000000004310A6  Unknown               Unknown  Unknown
gcm_64x48x54_phym  000000000040616C  Unknown               Unknown  Unknown
gcm_64x48x54_phym  0000000000404CBE  Unknown               Unknown  Unknown
libc.so.6          00002B2902DA6555  Unknown               Unknown  Unknown
gcm_64x48x54_phym  0000000000404BC9  Unknown               Unknown  Unknown
Note: See TracTickets for help on using tickets.