CosmoCoffee Forum Index CosmoCoffee

 
 FAQFAQ   SearchSearch  MemberlistSmartFeed   MemberlistMemberlist    RegisterRegister 
   ProfileProfile   Log inLog in 
Arxiv New Filter | Bookmarks & clubs | Arxiv ref/author:

forrtl: severe (174): SIGSEGV, segmentation fault occurred cause by the latest ifort
 
Post new topic   Reply to topic    CosmoCoffee Forum Index -> Computers and software
View previous topic :: View next topic  
Author Message
Yutong Wang



Joined: 06 May 2014
Posts: 9
Affiliation: UCAS

PostPosted: October 14 2017  Reply with quote

Hi everyone:
I use the latest ifort 2018.0.128 to run cosmomc, I set action=2 in test.ini and run the program, after one night, I got this error:

forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image PC Routine Line Source
cosmomc 00000000006DE5ED for__signal_handl Unknown Unknown
libpthread−2.21.s 00007FF63292CD10 Unknown Unknown Unknown
libiomp5.so 00007FF6338D2E05 Unknown Unknown Unknown
cosmomc 0000000000710B0A for_dealloc_alloc Unknown Unknown
cosmomc 00000000005DEC68 Unknown Unknown Unknown
cosmomc 00000000006730C9 Unknown Unknown Unknown
cosmomc 000000000050F5E4 Unknown Unknown Unknown
cosmomc 000000000049CAA3 Unknown Unknown Unknown
cosmomc 00000000004DB8B2 Unknown Unknown Unknown
cosmomc 00000000004DA81A Unknown Unknown Unknown
cosmomc 00000000004E6552 Unknown Unknown Unknown
cosmomc 00000000005BB345 Unknown Unknown Unknown
cosmomc 0000000000410F2E Unknown Unknown Unknown
libc−2.21.so 00007FF632572A40 __libc_start_main Unknown Unknown
cosmomc 0000000000410E29 Unknown Unknown Unknown
——————————————————-
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
——————————————————-
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
cosmomc 00000000006DE61E for__signal_handl Unknown Unknown
libpthread−2.21.s 00007F8762B85D10 Unknown Unknown Unknown
libpthread−2.21.s 00007F8762B84BFA Unknown Unknorwn Unknown
libpthread−2.21.s 00007F8762B803F0 __pthread_mutex_u Unknown Unknown
libiomp5.so 00007F8763AEEEED Unknown Unknown Unknown
libiomp5.so 00007F8763A8B8D5 Unknown Unknown Unknown
libiomp5.so 00007F8763A8D22D Unknown Unknown Unknown
libiomp5.so 00007F8763AB8458 __kmp_fork_call Unknown Unknown
libiomp5.so 00007F8763A8F7DE __kmpc_fork_call Unknown Unknown
cosmomc 00000000005F6F21 Unknown Unknown Unknown
cosmomc 0000000000670D87 Unknown Unknown Unknown
cosmomc 000000000064F0F2 Unknown Unknown Unknown
cosmomc 0000000000672AD1 Unknown Unknown Unknown
cosmomc 000000000050586E Unknown Unknown Unknown
cosmomc 00000000005031B2 Unknown Unknown Unknown
cosmomc 00000000005B35E4 Unknown Unknown Unknown
cosmomc 00000000005B0FB9 Unknown Unknown Unknown
cosmomc 00000000004DB5E0 Unknown Unknown Unknown
cosmomc 00000000004DA81A Unknown Unknown Unknown
cosmomc 00000000004E6552 Unknown Unknown Unknown
cosmomc 00000000005BB345 Unknown Unknown Unknown
cosmomc 0000000000410F2E Unknown Unknown Unknown
libc−2.21.so 00007F87627CBA40 __libc_start_main Unknown Unknown
cosmomc 0000000000410E29 Unknown Unknown Unknown
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
cosmomc 00000000006DE61E for__signal_handl Unknown Unknown
libpthread−2.21.s 00007F8E5E10CD10 Unknown Unknown Unknown
cosmomc 0000000000607794 Unknown Unknown Unknown
cosmomc 00000000005D54AF Unknown Unknown Unknown
cosmomc 00000000005FFABF Unknown Unknown Unknown
cosmomc 00000000005FF14C Unknown Unknown Unknown
cosmomc 00000000005FF14C Unknown Unknown Unknown
cosmomc 0000000000651D1C Unknown Unknown Unknown
cosmomc 000000000065017E Unknown Unknown Unknown
libiomp5.so 00007F8E5F06FAC3 __kmp_invoke_micr Unknown Unknown
libiomp5.so 00007F8E5F03E257 Unknown Unknown Unknown
libiomp5.so 00007F8E5F03F498 __kmp_fork_call Unknown Unknown
libiomp5.so 00007F8E5F0167DE __kmpc_fork_call Unknown Unknown
cosmomc 000000000064F1FB Unknown Unknown Unknown
cosmomc 0000000000672AD1 Unknown Unknown Unknown
cosmomc 000000000050586E Unknown Unknown Unknown
cosmomc 00000000005031B2 Unknown Unknown Unknown
cosmomc 00000000005B35E4 Unknown Unknown Unknown
cosmomc 00000000005B0FB9 Unknown Unknown Unknown
cosmomc 00000000004DB5E0 Unknown Unknown Unknown
cosmomc 00000000004DA81A Unknown Unknown Unknown
cosmomc 00000000004E6552 Unknown Unknown Unknown
cosmomc 00000000005BB345 Unknown Unknown Unknown
cosmomc 0000000000410F2E Unknown Unknown Unknown
libc−2.21.so 00007F8E5DD52A40 __libc_start_main Unknown Unknown
cosmomc 0000000000410E29 Unknown Unknown Unknown
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
cosmomc 00000000006DE61E for__signal_handl Unknown Unknown
libpthread−2.21.s 00007FCE58FB2D10 Unknown Unknown Unknown
libclik_mkl.so 00007FCE54532B59 mkl_blas_avx_dsyr Unknown Unknown
libclik_mkl.so 00007FCE54AB29AF mkl_blas_avx_dsyr Unknown Unknown
libclik_mkl.so 00007FCE544EDEAF mkl_blas_avx_dsyr Unknown Unknown
libmkl_avx.so 00007FCE410EDE7A mkl_blas_avx_xdsy Unknown Unknown
libmkl_intel_thre 00007FCE5C215643 mkl_blas_dsyrk_om Unknown Unknown
libmkl_intel_thre 00007FCE5C1EA885 mkl_blas_dsyrk Unknown Unknown
libmkl_core.so 00007FCE5A98E852 mkl_lapack_dpotrf Unknown Unknown
libmkl_core.so 00007FCE5ADBE757 mkl_lapack_xdpotr Unknown Unknown
libmkl_intel_thre 00007FCE5CC28368 mkl_lapack_dpotrf Unknown Unknown
libmkl_core.so 00007FCE5AFDB839 mkl_lapack_ao_dpo Unknown Unknown
libmkl_core.so 00007FCE5A98CE8E mkl_lapack_dposv Unknown Unknown
libmkl_intel_lp64 00007FCE5DFC89D2 dposv Unknown Unknown
libclik.so 00007FCE58513525 bflike_smw_mp_get Unknown Unknown
libclik.so 00007FCE584D886A bflike_smw_extra_ Unknown Unknown
libclik.so 00007FCE584B5478 bflike_smw_lkl Unknown Unknown
libclik.so 00007FCE584784E5 lklbs_lkl Unknown Unknown
libclik.so 00007FCE5848BAD1 distribution_lkl Unknown Unknown
libclik.so 00007FCE58476FD4 clik_compute Unknown Unknown
libclik_f90.so 00007FCE5EB0670F fortran_clik_comp Unknown Unknown
libclik_f90.so 00007FCE5EB07037 clik_mp_clik_comp Unknown Unknown
cosmomc 00000000005158DE Unknown Unknown Unknown
cosmomc 0000000000503D70 Unknown Unknown Unknown
cosmomc 00000000005B450D Unknown Unknown Unknown
cosmomc 00000000005B35F8 Unknown Unknown Unknown
cosmomc 00000000005B0FB9 Unknown Unknown Unknown
cosmomc 00000000004DB5E0 Unknown Unknown Unknown
cosmomc 00000000004DA81A Unknown Unknown Unknown
cosmomc 00000000004E6552 Unknown Unknown Unknown
cosmomc 00000000005BB345 Unknown Unknown Unknown
cosmomc 0000000000410F2E Unknown Unknown Unknown
libc−2.21.so 00007FCE58BF8A40 __libc_start_main Unknown Unknown
cosmomc 0000000000410E29 Unknown Unknown Unknown
BK14_dust lnlike = 322.869937285896
————————————————————————–
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

Process name: [[27820,1],3]
Exit code: 174

I use the CosmoMC-Master verison, and the version of Ubuntu is 15.04, the version of openmpi is 3.0.0. I try to use the command: ulimit -s unlimited to release the stack size, but the error remain exist.

so any clue?

PS: In fact, when I use ifort 2018.0.128 and openmpi 3.0.0 in Ubuntu16.04, after compiled cosmomc(use the master version) and use the command "mpirun -np 4 ./cosmomc test.ini" to run the program (I set action=0 in test.ini), I can also get the same error. I find if add batch2/BK14.ini&&batch2/plik_dx11dr2_HM_v18_TTTEEE.ini && batch2/lowTEB.ini in test.ini, this error will be occur, if I only add the batch2/plik_dx11dr2_HM_v18_TT.ini in test.ini, these is no error and everything goes well. But in Ubuntu 15.04, I will not meet this error and the program will be run in the correct way.
Back to top
View user's profile  
Display posts from previous:   
Post new topic   Reply to topic    CosmoCoffee Forum Index -> Computers and software All times are GMT + 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group. Sponsored by WordWeb online dictionary and dictionary software.