Informix - Problem description
Problem IT27523 | Status: Closed |
HDR PRIMARY DR_GETTYPE THREAD AND SECONDARY DR_ACCEPT THREADS CAN HANG ATTEMPTING TO RECONNECT AFTER DR:TURNED OFF ON PRIMARY SE | |
product: | |
INFORMIX SERVER / 5725A3900 / C10 - IDS 12.10 | |
Problem description: | |
After network issues caused HDR to be shutdown, the act of reconnecting the primary to hdr led to continually hung threads on both the primary and secondary that never released and prevented HDR from syncing back up. On the primary, the dr_prsend thread was observed waiting for dr_gettype thread that was stuck: Thread CPU Info: tid name vp Last Run CPU Time #scheds status 147 dr_prsend 8cpu 10/12 10:03:51 65.1068 1977253 join wait 3679913 3679913 dr_gettype 9cpu 10/12 19:11:09 0.2838 33084 cond wait smx pipe1 The stack thread of dr_gettype thread showed it stuck in smx_connect: Stack for thread: 3679913 dr_gettype base: 0x000000015098d000 len: 69632 pc: 0x00000001012c63c0 tos: 0x000000015099cdd1 state: cond wait vp: 10 oninit :: yield_processor_mvp oninit :: mt_wait oninit :: smx_connect oninit :: SC_smx_sporadic_connect oninit :: SC_maxmsg_ping oninit :: GetServerVersionInfo oninit :: verify_server_version oninit :: dr_whattype oninit :: startup At the same time, on the HDR side, there was a dr_accept thread that was also hung and it's stack was: Stack for thread: 47176 dr_accept oninit :: yield_processor_mvp oninit :: mt_wait oninit :: net_buf_get oninit :: recvtli oninit :: slSQIrecv oninit :: pfRecv oninit :: asfRecv oninit :: ASF_Call oninit :: rsasf_recv_buf oninit :: rsasf_recv_with_timeout oninit :: dr_asf_recv_with_timeout oninit :: dr_session_recv_with_timeout oninit :: dr_acceptInt oninit :: dr_accept oninit :: listen_verify oninit :: spawn_thread oninit :: th_init_initgls In this scenario, both the HDR dr_accept thread and the primary dr_gettype threads had been hung for over 9 hours. Restarting the HDR secondary did not release the dr_gettype thread on the primary and the primary had to be restarted to sync HDR back up. | |
Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * Users of IDS 12.10.xC10 and earlier versions. * **************************************************************** * PROBLEM DESCRIPTION: * * HDR primary dr_gettype thread and secondary dr_accept * * threads can hang attempting to reconnect after DR:Turned off * * on primary server. * **************************************************************** * RECOMMENDATION: * **************************************************************** | |
Local Fix: | |
Solution | |
Workaround | |
not known / see Local fix | |
Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 24.12.2018 07.10.2019 07.10.2019 |
Problem solved at the following versions (IBM BugInfos) | |
12.10.xC11 | |
Problem solved according to the fixlist(s) of the following version(s) |