DB2 - Problem description
| Problem IC77518 | Status: Closed |
Instance hung at function sqmFastWriterQueueMgr::allocFastWriter Records when activity event monitors are enabled | |
| product: | |
DB2 FOR LUW / DB2FORLUW / 970 - DB2 | |
| Problem description: | |
When an activity event monitor is active and collection is enabled agents generating activity records may block waiting for records while holding activity list latches (SQLO_LT_sqlrr_curr_activity_cb__sqlrr_curr_activity_cb_latch). This can result in a hang as the fast writers (which release records for reuse) end up waiting on those latches. | |
| Problem Summary: | |
****************************************************************
* USERS AFFECTED: *
* All users that use activity event monitors *
****************************************************************
* PROBLEM DESCRIPTION: *
* There are two fast writers are waiting on 2 different *
* SQLP_LHSH__xhshlatches. One (22345.98.000.stack.txt) is *
* performing an insert and is waiting on one of them. The *
* other one ("22345.97.000.stack.txt") is committing some work *
* and is waiting on the other one so it can release some *
* locks: *
* *
* *
* <StackTrace> *
* *
* (0) 0xc000000027da3120 ossDumpStackTrace *
* (1) 0xc000000027d9ba80 OSSTrapFile 0x100 ossstack.C:1971 *
* *
* (2) 0xc000000051ff02d0 sqlo_trce *
* (3) 0xc0000000520c6370 sqloDumpDiagInfoHandler *
* (4) 0xe0000001205c6f80 ---- Signal 16 (SIGUSR1) delivered *
* ---- *
* (5) 0xc0000000012f21d0 sched_yield *
* (6) 0xc00000004c8b8ec0 sqloSpinLockConflict *
* (7) 0xc00000004cd50df0 sqlplrm *
* (8) 0xc00000004cdad030 sqlpxcm1 sqlpxcm1.C:502 *
* (9) 0xc00000004c9f2010 sqlrrcom_dps *
* (10) 0xc00000004c90e060 sqlrrcom *
* (11) 0xc00000004fbe54a0 sqmRecordTypeArray::commitInserts *
* (12) 0xc00000004fb58ac0 sqmRecordTypeArray::processRecord *
* engn/include/sqlm_fast_writer_target_inlines.h:3096 *
* *
* (13) 0xc00000004fbccc50 sqmFastWriter::processWriterRecord *
* sqlm_fast_writer.C:365 *
* *
* (14) 0xc00000004c5e8100 sqmFastWriter::executeFastWriter *
* (15) 0xc00000004c5ed0e0 sqleIndCoordProcessRequest *
* (16) 0xc00000004c6fbef0 sqeAgent::RunEDU sqle_agent.C:1237 *
* (17) 0xc00000004d11d170 sqlzRunEDU sqlz_edu_obj.C:371 *
* *
* (18) 0xc00000004c7545c0 sqloEDUEntry *
* (19) 0xc0000000000fb220 pthread_bound_body *
* /ux/core/libs/threadslibs/src/common/pthreads/pthread.c:4851 *
* *
* </StackTrace> *
* *
* *
* *
* Sadly, both of those are waiting on two different *
* SQLO_LT_sqlrr_curr_activity_cb__sqlrr_curr_activity_cb_latch *
* latches. And those are held by 22345.461.000.stack.txt & *
* 22345.461.000.stack.txt and both of those are in *
* allocFastWriterRecords waiting for fast writer records. *
* *
* *
* *
* *
* *
* The other 4 fast writers (22345.95.000.stack.txt *
* 22345.96.000.stack.txt 22345.99.000.stack.txt *
* 22345.100.000.stack.txt) are performing an INSERT are stuck *
* in sqlbVerifyAndLatchPage. I believe they're stuck because *
* the bufferpool pages from the above fast writers haven't *
* been committed and so cannot be ejected from the bufferpool. *
* *
* *
* *
* <StackTrace> *
* *
* (0) 0xc000000027da3120 ossDumpStackTrace *
* (1) 0xc000000027d9ba80 OSSTrapFile ossstack.C:1971 *
* *
* (2) 0xc000000051ff02d0 sqlo_trce sqlodump.C:1595 *
* (3) 0xc0000000520c6370 sqloDumpDiagInfoHandler *
* sqloedu.C:5085 *
* (4) 0xe0000001205c6f80 ---- Signal 16 (SIGUSR1) delivered *
* ---- *
* (5) 0xc00000000041abf0 _semop_sys *
* (6) 0xc00000000042c560 _semop *
* ../../../../../core/libs/libc/shared_em_64_perf/../core/sysc *
* alls/t_semop.c:19 *
* *
* (7) 0xc00000004c8c4390 SQLO_SLATCH_CAS::getConflictComplex *
* (8) 0xc00000004c8267a0 SQLO_SLATCH_CAS::getConflictEm *
* (9) 0xc00000004cb17580 sqloltch_notrack::sqlo_latch *
* /view/db2_v97fp4_hpipf64_s110330/vbs/engn/include/sqloLatch. *
* h:234 *
* (10) 0xc00000004cecde80 sqlbVerifyAndLatchPage *
* /view/db2_v97fp4_hpipf64_s110330/vbs/engn/sqb/inc/sqlbslat.h *
* :417 *
* (11) 0xc00000004cb7dfa0 sqlbfix *
* (12) 0xc00000004c862c60 sqldGetPageForAppend *
* (13) 0xc00000004ce01e70 sqldInsertRow *
* (14) 0xc00000004cd9e9a0 sqldRowInsert sqldmisr.C:825 *
* *
* (15) 0xc00000004ce4a750 sqlrinsr sqlri_aux.C:729 *
* (16) 0xc00000004fb59360 sqmRecordTypeArray::processRecord *
* /view/db2_v97fp4_hpipf64_s110330/vbs/engn/include/sqlm_fast_ *
* writer_target_inlines.h:481 *
* (17) 0xc00000004fbccc50 sqmFastWriter::processWriterRecord *
* sqlm_fast_writer.C:365 *
* (18) 0xc00000004c5e8100 sqmFastWriter::executeFastWriter *
* (19) 0xc00000004c5ed0e0 sqleIndCoordProcessRequest *
* (20) 0xc00000004c6fbef0 sqeAgent::RunEDU sqle_agent.C:1237 *
* (21) 0xc00000004d11d170 sqlzRunEDU sqlz_edu_obj.C:371 *
* *
* (22) 0xc00000004c7545c0 sqloEDUEntry *
* (23) 0xc0000000000fb220 __pthread_bound_body *
* /ux/core/libs/threadslibs/src/common/pthreads/pthread.c:4851 *
* *
* </StackTrace> *
****************************************************************
* RECOMMENDATION: *
* Upgrade to Version 9.7 Fix Pack 5 *
**************************************************************** | |
| Local Fix: | |
| available fix packs: | |
DB2 Version 9.7 Fix Pack 5 for Linux, UNIX, and Windows | |
| Solution | |
Problem was first fixed in Version 9.7 Fix Pack 5 | |
| Workaround | |
not known / see Local fix | |
| Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 13.07.2011 21.12.2011 21.12.2011 |
| Problem solved at the following versions (IBM BugInfos) | |
9.7.FP5 | |
| Problem solved according to the fixlist(s) of the following version(s) | |
| 9.7.0.5 |
|