DB2 - Problem description
| Problem IT03829 | Status: Closed |
DB2STOP FORCE MAY HANG ON PAGE LATCHES DUE TO AN INSTANCE TRAP WHICH DID NOT TERMINATE THE INSTANCE | |
| product: | |
DB2 FOR LUW / DB2FORLUW / A10 - DB2 | |
| Problem description: | |
This problem is specific to DB2 on the Windows operating system
platforms.
A "db2stop force" command might hang and while the instance is
hung, performing a "db2fodc -db <database name> -hang full"
would reveal pool read latches being held indefinitely :
Stack:
======================
0000000180012E26 <SQLO_SLATCH_CAS32::getConflictComplex>
<sqloLatchCAS32.C:718>
00000001800128D7 <SQLO_SLATCH_CAS32::getConflict>
<sqloLatchCAS32.C:1005>
0000000001D66504 <sqlbFindPageInBPOrSim>
<E:\db2_v97fp6\ntx64\s120629\engn\sqb\inc\sqlbslat.h:720>
0000000001D58FE4 <sqlbPurgeOrFlushAllPagesInSmallRange>
<sqlbbuffers.C:4071>
0000000001D57C7E <sqlbPurgeObject> <sqlbbuffers.C:5514>
0000000001D430FE <sqlbSMSDeleteObject> <sqlbfiles.C:2063>
0000000001EF12CA <sqldDropObj> <sqldmdrp.C:1262>
0000000001EF0975 <sqldDropTable> <sqldmdrp.C:946>
0000000001DD24D7 <sqlbPFPrefetcherEntryPoint>
<sqlbpfchr.C:1961>
0000000001DD1AFD <sqbPrefetcherEdu::RunEDU> <sqlbpfchr.C:7699>
0000000003AA32C5 <sqlzRunEDU> <sqlz_edu_obj.C:35>
00000001800E69FE <sqloEDUEntry> <sqloedu.C:3454>
Summary:
Found in 3 stacks of a total of 46 stacks ( 6.52% ) in 1 files
Found in:
92644.019.866.stack1.txt -- Pid:3660 Tid:7032 --
2013-12-09-09.41.30.813000
92644.019.866.stack1.txt -- Pid:3660 Tid:6072 --
2013-12-09-09.41.30.860000
92644.019.866.stack1.txt -- Pid:3660 Tid:3788 --
2013-12-09-09.41.30.860000
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~
and latches held
Holding latch type: (SQLO_LT_SQLB_POOL_CB__readLatch) - Address:
(000000001DD5F868), Line: 2455, File: sqlbpacc.C HoldCount: 9
Holding latch type: (SQLO_LT_SQLB_POOL_CB__readLatch) - Address:
(000000001DD5F868), Line: 2455, File: sqlbpacc.C HoldCount: 18
Holding latch type: (SQLO_LT_SQLB_POOL_CB__readLatch) - Address:
(000000001DD5F868), Line: 2455, File: sqlbpacc.C HoldCount: 19
You will also see that page cleaners share the same stack and be
held on sqloLioAIOCollect function:
Stack:
======================
00000001800E992A <sqloWaitInterrupt> <sqloedu.C:634>
000000018001C74F <sqloWaitIPCWaitPost> <sqlowaitpost.C:1979>
00000001800F3A9D <SQLO_LIO_HANDLE_DATA::sqloLioAIOCollect>
<sqlolio.C:3578>
00000001800F3678 <sqloLioCollectNBlocks> <sqlolio.C:4631>
000000000415F73E <sqlbClnrFindWork> <sqlbclnr_core.C:2633>
000000000415E981 <sqlbClnrEntryPoint> <sqlbclnr_core.C:3427>
000000000415E8BF <sqbPgClnrEdu::RunEDU> <sqlbclnr_core.C:4358>
0000000003AA32C5 <sqlzRunEDU> <sqlz_edu_obj.C:35>
00000001800E69FE <sqloEDUEntry> <sqloedu.C:3454>
This hang situation could happen if a severe error that would
require to bring down the instance to avoid further issues was
previously encountered in the AIO code. In this case DB2 might
go on processing instead of performing a "panic" emergency stop
of the instance and generate a trap. This could be identified by
the following message in db2diag.log :
2013-12-06-14.09.28.492000+000 I1723226F516 LEVEL: Severe
(OS)
PID : 3660 TID : 2052 PROC :
db2syscs.exe
INSTANCE: DB2 NODE : 000 EDUID : 2052
EDUNAME: db2aiothr
FUNCTION: DB2 UDB, oper system services,
sqloAIOCollectorEDUEntry, probe:100
MESSAGE : ZRC=0x83000070=-2097151888
CALLED : OS, -, GetQueuedCompletionStatus
OSERR : 112 "There is not enough space on the disk."
DATA #1 : String, 37 bytes
Failed in getting completion status.
This message should trigger an instance "panic" but in this case
it does not. This APAR is to have the instance "panic" in this
specific case as it should. | |
| Problem Summary: | |
**************************************************************** * USERS AFFECTED: * * all * **************************************************************** * PROBLEM DESCRIPTION: * * See Error Description * **************************************************************** * RECOMMENDATION: * * Upgrade to version 10.1 Fix Pack 5 * **************************************************************** | |
| Local Fix: | |
| Solution | |
First fixed in version 10.1 Fix Pack 5 | |
| Workaround | |
not known / see Local fix | |
| Timestamps | |
Date - problem reported : Date - problem closed : Date - last modified : | 18.08.2014 13.07.2015 13.07.2015 |
| Problem solved at the following versions (IBM BugInfos) | |
| Problem solved according to the fixlist(s) of the following version(s) | |
| 10.1.0.5 |
|