suche 36x36
Latest versionsfixlist
11.1.0.7 FixList
10.5.0.9 FixList
10.1.0.6 FixList
9.8.0.5 FixList
9.7.0.11 FixList
9.5.0.10 FixList
9.1.0.12 FixList
Have problems? - contact us.
Register for free anmeldung-x26
Contact form kontakt-x26

DB2 - Problem description

Problem IT27157 Status: Closed

NOT ABLE TO FORM HADR CONNECTION AFTER TAKEOVER WHEN PURESCALE HADR
STANDBY MEMBERS ON SAME HOST HAVE THE SAME HADR_LOCAL_SVC

product:
DB2 FOR LUW / DB2FORLUW / A50 - DB2
Problem description:
When using HADR feature in pureScale, it is possible to define
the standby cluster with multiple members running on the same
host. In such configuration, the HADR_LOCAL_HOST database
configuration parameter is the same for these members. Their
HADR_LOCAL_SVC database configuration parameter must be set to a
different port value in order to avoid conflict.

When HADR_LOCAL_SVC is set to the same value, this incorrect
configuration is not detected when the database is activated as
standby, because only one member of the standby cluster is
activated. A wrongly such configured standby can still function,
e.g. reach peer state. However, after HADR takeover, only one
member of the new primary can listen on the configured port on
that host. It would reject the connection request sent by the
new standby when it detects it is for a different member. This
unexpected rejection will cause the new standby database to
terminate. Additionally, when attempting to activate the
database on all members of the new primary cluster, some
members will fail to be activated due to the port conflict.

It is more desirable to detect the incorrect configuration and
prevent the database to be activated as the standby.

The following message in the db2diag.log of the new primary
confirms that the new primary is rejecting the standby
because it is for a different member than what the standby
wanted to be connected with.
2018-12-03-03.11.32.289579-300 I92473E622            LEVEL:
Error
PID     : 15302                TID : 140287603107584 PROC :
db2sysc 0
INSTANCE: hsjiang              NODE : 000            DB   :
HADRDB
HOSTNAME: hotellnx113
EDUID   : 112                  EDUNAME: db2hadrp.0.1 (HADRDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery,
hdrVerifyMembersMatch, probe:15595
MESSAGE : ZRC=0x87800140=-2021654208=HDR_ZRC_CONFIGURATION_ERROR
          "One or both databases of the HADR pair is configured
incorrectly"
DATA #1 : 
The local HADR log stream id 0 does not match the remote log
stream id 2

The following message in the db2diag.log of the new standby
confirms that the rejection cause the standby database to
terminate.
2018-12-03-03.11.32.281133-300 I175790E491           LEVEL: Info
PID     : 5264                 TID : 140135265986304 PROC :
db2sysc 0
INSTANCE: hsjiang              NODE : 000            DB   :
HADRDB
HOSTNAME: hotellnx112
EDUID   : 483                  EDUNAME: db2hadrs.2.0 (HADRDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery,
hdrHandleHsAck, probe:43900
DATA #1 : 
Handshake HDR_MSG_HDRREJECT message is received from
hotellnx113:32601 (9.26.121.209:32601)

2018-12-03-03.11.32.282571-300 I176812E609           LEVEL:
Error
PID     : 5264                 TID : 140135265986304 PROC :
db2sysc 0
INSTANCE: hsjiang              NODE : 000            DB   :
HADRDB
HOSTNAME: hotellnx112
EDUID   : 483                  EDUNAME: db2hadrs.2.0 (HADRDB) 0
FUNCTION: DB2 UDB, High Availability Disaster Recovery,
hdrHandleHsAck, probe:43901
MESSAGE : ZRC=0x87800140=-2021654208=HDR_ZRC_CONFIGURATION_ERROR
          "One or both databases of the HADR pair is configured
incorrectly"
DATA #1 : 
HADR handshake with hotellnx113:32601 (9.26.121.209:32601)
failed.

2018-12-03-03.11.32.298613-300 E182259E1365          LEVEL:
Severe
PID     : 5264                 TID : 140135333095168 PROC :
db2sysc 0
INSTANCE: hsjiang              NODE : 000            DB   :
HADRDB
APPHDL  : 0-115                APPID: *N0.DB2.181203081132
HOSTNAME: hotellnx112
EDUID   : 489                  EDUNAME: db2agent (HADRDB) 0
FUNCTION: DB2 UDB, data protection services,
SQLP_DBCB::setLogState, probe:5000
DATA #1 : 
Database error has been detected.  As a result, for
precautionary reasons
all logging services have been stopped.

2018-12-03-03.11.32.307417-300 I184040E572           LEVEL:
Severe
PID     : 5264                 TID : 140135333095168 PROC :
db2sysc 0
INSTANCE: hsjiang              NODE : 000            DB   :
HADRDB
APPHDL  : 0-115                APPID: *N0.DB2.181203081132
HOSTNAME: hotellnx112
EDUID   : 489                  EDUNAME: db2agent (HADRDB) 0
FUNCTION: DB2 UDB, base sys utilities,
sqeApplication::AppStopUsing, probe:7876
MESSAGE : ZRC=0xFFFFFBF6=-1034
          SQL1034C  The database was damaged, so all
applications processing the database were stopped.
Problem Summary:
****************************************************************
* USERS AFFECTED:                                              *
* pureScale HADR users                                         *
****************************************************************
* PROBLEM DESCRIPTION:                                         *
* See Error Description                                        *
****************************************************************
* RECOMMENDATION:                                              *
* See local fix                                                *
****************************************************************
Local Fix:
Configure different port for the members on the same host.
Solution
Workaround
not known / see Local fix
BUG-Tracking
forerunner  : 
follow-up : IT27252 
Timestamps
Date  - problem reported    :
Date  - problem closed      :
Date  - last modified       :
02.12.2018
09.12.2018
09.12.2018
Problem solved at the following versions (IBM BugInfos)
Problem solved according to the fixlist(s) of the following version(s)