suche 36x36
  • Admin-Scout-small-Banner
           

    CURSOR Admin-Scout

    das ultimative Tool für Informix

    pfeil  
Neueste VersionenFixList
14.10.xC10 FixList
12.10.xC16.X5 FixList
11.70.xC9.XB FixList
11.50.xC9.X2 FixList
11.10.xC3.W5 FixList
Haben Sie Probleme? - Kontaktieren Sie uns.
Kostenlos registrieren anmeldung-x26
Kontaktformular kontakt-x26

Informix - Problembeschreibung

Problem IT27476 Status: Geschlossen

SWITCH PRIMARY ROLE TO SECONDARY CAUSES SERVER CRASH

Produkt:
INFORMIX SERVER / 5725A3900 / C10 - IDS 12.10
Problembeschreibung:
In a cluster with a Primary, HDR & RSS running on IDS 12.10.FC9,
these are the steps to reproduce issue:

-force a checkpoint on the Primary
-shutdown the RSS and upgrade from FC9 to FC10
-next shutdown/upgrade the HDR Secondary
-with cluster online and HDR/RSS running FC10, run the onmode -d
make primary command on the HDR Secondary
-upgrade the old Primary to FC10
-after successful upgrade, attempt to connect the old Primary as
an HDR Secondary to the newly promoted Primary and this will end
up crashing in the process

The log of the old Primary will look like this when trying to
connect to the cluster as an HDR Secondary:

12:11:46  DR: Reservation of the last logical log for log backup
turned on
12:11:46  Data replication type and state information reset. To
start DR, use
          the 'onmode -d' command and wait for the pair to be
operational,
          before shutting down the database server

12:11:46  Warning: Invalid (non-existent/blobspace/disabled)
dbspace listed
                   in DBSPACETEMP: 'tmpdbspace'
12:11:46  Physical Recovery Started at Page (2:4781).
12:11:46  Physical Recovery Complete: 0 Pages Examined, 0 Pages
Restored.
12:11:46  Dataskip is now OFF for all dbspaces
12:11:46  Restartable Restore has been ENABLED
12:11:46  Recovery Mode
12:13:04  smx creates 1 transports to server pri_root
12:13:18  DR: Reservation of the last logical log for log backup
turned off
12:13:18  DR: new type = secondary, primary server name =
sec_root
12:13:18  DR: Trying to connect to primary server = sec_root
12:13:20  DR: Secondary server connected
12:13:20  DR: Secondary server needs failure recovery

12:13:20  DR: Failure recovery from disk in progress ...
12:13:20  Logical Recovery Started.
12:13:20  11 recovery worker threads will be started.
12:13:21  Warning: Invalid (non-existent/blobspace/disabled)
dbspace listed
                   in DBSPACETEMP: 'tmpdbspace'
12:13:21  Start Logical Recovery - Start Log 89, End Log ?
12:13:21  Starting Log Position - 89 0xaf0018
12:13:21  DR: Cleared 542 KB of logical log in 0 seconds.
12:13:21  Warning: Some logical logs are not backed up. Do not
         proceed. The secondary server might hang or the logs
might be
         lost.

12:13:21  Defragmenter cleaner thread now running
12:13:21  Defragmenter cleaner thread cleaned:0 partitions
12:13:23  DR_ERR set to -25
12:13:23  Assert Failed: DR: Log Record Apply Thread Exited
Abnormally. Internal Error.
              A restart of the database server shall be required
to correct
              this problem.

12:13:23  IBM Informix Dynamic Server Version 12.10.FC10
12:13:23   Who: Session(34, informix@, 0, 0x45142bc8)
                Thread(42, dr_secapply, 451024c8, 1)
                File: hdr/hdrApply.c Line: 580
12:13:23   Results: Dynamic Server must abort
12:13:23   Action: Reinitialize shared memory
12:13:23  stack trace for pid 25248 written to
/opt/informix/12.10.FC10/tmp/af.412bf43
12:13:24   See Also: /opt/informix/12.10.FC10/tmp/af.412bf43
12:13:27  Thread ID 42 will NOT be suspended because
          it is a daemon.
12:13:27   See Also: /opt/informix/12.10.FC10/tmp/af.412bf43,
shmem.412bf43.0
12:13:28  Starting crash time check of:
12:13:28  1. memory block headers
12:13:28  2. stacks
12:13:28  Crash time checking found no problems
12:13:28  hdr/hdrApply.c, line 580, thread 42, proc id 25248,
DR: Log Record Apply Thread Exited Abnormally. Internal Error.
              A restart of the database server shall be required
to correct
              this problem.
.
12:13:28  The Master Daemon Died
12:13:28  PANIC: Attempting to bring system down
12:16:06  Parameter's user-configured value was adjusted.
(DS_MAX_SCANS)
12:16:06  Parameter's user-configured value was adjusted.
(ONLIDX_MAXMEM)
12:16:06  IBM Informix Dynamic Server Started.
12:16:06  Requested shared memory segment size rounded from
8308KB to 8840KB


Subsequent restarts will allow the new HDR to connect to the
cluster for a short time, but will crash again with a log replay
error.
Initial crash will have this stack:
12:13:23  shmcb =           0x000000004401a418
12:13:23  rhead =           0x0000000044086800
12:13:23  pool list =       0x000000004401a4f0
12:13:23  block pool list = 0x00000000440810f8
12:13:23  TRANSP =          0x0000000045142bc8
12:13:23  PARTP =           0x0000000000000000
12:13:23  PARTNP =          0x0000000000000000
12:13:23  OPENP =           0x0000000000000000
12:13:23  FILEP =           0x0000000000000000
12:13:23  Raw hex dump of stack located in
/opt/informix/12.10.FC10/tmp/af.412bf43.rawstk
12:13:23  Stack for thread: 42 dr_secapply

 base: 0x000000004677e000
  len:   69632
   pc: 0x00000000013f01a9
  tos: 0x000000004678de80
state: running
   vp: 1

(oninit) afstack
(oninit) afhandler
(oninit) afcrash_interface
(oninit) dr_secondary_apply
(oninit) startup



  siginfo:
Problem-Zusammenfassung:
****************************************************************
* USERS AFFECTED:                                              *
* Users of Informix Server prior to 12.10.xC11.                *
****************************************************************
* PROBLEM DESCRIPTION:                                         *
* SWITCH PRIMARY ROLE TO SECONDARY CAUSES SERVER CRASH         *
****************************************************************
* RECOMMENDATION:                                              *
* Upgrade to Informix Server 12.10.xC11+.                      *
****************************************************************
The fix for this APAR corrects the error message in server
message log file and assertion file to show more details on the
problem and solution details.


Log message before this fix:
14:32:21  Start Logical Recovery - Start Log 67, End Log ?
14:32:21  Starting Log Position - 67 0xa2b018
14:32:21  DR: Cleared 4792 KB of logical log in 0 seconds.
14:32:21  Warning: Some logical logs are not backed up. Do not
         proceed. The secondary server might hang or the logs
might be
         lost.

14:32:22  Defragmenter cleaner thread now running
14:32:22  Defragmenter cleaner thread cleaned:0 partitions
14:32:22  DR_ERR set to -25
14:32:22  Assert Failed: DR: Log Record Apply Thread Exited
Abnormally. Internal Error.
              A restart of the database server shall be required
to correct
              this problem.

14:32:22  IBM Informix Dynamic Server Version 12.10.F
14:32:22   Who: Session(31, informix@, 0, 0x44d1ba98)
                Thread(42, dr_secapply, 44cd7bd0, 1)
                File: hdr/hdrApply.c Line: 580
14:32:22   Results: Dynamic Server must abort
14:32:22   Action: Reinitialize shared memory
14:32:22  stack trace for pid 12064 written to
/work/.../af.412ed56
14:32:22   See Also: /work/.../af.412ed56
14:32:24  Thread ID 42 will NOT be suspended because
          it is a daemon.
14:32:24   See Also: /work/.../af.412ed56
14:32:24  Starting crash time check of:
14:32:24  1. memory block headers
14:32:24  2. stacks
14:32:24  Crash time checking found no problems
14:32:24  hdr/hdrApply.c, line 580, thread 42, proc id 12064,
DR: Log Record Apply Thread Exited Abnormally. Internal Error.
              A restart of the database server shall be required
to correct
              this problem.
.
14:32:24  The Master Daemon Died
14:32:24  PANIC: Attempting to bring system down



After this:
=========

11:12:12  DR_ERR set to -25
11:12:12  Assert Failed: Some logical logs are not backed up.
The secondary server might hang or the
 logs might be lost.
Primary server was started with the 'oninit -PHY' command and
then it was
 attempted to be converted into a secondary server.
If you want to back up the logs, restart the primary server as a
standard
 server and then back up the logs.
If you want to ignore the logs, restart the server with the
'oninit -PHY
 --forget_logs' command.

11:12:12  HCL Informix Dynamic Server Version 12.10.F

11:12:12   Who: Session(52, informix@, 0, 0x44d19e08)
                Thread(63, dr_secapply, 44cd64f8, 1)
                File: hdr/hdrApply.c Line: 586
11:12:12   Results: Dynamic Server must abort
11:12:12   Action: Reinitialize shared memory
11:12:12  stack trace for pid 11150 written to
/work/.../af.427616c
11:12:12   See Also: /work/.../af.427616c

*** WARNING: HCL Informix Dynamic Server is no longer running.
***


This provides a better explanation in the online.log and can be
worked around using the command oninit -PHY --forget_logs
Local-Fix:
Lösung
Workaround
keiner bekannt / siehe Local-Fix
Weitere Daten
Datum - Problem gemeldet    :
Datum - Problem geschlossen :
Datum - der letzten Änderung:
21.12.2018
04.12.2019
04.12.2019
Problem behoben ab folgender Versionen (IBM BugInfos)
12.10.F0,
12.10.xC11
Problem behoben lt. FixList in der Version
Informix EditionenInformix Editionen
Informix Editionen
DokumentationDokumentation
Dokumentation
IBM NewsletterIBM Newsletter
IBM Newsletter
Current BugsCurrent Bugs
Current Bugs
Bug ResearchBug Research
Bug Research
Bug FixlistsBug Fixlists
Bug Fixlists
Release NotesRelease Notes
Release Notes
Machine NotesMachine Notes
Machine Notes
Release NewsRelease News
Release News
Product LifecycleProduct Lifecycle
Lifecycle
Media DownloadMedia Download
Media Download