Support Article

Unable to restart node

SA-26045

Summary



On attempting to restart a multinode cluster environment, some of the nodes did not restart.

Error Messages



[**Date, time, and time zone**] 000000bb ThreadMonitor W WSVR0605W: Thread "<your_hostname>" (0000008d) has been active
for 696797 milliseconds and may be hung. There is/are 1 thread(s) in total in the server that may be hung.
at sun.misc.Unsafe.park(Native Method)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:198)
at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:846)
at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1006)
.
.
.

Steps to Reproduce

  1. Set up a multinode-Dnode cluster.
  2. Set up a Visual Business Director (VBD) cluster with two nodes.
  3. Restart all nodes.

Root Cause



A defect in Pegasystems’ code or rules. A deadlock can occur when starting multiple VBD nodes concurrently. Two threads compete for a Hazelcast distributed lock when persistence is initialized.

Resolution



Apply HFix-28668.

Published July 26, 2016 - Updated August 5, 2016

Have a question? Get answers now.

Visit the Collaboration Center to ask questions, engage in discussions, share ideas, and help others.