Support Article
Unable to restart node
SA-26045
Summary
On attempting to restart a multinode cluster environment, some of the nodes did not restart.
Error Messages
[**Date, time, and time zone**] 000000bb ThreadMonitor W WSVR0605W: Thread "<your_hostname>" (0000008d) has been active
for 696797 milliseconds and may be hung. There is/are 1 thread(s) in total in the server that may be hung.
at sun.misc.Unsafe.park(Native Method)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:198)
at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:846)
at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1006)
.
.
.
Steps to Reproduce
- Set up a multinode-Dnode cluster.
- Set up a Visual Business Director (VBD) cluster with two nodes.
- Restart all nodes.
Root Cause
A defect in Pegasystems’ code or rules. A deadlock can occur when starting multiple VBD nodes concurrently. Two threads compete for a Hazelcast distributed lock when persistence is initialized.
Resolution
Apply HFix-28668.
Published August 5, 2016 - Updated October 8, 2020
Have a question? Get answers now.
Visit the Collaboration Center to ask questions, engage in discussions, share ideas, and help others.