7 Replies Latest reply on Mar 2, 2016 5:47 AM by Saifuddin Rangwala

    AppServer instance goes down as it fails to send Heartbeat

    Santhosh Kurimilla

      Bill/Folks,

       

      We have multiple JOB servers with 3 instances on each of them configured. One or the other instances of them goes down frequently as it is unable to update the heartbeat to DB Server. From the logs, we noticed that it seems to be down 5 mins after the [Large Object Cleanup Task] get started.

       

      Can you please confirm what task it is? And, how it is causing the HeartBeat failure?

       

      [29
      Feb 2016 06:30:09,870] [Scheduled-System-Tasks-Thread-7] [INFO]
      [System:System:] [Large
      Object Cleanup Task] Starting large object cleanup task

      ........

      ........

      [29
      Feb 2016 06:37:43,080] [Scheduled-System-Tasks-Thread-11] [ERROR]
      [System:System:] [App Server
      Heartbeat
      ] java.sql.SQLTimeoutException: ORA-01013: user requested cancel of current operation

       

      com.bladelogic.om.infra.mfw.util.BlException:
      java.sql.SQLTimeoutException: ORA-01013: user requested cancel of current
      operation