Support Questions

Find answers, ask questions, and share your expertise

Hive LLAP cant start

avatar
Explorer

when i try to restart hive llap get this error

 

```

Traceback (most recent call last):
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 992, in restart
    self.status(env)
  File "/var/lib/ambari-agent/cache/common-services/HIVE/3.1.0/package/scripts/hive_server_interactive.py", line 159, in status
    check_process_status(status_params.hive_interactive_pid)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/check_process_status.py", line 43, in check_process_status
    raise ComponentIsNotRunning()
ComponentIsNotRunning

The above exception was the cause of the following exception:

Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/common-services/HIVE/3.1.0/package/scripts/hive_server_interactive.py", line 550, in <module>
    HiveServerInteractive().execute()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 351, in execute
    method(env)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 1003, in restart
    self.start(env, upgrade_type=upgrade_type)
  File "/var/lib/ambari-agent/cache/common-services/HIVE/3.1.0/package/scripts/hive_server_interactive.py", line 99, in start
    raise Fail("Skipping START of Hive Server Interactive since LLAP app couldn't be STARTED.")
resource_management.core.exceptions.Fail: Skipping START of Hive Server Interactive since LLAP app couldn't be STARTED.

```

anyone can help me to solve this ? thank you

1 REPLY 1

avatar
Master Collaborator

The error indicates that the Hive Server Interactive (HSI) component is failing to start because the LLAP (Live Long and Process) app associated with it couldn't be started. To troubleshoot and resolve this issue, you can follow these general steps:

  • Check LLAP Log Files:

    • Look into the LLAP log files for more detailed error messages. These log files are typically located in a directory like /var/log/hive or a custom location configured in your environment.
    • Examine the LLAP logs to identify any specific errors or issues that are preventing LLAP from starting.
  1. Verify LLAP Configuration:

    • Check the LLAP configuration settings, including memory configurations, queue configurations, and other LLAP-specific parameters. Ensure that the configuration is correct and appropriate for your cluster resources.
    • Verify that there are no typos or errors in the LLAP configuration files .
  2. Check Resource Availability:

    • Ensure that there are sufficient resources (memory, CPU, etc.) available on the nodes where LLAP is supposed to run.
    • Verify that LLAP is not competing for resources with other applications or services on the cluster.
  3. Check Hive Server Interactive Configuration:

    • Review the configuration settings for Hive Server Interactive. Verify that the LLAP configuration is correctly specified in the Hive Server Interactive configurations.
    • Ensure that the LLAP application name, number of instances, and other LLAP-related settings are accurate.
  4. Examine System Logs:

    • Check the system logs on the nodes where LLAP is supposed to run. Look for any system-level issues or errors that might be affecting LLAP startup.
  5. Restart LLAP Manually:

    • If LLAP fails to start during Hive Server Interactive startup, consider manually starting LLAP to see if you can get more detailed error messages. You can use commands like hive --service llap --start or the Ambari UI to start LLAP separately.
  6. Check for LLAP Process:

    • After trying to start LLAP manually, check if the LLAP process is running. You can use tools like ps or jps to see if the LLAP daemon process is running on the expected nodes.
  7. Review Ambari Alerts:

    • Check the Ambari Alerts for any warnings or errors related to Hive Server Interactive or LLAP. Ambari often provides helpful alerts and diagnostics.

If the LLAP process is still not starting, the detailed logs and error messages should provide more insights into the root cause of the issue. Addressing the specific error or misconfiguration mentioned in the logs will be crucial in resolving the problem