2025-09-14 21:52:36 +00:00
|
|
|
|
# 【E21-OLAP】E现场定位BOL-IGW Hbase down OLAP HOS Services Down 告警原因
|
|
|
|
|
|
|
|
|
|
|
|
| ID | Creation Date | Assignee | Status |
|
|
|
|
|
|
|----|----------------|----------|--------|
|
|
|
|
|
|
| OMPUB-351 | 2022-02-10T03:05:46.000+0800 | 戚岱杰 | 已关闭 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
定位排查nezha告警信息,找出最终处理方案。
|
|
|
|
|
|
BOL-IGW Hbase down
|
|
|
|
|
|
BOL-IGW OLAP HOS Services Down
|
|
|
|
|
|
**qidaijie** commented on *2022-02-10T10:36:48.307+0800*:
|
|
|
|
|
|
|
|
|
|
|
|
上传排查时截图。
|
|
|
|
|
|
# zookeeper处理请求存在一定延迟,但未超过HBase配置的最长时间。
|
|
|
|
|
|
# OLAP HOS Services down是由HBase宕引发的,不需要处理;HBase恢复后会自动恢复正常。
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
**qidaijie** commented on *2022-02-23T11:41:36.481+0800*:
|
|
|
|
|
|
|
|
|
|
|
|
出现HBase down有两个原因:
|
|
|
|
|
|
# HBase与Zookeeper会话连接超时,HBase自身机制导致重启;由于单机环境资源不足,Zookeeper处理延迟偏高,因此暂时没有较好的优化方案。
|
|
|
|
|
|
# 容器出现退出未自动拉起的情况;因修复过程中会更换容器,导致无法查询到旧容器行为日志,同时信息港暂未复现。若现场后续再出现该问题,需及时联系进行记录排查。
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2025-09-14 22:26:17 +00:00
|
|
|
|
# Attachments
|
2025-09-14 21:52:36 +00:00
|
|
|
|
|
2025-09-14 22:26:17 +00:00
|
|
|
|
Attachment: hbase-conf.png
|
2025-09-14 22:27:11 +00:00
|
|
|
|
|
2025-09-14 22:26:17 +00:00
|
|
|
|

|
2025-09-14 21:52:36 +00:00
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2025-09-14 22:26:17 +00:00
|
|
|
|
Attachment: hbase-log.png
|
2025-09-14 22:27:11 +00:00
|
|
|
|
|
2025-09-14 22:26:17 +00:00
|
|
|
|

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Attachment: zk延迟.png
|
2025-09-14 22:27:11 +00:00
|
|
|
|
|
2025-09-14 22:26:17 +00:00
|
|
|
|

|
2025-09-14 21:52:36 +00:00
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|