Files
geedge-jira/md/OMPUB-1072.md
2025-09-14 21:52:36 +00:00

187 lines
4.0 KiB
Markdown
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# 11月30日18:20左右192.168.44.3虚拟机死机
| ID | Creation Date | Assignee | Status |
|----|----------------|----------|--------|
| OMPUB-1072 | 2023-12-04T11:19:56.000+0800 | 雷军 | 已解决 |
---
人为重启虚拟机后恢复
麻烦帮忙排查原因**leijun** commented on *2023-12-04T17:06:47.358+0800*:
11月30日18:20左右 192.168.44.3虚拟机Xshell无法连接通过 Proxmox VE 对44.3虚拟机先进行 STOP然后START启动
以下是16:00点前的内存观察
!image-2023-12-04-17-01-07-825.png|width=694,height=263!
 
 
---
**xubotao** commented on *2023-12-05T09:32:07.423+0800*:
经过排查当天的系统日志、安全日志、系统引导日志、核心日志及服务日志没有找到造成卡死的原因
---
**xubotao** commented on *2023-12-05T11:42:26.298+0800*:
经过以下几点排查发现可能引起44.3卡死或宕机的情况发生
1.141宿主机内存资源使用过高减去buff/cache占用的资源基本已经跑满这台宿主机下运行的虚拟机所有分配的内存已经超过物理机最大内存虚拟机测试状态下资源上下浮动可能会导致虚拟机宕机现象
2.44.3虚拟机I/O利用率基本写满导致磁盘I/O负载过高可能会引起延迟或宕机
3.44.3虚拟机网络连接数增长,网络节点负载升高,可能会导致性能下降、延迟或者宕机
 
141宿主机内存资源及分配情况
!image-2023-12-05-11-39-11-323.png!
!image-2023-12-05-11-42-17-560.png!
44.3 I/O利用率
!image-2023-12-05-11-39-32-258.png!
 
44.3网络链接数
!image-2023-12-05-11-40-26-255.png!
 
44.3网络节点负载
!image-2023-12-05-11-40-44-908.png!
 
 
---
**niuxiang** commented on *2023-12-15T16:52:42.246+0800*:
2023年12月15日1548出现宕机现象如图。从PVE宿主机查看整台设备内存使用率接近100%通过进程使用的内存排序情况如图。主要占用内存的虚拟机是44.3和44.17各占了25%。 !image-2023-12-15-16-51-09-078.png!
!image-2023-12-15-16-51-22-664.png!
|VID|VNAME|在PVE宿主机中查看虚拟机进程内存占用率|
|102|zhangwei-192.168.44.3-long|25.5|
|118|dongxiaoyan-192.168.44.17-long|25.4|
|104|zhangwei-192.168.44.5-long|12.7|
|261|doufenghu-192.168.44.14-long|9.5|
|134|duandongmei-192.168.44.136|6.3|
|133|yangwei-dpi|5.6|
|106|dongxiaoyan-192.168.40.6-win-long|3.2|
|103|zhangwei-192.168.44.4-long|3.1|
|109|dongxiaoyan-192.168.44.9-long|2.8|
|149|luwenpeng-192.168.44.128-long|2.6|
|153|duandongmei-192.168.44.137|2.4|
!image-2023-12-15-16-52-40-908.png!
---
**liuyang** commented on *2023-12-18T09:44:07.168+0800*:
* 宿主机内存512G查看44.3内存使用了25%即128G
* 192.168.44.3虚拟机申请内存128G通过NZ监控查看死机前44.3内存占用40%左右即不到60G
!image-2023-12-18-09-46-58-427.png|thumbnail!
[~niuxiang]上述两处查看44.3使用内存不一致是否可以通过系统日志或者其他信息帮忙再次确认44.3内存使用
---
**leijun** commented on *2024-01-03T19:10:52.548+0800*:
2024/01/03 18:50左右192.168.44.3虚拟机死机
* 通过NZ监控查看死机前44.3内存占用了36%CPU使用了51%左右
!image-2024-01-03-18-55-19-559.png|width=625,height=321!  
44.3虚拟机控制台信息
!image-2024-01-03-18-58-42-358.png|width=608,height=456!
---
**leijun** commented on *2024-07-08T09:36:55.104+0800*:
44.3虚拟机迁移新物理机后,没有再出现此现象
---
## Attachments
**47616/image-2023-12-04-17-01-07-825.png**
---
**47626/image-2023-12-05-11-39-11-323.png**
---
**47627/image-2023-12-05-11-39-32-258.png**
---
**47628/image-2023-12-05-11-40-26-255.png**
---
**47629/image-2023-12-05-11-40-44-908.png**
---
**47630/image-2023-12-05-11-42-17-560.png**
---
**47704/image-2023-12-15-16-51-09-078.png**
---
**47705/image-2023-12-15-16-51-22-664.png**
---
**47706/image-2023-12-15-16-52-40-908.png**
---
**47711/image-2023-12-18-09-46-58-427.png**
---
**49380/image-2024-01-03-18-55-19-559.png**
---
**49382/image-2024-01-03-18-58-42-358.png**
---