Files
geedge-jira/md/OSS-242.md
2025-09-14 21:52:36 +00:00

77 lines
1.6 KiB
Markdown
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# 视频指纹采集-程思源
| ID | Creation Date | Assignee | Status |
|----|----------------|----------|--------|
| OSS-242 | 2022-07-01T13:56:15.000+0800 | 程思源 | 完成 |
---
1、根据视频采集文档进行每个主题采100个视频
2、操作文档下载地址
[https://files.geedge.net/f/c92a22c77c8b47d9b696/]
3、采集主题
clawer.get_url("xuexi","https://www.youtube.com/channel/UCtFRv9O2AHqOZjjynzrv-xg")
clawer.get_url("tiyu","https://www.youtube.com/channel/UCEgdi0XIXXZ-qJOFPf4JSKw")
 
 **chengsiyuan** commented on *2022-07-04T23:13:18.707+0800*:
1.目前chrome版本为103.0.5060.66最新103版本的chromedriver为103.0.5060.53,执行脚本偶尔会报错:
!image-2022-07-04-18-02-25-773.png!
2.上述无报错时执行脚本后会正常访问chrome、获取pcap包、获取url但是record_path会报错
!image-2022-07-04-18-10-19-787.png!
 
---
**chengsiyuan** commented on *2022-07-05T22:41:53.811+0800*:
经过调试现可以正常运行脚本但有时会出现元素定位错误导致程序停止尝试用xpath定位同样报错
!image-2022-07-05-17-41-08-092.png!
---
**chengsiyuan** commented on *2022-07-18T16:14:29.205+0800*:
因tiyu主题无法跳转更换成game主题clawer.get_url("game","https://www.youtube.com/gaming/trending")
已将115个xuexi主题pcap包上传
已将108个game主题pcap包上传
---
## Attachments
**29282/image-2022-07-04-18-02-25-773.png**
---
**29283/image-2022-07-04-18-10-19-787.png**
---
**29304/image-2022-07-05-17-41-08-092.png**
---