Files
geedge-jira/md/OSS-242.md
2025-09-14 21:52:36 +00:00

1.6 KiB
Raw Blame History

视频指纹采集-程思源

ID Creation Date Assignee Status
OSS-242 2022-07-01T13:56:15.000+0800 程思源 完成

1、根据视频采集文档进行每个主题采100个视频

2、操作文档下载地址

[https://files.geedge.net/f/c92a22c77c8b47d9b696/]

3、采集主题

clawer.get_url("xuexi","https://www.youtube.com/channel/UCtFRv9O2AHqOZjjynzrv-xg") clawer.get_url("tiyu","https://www.youtube.com/channel/UCEgdi0XIXXZ-qJOFPf4JSKw")

   chengsiyuan commented on 2022-07-04T23:13:18.707+0800:

1.目前chrome版本为103.0.5060.66最新103版本的chromedriver为103.0.5060.53,执行脚本偶尔会报错:

!image-2022-07-04-18-02-25-773.png!

2.上述无报错时执行脚本后会正常访问chrome、获取pcap包、获取url但是record_path会报错

!image-2022-07-04-18-10-19-787.png!

 


chengsiyuan commented on 2022-07-05T22:41:53.811+0800:

经过调试现可以正常运行脚本但有时会出现元素定位错误导致程序停止尝试用xpath定位同样报错

!image-2022-07-05-17-41-08-092.png!


chengsiyuan commented on 2022-07-18T16:14:29.205+0800:

因tiyu主题无法跳转更换成game主题clawer.get_url("game","https://www.youtube.com/gaming/trending")

已将115个xuexi主题pcap包上传

已将108个game主题pcap包上传


Attachments

29282/image-2022-07-04-18-02-25-773.png


29283/image-2022-07-04-18-10-19-787.png


29304/image-2022-07-05-17-41-08-092.png