python selenium chrome 控制浏览器滚动条缓慢下拉到最底-python,selenium-星河码客

转载
python selenium chrome 控制浏览器滚动条缓慢下拉到最底
分类:python,selenium 713人阅读 IT小君 2021-09-03 22:59

用selenium 爬取网站数据，有些网站的的数据是ajax动态加载，要缓慢分段下拉才可以获取到数据，如果连续用执行js语句'window.scrollTo(0,document.body.scrollHeight)'

会导致浏览器迅速直接拉到底，中间遗失数据。
代码实现思路：
首先获取当前窗口总高度
然后每次下拉100 像素，拉到最底；
只是如果又有新的页面内容加载，高度会变大，这时比较原高度与现在高度是否相同，如果不同，在每次100 像素下拉，直到没有新的内容加载。

这里以蘑菇街为例：蘑菇街如果直接拉到底他会什么数据页加载不了

import time
from selenium import webdriver

driver = webdriver.Chrome()
driver.get("https://www.mogu.com/")
time.sleep(1)

# 执行这段代码，会获取到当前窗口总高度
js = "return action=document.body.scrollHeight"
# 初始化现在滚动条所在高度为0
height = 0
# 当前窗口总高度
new_height = driver.execute_script(js)

while height < new_height:
    # 将滚动条调整至页面底部
    for i in range(height, new_height, 100):
        driver.execute_script('window.scrollTo(0, {})'.format(i))
        time.sleep(0.5)
    height = new_height
    time.sleep(2)
    new_height = driver.execute_script(js)
driver.quit()

把代码封装到函数复用

import time

def scroll_to_bottom(driver):
	js = "return action=document.body.scrollHeight"
	# 初始化现在滚动条所在高度为0
	height = 0
	# 当前窗口总高度
	new_height = driver.execute_script(js)

	while height < new_height:
    	# 将滚动条调整至页面底部
    	for i in range(height, new_height, 100):
        	driver.execute_script('window.scrollTo(0, {})'.format(i))
        	time.sleep(0.5)
    	height = new_height
    	time.sleep(2)
    	new_height = driver.execute_script(js)

赏

支付宝打赏

微信打赏

如果文章对你有帮助，欢迎点击上方按钮打赏作者

博文推荐更多»

MCP 实战之从0开始实现MCP server

cloudflare IP优选配置实战（tunel和正常域名解析两种），网上坑很多,网站加速

我的学习笔记

免费代理网站

赛博活佛cloudflare 免费搭建网站方案实战（域名解析、ip优选配置、无公网IP、内网穿透、ipv6等处理）

赛博活佛cloudflare workers 免费部署动态网站原理解析及实战

微信免签约收款方案及源码（android+java）

最热实例源代码更多»

android.view.animation.AnticipateOvershootInterpolator#android.view.WindowManager实例Demo源码 4923阅

android.view.View实例Demo源码 4564阅

org.springframework.web.bind.annotation.RequestBody实例Demo源码 4323阅

org.springframework.web.bind.annotation.GetMapping实例Demo源码 4107阅

org.springframework.boot.context.embedded.tomcat.TomcatEmbeddedServletContainerFactory实例Demo源码 3993阅

org.springframework.web.bind.annotation.PostMapping实例Demo源码 3867阅

工具推荐更多»

chromedriver win64 133 谷歌浏览器版本2025

淘宝商品上架1比1（1:1）图片批量一键调整工具，好用哭了

TTF字体抽取压缩、JSON格式化、qrcode 二维码生成解码工具windows桌面版

视频&图片水印免费除

frp0.53.2下载备份

免费录屏工具

自动必应刷搜索积分插件 auto get Microsoft Rewards chrome/edge plugin

chromedriver win64 116.0.5845.96 (r1160321) 下载

notepad++ 下载备份 npp.8.4.6.Installer.x64版本

Git-2.38.1-64-bit 下载备份

java17 jdk-17_windows-x64_bin 下载备份

java JDK1.8 x64 下载

notepad++ 二维码生成插件nppqrcode

站酷小薇LOGO体 ttf

spacedesk，将你的移动设备（手机或者平板）转变成你电脑的第二个显示器的软件(apk、win10 客户端下载)

idisplay，将你的移动设备（手机或者平板）转变成你电脑的第二个显示器的软件

sqlmap：开源的sql 自动注入、渗透测试工具

中文字体songti.ttf

IA图片助手(ImageAssistant)，轻松下载网页所有图片

微软雅黑体(msyh.ttf)字体

tinymce_5.8.2 编译版本和开发版本包含汉化脚本

LICEcap GIF 视屏截图、屏幕录制工具

ScreenToGif gif 录屏工具 v2.32.1 绿色免安装中文版

截图录制编辑GIF工具GifCam免安装中文绿色版和英文最新版

GC日志可视化分析工具GCViewer1.36

JAVA内存分析工具（Memory Analyzer Tool，MAT）独立安装版

VisualVM Java 程序性能分析、虚拟机GC分析工具-visualvm_207.zip

JClassLib开源的字节码阅读和编辑器器-jclasslib_win64_5_8.exe

Git-2.31.1 windows 下载

SoapUI-5.3.0 windows免安装版