duckdb excel插件和rusty_sheet插件在python中的不同表现
建立虚拟环境安装python扩展包duckdb、polars和pandaspython -m venv pwin313 pwin313\scripts\activate.bat pwin313)C:\d\pwin313pip install duckdb polars pandas在python中引入扩展包安装加载duckdb插件pwin313) C:\d\pwin313python Python 3.13.2 (tags/v3.13.2:4f8bb39, Feb 4 2025, 15:23:48) [MSC v.1942 64 bit (AMD64)] on win32 Type help, copyright, credits or license for more information. import duckdb duckdb.sql(install excel) duckdb.sql(install rusty_sheet from community) import time duckdb.sql(load excel) duckdb.sql(load rusty_sheet)rusty_sheet是 v0.4.2版。分别测试两种插件不转成数据框和转成polars、duckdb数据框 ttime.time();duckdb.sql(select * from read_xlsx(/d/lineitem.xlsx));t1time.time();print(t1-t) 0.3100736141204834 ttime.time();duckdb.sql(select * from read_xlsx(/d/lineitem.xlsx)).pl();t1time.time();print(t1-t) 4.7497313022613525 ttime.time();duckdb.sql(select * from read_xlsx(/d/lineitem.xlsx)).fetchdf();t1time.time();print(t1-t) 4.8916003704071045 ttime.time();duckdb.sql(select * from read_sheet(/d/lineitem.xlsx,range2:));t1time.time();print(t1-t) 4.251304626464844 ttime.time();duckdb.sql(select * from read_sheet(/d/lineitem.xlsx,range2:)).pl();t1time.time();print(t1-t) 9.083962440490723 ttime.time();duckdb.sql(select * from read_sheet(/d/lineitem.xlsx,range2:)).fetchdf();t1time.time();print(t1-t) 9.565066576004028可见read_xlsx不转数据框则不实际读取数据而read_sheet读取。
本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.coloradmin.cn/o/2547884.html
如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈,一经查实,立即删除!