top | item 45679723

Rusty-Sheet: An Excel/WPS/OpenDocument Spreadsheets File Reader for DuckDB

2 points| redraiment | 4 months ago |github.com

3 comments

order

redraiment|4 months ago

[rusty-sheet](https://github.com/redraiment/rusty-sheet) is a DuckDB extension written in Rust:

* File Formats: Excel, WPS, OpenDocument. * Remote Access: HTTP(S), S3, GCS, Hugging Face. * Batch Reading: Multiple files & sheets. * Schema Merging: By name or by position. * Type Inference: Automatic + manual override. * Excel Range: `range=’C3:E10'` syntax. * Provenance: File & sheet tracking. * Performance: Optimized Rust core.

tobwen|4 months ago

“When using wildcard patterns, this function analyzes the column structure and data types from the first matching worksheet only.”

Meh… I think I should work on a PR to fix this…

redraiment|4 months ago

Oh, my bad! I forgot to update the documentation: Actually, in union by name mode, all worksheets are now analyzed; otherwise, it remains limited to the first sheet only.