It really depends on what you need to do with the data, but in most cases Python could do this pretty easily with csv.reader (with a \t delimiter for TSV) or xml.etree.ElementTree.iterparse (for XML) in streaming fashion such that you're not loading the whole file at once.
FlyingAvatar|1 year ago
pradeepchhetri|1 year ago
[0] https://clickhouse.com/docs/en/interfaces/formats#tabseparat...
[1] https://clickhouse.com/docs/en/interfaces/formats#xml
mobilio|1 year ago
datadrivenangel|1 year ago
Also how is it structured? If it's actually a tab separated value file, consider using something like polars or DuckDB?
anindha|1 year ago
abdusco|1 year ago
https://stackoverflow.com/a/35454070/5298150
You can also use datasette & sqlite utils for it
https://sqlite-utils.datasette.io/en/stable/cli.html#inserti...