(no title)
klinch | 5 months ago
I used to work on payment infrastructure and whenever a vendor offered us the choice between CSV and some other format we always opted for that other format (often xlsx). This sounds a bit weird, but when using xlsx and a good library for handling it, you never have to worry about encoding, escaping and number formatting.
This is one of these things that sound absolutely wrong from an engineering standpoint (xlsx is abhorrently complex on the inside), but works robustly in practice.
Slightly related: This was a German company, with EU and US payment providers. Also note that Microsoft Excel (and therefore a lot of other tools) produces "semicolon-separated values" files when started on a computer with the locale set to German...
n4r9|5 months ago
chungy|5 months ago
olive-n|5 months ago
I work a lot with time series data, and excel does not support datetimes with timezones, so I have to figure out the timezone every time to align with other sources of data.
Reading and writing them is much slower than csv, which is annoying when datasets get larger.
And most importantly, xlsx are way more often fucked up in some way than any other format. Usually, because somebody manually did something to them and sometimes because the library used to write them had some hiccup.
So I guess a hot take indeed.
imtringued|5 months ago
porker|5 months ago
Which good libraries did you find? That's been my pain point when dealing with xlsx.
klinch|5 months ago
personalityson|5 months ago
IanCal|5 months ago
I tried using them once after what felt like an aeon of quoting issues, and the first customer file I had had them randomly appearing in their fields.