I think its really interesting such gems offering a ruby layer on top of rust libs. One issue that I have with that is - and maybe it's my ignorance - but is that necessary to bundle the original lib as https://github.com/ankane/polars-ruby/tree/master/ext/polars ? I can imagine that makes easier to avoid breakage, or with C extensions, because you dont have some dependency manager around, but couldn't we sort it with Cargo? For instance, having the version locked and it could download (and cache) the dependency when necessary?
Blazer is a particular favourite of mine. A former colleague taught herself SQL from zero knowledge by looking at and piecing together bits of other reports, experimenting with familiar (interesting) data and going on to build dashboards her team loved.
If we'd have just given the team a locked down Data Studio/PowerBI report none of that would have happened. Encouraging people to peek under the hood can be a huge benefit.
Having done financial modelling and data analytics in Ruby: Because I like Ruby, all the other backend code in those projects was in Ruby, and most projects don't rely on data volumes where the lack of something like Polars is an issue to begin with.
Most people don't have large datasets (even many people who think they have large datasets). Some do, or require more complex supporting libraries, and I get that Ruby then often isn't practical for them, and that's fine.
But it's nice to know I now have one more option reducing my need to consider another language.
- Because some like it better than Python/Julia/<INSERT NEXT language>.
- Because they want data analytics in a ruby application
- Because Ruby is awesome
Honestly I used Ruby about 6 years ago, but have been a python guy ever since. That said I believe Ruby's main advantage is its metaprogramming capabilities. You can build powerful DSLs in Ruby pretty quickly. Adding analytics to that could be useful in certain cases.
I'd like to see spark bindings first though before I would seriously consider it.
dang|3 years ago
Modern Polars: A comparison of the Polars and Pandas dataframe libraries - https://news.ycombinator.com/item?id=34275818 - Jan 2023 (62 comments)
Also:
Polars: Fast DataFrame library for Rust and Python - https://news.ycombinator.com/item?id=29584698 - Dec 2021 (124 comments)
Polars: Rust DataFrames Based on Apache Arrow - https://news.ycombinator.com/item?id=23768227 - July 2020 (1 comment)
pelasaco|3 years ago
ssaunier_|3 years ago
petepete|3 years ago
If we'd have just given the team a locked down Data Studio/PowerBI report none of that would have happened. Encouraging people to peek under the hood can be a huge benefit.
BilalBudhani|3 years ago
pantsforbirds|3 years ago
tsaoyu|3 years ago
tecleandor|3 years ago
oofbey|3 years ago
vidarh|3 years ago
Most people don't have large datasets (even many people who think they have large datasets). Some do, or require more complex supporting libraries, and I get that Ruby then often isn't practical for them, and that's fine.
But it's nice to know I now have one more option reducing my need to consider another language.
looopTools|3 years ago
dajonker|3 years ago
jay-barronville|3 years ago
unknown|3 years ago
[deleted]
swalsh|3 years ago
I'd like to see spark bindings first though before I would seriously consider it.
unknown|3 years ago
[deleted]
peoplefromibiza|3 years ago
nurettin|3 years ago
claudiug|3 years ago