rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil's comments
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
The TLDR would then be "Write some generators or functions, link them in a graph, and call them in order on each line of data as soon as the previous transformation node output is ready.". For example if you have a database cursor that yields each line of a query as its output, it starts to run the next step(s) in the graph as soon as the first result is ready (yet not stop yielding from database until the graph is done for the current row). I did not find it easy to do with the libraries I tried.
The docs clearly lacks completion to say the least, and would need an example with a big dataset, one with long individual operations and one with a non linear graph, so it's more obvious that, of course, it's not made to process strings to uppercase twice in a row.
Stay tuned, I'm very happy HN brought it to homepage, did not really think it could happen at this stage though and I understand you. But that's a good thing for the project to move forward.
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
This being said, if any of you have a good picture of bonobos that I can use instead of the current one, I'd be really glad to replace it! It needs to be released under a free license, though.
Thanks HN
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
Mostly, when I want a quasi-mathematical look over a dataset, pandas is my tool of choice. For all those data pipeline things that reasonably fit on one computer, I do use bonobo.
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
I'm preparing explanation pages for a lot of the questions I got, including comparisons, volumes of data, where it is good and where it is not ...
All that will be well ready before 1.0, but for now, we're at 0.2 ...
Thanks for all the hackerlove, though!
rdorgueil | 9 years ago | on: Bonobo – A data processing toolkit for Python 3.5+
I note that I still have a lot of work explaining in simple terms what is actually bonobo, without falling in the trap of "overgeneral description".