asksol | 7 years ago | on: What Is Idempotence?
asksol's comments
asksol | 7 years ago | on: Questions
asksol | 7 years ago | on: Faust: Stream Processing for Python
If you have a Faust app that depends on a particular feature we strongly suggest you submit it as an integration test for us to run.
Hopefully some day Celery will be able to take the same approach, but running cloud servers cost money that the project does not have.
asksol | 7 years ago | on: Faust: Stream Processing for Python
It's important to remember that users had difficulty understanding the concepts behind Celery as well, perhaps it's more approachable now that you're used to it.
Using an asynchronous iterator for processing events enables us to maintain state. It's no longer just a callback that handles a single event, you could do things like "read 10 events at a time", or "wait for events on two separate topics and join them".
asksol | 7 years ago | on: Faust: Stream Processing for Python
asksol | 7 years ago | on: Faust: Stream Processing for Python
asksol | 7 years ago | on: Faust: Stream Processing for Python
asksol | 7 years ago | on: Faust: Stream Processing for Python
- A Stream iterates over a channel
- A Channel implements: `channel.send` and `channel.__aiter__`.
- A topic is a "named" channel backed by a Kafka topic
- Further the topic is backed by a Transport
- Transport is very Kafka specific
To implement support for AMQP the idea is you only need to implement a custom channel.
If you open an issue we can consider how to best implement it.
asksol | 7 years ago | on: Faust: Stream Processing for Python
asksol | 7 years ago | on: Faust: Stream Processing for Python
Simplicity is of course a goal, but this may mean we have to sacrifice some features when Redis Streams is used as a backend.
I'm glad you like Celery, this project in many ways realize what I wanted it to be.
asksol | 7 years ago | on: Faust: Stream Processing for Python
Thanks for pointing this out. Fixed the links. You can also find the docs here: http://faust.readthedocs.io/en/latest/
Faust uses Kafka for message passing. The new messages you create can be pushed to a new topic and you could have another agent consuming from this new topic. Check out the word count example here: https://github.com/robinhood/faust/blob/9fc9af9f213b75159a54...
Also note that the Table is persisted in a log compacted Kafka topic. This means, we are able to recover the state of the table in the case of a failure. However, you can always write to any other datastore while processing a stream within an agent. We do have some services that process streams and storing state in Redis and Postgres.
asksol | 7 years ago | on: Faust: Stream Processing for Python
Yes, we're definitely interested in supporting Redis Streams!
Faust is designed to support different brokers, but Kafka is the only implementation as of now.
asksol | 7 years ago | on: Faust: Stream Processing for Python
This answer sums up why we wanted to use Python for this project: it's the most popular language for data science and people can learn it quickly.
Performance is not really a problem either, with Python 3 and asyncio we can process tens of thousands of events/s. I have seen 50k events/s, and there are still many optimizations that can be made.
That said, Faust is not just for data science. We use it to write backend services that serve WebSockets and HTTP from the same Faust worker instances that process the stream.
asksol | 8 years ago | on: Equifax terms of service may include binding arbitration clause
asksol | 9 years ago | on: Celery 4.0
Disclaimer: I'm a contributor
asksol | 9 years ago | on: Celery 4.0
I have merged many features, like broker transports, result backends, etc, and while the initial contribution was great, it ends up being unmaintained with issues that nobody fixes.
If there's any feature that you really want back, chances are the problems with that feature are not super difficult to fix, so please reach out!
asksol | 9 years ago | on: Association Between Cesarean Birth and Risk of Obesity
asksol | 9 years ago | on: Writing an editor in less than 1000 lines of code, just for fun
asksol | 9 years ago | on: San Francisco man fights eviction after rent increase from $1,800 to $8,000
asksol | 9 years ago | on: Senate rejects FBI bid for warrantless access to internet browsing histories
It doesn't make much sense to me, unless they are carefully wording this into something that seems reasonable to the public ("I'll just use private mode, no deal") when it really means monitoring all our internet traffic. But then why is the media repeating it?