(no title)
dxxvi | 4 months ago
Have you ever thought of using a postgresql db (also on aws) to store those files and use CDC to publish messages about those files to a kafka topic? In your original way, we need 3 aws services: s3, lambda and sqs. With this way, we need 2: postgresql and kafka. I'm not sure how well this method works though :-)
ravedave5|4 months ago
jrochkind1|4 months ago
dxxvi|4 months ago
1GB with the bytea data type (https://www.postgresql.org/docs/current/datatype-binary.html) and 4TB with the BLOB data type (https://wiki.postgresql.org/wiki/BinaryFilesInDB).
dxxvi|4 months ago
Why hesitant? Just ask AI. It'll tell you how to do it and then you can experiment it yourself.