top | item 45932729

(no title)

isaacvando | 3 months ago

At work we have anonymized dumps of our production data that we use for local development. This is extremely helpful as it allows testing things locally that would be completely unrealistic to setup otherwise. Your whole dataset might not be able to fit on a dev machine, but I'd guess in most cases you could create a subset of it that would be large enough to be useful for local performance work.

discuss

order

jitl|3 months ago

Do you use an off the shelf tool for anonymization or did you write something yourself?

isaacvando|3 months ago

We using Tonic.ai. Although I am interested in writing something myself; I suspect there are a lot of ways it could be improved such as generating the anonymized version more quickly.