top | item 42964598

(no title)

panta | 1 year ago

How do you handle the privacy of the scanned documents?

discuss

order

bitdribble|1 year ago

With the docrouter.ai, it can be installed on prem. If using the SAAS version, users can collaborate in separate workspaces, modeled on how Databricks supports workspaces. Back end DB is Mongo, which keeps things simple.

One level of privacy is the workspace level separation in Mongo. But, if there is customer interest, other setups are possible. E.g. the way Databricks handles privacy is by actually giving each account its own back end services - and scoping workspaces within an account.

That is a good possible model.

kbyatnal|1 year ago

We work with fortune 500s in sensitive industries (healthcare, fintech, etc). Our policies are:

- data is never shared between customers

- data never gets used for training

- we also configure data retention policies to auto-purge after a time period

panta|1 year ago

But how to get these guarantees from the upstream vendors? Or do you run the LLMs on premises?