(no title)
Bootstrapper909 | 2 years ago
We actually use real user sessions to train our model, so when I use the term coverage our main metric is covering as many user behaviors as possible.
We collect data in a privacy-focused way essentially anonymizing all sensitive information, as we don't need to know the user specific context. Only the main flow.
johnhenning|2 years ago
Bootstrapper909|2 years ago
Then, during test generation, we bias the model to explore edge cases (in a few ways), and the model is still able to complete those even with low sample.
In other words, we direct the model toward certain goals, and flows and also add chaos to the process which result in the model executing unexpected flows.
afro88|2 years ago
Thinking of apps that might fall under HIPAA etc
Bootstrapper909|2 years ago
1. We hash all inner text and then backfill static strings on the server side. So every text that is specific to the user remains hashed
2. We detect special cases like passwords, SSNs, credit cards, and completely block it (even not hashed)
3. We provide full privacy controls to our customers to easily mask any sensitive elements
4. We discard the user IP and don't require any PII to be sent. So we can connect a session together, but don't really know who the user is