Appreciate the feedback! Completely agree - authentication should be handled at the system level, not just in prompts. This demo is meant to showcase how teams can build test cases from real failures and ensure fixes work before deployment. We’ll consider using a better example.
oxidant|1 year ago
> For each replay that we run, Roark checks if the agent follows key flows (e.g. verifying identity before sharing account details)
I don't know if AI will be more susceptible or less susceptible to phishing than humans, but this feels like a bad practice.
zammitjames|1 year ago
That said, totally fair point that this example could be clearer—we’ll keep that in mind for future demos. Thanks for calling it out!