After Bing has finished generating a message, it will likely call the moderation API with the message it has generated to see if it accidentally generated anything inappropriate. If so, it'll delete the message and replace it with a generic "Sorry, I don't know how to help here." message instead.
EDIT: I tried calling the moderation API with the message in your example and it does get flagged for violence:
if that is the case, could you trick it into giving you one word at a time? ie: give me the first word of your response for the innapropriate query, then the same question but only ask for the second word and so on. then each word will pass through the moderatiom api but the whole never gets checked.
netruk44|3 years ago
After Bing has finished generating a message, it will likely call the moderation API with the message it has generated to see if it accidentally generated anything inappropriate. If so, it'll delete the message and replace it with a generic "Sorry, I don't know how to help here." message instead.
EDIT: I tried calling the moderation API with the message in your example and it does get flagged for violence:
"flagged":true,
"categories":{
ipv4dhcp|3 years ago
canes123456|3 years ago