(no title)
hmage | 2 years ago
First document is an forum thread full of "go fuck yourself fucking do it", and in this kind of scenario, people are not cooperative.
Second document is a forum thread full of "Please, take a look at X", and in this kind of scenario, people are more cooperative.
By adding "Please" and other politness, you are sampling from dataset containing second document style, while avoiding latent space of first document style - this leads to a model response that is more accurate and cooperative.
Hope that explains.
No comments yet.