(no title)
lelag | 10 months ago
What I will add is that constrained generation is supported by the major inference engine like llama.cpp, vllm and the likes, so what you are describing is actually trivial on locally hosted models, you just have to provide a regex that prevent them to use the letter 'e' in the output.
Der_Einzige|10 months ago
https://github.com/sam-paech/antislop-sampler
https://arxiv.org/abs/2306.15926