(no title)
gixco
|
1 year ago
I didn't just look at the implementation, I tried it as well. I was hoping it would work, but the aggregating model mostly either failed to properly synthesize a new response (merely dumping out the previous responses as separate functions) or erratically took bits from each without properly gluing them together. In every case, simply picking the best out of the four responses myself yielded better results.
ianbutler|1 year ago
I am however working in a domain where verification isn't subjective so I know a good response from a bad response fairly easily. Things like this depend quite heavily on the model being used too in my experience.
ericjmorey|1 year ago