(no title)
AlfeG | 1 year ago
Grok3, Claude, Deepseek, Qwen all failed to solve this problem. Resulting in some very very wrong solutions. While Grok3 were admit it fail and don't provide answers all other AI's are provided just plain wrong answers, like `12 * 5 = 80`
ChatGPT were able to solve for 40, but not able to 80. YandexGPT solved those correctly (maybe it were trained on same Math books)
Just checked Grok3 few more times. It were able to solve correctly for 80.
CamperBob2|1 year ago
Interestingly, the R1 1.58-bit dynamic quant model was able to sort of solve it. I changed the problem statement a bit to request only the solution for 40 and to tell it what operations it can use, both needed to keep from blowing out the limited context available on my machine (128MB RAM + 24MB GPU).
Took almost 3 hours and it wigged out a bit at the end, rambling about Lisp in Chinese, but it got an almost-valid answer: 1 * (2 + 3) * (4 + 5) - 5 (https://pastebin.com/ggL85RWJ) I didn't think it would get that far.
Pannoniae|1 year ago
https://claude.ai/share/dfb37c1a-f6a8-45a1-b987-e6d28e205080
ducktin|1 year ago
12 * 3 + 4 = 40
1 * 2 * 3 * 4 * 5 / 3 = 40
bfm|1 year ago
coffeeaddict1|1 year ago
```
We can “stick‐to the order” of the digits and allow concatenation. For example, one acceptable answer is
and another is In both cases the digits 1,2,3,4,5 appear in order without rearrangement.```
However, it took 8 minutes to produce that.
sizzle|1 year ago
AlfeG|1 year ago