top | item 43377989

(no title)

integralof6y | 11 months ago

I just tried the chat and asked the LLM to compute the double integral of 6*y on the interior of a triangle given the vertices. There were many trials all incorrect, then I asked to compute a python program to solve this, again incorrect. I know math computation is a weak point for LLM specially on a chat. In one of the programs it used a hardcoded number 10 to branch, this suggests that the program generated was fitted to give a good result for the test (I did give it the correct result before). This suggests that you should be careful when testing the generated programs, they could be fitted to pass your simple tests. Edited: Also I tried to compute the integral of 6y on the triangle with vertices A(8, 8), B(15, 29), C(10, 12) and it yield a wrong result of 2341, then I suggested computing that using the formula for the barycenter of the triangle, that is, 6Area*(Mean of y-coordinates) and it returned the correct value of 686.

To summarize: It seems that LLM are not able to give correct result for simple math problems (here a double integral on a triangle). So students should not rely on them since nowaday they are not able to perform simple task without many errors.

discuss

order

vmg12|11 months ago

Here is an even easier one, ask llms to take the integral from 0 to 3 of 1/(x-1)^3. It fails to notice it's an improper integral and just gives an answer.

floam|11 months ago

ChatGPT definitely noticed: o1, o3-mini, o3-mini-high.

Maybe 4o will get it wrong? I wouldn’t try it for math.

HeatrayEnjoyer|11 months ago

>compute the integral of 6*y on the triangle with vertices A(8, 8), B(15, 29), C(10, 12)

o3-mini returned 686 on the first try, without executing any code.