top | item 40382146

(no title)

if you sampled N random people on the street and asked them to solve this problem, what would the outcome be? would it be better than asking chatgpt N times? I wonder

discuss

jiiam|1 year ago

I am deeply interested in this point of view of yours so I will be hijacking your reply to ask another question: is "better than asking a few random people on the street" the bar we should be setting?

As far as mathematical thinking goes this doesn't seem an interesting metric at all. Do you believe that optimizing for this metric will indeed lead to reliable mathematical thinking?

I am of the idea that LLMs are not suited to maths, but since I'm not an expert of the field I'm always looking for counterarguments. Of course we can always wait another couple of years and the question will be resolved.

jiggawatts|1 year ago

People compare a general intelligence against the yardstick of their own specialist skills.

I’ve seen some truly absurd examples, like people complaining that it didn’t have the latest updates to some obscure research functional logic proof language that has maybe a hundred users globally!

GPT 4 already has markedly superior English comprehension and basic logic than most people I interact with on a daily basis. It’s only outperformed by a handful of people, all of whom are “high achievers” such as entrepreneurs, professors, or consultants.

I actively simplify my speech when talking to ordinary people to avoid overwhelming them. I don’t need to when instructing GPT.

grob-gambit|1 year ago

I don't have a counter argument. Not to be ironic but ChatGPT4o gives a better response to the question at hand than anything I have read in this thread:

https://chatgpt.com/share/c10c540f-b9c2-4714-ae6b-77460b900b...

hervature|1 year ago

HN: "Tesla needs to be 100x safer than the best human drivers!!!"

Also HN: "ChatGPT just needs to spell its name."

fragmede|1 year ago

While words have power too, I'm not driving next to ChatGPT on the freeway where it's going to immediately kill or maim me if it hallucinates.

Besides, only half of HN is all self-driving has to be 100x safer. the other half keeps bringing up the fact that Waymo is here and working, just not everywhere yet.