top | item 47086872

(no title)

djb_hackernews | 10 days ago

You have a misunderstanding of what LLMs are good at.

discuss

order

cap11235|10 days ago

Poster wants it to play Jeopardy, not process text.

paganel|10 days ago

Not sure if you're correct, as the market is betting trillions of dollars on these LLMs, hoping that they'll be close to what the OP had expected to happen in this case.

raincole|10 days ago

The market didn't throw trillions of dollars to develop Llama 3 8B.

What GP is expected to happen has happened around late 2024 ~ early 2025 when LLM frontends got web search feature. It's old tech now.

IshKebab|10 days ago

I don't think he does. Larger models are definitely better at not hallucinating. Enough that they are good at answering questions on popular topics.

Smaller models, not so much.

kleiba|10 days ago

Care to enlighten me?

vntok|10 days ago

Don't ask a small LLM about precise minutiae factual information.

Alternatively, ask yourself how plausible it sounds that all the facts in the world could be compressed into 8k parameters while remaining intact and fine-grained. If your answer is that it sounds pretty impossible... well it is.