(no title)
j-b | 2 years ago
Q: Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?
A: Okay, let's organize the information we have:
Sally is a girl. Sally has 3 brothers. Each of Sally's brothers has 2 sisters. Now, let's think about this step by step:
We know that Sally is a girl, so she is a sister to her brothers. Each of Sally's brothers has 2 sisters. One of these sisters is Sally, but who is the other sister? Since Sally has only brothers and no other sisters are mentioned, we can conclude that the other sister for each brother is also Sally.
Therefore, Sally has no sisters other than herself. So, the answer is that Sally has 0 sisters.
yaj54|2 years ago
GPT4 w/ CoT system prompting gets it right:
SYS: "You are a helpful assistant. Think through your work step by step before providing your answer."
USER: "Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?"
ASSIST: "Let's break down the information given:
1. Sally has 3 brothers. 2. Each brother has 2 sisters.
Given that Sally is a girl, she is one of the sisters. Since each brother has 2 sisters and we know Sally is one of them, there must be one more sister to make the statement true for each brother.
Therefore, Sally has 1 sister."
The importance of prompting makes it quite difficult to compare model peak performance. Especially since different models have different styles of prompts that generate peak performance.
sigmoid10|2 years ago
"Sally has 3 brothers, and each of them has 2 sisters. The description implies that Sally's brothers are her only siblings. Therefore, the two sisters each brother has must be Sally and one other sister. This means Sally has just one sister."
fragmede|2 years ago
COAGULOPATH|2 years ago
compumetrika|2 years ago
I have one that describes a lot of statistical work I want GPT to help me with.
I got this result the first try:
> Sally has 2 sisters. Since each brother has 2 sisters, and Sally is one of them, there must be one other sister making it two sisters in total. >
auggierose|2 years ago
daviding|2 years ago
> There are two ways to approach this problem. First, we can count each person by gender and then add them up. Second, we can use the given information about the siblings and their genders to find a quicker solution.
> Let's go with the second method:
> 1. Sally has one sister (herself).
> 2. Each brother has two sisters. So, there are 3 sisters for each of her brothers.
> 3. Since there are three brothers, there are a total of 3 x 3 = 9 sisters for them all.
> 4. Now that we know Sally has one sister and the rest belong to her brothers, we can subtract her share from the total: 9 - 1 = 8 sisters.
>
> So, Sally has 8 sisters.
:|
TrueSlacker0|1 year ago
A: "If Sally has 3 brothers, and each brother has 2 sisters, then the total number of sisters is: 3 brothers x 2 sisters per brother = 6 sisters So, Sally has 6 sisters in total."
Closer than 9 but no better.
sexy_seedbox|2 years ago
giantrobot|2 years ago
oreilles|2 years ago
llmzero|2 years ago
FS = father of Sally = 7
MS = mother of Sally = 10
FB = father of Bob = 12
MB = mother of Bod = 10
FM = father of Mary = 12
MM = mother of Mary = 24
Now MS=MB=10 (S and B are brothers), FB=FM=12 (Bob and Mary are brothers), (FS=7)#(FB=12), and (MB=10)#(MM=24). Now S and M are not sisters because their parents {7,10} and {12,24} are disjoint sets.
Edited several times to make the example trivial and fix grammar.
phkahler|2 years ago
In an adjacent area - autonomous driving - I know that lane following is f**ing easy, but lane identification and other object identification is hard. Having real understanding of a situation and acting accordingly is very complex. I wonder if people look at these cars doing the basics and assume they "understand" a lot more than they actually do. I ask the same about LLMs.
Workaccount2|2 years ago
xanderlewis|2 years ago
That’s why a lot (though not all!) of the over-the-top LLM hype you see online is coming from people with very little experience and no serious expertise in a technical domain.
If it walks like a duck, and quacks like a duck…
…possibly it’s just an LLM trained on the output of real ducks, and you’re not a duck so you can’t tell the difference.
I think LLMs are simply a less general technology than we (myself included) might have predicted at first interaction. They’re incredibly good at what they do — fluidly manipulating and interpreting natural language. But humans are prone to believing that anything that can speak their language to a high degree of fluency (in the case of GPT-3+, beyond almost all native speakers) must also be hugely intelligent and therefore capable of general reasoning. And in LLMs, we finally have the perfect counterexample.
smokel|2 years ago
Whoever told you that?
bbor|2 years ago
Luckily, we figured out how to write programs to mimic that part of the brain in the 70s ;)
strangescript|2 years ago
That is not to say this is fine, but more that we tend to get hung up on what these models do wrong rather than all the amazing stuff they do correctly.
torginus|2 years ago
raydev|2 years ago
I'm not just going to ask some rando on the street to give me factual information, there are people who get paid to do that and are incentivized to find citations/get it right.
geuis|2 years ago
Jackson__|2 years ago
This causes it to get the question wrong for me, when testing, and only if I manually prompt normal CoT does it get it right.
Is there any papers showing a merit to this approach? It seems extremely counter-intuitive.
lulznews|2 years ago
walkieer|2 years ago
campbel|2 years ago
Okay, let's think through this step-by-step:
We know that Sally is a girl and she has 3 brothers. Each of Sally's brothers has 2 sisters. Now, who are these sisters? They must be Sally and one other sister, because if Sally's brothers had any other sisters, they would also be Sally's sisters, and the problem doesn't mention that. So, if each of Sally's brothers has 2 sisters, and one of these sisters is Sally herself, then Sally must have 1 other sister. Therefore, Sally has 1 sister.
maxnevermind|2 years ago
SirMaster|2 years ago
>Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?
Sally has 1 sister.
Here's the logic behind this:
1. We know that Sally is a girl and she has 3 brothers.
2. Then we are told that each of her brothers has 2 sisters.
3. Since all of Sally's brothers share the same siblings, they would both count Sally as one of their two sisters.
4. Therefore, Sally only has 1 sister because if each brother counts her once, there is no need for another sister to fulfill the "two sisters" condition.
scrollop|2 years ago
unknown|2 years ago
[deleted]
heliostatic|2 years ago
User: Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?
Claude: To solve this, let's break it down step-by-step:
We know that Sally has 3 brothers. Each of Sally's brothers has 2 sisters. This might make you think that there are 6 sisters in total (3 brothers × 2 sisters each). However, this is not the case. The sisters that each brother has are the same - they are Sally and one other sister. The question is asking how many sisters Sally herself has, not the total number of sisters in the family. Since Sally is a girl, she cannot be her own sister. Therefore, the number of sisters Sally has is equal to the number of sisters each of her brothers has minus herself. So, if each brother has 2 sisters, and one of those sisters is Sally, then Sally must have 2 - 1 = 1 sister. Therefore, Sally has 1 sister.
https://0x0.st/H7Sc.02.png
jedilord|2 years ago
[deleted]
woadwarrior01|2 years ago
[1]: https://imgur.com/a/zRI8wKZ
[2]: https://imgur.com/a/0On1I52
hmottestad|2 years ago
I have a niche question about modelling using some called SHACL that most models except GPT4 got right. Bard, Gemini, Llama all got it wrong. Gemini Ultra gets it right. And Mixtral also gets it right!
One weakness of Mixtral for me is its support for Norwegian. GPT4 is fluent, but Mixtral mixes it up with Danish and is generally poor at performing tasks on Norwegian text. Even summarising Norwegian text is pretty bad. This is obviously just an issue for a few million people in Norway, it’s not that I’m expecting a general model that I can run locally to be good in Norwegian.
Spivak|2 years ago
This has been one of my favorite things to play around with when it comes to real life applications. Sometimes a smaller "worse" model will vastly outperform a larger model. This seems to happen when the larger model overthinks the problem. Trying to do something simple like "extract all the names of people in this block of text" Llama 7B will have significantly fewer false positives than LLama 70B or GPT4.
ijustlovemath|2 years ago
Based on the search results, we can break this down step-by-step:
In conclusion, Sally has 1 sister. The key thing to understand here is that when it says "each brother has 2 sisters", it is referring to Sally and one other sister that all the brothers share.elorant|2 years ago
https://groq.com/
Sally has 1 sister. This may seem counterintuitive at first, but let's reason through it:
It's a bit of a trick question, but it highlights the importance of understanding the phrasing and context in logical reasoning.moffkalast|2 years ago
In the same fashion lots of Mistral 7B fine tunes can solve the plate-on-banana prompt but most larger models can't, for the same reason.
https://arxiv.org/abs/2309.08632
bbor|2 years ago
m00nsome|2 years ago
Given information: - Sally (a girl) has 3 brothers. - Each brother has 2 sisters.
Since Sally is a girl, and she has 3 brothers, it means that each of her brothers considers her as one of their sisters.
Therefore, if each brother has 2 sisters, and Sally is one of those sisters for each brother, then Sally has 1 other sister besides herself.
So, the number of sisters Sally has is 1.
- from Sonnet
campbel|2 years ago
j-b|2 years ago
To solve this problem, we need to find the number of sisters Sally has.
Given information:
Sally has 3 brothers. Each brother has 2 sisters. Since Sally is a girl, she is not counted as a sister to her brothers.
Step 1: Find the total number of sisters for all 3 brothers. Number of sisters for each brother = 2 Total number of sisters for all 3 brothers = 3 × 2 = 6
Step 2: Since Sally is not counted as a sister to her brothers, the number of sisters Sally has is the total number of sisters for all 3 brothers minus Sally herself. Number of sisters Sally has = Total number of sisters for all 3 brothers - 1 Number of sisters Sally has = 6 - 1 = 5
Therefore, Sally has 5 sisters.
jasondclinton|2 years ago
Did you perhaps run this on Sonnet?
unknown|2 years ago
[deleted]
j-b|2 years ago
uptownfunk|2 years ago
lossolo|2 years ago
imtringued|2 years ago
The "smart" models can figure it out if you give them enough rope, the dumb models are still hilariously wrong.
scrollop|2 years ago
https://i.imgur.com/7gI1Vc9.png
Temperature 0 - it answered 0 sisters:
https://i.imgur.com/iPD8Wfp.png
throwaway63820|2 years ago
kkukshtel|2 years ago
evantbyrne|2 years ago
vjerancrnjak|2 years ago
I was quite amazed that during 2014-2016, what was being done with dependency parsers, part-of-speech taggers, named entity recognizers, with very sophisticated methods (graphical models, regret minimizing policy learners, etc.) became fully obsolete for natural language processing. There was this period of sprinkling some hidden-markov-model/conditional-random-field on top of neural networks but even that disappeared very quickly.
There's no language modeling. Pure gradient descent into language comprehension.
Terretta|2 years ago
Given a string of text, what's the most likely text to come next.
You /could/ rewrite input text to be more logical, but what you'd actually want to do is rewrite input text to be the text most likely to come immediately before a right answer if the right answer were in print.
* Unless you mean inside the model itself. For that, we're still learning what they're doing.
bbor|2 years ago
Phrasing it like that, it sounds like the stack has become analog -> digital -> analog, in a way…
vineyardmike|2 years ago
I think this is a perfect example of why these things are confusing for people. People assume there's some level of "intelligence" in them, but they're just extremely advanced "forecasting" tools.
That said, newer models get some smarts where they can output "hidden" python code which will get run, and the result will get injecting into the response (eg. for graphs, math, web lookups, etc).
brookman64k|2 years ago
peterisza|2 years ago
auggierose|2 years ago
pritambarhate|2 years ago
youssefabdelm|2 years ago
It's strange that these tests are frequent. Why would people think this is a good use of this model or even a good proxy for other more sophisticated "soft" tasks?
Like to me, a better test is one that tests for memorization of long-tailed information that's scarce on the internet. Reasoning tests like this are so stupid they could be programmed, or you could hook up tools to these LLMs to process them.
Much more interesting use cases for these models exist in the "soft" areas than 'hard', 'digital', 'exact', 'simple' reasoning.
I'd take an analogical over a logical model any day. Write a program for Sally.
unknown|2 years ago
[deleted]
gait2392|2 years ago
bbor|2 years ago