top | item 41740269 PixelVerse t1 – CoT prompting outperforms flagship LLMs 9 points| hayden_k | 1 year ago |ai.pixelverse.tech 11 comments order hn newest unknown|1 year ago [deleted] hayden_k|1 year ago OpenAI o1-like CoT and logical thinking prompting strategy significantly enhances llm responses.PixelVerse t1 is powered by Llama 3.1 70b and 3.2 90b. However, with a detailed CoT prompt, it answers complex questions correctly, much better than it's base model and even sometimes beating flagship models like GPT 4o, Claude 3 and Gemini.Try it at: https://ai.pixelverse.tech/app/cortexchat growt|1 year ago First try with llama 70b found two R's in strawberry :) Gemma did better hayden_k|1 year ago yeah - its not perfect yet and responses always vary dcastm|1 year ago I asked the 9.8 vs. 9.11 question from the examples and it got it wrong :/ hayden_k|1 year ago maybe try again - this is still in beta and isn't perfect. however - its much better than the base model, llama 3.1 70b. satisfice|1 year ago I guess there really are two r’s in strawbery. hayden_k|1 year ago its not perfect yet - still a lot of tuning needed. it should get the correct answer in a few tries. red2awn|1 year ago prompt: how many "r"s in the word "raspberry"?response: There are 2 "r"s in the word "raspberry". Lienetic|1 year ago Can we see the detailed CoT prompt? hayden_k|1 year ago I can email it to you if share your email here or email contact@pixelverse.tech brianjking|1 year ago lol, this feels like Reflection 70b all over again. unknown|1 year ago [deleted]
hayden_k|1 year ago OpenAI o1-like CoT and logical thinking prompting strategy significantly enhances llm responses.PixelVerse t1 is powered by Llama 3.1 70b and 3.2 90b. However, with a detailed CoT prompt, it answers complex questions correctly, much better than it's base model and even sometimes beating flagship models like GPT 4o, Claude 3 and Gemini.Try it at: https://ai.pixelverse.tech/app/cortexchat
growt|1 year ago First try with llama 70b found two R's in strawberry :) Gemma did better hayden_k|1 year ago yeah - its not perfect yet and responses always vary
dcastm|1 year ago I asked the 9.8 vs. 9.11 question from the examples and it got it wrong :/ hayden_k|1 year ago maybe try again - this is still in beta and isn't perfect. however - its much better than the base model, llama 3.1 70b.
hayden_k|1 year ago maybe try again - this is still in beta and isn't perfect. however - its much better than the base model, llama 3.1 70b.
satisfice|1 year ago I guess there really are two r’s in strawbery. hayden_k|1 year ago its not perfect yet - still a lot of tuning needed. it should get the correct answer in a few tries.
hayden_k|1 year ago its not perfect yet - still a lot of tuning needed. it should get the correct answer in a few tries.
red2awn|1 year ago prompt: how many "r"s in the word "raspberry"?response: There are 2 "r"s in the word "raspberry".
Lienetic|1 year ago Can we see the detailed CoT prompt? hayden_k|1 year ago I can email it to you if share your email here or email contact@pixelverse.tech
unknown|1 year ago
[deleted]
hayden_k|1 year ago
PixelVerse t1 is powered by Llama 3.1 70b and 3.2 90b. However, with a detailed CoT prompt, it answers complex questions correctly, much better than it's base model and even sometimes beating flagship models like GPT 4o, Claude 3 and Gemini.
Try it at: https://ai.pixelverse.tech/app/cortexchat
growt|1 year ago
hayden_k|1 year ago
dcastm|1 year ago
hayden_k|1 year ago
satisfice|1 year ago
hayden_k|1 year ago
red2awn|1 year ago
response: There are 2 "r"s in the word "raspberry".
Lienetic|1 year ago
hayden_k|1 year ago
brianjking|1 year ago
unknown|1 year ago
[deleted]