top | item 38588127

Ask HN: Where to learn the cutting edge of prompt engineering?

33 points| michaericalribo | 2 years ago

I’m doing experiments with LLMs and I’m trying to research techniques for grounding. Example prompt templates, for instance. There’s lots of generic articles about grounding, but details and specific examples are thin on the ground. I’ve read the source for langchain to find the prompt template for agent based reasoning, but that was just one perspective…are there better ways?

18 comments

order

VoodooJuJu|2 years ago

Please stop trying to academize and intellectualize and nerdify what is simply questions/conversation. Prompt engineering is a forced meme, that's all it is.

muzani|2 years ago

Do customer support for OpenAI, lol. https://community.openai.com/

Answer enough questions, stay active enough, and you'll see the same patterns emerge. You'll probably make a lot of mistakes. You'll be corrected by other regulars and people you try to help will send you angry messages saying your prompt didn't work when utilised in the industry. It's a good way to learn. As a little bonus, if you do it constantly enough, OpenAI will give you this little "Regular" rank with a secret forum and such.

Langchain feels a little outdated IMO. I feel like OpenAI's in built tools might be a little ahead of it. It was originally designed to handle memory on the old completion API, but since OpenAI's chat API was released, it's not as useful. There's still good reason to use their completion models though - it performs higher quality responses for some creative uses. Agents built on them don't seem very impressive and OpenAI has their own "assistants" for agent-like stuff: https://platform.openai.com/docs/assistants/how-it-works

catlover76|2 years ago

> Langchain feels a little outdated IMO.

That's being too generous lol

jzombie|2 years ago

My opinion is if you want to find out what works best is to come up with a bunch of different variations in a context-free environment to not influence prior results, determine some metrics you are targeting, and start prompting away.

Then you will find the answer that works for you, and probably well more thought out than 3/4 of the articles you will find regarding this sort of thing.

arthurcolle|2 years ago

Prompt Engineering is clearly "a thing" irrespective of whether or not one trains or build models. LLMs clearly have a wide range of possible outputs given a particular prompt (even with just tuning temperature, top_p, top_k) but then, modification of a prompt can lead to significant improvements in the output. it's not a science. It's not really an art either. Certain prompts lead to better outputs than other prompts, and having a systematic way to characterize these differences is going to be important going forward.

I personally stay abreast of new models coming out and run an evals set against new models to assess their performance vs other models (say, gpt-2, gpt-3.5-turbo, etc, gpt-4.)

In terms of grounding, there is RAG, which can be built in any number of ways (PG+pg_vector, vector store, graph db). I would look at arxiv.org publicatons to stay on top of SOTA prompting stuff, as well as adjacent publications (LLMs, scaling, other things)

_andrei_|2 years ago

  - https://arxiv.org/
  - https://www.microsoft.com/en-us/research/group/dynamics-insights-apps-artificial-intelligence-machine-learning/articles/prompt-engineering-improving-our-ability-to-communicate-with-an-llm/
  - https://cloud.google.com/blog/products/ai-machine-learning/how-to-use-grounding-for-your-llms-with-text-embeddings
  - https://amatriain.net/blog/hallucinations

  and general resources:
  - https://learnprompting.org
  - https://www.promptingguide.ai
  - https://github.com/dair-ai/Prompt-Engineering-Guide

catlover76|2 years ago

Is there a "cutting edge"? The space seems pretty pseudo-sciency

I'm reading some papers on arxiv right now, and trying to implement them in our codebase at work. Those papers usually involve doing some common sense thing and measuring the results. Anyone could have come up with it, but they did the data science and showed some evidence it worked.

If there is a better way, I would love to know lol

danielmarkbruce|2 years ago

two cents: any situation involving billions/trillions of variables looks pretty pseudo-sciency because you can't reduce it down or isolate components very well. People can do studies, add things and take things out, and sort of hint at things and explain things sort of. It is what it is.

1tushr|2 years ago

[deleted]