(no title)
robbiemitchell | 1 year ago
- Generate targeted LLM micro summaries of every record (ticket, call, etc.) continually
- Use layers of regex, semantic embeddings, and scoring enrichments to identify report rows (pivots on aggregates) worth attention, running on a schedule
- Proactively explain each report row by identifying what’s unusual about it and LLM summarizing a subset of the microsummaries.
- Push the result to webhook
Lack of JSON schema restriction is a significant barrier to entry on hooking LLMs up to a multi step process.
Another is preventing LLMs from adding intro or conclusion text.
adamsbriscoe|1 year ago
(Plug) I shipped a dedicated OpenAI-compatible API for this, jsonmode.com a couple weeks ago and just integrated Groq (they were nice enough to bump up the rate limits) so it's crazy fast. It's a WIP but so far very comparable to JSON output from frontier models, with some bonus features (web crawling etc).
tarasglek|1 year ago
joatmon-snoo|1 year ago
You can check it out over at https://github.com/BoundaryML/baml. Would love to talk if this is something that seems interesting!
BoorishBears|1 year ago
How are you struggling with this, let alone as a significant barrier? JSON adherence with a well thought out schema hasn't been a worry between improved model performance and various grammar based constraint systems in a while.
> Another is preventing LLMs from adding intro or conclusion text.
Also trivial to work around by pre-filling and stop tokens, or just extremely basic text parsing.
Also would recommend writing out Stream-Triggered Augmented Generation since the term is so barely used it might as well be made up from the POV of someone trying to understand the comment
robbiemitchell|1 year ago
You work around it with post-processing and retries. But it’s still a bit brittle given how much stuff happens downstream without supervision.
benreesman|1 year ago
Might be worth checking out.
lastdong|1 year ago
This is really interesting, is there any architecture documentation/articles that you can recommend?
evrydayhustling|1 year ago
https://www.linkedin.com/pulse/ai-2024-more-answers-fewer-qu...