trq_ | 2 days ago | on: OpenCode – Open source AI coding agent
trq_'s comments
trq_ | 1 month ago | on: Claude Code daily benchmarks for degradation tracking
trq_ | 1 month ago | on: Claude Code daily benchmarks for degradation tracking
Thanks for reporting this. We fixed a Claude Code harness issue that was introduced on 1/26. This was rolled back on 1/28 as soon as we found it.
Run `claude update` to make sure you're on the latest version.
trq_ | 3 months ago | on: Claude Code gets native LSP support
trq_ | 4 months ago | on: Claude Is Down
trq_ | 1 year ago | on: Show HN: Llama 3.3 70B Sparse Autoencoders with API access
trq_ | 1 year ago | on: Show HN: Llama 3.3 70B Sparse Autoencoders with API access
trq_ | 1 year ago | on: Show HN: Llama 3.3 70B Sparse Autoencoders with API access
We have a notebook about that here: https://docs.goodfire.ai/notebooks/dynamicprompts
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
trq_ | 1 year ago | on: OmniParser for Pure Vision Based GUI Agent
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
- sample multiple logits and branch (we maybe could with the old text completion API, but this no longer exists)
- add in a reasoning token on the fly
- stop execution, ask the user, etc.
But a visualization of logprobs in a query seems like it might be useful.
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
Here's an example of someone doing that for 9.9 > 9.11: https://x.com/mengk20/status/1849213929924513905
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
But in this case, it means that the underlying point in embedding space doesn't map clearly to only one specific token. That's not too different from when you have an idea in your head but can't think of the word.
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
So far this has mostly been done using Reinforcement Learning, but catching it and doing it inference seems like it could be interesting to explore. And much more approachable for open source, only the big ML labs can do this sort of RL.
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
There are definitely times when entropy can be high but not actually be uncertain (again synonyms are the best), but it seems promising. I want to build a visualizer using the OpenAI endpoints.
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
trq_ | 1 year ago | on: Detecting when LLMs are uncertain
trq_ | 1 year ago | on: Detecting when LLMs are uncertain