imaurer | 1 year ago | on: Prelude – a tiny CLI tool building context prompts from your code
imaurer's comments
imaurer | 1 year ago | on: The one about the web developer job market
Who needs documentation with Claude and pbcopy?
imaurer | 1 year ago | on: Tech Debt: My Rust Library Is Now a CDO
imaurer | 2 years ago | on: OpenAI GPT-4 vs. Groq Mistral-8x7B
I am collecting these approaches and tools here: https://github.com/imaurer/awesome-llm-json
imaurer | 2 years ago | on: Inversion: Fast, Reliable Structured LLMs
However, they are way easier to get started with using in context learning. Soon, they will be cheaper and probably faster enough too that training your own model will be a waste of time for 95% of use cases (probably higher because it will unlock use cases that wouldn’t break even with the old NLP approaches from a value perspective).
This is why I am tracking LLM structured outputs here:
https://github.com/imaurer/awesome-llm-json
And created an autocorrecting pydantic library that could be used for Named entity linking:
imaurer | 2 years ago | on: Show HN: We built the fastest spreadsheet
imaurer | 2 years ago | on: IAC sold 17 apps to Bending Spoons. $100M deal, all 330 employees fired
imaurer | 2 years ago | on: LuaX: A Lua Dialect with JSX
imaurer | 2 years ago | on: Weaviate – Open-Source AI Native Vector Database
Posting here to see if someone sees it by happenstance and writes an awesome article about it someday so I can read it.
imaurer | 2 years ago | on: Donut: OCR-Free Document Understanding Transformer
imaurer | 2 years ago | on: Which vector similarity metric should I use?
imaurer | 2 years ago | on: Which vector similarity metric should I use?
imaurer | 2 years ago | on: What is a Vector Database? (2021)
https://weaviate.io/developers/weaviate/api/graphql/filters
https://weaviate.io/blog/hybrid-search-explained
I have a ChatGPT session where I have asked it to do a hybrid search using filtering, pg fts and vector search. Looks reasonable just need to test it and write it up somewhere.
imaurer | 2 years ago | on: What is a Vector Database? (2021)
imaurer | 2 years ago | on: What is a Vector Database? (2021)
Current concerns are the scaling and recall performance.
The author is looking at product quantization along with other ideas: https://github.com/pgvector/pgvector/issues/27
More details on product quantization: https://mccormickml.com/2017/10/13/product-quantizer-tutoria...
A nice repo that tracks the ANN relative performance of different indexes: https://mccormickml.com/2017/10/13/product-quantizer-tutoria...
Also shoutout to Weaviate because they have great docs, are open source and have very informative YouTube channel.
imaurer | 2 years ago | on: In PostgreSQL, powerful Full Text Search is available out of the box
imaurer | 2 years ago | on: GitHub Accelerator: our first cohort and what’s next
imaurer | 2 years ago | on: Collection of LLM resources that can be used to build products you can “own”
For “merging”, I would learn about fine tuning to see if that’s what you are looking to learn more about.
imaurer | 2 years ago | on: Collection of LLM resources that can be used to build products you can “own”
That's the inspiration behind the name, open for something better. Considered "Edge" as well, but was concerned that would seem IoT/mobile specific.
imaurer | 2 years ago | on: The Coming of Local LLMs
Kitchen sink command: pbcopy-all: git ls-files | xargs tail -n +1 | pbcopy
Works like a charm in Q2 2024.
I’m sure this will be a very solved problem by 2025.