tmostak | 1 month ago | on: Waymo robotaxi hits a child near an elementary school in Santa Monica
tmostak's comments
tmostak | 1 month ago | on: Waymo robotaxi hits a child near an elementary school in Santa Monica
This certainly may have been true of older Teslas with HW3 and older FSD builds (I had one, and yes you couldn't trust it).
tmostak | 1 month ago | on: Waymo robotaxi hits a child near an elementary school in Santa Monica
tmostak | 4 months ago | on: Prefix sum: 20 GB/s (2.6x baseline)
Of course prefix sums are often used within a series of other operators, so if these are already computed on GPU, you come out further ahead still.
tmostak | 9 months ago | on: Modern Minimal Perfect Hashing: A Survey
You can use perfect hashes not only the usual suspects of contiguous integer and dictionary-encoded string ranges, but also use cases like binned numeric and date ranges (epoch seconds binned per year can use a perfect hash range of one bin per year for a very wide range of timestamps), and can even handle arbitrary expressions if you propagate the ranges correctly.
Obviously you need a good "baseline" hash path to fall back to you, but it's surprising how many real-world use cases you can profitably cover with perfect hashing.
tmostak | 1 year ago | on: Show HN: TabPFN v2 – A SOTA foundation model for small tabular data
Just looking through the code a bit, it seems that the model both supports a (custom) attention mechanism between features and between rows (code uses the term items)? If so, does the attention between rows help improve accuracy significantly?
Generally, for standard regression and classification use cases, rows (observations) are seen to be independent, but I'm guessing cross-row attention might help the model see the gestalt of the data in some way that improves accuracy even when the independence assumption holds?
tmostak | 1 year ago | on: All You Need Is 4x 4090 GPUs to Train Your Own Model
tmostak | 1 year ago | on: How Meta trains large language models at scale
tmostak | 1 year ago | on: GPT-4.5 or GPT-5 being tested on LMSYS?
The difference matters as generally in my experience, Llama 3, by virtue of its giant vocabulary, generally tokenizes text with 20-25% less tokens than something like Mistral. So even if its 18% slower in terms of tokens/second, it may, depending on the text content, actually output a given body of text faster.
tmostak | 1 year ago | on: Ollama v0.1.33 with Llama 3, Phi 3, and Qwen 110B
tmostak | 2 years ago | on: Show HN: Use natural language to query and visualize 400M tweets
tmostak | 2 years ago | on: Explore 400M tweets with LLM-powered conversational analytics
tmostak | 2 years ago | on: Ask HN: Who is hiring? (February 2024)
HEAVY.AI builds a GPU-accelerated analytics platform that allows users to interactively query and visualize billions of records of data in milliseconds.
We’re looking for someone who really knows SQL. If you can decipher schemas, figure out what’s wrong with SQL statements and correct them, as well as generate queries in response to user questions, we'd love to talk to you.
The work would initially be on contract, but could lead to full-time employment. Geospatial analytics, data science background, and Python programming skills would be very useful to have as well, but are not absolute requirements.
If interested please reach out to pey.silvester@heavy[dot]ai.
tmostak | 2 years ago | on: Drawing.garden
tmostak | 2 years ago | on: OpenAI investors keep pushing for Sam Altman’s return
tmostak | 2 years ago | on: Fine-tuning GPT-3.5-turbo for natural language to SQL
See the following for more info:
https://yale-lily.github.io/spider https://github.com/taoyds/spider/tree/master/evaluation_exam...
tmostak | 2 years ago | on: Fine-tuning GPT-3.5-turbo for natural language to SQL
tmostak | 4 years ago | on: Getting to the bottom of web map performance
tmostak | 5 years ago | on: OmniSci launches free edition of platform for interactive visual analytics
tmostak | 5 years ago | on: International Space Station 437.800 MHz cross band FM repeater activated