justinpombrio | 1 year ago | on: LLMs know more than they show: On the intrinsic representation of hallucinations
justinpombrio's comments
justinpombrio | 1 year ago | on: LLMs know more than they show: On the intrinsic representation of hallucinations
This isn't true.
You're conflating whether a model (that hasn't been fine tuned) would complete "the capital of Connecticut is ___" with "Moscow", and whether that model contains a bit labeling that fact as "false". (It's not actually stored as a bit, but you get the idea.)
Some sentences that a model learns could be classified as "trivia", and the model learns this category by sentences like "Who needs to know that octopuses have three hearts, that's just trivia". Other sentences a model learns could be classified as "false", and the model learns this category by sentences like "2 + 2 isn't 5". Whether a sentence is "false" isn't particularly important to the model, any more than whether it's "trivia", but it will learn those categories.
There's a pattern to "false" sentences. For example, even if there's no training data directly saying that "the capital of Connecticut is Moscow" is false, there are a lot of other sentences like "Moscow is in Russia" and "Moscow is really far from CT" and "people in Moscow speak Russian", that all together follow the statistical pattern of "false" sentences, so a model could categorize "Moscow is the capital of Connecticut" as "false" even if it's never directly told so.
justinpombrio | 1 year ago | on: When are two proofs essentially the same? (2007)
Sum(i=1..n, i)
= Sum(i=1..n/2, i) + Sum(i=1..n/2, n+1-i)
= Sum(i=1..n/2, n+1)
I'm still interested in the general question, of whether some proofs have big gaps between them. The more complex the proofs, the more obvious this would be; my examples are unfortunately simple. Something like proving the fundamental theorem of algebra using Rouche's Theorem (complex analysis) vs. field theory. But I don't know enough math to compare those.
justinpombrio | 1 year ago | on: When are two proofs essentially the same? (2007)
Can you do that with these two proofs? What's a proof that's halfway in between the two?
If you can get from one proof to the other with small "refactorings", then I agree that they're fundamentally the same. If you can't---if there's an insurmountable gap that you need to leap across to transform one into the other---then I'd call them fundamentally different. If you insist that two proofs are "essentially the same thing" despite having this uncrossable gap between them, then I suspect you're defining "essentially the same" to mean "proves the same thing", which is a stupid definition because it makes all proofs the same by fiat, and avoids the interesting question.
justinpombrio | 1 year ago | on: When are two proofs essentially the same? (2007)
1. Prove that the interior angles of a triangle sum to 180 degrees.
First proof: draw a line parallel to one of the triangle's sides passing through its opposite vertex. There are three angles on one side of this line, and they obviously add to 180 degrees because it's a line. One of the three angles is directly one of the triangle's interior angles; the other two can be shown to be equal to the triangle's other two interior angles. (Try drawing it out.)
Second proof: start at one side of the triangle and walk around it. By the time you return to where you started, you must have turned 360 degrees. Thus the sum of the exterior angles is 360 degrees. Each interior angle is 180 minus the corresponding exterior angle, and there are three of them, so calling the interior angles A, B, C and the exterior angles A', B', C' we have A'+B'+C' = 360 implies (180-A) + (180-B) + (180-C) = 360 implies 540 - A - B - C = 360 implies 180 = A + B + C.
2. Prove that the sum of the first N numbers is N(N+1)/2.
First proof: sum the first and last number to get 1 + N, then the second and second-to-last to get 2 + (N-1) = 1 + N, repeating until you get to the middle. There are N/2 such pairs, giving a total of (1 + N)N/2. (This assumed that there were an even number of terms; consider the odd case too.)
Second proof: proceed by induction. For the base case, it's true for N=1 because 1*2/2 = 1. For the inductive case, suppose it's true for N-1. Then 1 + 2 + ... + N-1 + N = (1 + 2 + ... + N-1) + N = N(N-1)/2 + N = N(N-1)/2 + 2N/2 = N(N+1)/2.
justinpombrio | 1 year ago | on: Battleships Logic Puzzle
justinpombrio | 1 year ago | on: FTC announces "click-to-cancel" rule making it easier to cancel subscriptions
justinpombrio | 1 year ago | on: FTC announces "click-to-cancel" rule making it easier to cancel subscriptions
The trouble is they're endlessly creative about the lists they put you on. I'd get one email from "Alumni Connections" and then another from "Faculty Spotlight" and then another from "Global Outreach" and then another from "Event Invitations, 2023 series". I'm making those names up because I forget exactly what they were called, but you get the idea. I hope this was in violation of the regulation: surely you can't invent a new mailing list that didn't used to exist, add me to it, and require me to unsubscribe from it individually.
They finally stopped after I sent them an angry email.
justinpombrio | 1 year ago | on: Exploring Typst, a new typesetting system similar to LaTeX
1. In the content passed that the user passes to the template
2. In the template itself
3. By the user, outside the template
They take priority in that order.
OTOH, if the template really wants control, it can take optional styling arguments with defaults, and do as it likes with them. And if it wants content from the user that the user doesn't get to style, it can take that content as a string.
It's a fantastic system, so far as I've seen.
justinpombrio | 1 year ago | on: Exploring Typst, a new typesetting system similar to LaTeX
I don't think Typst has a lot of global state to get corrupted. Like, if one package defines a variable `foo` and another package defines a variable `foo`, and you use both of them (and don't try to import `foo` from both), it's not like those `foo`s are going to conflict with each other. Is that the sort of issue that LaTeX packages run into?
Likewise, you don't modify typesetting in Typst by modifying global state like you do in Latex. You use `set` and `show`, which are locally scoped. You never need to, like, set the font size, then write some stuff, then remember to set it back. You just put `set font(size)` around precisely the stuff you want to be bigger.
justinpombrio | 1 year ago | on: Nobel Peace Prize for 2024 awarded to Nihon Hidankyo
justinpombrio | 1 year ago | on: What is the history of the use of "foo" and "bar" in source code examples? (2012)
justinpombrio | 1 year ago | on: Blood puddles, mold, tainted meat, bugs: Boar's Head inspections are horrifying
justinpombrio | 1 year ago | on: US judge throws out FTC's ban on non-compete agreements
justinpombrio | 1 year ago | on: Enum class improvements for C++17, C++20 and C++23
justinpombrio | 1 year ago | on: Where does the name "algebraic data type" come from?
https://justinpombrio.net/2021/03/11/algebra-and-data-types....
And someone else wrote a related blog post:
https://codewords.recurse.com/issues/three/algebra-and-calcu...
justinpombrio | 1 year ago | on: My Favorite Algorithm: Linear Time Median Finding (2018)
That's a good point, though, that the 49th percentile and 51st percentile can be arbitrarily far from the median.
justinpombrio | 1 year ago | on: My Favorite Algorithm: Linear Time Median Finding (2018)
(I made the number 10,000 up, but you could do some statistics to figure out how many samples would be needed for a given level of confidence, and I don't think it would be prohibitively large.)
justinpombrio | 1 year ago | on: Schrödinger's cat among biology's pigeons: 75 years of What Is Life?
justinpombrio | 1 year ago | on: No reasonable expectation of privacy in one's Google location data
Well my apartment building is about to replace the doorbells with a cell phone app (or phone calls as an alternative). No cell phone -> no door bell. I'm just a renter; I get no say in the matter.