A million years ago, when you could still find video poker games with 100%+ theoretical return or poorly thought-out promotions offering enough cash-back to get you over 100%, we'd calculate the Kelly number for a given opportunity -- the bankroll necessary to ride out hills and valleys in favorable situations.
Spoiler: It's almost always 3-4x the value of a royal flush. So you needed $12-16k if you were playing a $1-per-coin game with a 1% edge at a pretty good clip.
And what do you earn with perfect play in that situation? The princely sum of around $30 an hour.
I had never paid much attention to Effective Altruism or SBF before FTX blew up, but when that happened I spent some time reading old EA forum posts and SBF tweets and interviews. One of the things that absolutely shocked me was the dismissal of the Kelly criterion by SBF and other EAs. The argument was that the Kelly criterion was only rationalized by a logistic utility function, and if you were going to use your money for altruistic purposes a linear utility function is more appropriate (at least up into the trillions of dollars) because you can help twice as many people with $200 billion as you can with $100 billion.
This argument was used--by SBF and others--to justify truly absurd risk taking. I don't think it's an exaggeration to suggest that this misunderstanding may have been one of the primary drivers of Alameda's (and hence FTX's) downfall. For a group with as many smart people as EA and as many people obsessed with existential risks as EA not to have started screaming en masse when SBF suggested he would take a 51-49 bet on doubling utility or deleting all known life out of existence[1] is insane.
The mathematical misunderstanding is one part of it. Kelly betting dominates any other betting strategy in the sense that as the number of bets increases the probability that the Kelly better will have more money than someone following any other strategy approaches 1. You don't need a logarithmic utility function. If I bet Kelly and you follow some other strategy, eventually I will almost surely end up with more money and more utility than you.
I suspect another part of it is a misunderstanding by SBF (and perhaps others) of Jane Street's trading strategy. Jane Street encouraged their traders to be "risk neutral", which can be expressed as maximizing expected utility with a linear utility function. They wanted their traders to be willing to take big risks. But any individual trader is only working with a tiny fraction of Jane Street's capital, so even if they're risking all the money they've been given to work with on a bet that's still a small bet relative to the entire company. SBF seems to have taken that same risk neutral idea and applied it to the entirety of Alameda/FTX's available capital (and indeed expressed a willingness to apply it to the combined utility of the entire world), with predictably disastrous results.
I don't want to denigrate the whole community or anything as quite a lot of thought-provoking and interesting reading has come out of it over the years, but I can't help but recall very serious defense of the notion of quantum immortality on LessWrong after Eliezer's fairly convincing rants that any serious scientist has to conclude multiple worlds is the only sensible interpretation of quantum mechanics. If you honest to God take this to its logical conclusion, then wiping out all life in your particular branch of the multiverse may very well be the right move if it doubles utility in 51% of all possible universes.
I don't actually buy that argument and think it's insane, but it would not remotely surprise me if SBF believed it, and if you do, then you don't really observe the Kelly criterion. You take the ruin for the larger team of other yous that collectively wins. If the density of quantum branches in which he funded colonization of the galaxy is greater than the density in which he is serving life in prison, it was worth it.
What a great example of Dunning-Kruger as applied to elites. I remember the spike of interest in DK bias during the pandemic [1], largely as a way of explaining how uneducated folks could be so confidently incorrect about vaccination strategies. In reality it can strike in any social strata -- like a bunch of professional traders wielding billions of dollars, smugly misunderstanding Kelly.
A word that is good to know here is ergodic [0]. Which I must admit to not really understanding although it is something like the average system behaviour being equivalent to a typical point's behaviour. If a process is non-ergodic then E[X] is usually not as helpful as it seems in formulating a strategy.
Ergodicity in the mean refers to the ensemble mean being the same as the temporal mean, i.e. measuring one process 1000 times will give the same average as a single measurement of 1000 different processes.
One way for a process to not be ergodic in the mean is when there's some sort of barrier, as sibling comments allude to.
Another is if the overall mean value is picked randomly each time the process starts, but is different each time the process runs. So for example personal monthly expenditures are not ergodic in the mean, because some people are born into circumstances that make them wealthy, and they will on average spend more each month than people not born into such good circumstances.
The ensemble average will tend towards people's average spending, while the temporal average will tend towards each individual's spending.
An illustrative example to explain ergodicity. Consider the following game. Players start with $100. At every turn, a fair coin is flipped. If tails, the amount of player's money is increased by 50%. If heads, the amount of player's money is decreased by 40%. To play or not to play, that is the question.
TL;DR: while a single investment may be ergodic, portfolio management (the math behind weighting successive and concurrent investments/bets) is not, as it has a strong dependence on all prior states.
The Kelly criterion is almost never used as-is because it is very sensitive to probability of success, which is hard to know accurately and in many cases, dynamically changing. This is easy to see in an Excel spreadsheet. Changing the probability by even 0.01 percent can vastly shift the results. The article calls this out in the last paragraph.
The article mentions fractional Kelly is a hedge. But what fraction is optimal to use? That is also unknowable.
Finance folks, correct me if I’m wrong, but the Kelly Criterion is rarely used in financial models but is more a rule of thumb that says roughly if you have x $ and probability p, in a perfect world you should only bet y amount. But in reality y cannot be determined accurately because p is always changing or hard to measure.
I am not sure what you mean by "never used as is."
The Kelly criterion is an optimization of capital growth (its logarithm) method/guide. Not using it doesn't change its correctness.
But yes you need to know the advantage/the edge you have. Like with pricing methods eg for European options for Black Scholes you need to know the volatility and there is no way to know it, you estimate. This is where all the adjusting for bias and ML comes in.
> Changing the probability by even 0.01 percent can vastly shift the results.
No, not generally. Since it's a quadratic function we're optimising, it's surprisingly flat at the top. Sure, there are some bets where the edge is tiny and 0.01 percent is a large proportion of that, but that doesn't invalidate the Kelly criterion – by what other criterion would you determine the appropriate bet size?
> is more a rule of thumb that says roughly if you have x $ and probability p, in a perfect world you should only bet y amount.
It applies far more broadly than to binary bets. It tells you how to allocate your spending optimally across any number of opportunities, based on joint probability of outcomes.
Yeah, but I think this misses the point a bit. The fact that your true edge isn’t knowable wouldn’t be so bad except that if you’re betting full-Kelly and overestimate your edge even a little bit, your probability of ruin in the long run goes to 1. Whereas if you underbet, you’ll compound wealth at a little lower rate but won’t risk ruin.
I think it shows that Blackjack is not even theoretically winnable over time if you have to pay for information on the count in the form on minimum bets. The ideal case it that you bet $0.49 for every $1,000 in your investment pool when the count is extraordinarily high.
Even if you hack the casino's cameras so you know the count without having to be at the table, your reward is a growth rate that is very low per hand.
Thanks for linking the image. You're right in my simulation there is almost no growth, even at a high count and if you're forced to bet every round you would certainly lose money.
But it's a simplistic simulation and a real casino offers slightly better odds if the rules are right.
The game is a bit weird. It often offers very bad deals, without an option not to invest. For example for the Gibraltar strait, the game gives the information that the probablity of success is between 90% and 100% and that it’s been traversed 31 times with a 90 % success rate. Then it offers me the choice between an investment of different sizes, where I cannot win back more than my investment, so I have to risk money for no possible gain (invest 167 ducats in shipment worth 167 ducats).
For the coin flipping scenario, what happens to the casino? Shouldn't they lose money in the long run as well? Or is it that they're under the kelly threshold with all the house cash?
The casino will break even, but for the gamblers there will be a small number that win big, and a much larger number that lose out.
Consider two rounds, there's a 25% chance you 4x your money, a 50% chance you 0.75x your money and a 25% chance you 0.25x your money
All these probabilities and so on are basically unknowable so the real utility of this kind of thing for a hedge fund is converting a traders view into a suggested size so they take the amount of risk that they're being paid to take.
gwbrooks|1 year ago
Spoiler: It's almost always 3-4x the value of a royal flush. So you needed $12-16k if you were playing a $1-per-coin game with a 1% edge at a pretty good clip.
And what do you earn with perfect play in that situation? The princely sum of around $30 an hour.
bdjsiqoocwk|1 year ago
"$1 per coin game" is this a game where you put in $1 to play and get paid either $2 or $0 with 50-50 probability (0 expected).
And the what does it mean %1 edge? Does it mean the probabilities are such that the expected payout is 1c per coin flip?
jiggawatts|1 year ago
CrazyStat|1 year ago
This argument was used--by SBF and others--to justify truly absurd risk taking. I don't think it's an exaggeration to suggest that this misunderstanding may have been one of the primary drivers of Alameda's (and hence FTX's) downfall. For a group with as many smart people as EA and as many people obsessed with existential risks as EA not to have started screaming en masse when SBF suggested he would take a 51-49 bet on doubling utility or deleting all known life out of existence[1] is insane.
The mathematical misunderstanding is one part of it. Kelly betting dominates any other betting strategy in the sense that as the number of bets increases the probability that the Kelly better will have more money than someone following any other strategy approaches 1. You don't need a logarithmic utility function. If I bet Kelly and you follow some other strategy, eventually I will almost surely end up with more money and more utility than you.
I suspect another part of it is a misunderstanding by SBF (and perhaps others) of Jane Street's trading strategy. Jane Street encouraged their traders to be "risk neutral", which can be expressed as maximizing expected utility with a linear utility function. They wanted their traders to be willing to take big risks. But any individual trader is only working with a tiny fraction of Jane Street's capital, so even if they're risking all the money they've been given to work with on a bet that's still a small bet relative to the entire company. SBF seems to have taken that same risk neutral idea and applied it to the entirety of Alameda/FTX's available capital (and indeed expressed a willingness to apply it to the combined utility of the entire world), with predictably disastrous results.
[1] https://elmwealth.com/a-missing-piece-of-the-sbf-puzzle/
nonameiguess|1 year ago
I don't actually buy that argument and think it's insane, but it would not remotely surprise me if SBF believed it, and if you do, then you don't really observe the Kelly criterion. You take the ruin for the larger team of other yous that collectively wins. If the density of quantum branches in which he funded colonization of the galaxy is greater than the density in which he is serving life in prison, it was worth it.
whatshisface|1 year ago
evrydayhustling|1 year ago
[1] https://trends.google.com/trends/explore?date=today%205-y&ge...
roenxi|1 year ago
[0] https://en.wikipedia.org/wiki/Ergodic_process
kqr|1 year ago
One way for a process to not be ergodic in the mean is when there's some sort of barrier, as sibling comments allude to.
Another is if the overall mean value is picked randomly each time the process starts, but is different each time the process runs. So for example personal monthly expenditures are not ergodic in the mean, because some people are born into circumstances that make them wealthy, and they will on average spend more each month than people not born into such good circumstances.
The ensemble average will tend towards people's average spending, while the temporal average will tend towards each individual's spending.
diab0lic|1 year ago
A million players each placing a single bet will have an expectation of losing the house edge.
A single player placing a million bets has an expectation of $0.
The fact that the aggregate and the single entity Experience different expectations despite both placing a million bets is what makes this ergodic.
sobriquet9|1 year ago
KK7NIL|1 year ago
Nassim Taleb also talks about this quite a lot: https://youtu.be/91IOwS0gf3g
TL;DR: while a single investment may be ergodic, portfolio management (the math behind weighting successive and concurrent investments/bets) is not, as it has a strong dependence on all prior states.
wenc|1 year ago
The article mentions fractional Kelly is a hedge. But what fraction is optimal to use? That is also unknowable.
Finance folks, correct me if I’m wrong, but the Kelly Criterion is rarely used in financial models but is more a rule of thumb that says roughly if you have x $ and probability p, in a perfect world you should only bet y amount. But in reality y cannot be determined accurately because p is always changing or hard to measure.
eftychis|1 year ago
The Kelly criterion is an optimization of capital growth (its logarithm) method/guide. Not using it doesn't change its correctness.
But yes you need to know the advantage/the edge you have. Like with pricing methods eg for European options for Black Scholes you need to know the volatility and there is no way to know it, you estimate. This is where all the adjusting for bias and ML comes in.
kqr|1 year ago
No, not generally. Since it's a quadratic function we're optimising, it's surprisingly flat at the top. Sure, there are some bets where the edge is tiny and 0.01 percent is a large proportion of that, but that doesn't invalidate the Kelly criterion – by what other criterion would you determine the appropriate bet size?
> is more a rule of thumb that says roughly if you have x $ and probability p, in a perfect world you should only bet y amount.
It applies far more broadly than to binary bets. It tells you how to allocate your spending optimally across any number of opportunities, based on joint probability of outcomes.
Both of your misconceptions are common, and they are addressed in the article linked in the submission: https://entropicthoughts.com/the-misunderstood-kelly-criteri...
intuitionist|1 year ago
avidiax|1 year ago
https://github.com/obrhubr/kelly-criterion-blackjack/blob/ma...
I think it shows that Blackjack is not even theoretically winnable over time if you have to pay for information on the count in the form on minimum bets. The ideal case it that you bet $0.49 for every $1,000 in your investment pool when the count is extraordinarily high.
Even if you hack the casino's cameras so you know the count without having to be at the table, your reward is a growth rate that is very low per hand.
obrhubr|1 year ago
nighthawk454|1 year ago
mudita|1 year ago
quickquest|1 year ago
headPoet|1 year ago
mhh__|1 year ago