eiz | 2 years ago | on: RedPajama: Reproduction of LLaMA with friendly license
eiz's comments
eiz | 2 years ago | on: What are transformer models and how do they work?
This is an active area of study ("mechanistic interpretability") and it's very early days. For instance here's a paper I read recently that tries to explain how a very simple transformer learns how to do modular arithmetic: https://arxiv.org/abs/2301.05217
Curious what interesting results people are aware of in this area.
eiz | 2 years ago | on: What are transformer models and how do they work?
Worth noting that rotary position embeddings, used in many recent architectures (LLaMA, GPT-NeoX, ...), are very similar to the original sin/cos position embedding in the transformer paper but using complex multiplication instead of addition
eiz | 3 years ago | on: Anthropic’s $5B, 4-year plan to take on OpenAI
eiz | 3 years ago | on: Anthropic’s $5B, 4-year plan to take on OpenAI
What? Not having a display output is not the same as not having graphics rendering circuitry. Here's vulkaninfo from an A100 box: https://gist.github.com/eiz/c1c3e1bd99341e11e8a4acdee7ae4cb4
eiz | 3 years ago | on: The LLama Effect: Leak Sparked a Series of Open Source Alternatives to ChatGPT
https://twitter.com/ylecun/status/1643945742850031616
https://twitter.com/ylecun/status/1644503237699969026
pretty clear the non-release and extremely half assed response to the leak is just CYA
eiz | 3 years ago | on: Building a Virtual Machine Inside ChatGPT
> Solve the following word problem step by step:
> "John is in a train moving west at ten miles per hour. Jill is in a train moving north at twenty miles per hour. How fast are John and Jill moving apart from each other?"
To solve this problem, we first need to understand the concept of speed. Speed is a measure of distance traveled over time. In this problem, we are told that John and Jill are traveling at different speeds in different directions.
Since the two trains are moving in different directions, we cannot simply add or subtract their speeds to find the combined speed at which John and Jill are moving apart from each other. Instead, we need to use the Pythagorean theorem to find the combined speed.
The Pythagorean theorem states that in a right triangle, the square of the length of the hypotenuse (the longest side of the triangle) is equal to the sum of the squares of the lengths of the other two sides. In this problem, we can consider the trains to be moving along the two sides of a right triangle, with the distance between them as the hypotenuse.
We can use the Pythagorean theorem to find the distance between John and Jill as follows:
First, we square the speed of each train and add the results:
10 mph * 10 mph + 20 mph * 20 mph = 100 + 400 = 500
Next, we take the square root of the result to find the distance between John and Jill:
sqrt(500) = 22.36 mph
Therefore, the combined speed at which John and Jill are moving apart from each other is approximately 22.36 mph.
---
Results seem pretty inconsistent though. Other attempts have given complete nonsense.
eiz | 3 years ago | on: Pet Airways
> There’s no way to pre-register the pet.
At least on Delta, you can. You need to call the airline and arrange it.
> Oh, and to go through security, you have to take a nervous and scared cat out of their carry case in the middle of the security line, walk through the metal detector, and then they have to test both of your hands for residue while still carrying the cat.
I always refuse the normal line and get a private screening. So far I haven't had any issues doing that.
Still, it's a huge pain in the ass.
eiz | 4 years ago | on: The K Programming Language
having not touched K in about 15 years, when did this change? in k3:
K 3.2 2004-09-23 Copyright (C) 1993-2004 Kx Systems
LIN32 16CPU 15985MB ubuntu 0 EVAL
f:{a:x+1;{a+x}}
g:f 1
g
{a+x}
g 2
4
a
value error
a
^
parse erroreiz | 5 years ago | on: Git is too hard
https://www.amazon.com/gp/help/customer/display.html?nodeId=...
eiz | 5 years ago | on: Ask HN: Who is hiring? (October 2020)
eiz | 7 years ago | on: Audiophiles in Japan Are Installing Their Own Power Poles
I don't disagree with your main point, but this actually isn't quite true. The HDMI signal is split into 3 distinct interleaved periods: video data, data island and control. Video data is not packetized and the only possible error detection it has is from TMDS signaling, but no such error handling is required by the TMDS spec. You can absolutely get imperfect transmission of an HDMI video signal due to cable or other electrical problems. Auxiliary packets in the data island, including audio data, do have an error correction scheme (BCH + TERC4).
Feel free to check out the spec: https://glenwing.github.io/docs/HDMI-1.4b.pdf
eiz | 8 years ago | on: Apple’s Guidelines Now Allow Executable Code in Educational Apps and Dev Tools
eiz | 9 years ago | on: Google Fiber Was Doomed from the Start
eiz | 10 years ago | on: David Bowie Has Died
eiz | 10 years ago | on: libimobiledevice – A cross-platform library to communicate with iOS devices
Backup is done via a completely separate service.
iOS releases do tend to arbitrarily change the security policy of these services, but it's not clear that "Rootless" is anything more than business as usual. Guess we'll find out Monday.