stellographer's comments

stellographer | 10 years ago | on: Jobstart – Personalized, expert guidance to land your next software job

Haha, that mission statement though!

"empower all people," "help they deserve," "status quo is unacceptable," "they have had decades to find a solution to help the masses. They have failed," "facilitate this _movement_," "increase the rate of innovation across the world"...

Who do you think you are? MLK? You're a fancy-pants version of "Goodwill" that literally caters to just the 1%, get over it.

stellographer | 10 years ago | on: James Bond cars over the years

I hate this website. Trackpad scrolling is a workout. Spacebar is glitchy.

How this design is any better than those clickfarming slideshows, I'll never understand.

stellographer | 10 years ago | on: My Favorite Algorithm: Metropolis-Hastings

just downvoting every one of my comments because you disagree with them? That's cool.

1) Just one example? Here: http://adslabs.org/adsabs/abs/2009PhDT.........4C/

2) You are trying to find the global maximum. How do you not understand the value of communication when searching for a maxima in the likelihood distribution? You're just being intentionally obtuse.

3) Yes, really.

A story:

You have a landscape with mountains and hills and you have one person trying to find the tallest mountain. That person is blind, they can't see shit. That person is also mute and deaf. Their only sense is a vibrating altimeter. They get drunk, and climb mountains for 100000 days, trying to find the tallest mountain.

You are advocating the idea that you should send 1000 of these blind deaf mutes out there one at a time (running these in parallel is just faster serial) and then they should vote on which mountain is the tallest at the end.

I'm saying you should send a bunch of not-deaf-mutes out there (ie implement mutation, breeding, cross-contamination, gravitation, whatever) so they can tell each other where the stupid mountains are (this requires _parallel_) and they don't waste their whole time stumbling around (burning in).

You're just being intentionally argumentative.

stellographer | 10 years ago | on: My Favorite Algorithm: Metropolis-Hastings

I see you feel strongly about this.

"The point of Metropolis-Hastings is to sample from a distribution when you do not know the partition function." That's one point, yes. The other is optimization. In which case I prefer the others I mentioned.

"they do not seem to have either theoretical justification or empirical success." That's just patently false. They _do_ have empirical success. And besides, a lot of these variants have MH implemented inside of them to some extent o.o

"MH is embarrassingly parallel since you run multiple chains at the same time." I can make six single grilled cheese sandwiches in 2 minutes, but it takes me 8 minutes to make a 6-decker grilled cheese. Is that a parallel process?

"Being 100 years old is also largely irrelevant." In my opinion it is relevant since computation can now be done in completely new ways than 100 years ago.

stellographer | 10 years ago | on: My Favorite Algorithm: Metropolis-Hastings

Yeah, that's what I meant, the one particle bit.

Frequently you'll see things along the lines of "we ran 10,000 MCMC iterations to find the solution" and my first thought is "that must be a lot of wasted cycles."

I think I see what you mean now about the parallelization by decomposing the variables (splitting the problem into i separate problems which can be chained independently?)-- I didn't know that was a possibility. I'll have to look at that.

stellographer | 10 years ago | on: My Favorite Algorithm: Metropolis-Hastings

Are you sure that Gibbs sampling isn't just the multivariate version of MH?

What I'm saying is that convergence speed for MH is limited by the fact that guesses cannot communicate with each other... which doesn't matter when you have a pencil and a 4 function calculator like when it was designed.

A genetic algorithm or a particle swarm algorithm is capable of much swifter convergence because the guesses _can_ communicate and influence the direction of the drunken walk.

stellographer | 10 years ago | on: My Favorite Algorithm: Metropolis-Hastings

Interesting choice...

Seems like genetic algorithms, particle swarms etc. would be more attractive choices since they solve the same problem and are inherently parallelizable while Metropolis Hastings is almost 100 years old and designed for a 4 function calculator.

Although I guess some people have meta-parallelized it... but still seems like a patch job compared to modern likelihood navigation algos.

stellographer | 11 years ago | on: Is the doctoral thesis obsolete?

This article seems to adequately describe the definition and obvious shortcomings of the word _thesis_; which is what a _masters_ student should be doing. A _doctoral_ student should be doing a _dissertation_.

I'm not sure why so many people want a doctoral degree, as opposed to a masters. It doesn't offer significant financial return on the investment over the masters and it's only really required for entrance to the ivory tower... and once you're there you're going to be dealing with way more B.S. than the horrors of writing an introduction...

The entire point of the dissertation is to formulate a complete picture of where things belong and how they interact. It should start from nothing and progress to your contribution. _It should outline pitfalls, mistakes, musings, ideas, future works, etc._ all things which are included in paper publications only minimally (and often requested removed by referees for being off topic!).

Yes, it takes 6 months of headbashing LaTeX editing and it comes out with stupid errors on the front page and no one will ever read it besides yourself and your advisors; but that's honestly one of the best lessons you can learn about life in the ivory tower... You _will_ waste months of your life going nowhere, you _will_ have headbashingly mundane paperwork to do more than you like to think, you _will_ screw it up (and need to understand that every paper has some dirty little secret), you _will_ babble on about things no one cares about... That's what being an academic is!

If you just care about progress for publications and get a pained feeling when your time appears (appears! every fuck up is a valuable lesson) to have been wasted, then you are not an academic and, yes, the experience will be pointless for you.

The truly disgusting thing is these advisors who are streamlining and mechanizing the process. Go ahead and check out which schools pump out the most PhDs per faculty and per time invested (China and India? I'm fairly certain?). Which journals will accept the most articles per time invested (again the same). Are these places onto something no one else is onto? They are capable of producing far more quickly and efficiently than the "archaic" systems...

Just wait 5 years and see why that's a mistake.

stellographer | 11 years ago | on: Industrial Design vs. Mechanical Engineering

I don't really understand this article.

The only example of industrial design and mechanical engineering being fused given in the article is the heat vent one where you took the vents off the top of the sweat-lodge shaped computer and put them on the bottom because it looked "like a large salt shaker"...

But doesn't heat rise? And now the vents are in the thermodynamically opposite place they should be, and simultaneously sheltered from crosswind by the gumdrop shape? That doesn't sound like much of a field-mesh rather than an intentional crippling of the engineering portion...

stellographer | 11 years ago | on: Top data mining algorithms in plain English

I found this article useful.

What I was really hoping for was a layman's translation of the maths on wikipedia [ie. how to implement]. But this is a good jumping off point for figuring out which black box to use.

stellographer | 11 years ago | on: Retire This Idea: Scientific Knowledge Structured as “Literature”

This solution is like trying to stir a pot of chili with an absinthe spoon.

I'm not sure why there's this holy grail of "the unified master version of" whatever.

Let me give an example. Say I write a paper on the shape of the non-dark-matter (stellar) density in the milky way by looking at y-type stars and I get an answer x. Now Bob comes along and looks at y2-type stars and gets answer x2. People have the idea that you just go to the first paper, add a footnote to y and x showing alternate values for y2 and x2...

But what that doesn't take into account is the fact that I used telescope a (a 10 meter hawaiian behemoth) and pointed it in one beam of the sky for 8 hours to get an ultra-deep pencil; but bob used telescope a2 (a modest 1.8 meter in la palma) that took an all sky survey and only goes very shallow. Now we add this in a footnote.

Next, there's a critical difference in the stars we studied. My y-type stars take 8 Gyr just to form, but Bob was using y2-type stars which live anywhere from 100 Myr to 15 Gyr. So I'm looking at the old stars and he's looking at all the stars. Since we know that different age stars live in different parts of the galaxy (old in the halo, young in the disk and bulge), our results are starting to look not as comparable as we thought... but it's minor, we'll add a footnote.

But then we realize that, since my old stars are giants and his all age stars are dwarfs, my stars are way brighter than his. Since my telescope is monstrous, and his is a small surveyor, my stars actually end up being observed to a distance 10 times that of his sample. In fact at these distances, the original model is a bad fit and we need to change from a power law to an Einasto profile. Bob can do that too, so our answers are easily comparable, but the Einasto law has more parameters so it would give a worse fit per parameter value than the power law he wanted to use originally... We add an appendix to the paper to explain this bit.

Then I notice that Bob's been using infrared data, and in the infrared there's a well known problem separating stars and galaxies in the data on telescope a2. In fact, Bob has to write a whole new section on some probabilistic tests and models he uses to adequately remove these galaxies from his y2 star sample. My telescope, observing in the optical at high resolution, has no such problem, so that section doesn't exist in my paper. Bob looks around awkwardly and stickies a hyperlink to his meta-analysis somewhere in my data section.

Then Jill comes along and says she doesn't agree at all with us; she got value x3 using the distribution of dwarf galaxies and if you believe in theory z, then _hers_ is the most accurate answer.

And we tell Jill to go write her own fucking paper.

page 1