donovanr | 6 months ago | on: Minesweeper thermodynamics
donovanr's comments
donovanr | 8 months ago | on: Virtual cells
It's nice to see the idea of virtual cells make a comeback now, though the meaning seems to have shifted to transciptomics-based transformer / gpu-powered models (which have issues[0]), it's a fun field / problem, but I think it will make better progress if we take advantage of all the varied computational work that has come before.
[0] Benchmarking Transcriptomics Foundation Models for Perturbation Analysis : one PCA still rules them all https://arxiv.org/abs/2410.13956
donovanr | 1 year ago | on: More Agents Is All You Need: LLMs performance scales with the number of agents
donovanr | 5 years ago | on: We quit our jobs to build a cabin
While I was in grad school I got bored sitting behind a computer all day, and my wife and I decided to build a tiny house on a trailer as a way of venting our pent-up DIY urges. We'd just build it in our spare time. LOL.
We started in the late summer of 2013, with a trailer and no plans and a stack of construction books from the library.
Cut to spring 2016, having spent every single weekend and most evenings since (in zero degree winters and brutal Pittsburgh summers) sweating and swearing, really pushing the "divorce cabin" line, and having legitimate discussions late at night about the benefits of burning it all to the ground, where my wife, eight month pregnant, is trying to finish the trim work before I submit my dissertation and we tow it across the country.
The way the article captures the "not knowing what we were getting into" / tiny things that delay you to death / stressed out / losing friends / doing absolutely nothing else with your life / so over budget it hurts / final elation at success is absolutely perfect.
We only made it across the finish line because living in Pittsburgh on a grad student stipend is actually, well, livable, and I could do that while my wife worked pretty much full time on our housing boondoggle.
The main learning experience coming out of it was that you should absolutely pay how ever many thousands of dollars it costs for a good set of plans from someone who's done this before. Learning smaller tasks like framing and roofing etc is easy. Stitching it all together into a plan that you're arguing about because neither of you have any idea what you're doing, all while you're wasting precious daylight is _hard_. We would have finished at least a year sooner if we just had plans to follow.
All that said, building a place to live in was super rewarding (as others have said) type II fun.
We still have it, it's beautiful, and I have not yet burned it down.
donovanr | 6 years ago | on: Ask HN: Who is hiring? (April 2020)
The modeling team at the Allen institute for Cell Science is hiring for two software engineering positions -- a data generalist and a ML / computer vision specialist:
https://alleninstitute.org/what-we-do/cell-science/careers/j...
The Allen Institute for Cell Science aims to impact the entire cell science community. Our goal is to advance understanding of cell behavior in its normal, pathological, and regenerative contexts. Our multidisciplinary team will generate novel cellular reagents, data, models and databases that are informed by and open to scientists around the world. We will produce unique dynamic, visual databases and cellular models that integrate information and data across cellular and molecular sciences.
donovanr | 6 years ago | on: Ask HN: Who is hiring right now?
The modeling team at the Allen institute for Cell Science is hiring for two software engineering positions -- a data generalist and a ML / computer vision specialist:
https://alleninstitutecellscience.hrmdirect.com/employment/j... https://alleninstitutecellscience.hrmdirect.com/employment/j...
ONSITE, Seattle
The Allen Institute for Cell Science aims to impact the entire cell science community. Our goal is to advance understanding of cell behavior in its normal, pathological, and regenerative contexts. Our multidisciplinary team will generate novel cellular reagents, data, models and databases that are informed by and open to scientists around the world. We will produce unique dynamic, visual databases and cellular models that integrate information and data across cellular and molecular sciences.
donovanr | 7 years ago | on: How well does population density predict U.S. voting outcomes?
donovanr | 7 years ago | on: Allen Integrated Cell is a powerful tool for visualizing biology in 3D
2. We are very interested in how cells change as they divide, differentiate, age, are perturbed by their environment, etc. We study cells in culture right now -- getting good images of in vitro cells from multicellular organisms is way harder. So yes it would absolutely be useful. I don't know if we're going to tackle it ourselves, but one of our core missions is to lay the groundwork for the community to take our tools and run with them -- it's a big win for us if we can bring previously unfeasible research within the realm of the possible.
3. I am a Bayesian at heart, so modeling uncertainty is something that I'm always thinking about. It's high on my list of priorities to do something along these lines.
4. Image similarity is a hard problem. At the end of the day, metrics only get you so far and the proof is in the pudding. Unfortunately there is no ground-truth data to test against -- the probabilistic model was constructed exactly because we can't measure where everything is all at the same time. Some things we do to convince ourselves that we're on track is to see that the variation in the imputed predictions and the actual data are statistically similar, and to see if experts are confounded in differentiating the outputs of our models from actual data. You can read more here https://www.biorxiv.org/content/early/2017/12/21/238378
donovanr | 7 years ago | on: Allen Integrated Cell is a powerful tool for visualizing biology in 3D
There are some alternate visualizations here http://www.allencell.org/3d-cell-viewer.html of data that came off of our microscopes that we also use to visualize our models in house but wasn't;t included in the video. It's hard to visualize varying density 3D data in 2D -- there's no one good way to do it, especially on the fly over the web -- but if you have any feedback about what would be more informative / easier to understand, let us know.
donovanr | 7 years ago | on: Allen Integrated Cell is a powerful tool for visualizing biology in 3D
donovanr | 7 years ago | on: Allen Integrated Cell is a powerful tool for visualizing biology in 3D
donovanr | 7 years ago | on: Allen Integrated Cell is a powerful tool for visualizing biology in 3D
donovanr | 8 years ago | on: Launch HN: SharpestMinds (YC W18) – Online Community for AI Devs
donovanr | 8 years ago | on: Launch HN: SharpestMinds (YC W18) – Online Community for AI Devs
- a few of the questions were very good, and either spoke to key high level concepts, or were specific while being language agnostic. (e.g which one of these layers wouldn't you need, why wouldn't this type of classifier work on this data).
- too many of the questions were hyper-focused on the minutiae of word embeddings, tensor flow syntax, SQL queries, and recommender schemes.
- many of the questions were constructed vaguely enough that "I don't know" would be the technically correct answer even though I don't think that was what you were going for.
metadata: recent PhD with serious grad courses in ML and working in DL/CV for the past year using a non-tensorflow framework (PyTorch).
donovanr | 8 years ago | on: Path-breaking Papers About Image Classification
donovanr | 9 years ago | on: Beating the Compiler
[0] http://penguin.ewu.edu/cscd300/Topic/AdvSorting/Sedgewick.pd...
donovanr | 9 years ago | on: The Deconstructed Standard Model Equation
donovanr | 9 years ago | on: Cellphone-Cancer Link Found in Government Study
donovanr | 9 years ago | on: Cellphone-Cancer Link Found in Government Study
[edited because using an asterisk to denote multiplication was a bad idea]
donovanr | 10 years ago | on: My favorite interview question