top | item 734267

Should biologists study computer science?

29 points| terpua | 16 years ago |arstechnica.com | reply

41 comments

order
[+] buckwild|16 years ago|reply
In case you guys are wondering about current college curriculum, I just graduated with a BS in biotechnology-bioinformatics so I can provide a little insight.

My core classes consisted mainly of biology, chemistry, organic chemistry and physics. Other classes that my degree required was advanced mathematics (with biological application), advanced statistics, and computer science.

About the computer science portion of my degree: I was required to learn C, C++, discrete math, Perl, and data structure/algorithm design. I chose to take machine language as an elective.

What I've learned in industry: The CS foundation I built in college was critical. Although Perl is widely used where I work, languages like R and C are used more often (for my particular projects). I've also learned that my job is to bridge the gap between biologists and computer scientists.

Biologists say what they want to get-> Statisticians/Mathematicians think up a procedure -> I make sure the formulas make sense with the subject at hand, program it in Perl or whatever -> CS people optimize it and do their magic to make it run super fast -> checked by everyone to make sure its okay. -> stats analyze and feedback to the biologists.

My point being, I always think everyone should learn more math, but the industry has found a way to get around everyone needing to learn everything (jack of all trades master of none) to having experts work together towards a common goal (an oceans 11 type set-up). Everyone has something special to offer. Personally, I think the current set-up is working fine. Although everyone should learn more advanced math (or biologists should learn more CS), not everyone is willing and/or capable.

I hope this was helpful.

[+] biohacker42|16 years ago|reply
Upvoted for comparing science work to oceans 11.
[+] biohacker42|16 years ago|reply
I'm conflicted about this. A little knowledge can be far more dangerous then no knowledge. I have seen things... things I can't unsee. Things done to software by biology and chemistry Ph.Ds that still give me nightmares.

But make no mistake, modern science is neck deep in serious computering. Not being computer literate is almost as bad as being just plain illiterate.

So here's what I think about this. Every scientists who's not a physicist, mathematician, or computer scientists needs to study more math, more stat, and more cs.

In fact I would go so far as to say everyone needs the equivalent of an associate degree in cs to get a Ph.D. in anything. For mathematicians and physicists this would happen almost without any extra effort, for biologists it might be quite a bit of extra effort, but well worth it.

[+] tokenadult|16 years ago|reply
Just a day ago I asked how HN participants learned biology

http://news.ycombinator.com/item?id=731362

in their higher education. I would think that with judicious course selection and choosing fitting internships it would be possible for a CS major to learn quite a lot of biology, or the other way around, but perhaps I am mistaken. A book Mathematics Unlimited--2001 and Beyond

http://www.amazon.com/Mathematics-Unlimited-Bj%C3%B6rn-Engqu...

included an article by a European author suggesting that all math majors should study a lot more science than they did as of a decade ago, and all science majors (in all sciences) ought to study a lot more math. That makes sense to me.

[+] davi|16 years ago|reply
"A little knowledge can be far more dangerous then no knowledge. ... Not being computer literate is almost as bad as being just plain illiterate."

Need examples before I can know whether I agree or strenuously disagree.

[+] Retric|16 years ago|reply
Excluding the humanities, I agree with you. On a more personal level, learning CS skills is often a fallback for people who get a PHD and can't find work in their field.
[+] tom_b|16 years ago|reply
This article is timely for me, having recently started a new job in bioinformatics. Specifically, building a centralized database (warehouse) for a variety of cancer research study data.

I'm coming from the opposite direction - a computer science background to the biology. A huge challenge for me is rapidly learning enough of the biostats and process to understand how to allow researchers to leverage having all this data in one place, easily accessible, and with a front-end that makes "sense" to the those MD/PhD types. A starting point is understanding what type of questions researchers can ask now that they have all the different data in one spot.

Fred Brooks said something like "computer scientists are toolsmiths." We build tools for user needs that simplify and strengthen the user's work. This requires the ability to somehow understand the user's needs, communicate with them effectively, and implement usable tools for them.

I sometimes feel like it is a failure on our part as builders to make it necessary for people who need software tools to build their own. I'm more than happy for other fields to add more CS type education to their required courses, but I'd rather be able to give researchers tools so that they stay on their critical path, rather than having to learn enough to hack together their own full solution.

[+] jacquesm|16 years ago|reply
This is a bit like the welder and the diver question, is it easier to teach a welder how to dive or a diver how to weld ?

For divers and welders the answer appears to be that it is easier to teach welders to dive than the reverse, even if both are far from trivial activities.

For biologists and computer scientists the answer is probably that it is easier to teach programmers to do biology than the reverse.

(good) Programmers have something universal about the way they apply themselves to problems and that way generalizes to problems in a different domain.

[+] queensnake|16 years ago|reply
I admit I don't have much to go on, but it seems like biology is a full load of study, whereas at least undergraduate 'computer science' ie programming can be picked up by a smart person almost incidentally. I've seen that done, and I think it /is/ done more often than the other way around. As for writing /good/ code, that can come from practice and aesthetics. But knowing biology (or another science) takes real study.
[+] Estragon|16 years ago|reply
That's pure ignorant bigotry, although it might be well-received in a crowd of programmers who also know nothing of what biological research is like. You've got to keep in mind that the main task in biology is inference of biological structures and processes. This is complicated by a number of factors: (1) Biological systems are typically way more complex than computer systems. (2) They are comprised of components which are way too small to see in action or manipulate, so all inferences are from second-order effects presumed (often inaccurately) to arise from the subsystem under study. Designing a biology experiment to cope with these difficulties is an exceptionally difficult skill to master. Compare that to programming, where the central task is the construction of a procedure using well-understood components, the states of which can easily be queried.

Claiming that computer scientists would have an easier time learning biology than the reverse is like claiming that chess is a more sophisticated game than go because the lines of tactical analysis are often longer. It ignores the relative complexities of the systems, and the fact that you can usually say more about the simpler system, and usually say it more elegantly.

Incidentally, I am not a biologist. I am a mathematician who has moved into statistical genetics. However, my last postdoc advisor was a formally trained biologist who taught himself statistics and computer science, and has made his career in bioinformatics. But he could hold his own in both those fields.

[+] stevenbedrick|16 years ago|reply
Well, as a reformed biologist and current informatician, I certainly think that biologists should study CS. However, even more important than studying CS, they absolutely NEED to learn how to program. I've seen lab scientists use extremely convoluted and error-prone workflows to conduct their analyses and experiments- workflows that, if they knew just a little bit of Python, would have been much simpler. I'm actually teaching a class in the fall on "utility scripting" to a mix of molecular biology PhD students and informatics master's students for just this reason.

Regarding the age-old question of "should biologists learn CS or should CS people learn biology", I'm firmly in the camp of biologists learning to do their own CS, or at least learning enough CS to productively work with CS people. A little bit of CS really goes a long way towards improving a biologist's workflow. A little bit of biology, however, is almost completely useless for a CS person who wants to get involved in lab science. It really takes a surprising amount of domain knowledge to be productive in a laboratory, or even to understand the nitty-gritty details of an experiment at a deep enough level to write or modify an existing bioinformatics tool.

[+] boblol123|16 years ago|reply
Haha, you have the opposite opinion of me, but you also pretty-much have the exact opposite experiences too. Maybe we both know a lot in our "main" subjects and think of all the underlying knowledge/related material is required to be useful, but in reality you can just pick up the knowledge you need on its own without any background knowledge on how it all works, you'll be confused when that stuff in mentioned or brought up, but if you stay in your niche you'll be fine.
[+] stevenbedrick|16 years ago|reply
One aspect of the article that I haven't seen much discussion of is the second part- about representing biological processes using an algebraic notation. While this might be really helpful for computational biology, it strikes me as a lousy idea for general work, because it presents an overly reductionist view of what's going on. Biomolecular pathways are almost never as simple as they seem at first, and they always interact in weird and complex ways. Presenting them as a big, gnarly, nasty diagram communicates this to readers... explaining them using nice neat equations makes the whole thing seem both simpler and better-understood than it probably really is.
[+] jganetsk|16 years ago|reply
EVERYONE should study Computer Science. The questions are, how much and what parts?
[+] ken|16 years ago|reply
The article is about computers as "part of biological research". I wish it had been about the real place I want biologists: designing the computer systems themselves.

Biological systems have scaling and reliability that we computer scientists only dream about. (Can you name a self-repairing computer system that runs for 80 years?) I want computers with the kind of systems thinking that biological systems have, not just more x86 cores on a single chip.

The only biologist I know who switched to designing computer systems is Alan Kay. I think we could do with a few more like him.

[+] carterschonwald|16 years ago|reply
short answer, yes. Otherwise how can anything useful or meaning be effectively done with the huge volumes data that biologists now have quite frequently. They should also work on their statistics background to so that they can do more sophisticated model / hypothesis testing, but that's a whole separate issue that gets into the matter of education and community incentives and this is not the appropriate forum for that latter topic.
[+] boblol123|16 years ago|reply
A computer scientist can effectively analyse large volumes of biological data, I'm not convinced a biologist could do the same because there is just so much computer science related to visualisation, graphics programming and data modelling and their prerequisites.

A person who did 1/2 and 1/2 would likely not have enough knowledge or experience to do either the biology or cs side particularly well.

Not to mention, there are very few people I know who are good at both biology and cs.

[+] baxter|16 years ago|reply
I agree, but only up to a point. Astronomers need to understand telescopes, but astronomy is about studying distant objects. The same applies to biology -- biologists are not bioinformatics specialists.
[+] tome|16 years ago|reply
How about having biologists work side-by-side with experts in data analysis and statistics, rather than requiring the scientists be be experts in all fields?
[+] Agathos|16 years ago|reply
Often leads to the statisticians giving the biologists tutorials in statistics and the biologists giving the statisticians tutorials in biology. Which is okay, but then you have to wonder if more formal study up front would have been more efficient. Which brings us back to this topic.
[+] TrevorJ|16 years ago|reply
Corollary: should Computer scientists study biology? Yes
[+] jacquesm|16 years ago|reply
Agreed! If you can get your hands on a copy of a university level textbook in genetics that will be some of the best time you could invest in learning something about another field unrelated to the one you are currently in.
[+] tybris|16 years ago|reply
Probably both could do with more math.