Perversely, this is almost the exact opposite of how to solve the problem.
Speaking from experience, the real problem with scientific data (genomics, in particular) is when it is exchanged in plain-text format and then loaded IN to Excel. Automatic type inference is what fails.
Data already in Excel format can be created/modified/exchanged without unintentional conversion if the column type is correctly specified as "TEXT".
Look, I get the impulse, and might have agreed with you 10 years ago. At the end of the day, though, we have to work with non-computationally savvy people who (reasonably) want to look at their data sometimes. Not every lab tech or PI can or should learn Python/R/your favorite scripting language, and frameworks like Galaxy take time to set up and maintain, etc etc. Our job is to meet the users where they are and push the biology forward. Any time that I can push them onto a better path, I'll do that, but sometimes the right move is to tell them: "Yeah, go ahead and play with the data in excel, then tell me what you find", and I'll code something proper up afterwards to verify it, get solid stats and a make a pretty visualization.
chromatin|2 years ago
Speaking from experience, the real problem with scientific data (genomics, in particular) is when it is exchanged in plain-text format and then loaded IN to Excel. Automatic type inference is what fails.
Data already in Excel format can be created/modified/exchanged without unintentional conversion if the column type is correctly specified as "TEXT".
rbanffy|2 years ago
pjmlp|2 years ago
chrisamiller|2 years ago
rbanffy|2 years ago
The amount of learning required is minimal, even more when compared to the contortionist required to process data in Excel.
scrapcode|2 years ago
rvba|2 years ago
Businesses use Excel for a reason. Nice that MS finally gave that setting, but renaming genes to be able to use a popular tool also works.