Launch HN: Probably Genetic (YC W19): At-home DNA testing for every gene
Why rare genetic conditions? It may seem like a niche problem but there are ~400 million people worldwide with a rare genetic condition, half of which are currently undiagnosed. To put this into perspective, it’s estimated that 1 in 10 Americans has a rare condition and while each of the individual ~10,000 conditions is rare, the population that suffers such conditions is larger than cancer and HIV combined. Furthermore, for the patients that have been diagnosed, it takes on average 8 years for doctors to identify their conditions. You’ve already heard of rare genetic conditions, you might just not be aware of it. Remember the Ice Bucket Challenge for ALS? Most of these diseases initially look like more common conditions, such as autism, chronic pain, ADHD, or even the flu, before patients get worse. This diagnostic odyssey can be extremely costly for patients and in our experience, some are spending more than $30,000 and seeing more than 10 doctors before they get access to the right specialists and testing.
We have seen this problem first-hand. Lukas is a rare-disease expert and worked on the world’s largest rare disease project (the 100,000 genomes project) as a PhD candidate at Oxford University in the UK. I am trained as a theoretical computational astrophysicist and during my PhD and fellowship at Cambridge University and Oxford University, I spent my spare time working with National Healthcare Service doctors developing and publishing medical diagnostics with machine learning. Our original idea was actually slightly different from what we have now. We spent a lot of effort on developing a symptom checker specifically for rare conditions with the idea to comb through existing medical records and flag patients with potential rare genetic conditions because, unsurprisingly, WebMD and others aren't really great for this purpose. As we were building this, we realized that for the patients we worked with, even if their symptoms were suggestive of a genetic condition, access to clinical-grade genetic testing was extremely difficult for many as it was either too expensive because insurance wouldn’t cover the cost, or they couldn’t find a doctor that would order it. Thus, we decided to use our expertise to both find these patients more efficiently and built up a service to drastically reduce the time and cost to access clinical-grade genetic testing.
About the test:
Just like most DNA tests, you can do this from home and it’s noninvasive, all we need is a little saliva. Unlike most DNA tests, ours is physician-ordered, sequenced in a CLIA-accredited and CAP-certified lab, the results are signed out by a licensed clinical lab director, all users have access to genetic counseling, and we try to incorporate as much phenotypic data as possible into the analysis. Our product is a whole-exome sequencing test with 100x coverage and covers all of the more than 20,000 genes, where 85% of known disease-causing variants occur.
People always ask, how are you different from 23andme? Looking for a rare genetic condition is kind of like trying to find a typo in a novel. Using a 23andme (or similar) test to look for such a condition is like trying to find a typo in the first Harry Potter novel and stopping after 75 words. Those tests are just not meant for this purpose. Most are based on genotyping arrays that look for very specific variants at predetermined locations in the genome. However, the variants that cause rare diseases can occur anywhere. For example, there are over 1,700 different mutations in the CFTR gene that can cause cystic fibrosis. Approximately 85% of the known pathogenic mutations occur in the protein-coding regions of DNA called the exome. Our test is a whole exome sequencing test rather than a genotyping array, which allows us to cover all of the genes in a person’s DNA. We often get the question, why not do whole genome rather than whole exome? Right now it simply comes down to accessibility. For most consumers, whole genome sequencing is still too expensive and the additional gain in terms of coverage of pathogenic mutations doesn’t necessarily warrant the significant price increase. That being said, if you are interested in clinical-grade whole genome sequencing, we can soon offer this as well.
Patient privacy is a huge concern for us and something we think about all the time. Quite simply, we will never share any of our users' data without explicit consent and we are more than happy to both delete our users' data and destroy their sample if requested.
Interestingly, we often have the opposite problem where we receive inbound requests of people trying to share their data with us to see if we can help them. Out of the more than 10,000 rare diseases, over 95% do not have an FDA-approved treatment, which is why the rare disease community is so motivated to leverage their personal insights. We have started a waitlist to provide such services and are actively seeking ways to help these people and integrate them into our community, even if they have not had sequencing through us.
Finally, how much does it cost? A single test right now costs $899 on our website, but we try to offer the test in trios where we sequence both the patient and two family members as this often gives a higher diagnostic yield. The latter option is $1,799. We expect that the test price will decrease significantly with time as the cost of sequencing drops and more of the analysis can be automated. We don’t currently accept insurance; however, in our experience, using more traditional channels to access this kind of testing can result in bills of >$10,000, not all of which may be covered. Many insurance providers don’t even cover this kind of testing, except for very specific purposes, despite more and more of the medical literature recommending exome sequencing as a first-tier diagnostic for specific indications. Ideally, we would make the product so affordable that it simply makes sense to use us rather than billing insurance for the test plus the numerous doctor and specialist visits needed before and after it’s ordered. We are currently offering the test at cost, as we aim not to profit off of the patients that need it most, and this is sustainable because, with the consent of the patients, we can leverage our data asset for drug discovery, clinical trial recruitment, and drug repurposing.
Consumer genetic testing is growing rapidly as an industry and nearly doubling every single year. What is missing from this market is accessible physician-ordered testing that can genuinely help those with complex symptoms and undiagnosed genetic conditions. This is what we hope to provide. If you have any questions or feedback, we would love to hear it and please check us out at https://www.probablygenetic.com.
[+] [-] mattparmett|6 years ago|reply
Another very important part of the testing process, particularly in the rare disease space, is the pre- and post-test genetic counseling. I see that all users have access to genetic counseling, which is great - just curious about how you are handling that part of the process?
[+] [-] astroH|6 years ago|reply
[+] [-] all2|6 years ago|reply
This should be the default rather than a user-requested option. I would consider auto-deleting user data (ensuring privacy for anyone's data outside some window of time) to be akin to auto-opt-in organ donation: most people don't know or care enough to change the default and will go with whatever is on the form (which is why organ donation is 99ish% of some populations in Europe and drastically lower than that in the United States - we should ignore the moral implications of choosing for people and focus on the outcomes of choosing a 'sane' default action). If you want an iron-clad guarantee sequences will never fall into the hands of bad actors (anyone who has delusions of genocide, for example) deletion/destruction is the way to go; perhaps even two or three passes of bit-by-bit overwrite.
Consider a future where such sequences will allow bad actors to engineer viruses against family units, regions of peoples, and so on.
[+] [-] rkagerer|6 years ago|reply
I stay far away from services like 23andMe because of their user-hostile terms (at least that's the impression I got when I last scrutinized them a few years ago).
To me the ideal DNA test would be one I process myself at home, without sending anything to the cloud. I recognize that's not viable since most of us lack home laboratories.
Next best option would be one where the sample is submitted anonymously and results accessible under a system-generated ID code. That would alleviate some of the risk if you or your partners screw up in terms of privacy and security (it's naive to ignore that possibility - look at how many Fortune 500's have been breached or suffered human error).
A user could "opt in" to share identifying information (ideally only to the extent they're comfortable - e.g. I might be willing to submit coarse data like racial background, age, etc. while withholding my name and address).
In all cases I would still expect to see very strong, binding language in both your agreements with me, and those with the labs you subcontract out to, assuring that my data will be kept confidential and all traces deleted (including from all backups!) when I request.
Regarding re-sequencing and long-term storage for privacy concerned users: give me the option to download the raw data (compressed and encrypted?) and delete it from your platform. I can re-submit to you in a few years for an updated analysis. That avoids the cost of re-sequencing while, for privacy-conscious users like me, minimizes the window in which you storing my data could cause me harm.
[+] [-] astroH|6 years ago|reply
[+] [-] est31|6 years ago|reply
Anyways, wishing them lots of luck in building a company that actually respects privacy and doesn't just claim it.
[+] [-] selimthegrim|6 years ago|reply
https://stanfordcehg.wordpress.com/2017/06/22/feature-interv...
[+] [-] lukasl|6 years ago|reply
[+] [-] text70|6 years ago|reply
Whole genome, not just the protein coding regions, which can catch more clandestine promoter, and non-coding SNPs, etc.
[+] [-] zmmmmm|6 years ago|reply
I feel like you may be underestimating both the maturity of the existing market and the complexity of doing what you are proposing at scale. Are you issuing clinical reports for your results? If so, who is doing that as neither of you sounds like you would meet the normal requirements for that.
[1] https://www.invitae.com/en/physician/category/CAT000168/#exo...
[+] [-] astroH|6 years ago|reply
[+] [-] gavinray|6 years ago|reply
I am working with a team of people at the moment building a product around genome analysis and insight for people who have existing sequences - sort of like Promethease but more accessible to the average person.
If you are open to it, I would love to get in contact with your team and learn more about what you're doing. Is the email address [email protected] the best place to reach out?
[+] [-] astroH|6 years ago|reply
[+] [-] blue_eyes1978|6 years ago|reply
[+] [-] lukasl|6 years ago|reply
(1) The FASTQ, which contains the read data from the sequencer (see https://support.illumina.com/bulletins/2016/04/fastq-files-e...) (2) The BAM, which contains sequence alignment data (you can learn more here: https://software.broadinstitute.org/software/igv/BAM) (3) The VCF, which contains the variants present in the patient as compared to the human reference genome
We map all >20,000 genes in the human genome, which covers 85% of all known disease-causing mutations. Genes make up about 2% of your DNA.
Regarding integrating this into ongoing healthcare for patients: continuous re-analysis of a patient’s genetic data is important, for two reasons: (1) we discover 100-150 new disease-causing genes every year, which means we can return results today that we couldn’t a year ago, and (2) a patient’s symptoms change over time, which matters because we analyze the patient’s DNA for what’s causing their symptoms. Plus, a lot of diseases have a late-onset, which means you might be perfectly fine until your thirties, and all of a sudden you start developing a condition that you didn’t know about. Adult-onset ALD is an example of such a condition (https://www.mayoclinic.org/diseases-conditions/adrenoleukody...).
[+] [-] kian|6 years ago|reply
[+] [-] cflyingdutchman|6 years ago|reply
[+] [-] ablekh|6 years ago|reply
[+] [-] astroH|6 years ago|reply
[+] [-] ampdepolymerase|6 years ago|reply
[+] [-] astroH|6 years ago|reply
[+] [-] quantguy11959|6 years ago|reply
[+] [-] astroH|6 years ago|reply
[+] [-] aloukissas|6 years ago|reply
[+] [-] aroch|6 years ago|reply
It just so happens that when they do share your data it may be with a Pharmaceutical company
[+] [-] astroH|6 years ago|reply
[+] [-] the_watcher|6 years ago|reply
[+] [-] astroH|6 years ago|reply
[+] [-] the_watcher|6 years ago|reply
[+] [-] neil_s|6 years ago|reply
Cheering for you to succeed!
[+] [-] astroH|6 years ago|reply
[+] [-] aaauaucuggaa|6 years ago|reply
[+] [-] colsandurz|6 years ago|reply
[+] [-] astroH|6 years ago|reply
[+] [-] elektor|6 years ago|reply
My question to you: You've made a comparison of your service to 23andMe in your post, can you provide some similar insight as to why your service would be preferable to Dante Labs? They claim to provide a "Whole Genome Sequencing Full DNA Test (30X)" [1]
Thank you!
[1]https://us.dantelabs.com/collections/advanced-dna-tests/prod...
[+] [-] astroH|6 years ago|reply
[+] [-] rootusrootus|6 years ago|reply
I have given up on ever getting results from Dante Labs, and I'm unsure if I would even trust the sequencing results either way. I recommend anybody else considering them do their own due diligence before making a purchase.
[+] [-] CodiePetersen|6 years ago|reply
[+] [-] lukasl|6 years ago|reply
[+] [-] cflyingdutchman|6 years ago|reply
[+] [-] astroH|6 years ago|reply
[+] [-] p9s|6 years ago|reply
[+] [-] lukasl|6 years ago|reply
[+] [-] spease|6 years ago|reply
[+] [-] astroH|6 years ago|reply