rovr beat me to it below. Here are more links: https://jacobsgill.es/phdobtained (fun fact: because my thesis contains published papers, I am in breach of a few journal's copyright by uploading my own thesis pdf, but fuck'em).
LLM approaches were evaluated on my own time and but published (I left research after obtaining my PhD).
> because my thesis contains published papers, ..., but f 'em
Excluding the part in the middle because I don't wanna repost potential issues for you. I just wanted to comment that that is terrible. People often talk about the siloed nature of research in industry, without considering that academia supports the draconian publishing system. I understand IP protection, but IP protection doesn't have to mean no access. This is such a huge issue in the bio- world (biostats, genetics, etc).
I don't know your circumstances but often you retain the right to distribute a "post print", ie the final text as published but absent journal formatting. A dissertation should fit that definition.
Thank you for the link! And congratulations on obtaining your PhD
I have skimmed through it and it's truly amazing how good annotation of the dataset can lead to impressive results.
I apologise in advance if the question seems ignorant: The blog post talked about fine-tuning models online. Given that BERT models can run comfortably on even iPhone hardware, were you able to finetune your models locally or did you have to do it online too? If so, are there any products that you recommend?
This is really cool -- thanks for posting it! I'll have to skim through it at some point since a lot of my work is in classifications models and mirrors the results you've seen
gillesjacobs|1 year ago
LLM approaches were evaluated on my own time and but published (I left research after obtaining my PhD).
SpaceManNabs|1 year ago
Excluding the part in the middle because I don't wanna repost potential issues for you. I just wanted to comment that that is terrible. People often talk about the siloed nature of research in industry, without considering that academia supports the draconian publishing system. I understand IP protection, but IP protection doesn't have to mean no access. This is such a huge issue in the bio- world (biostats, genetics, etc).
uolmir|1 year ago
pandatigox|1 year ago
I have skimmed through it and it's truly amazing how good annotation of the dataset can lead to impressive results.
I apologise in advance if the question seems ignorant: The blog post talked about fine-tuning models online. Given that BERT models can run comfortably on even iPhone hardware, were you able to finetune your models locally or did you have to do it online too? If so, are there any products that you recommend?
Mockapapella|1 year ago
rovr138|1 year ago
wuschel|1 year ago
rovr138|1 year ago