(no title)
nimbusega | 1 year ago
It gets the top 100 stories, sends their html to GPT-4 to extract the main content (this was not producing good enough results with html parsing) and then gets an embedding using the title and content.
Likes/dislikes are stored in local storage and compared against all stories using cosine similarity to find the most relevant stories.
It costs about $10/day to run. I was thinking of offering additional value for a small subscription. Maybe more pages of the newspaper, full story content/comments, a weekly digest or ePub export or something?
ketzo|1 year ago
Not sure if it's a "premium feature" so to speak, but would be very cool to extend this to comments generally.
jkestner|1 year ago
nimbusega|1 year ago
Maybe 'See Comments' here could load the comments on the same page? In a newspaper like style.
gsky|1 year ago
It should not cost more than a dollar a day.
Take AWS and azure credits and run it for free for years
13hunteo|1 year ago
[deleted]
jzombie|1 year ago
You're referring to using the embeddings for cosine similarity?
I am doing something similar with stocks. Taking several decades worth of 10-Q statements for a majority of stocks and weighted ETF holdings and using an autoencoder to generate embeddings that I run cosine and euclidean algorithms on via Rust WASM.
mahin|1 year ago
tiborsaas|1 year ago
How well does it work?
tagawa|1 year ago
Regarding HTML to GPT-4, I seem to remember commenters here saying they got better results by converting the HTML to Markdown first, then sending to an LLM. Might save a bit of money too.
mahin|1 year ago