top | item 42827555

Show HN: Use Hugging Face smolagents to grab a journal's RSS link

1 points| samesense | 1 year ago |github.com

1 comment

order

samesense|1 year ago

Here's a python script to find the rss url on a science journal's website. It leverages smolagents and meta-llama/Llama-3.3-70B-Instruct. The journal’s html is pulled with a custom smolagent tool powered by playwright. Html parsing is handled by a CodeAgent given access to bs4.

I've tested with nature, mdpi, and sciencedirect so far.

I built it b/c I tired of manually scanning each journal's html for rss feeds, and I wanted to experiment with agents. It took a while to get the prompt right.

Suggestions welcome.