top | item 46639173

(no title)

subbu_devhub | 1 month ago

@focusedone I tried reader mode in several browsers it was such a hit or miss, it just did not work for me, and honestly I wanted to convert to markdown not just plain text

I tried several reader modes, there were several issues including * several potions of the main content was missing * the navigation bits get caught when in reader mode * the comments and other un-related sections come in play

I really tried these before invesitng time in this

discuss

order

focusedone|1 month ago

Oh, cool! How does this do with the intentionally obfuscated sites?

subbu_devhub|1 month ago

@focusedone if you see the code link here https://github.com/subranag/declutter/blob/main/src/page.ts there are some specific techniques recommended to simulate normal browsing behaviour, but it does not work 100% percent of the times, but works on most of the sites

for example * simulate scrolling after page loads * simulate plugins * simulate location etc

once all of this is done hopefully the HTML content becomes readable