top | item 39443419

(no title)

annowiki | 2 years ago

How do you get around 403/401's from WSJ/Reuters/Axios? Because I've tried user agent manipulation and it seems like I'd have to use selenium and headless to deal with them.

discuss

eddd-ddde|2 years ago

Sometimes you also need "Accept: html" I have noticed.

jonatron|2 years ago

If curl-impersonate works, it's probably TLS fingerprinting.