Skip to content
This repository has been archived by the owner on Mar 9, 2021. It is now read-only.

Not able to extract content #59

Open
saketmalpure opened this issue Jun 29, 2017 · 1 comment
Open

Not able to extract content #59

saketmalpure opened this issue Jun 29, 2017 · 1 comment

Comments

@saketmalpure
Copy link

Not able to extract content from the some websites like quora.com and possibly some others.
It is returning 403, for HEAD request method at this line in HtmlFetcher class.

@bark4mark
Copy link

Quora returns 403 for a HEAD request, if you call fetchAndExtract but set resolve to false it will work. I can add a pull request for this, adding an option where if the HEAD request fails it falls back on a GET request.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants