Skip to content
This repository has been archived by the owner on Mar 9, 2021. It is now read-only.

Text content is removed when there is an image in news webpage. #34

Open
avi20072008 opened this issue Sep 21, 2013 · 1 comment
Open

Comments

@avi20072008
Copy link

Hi,

I have tried using snacktory and It works well on the webpages which do not contain images. I have tried using one of the newspapers and I found that whenever there is an image, snacktory removes text block close to the image.

Try this url : http://articles.timesofindia.indiatimes.com/2013-09-17/rest-of-world/42147651_1_tropical-depression-mexico-city-heavy-rains

@karussell
Copy link
Owner

Would be nice if you could digg into it and provide a fix via pull request :) !

rborer pushed a commit to finity-ai/snacktory that referenced this issue Jun 19, 2017
…on_apnews

Fixed content extraction for apnews.com
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants