Skip to content
This repository has been archived by the owner on Mar 9, 2021. It is now read-only.

Relevant content in XML island is not returned #6

Open
zivk opened this issue Mar 28, 2012 · 1 comment
Open

Relevant content in XML island is not returned #6

zivk opened this issue Mar 28, 2012 · 1 comment

Comments

@zivk
Copy link

zivk commented Mar 28, 2012

When the relevant article content is in an XML island it wouldn't be returned. See for example WSJ Japan article http://jp.wsj.com/Finance-Markets/Foreign-Currency-Markets/node_400108 with the following fragment (shortened for clarity):

<p>
<?xml version="1.0" encoding="utf-8"?>
<section xmlns:image="http://ez.no/namespaces/ezpublish3/image/" ...>
<paragraph>(this is the relevant content) イスラエル銀行(中央銀行)は景気下支えを目的に過去5カ月間に ...</paragraph>
</section>
</p>

@karussell
Copy link
Owner

This should be fixed. But we need a test case to close this here ..

rborer referenced this issue in finity-ai/snacktory Aug 27, 2015
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants