Skip to content

Commit

Permalink
Do not write end of doc lines in split_wiki
Browse files Browse the repository at this point in the history
  • Loading branch information
albertvillanova committed Jul 14, 2019
1 parent 85e5052 commit 185d883
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion nlputils.py
Original file line number Diff line number Diff line change
Expand Up @@ -45,7 +45,7 @@ def split_wiki(path,lang):
if len(title)>150: continue
if f: f.close()
f = (dest/f'{title}.txt').open('w')
elif l.startswith('</doc>'): continue
else: f.write(l)
f.close()
return dest

0 comments on commit 185d883

Please sign in to comment.