Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hyphen at end of line #431

Open
funderburkjim opened this issue Jul 8, 2019 · 3 comments
Open

Hyphen at end of line #431

funderburkjim opened this issue Jul 8, 2019 · 3 comments
Labels

Comments

@funderburkjim
Copy link
Contributor

There is logic in the display of AP90 that aims to handle line-breaks with hyphens.
But sometimes this logic gives the wrong result. This user correction gives such a case.
In headword vid . Here is the scan in AP90 (hard to read):
image

Better image occurs in the 1957 version:
image

So Apte is giving perfect (viveda) and periphrastic perfect (vidAMcakAra), separated by a hyphen.
In AP90, this '-' occurs at the end of a line.

In the display of AP90, the words before and after the line-ending dash are joined as one word with no dash. In our example this results in (vivedavidAMcakAra); but this is inappropriate here since the dash is used to separate two perfect forms rather than to continue a samAsa across the line break.

I fiddled with the placement of the '-' so the display now shows viveda vidAMcakAra with a space (the dash is still absent). It would be better to retain the dash, but at least the display doesn't look like a single word here.

This shows the peril of doing automatic adjustments -- sometimes a misleading output is the result. I'm labeling this as a bug. I see no way at the moment to do a perfect rendition of '-' at end of lines in AP90.

@funderburkjim
Copy link
Contributor Author

Here is the text in digitization:

old
{#vid#}¦ I. 2 P. ({#vetti#} or {#veda, viveda-#}
<>{#vidAMcakAra, avedIt, vetsyati, vettuM, vidita;#}
new
{#vid#}¦ I. 2 P. ({#vetti#} or {#veda, viveda #} -         <<< note there is space before/after '-'
<>{#vidAMcakAra, avedIt, vetsyati, vettuM, vidita;#}

@gasyoun
Copy link
Member

gasyoun commented Jul 9, 2019

I see no way at the moment to do a perfect rendition of '-' at end of lines in AP90.

If even you do not, will never find a way out of it. Guess there are hundreds of such or smaller bugs.

@funderburkjim
Copy link
Contributor Author

The only way would be to add special markup in digitization. To do this perfectly would not be impossible but would be labor intensive.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants