-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bold and spans with styles not handled #5
Comments
Hi @jodamo5! Thanks so much for the detailed issue! The underlying issue tying all these behaviors together is that library is not very smart about parsing the text. It's not aware of HTML at all, and currently parses the text as normal strings, then constructs a string and (dangerously 😬) creates the output HTML by setting the output string as the output element's I'll provide some more info for each of your examples. One Word Bold - Works FineThe reason that this works fine is because there are no spaces for the engine to analyze as separate words. More than one word bold - Issue OccursThis issue is similar to the problem with attributes. As far as the lexer is concerned, the items end up being something like:
The only workaround I can think of for this is to wrap each separate word in its own Grouped words in Bold - More ProblemsThis one is luckily somewhat easy to fix: you need to put the grouping outside of the tags, so the parser is able to tell it's a group, i.e.
Because the engine isn't aware of HTML, if the first character in a word isn't a grouping character, it doesn't realize that it's supposed to be a group, and it ends up parsing it as normal text. If the grouping character is first, with the HTML inside it, then everything works out. Issues with Coloured TextThis is similar to issue with more than one word bolded: the engine parses the attributes as separate items, and the resulting HTML is something. If you only want a single word colored, you can wrap the whole item in a group, then put the tag, e.g.:
If a group of words need to be colored, just make sure to the tags inside the group. If multiple words need to be colored but can't be grouped, you'll unfortunately have to repeat the single-word grouping trick for each element 😞
You could potentially use some CSS to make this less annoying, maybe by using a specific tag ( .gloss__line em {
color: #0cf;
}
demo (but you'll have to add the styling yourself in the browser dev tools) That being said, I think the parser should be more aware of HTML, and we should definitely not be blindly setting the Let me know if you have any more questions! |
Thanks @bdchauvette That's great to see the easy solution for the grouped bold words of putting the Rather than fully understanding HTML I wonder if a more simple approach could achieve improved reliability while also giving bold or Phase 1Before glossing use javascript to strip all HTML tags except Phase 2As an additional enhancement the inner element of any I think phase 1 would be easy enough. I don't know how difficult phone 2 would be, but I think it would be easier than getting it to process full HTML. What do you think? |
For Phase 1, I definitely agree that we should be glossing based on the raw, unformatted text. I haven't tried this yet, but we should be able to use the For Phase 2, I'd like to avoid a solution that hard codes support for only certain tags. Ideally, we'd be able to reapply any HTML elements to the parsed gloss. I think we should be able to do this by walking the original DOM tree, and mapping the Alternatively, it might be cleaner to do the tree walking first, and build an AST-like structure that contains the various elements as we go. Once we have the AST, we could use that to render out the final gloss. |
Yes, I agree. Your outline of Phase 2 sounds ideal. And Phase 1 will be a great interim step. |
Thank you very much for creating this library. It is extremely good!
I have found a problem when when making text bold or colored. Hoping this can be fixed.
Issues with Bold Text
I made a string of the translated text bold. On the front end it made both the multiple lines of text bold (the original text plus the translated text bold).
Using bold for multiple words also throws out the alignment of the words slightly.
Example
As an example here is the original result with no bold used (so you can see how the alignment is supposed to look):
![leipzig_-no_bold-_result](https://user-images.githubusercontent.com/7741749/79302912-b9800c80-7f41-11ea-9f3a-1eb077b12c7c.png)
Then here is a screenshot showing the text that I made bold in the editor:
This was by just wrapping the bold text in
<strong></strong>
tags.Here is a screenshot of the outcome:
Points to note:
Demo Examples
To make it easy to replicate, the same issue can be seen on the http://bdchauvette.net/leipzig.js/demo/ page with these examples:
One Word Bold - Works Fine
If only one word is made bold then the bold works fine:
Link to this demo
More than one word bold - Issue Occurs
But as soon as more than one word is bold, the problem happens:
![leipzig-demo-two-words-bold](https://user-images.githubusercontent.com/7741749/79321296-3bcaf980-7f5f-11ea-9fce-04219dca8c0a.png)
Link to this demo
Grouped words in Bold - More Problems
And if words are grouped with brackets inside the strong tags it really messes up - it ignores then meaning of the brackets, so the words go out of alignment, and shows the brackets on the front end:
![leipzig-demo-grouped-words-bold](https://user-images.githubusercontent.com/7741749/79321807-0541ae80-7f60-11ea-83d5-d896bc25035c.png)
Link to this demo
Issues with Coloured Text
If a section of the text is set as a different colour, using a
<span>
tag with an inline style, then the style code is outputted on the front end.Link to this demo
Ideal Outcome
Ideally, a string of text could be bold or coloured and only the text selected as bold or coloured in the backend would output on the front end.
As a fallback, if it is too much work to fix the text colour options, then at least bold could be used, and other html tags with styles would be stripped out and not outputted onto the frontend.
The text was updated successfully, but these errors were encountered: