Add Lanman to simple-search #18

funderburkjim · 2020-11-29T21:24:01Z

Two additional steps required to make Lanman dictionary available in simple-search:

add LAN to dictnames.js
Regenerate hwnorm1c.sqlite (see Add Lanman (to 7 places) hwnorm1#17)

For example, now this simple-search url works:

https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/shiva

gasyoun · 2020-11-29T22:46:01Z

https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/shiva

Hurray. But how one can know that such nice URLs exist from homepage?

funderburkjim · 2020-12-01T22:35:05Z

Obviously, they can't know that.

I still don't think the simple search is quite ready for full disclosure.

For instance, I ran into some problems such as

https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/भद्र
- Problem is that the Devanagari is not properly handled: citation =
  %E0%A4%AD%E0%A4%A6%E0%A5%8D%E0%A4%B0
Similarly with IAST:
https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/rāma
- citation = r%C4%81ma
Even for standard HK input, the search sometimes fails. I've
noticed this for 'long' words:
https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/pratyakzadarSana
In this example, 'working' never finishes, or you get 'error Internal Server Error'
which is probably due to a timeout.

I'm dubious about wide usage of the simple link with such problems still unsolved.
And I currently think they are hard problems, especially the 'long word' problem.
Probably the Devanagari/IAST problems are not too hard.

And there are still a couple of problems you have pointed out (such as use of capital letters).

gasyoun · 2020-12-02T08:47:28Z

I still don't think the simple search is quite ready for full disclosure.

Let's fill the gap, so it can be finally done after three years of development.

%E0%A4%AD%E0%A4%A6%E0%A5%8D%E0%A4%B0

So it's a code issue, not Unicode. Same with r%C4%81ma

In this example, 'working' never finishes, or you get 'error Internal Server Error'
which is probably due to a timeout.

This one is harder - any clue?

Probably the Devanagari/IAST problems are not too hard.

Exactly, I'm even eager to hire a developer to solve them, because simple is really important for me.

funderburkjim · 2020-12-03T21:47:41Z

Use of Devanagari (or IAST) in url now works properly.

The need was to do a php 'uridecode' on the thing with '%' in the encoding.

Examples:

Ref: #18 (comment)

gasyoun · 2020-12-04T06:46:41Z

Use of Devanagari (or IAST) in url now works properly.

Perfect.
Can I ask again for

https://www.sanskrit-lexicon.uni-koeln.de/s/lan/bhagnāśa

instead of:

https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/bhagnāśa

funderburkjim · 2020-12-04T20:04:35Z

I prefer to keep 'simple' only.

'simple' is intuitive -- 's' is not intuitive
to make 's' an additional alternative to 'simple' would require
- adding a line to .htaccess
- modify php parsing of url which currently happens in
  - list-0.2s_rw.php for cologne
  - list-0.2s_xampp_rw.php for local installations
- code would need to be refactored to avoid duplication, and
  then modified in regard to the url parsing itself.

Don't want to do this now. You can make a separate 'enhancement' issue request if this detail is important to you.

gasyoun · 2020-12-04T20:35:44Z

'simple' is intuitive -- 's' is not intuitive

It's 5 letters shorter. Longer URLs break.

You can make a separate 'enhancement' issue request if this detail is important to you.

It is. Because it could become the default way of quoting Cologne URLs.

funderburkjim · 2020-12-05T02:30:21Z

Revised simple-search algorithm. It is now much quicker, although I think it always will provide the same answers.

In particular, the 'long' word example now is quite speedy:

https://www.sanskrit-lexicon.uni-koeln.de/simple/lan/pratyakzadarSana

gasyoun · 2020-12-05T04:23:56Z

In particular, the 'long' word example now is quite speedy

So now we can make the simple URLs public?

gasyoun · 2020-12-09T00:13:48Z

In book there is a white tab before, guess we can replicate that, @funderburkjim

funderburkjim · 2020-12-10T22:21:33Z

Given the current markup of lan.txt, this would be fairly difficult.

Currently, the things that look like paragraphs are, in lan.txt, preceded by an empty div: <div n="1"/>.

So consecutive paragraphs look like

<div n="1"/>blah1 blah1 blah1
<div  n="2"/>blah2 blah2 blah2
<div n="1"/>blah3 blah3 blah3

We would have to change these to

<div n="1">blah1 blah1 blah1</div>
<div  n="2">blah2 blah2 blah2</div>
<div n="1">blah3 blah3 blah3</div>

and then we could add css text-indent for these divs

div {
  text-indent: 50px;
}

The hard part is closing the divs. The example above is over-simplified, as the 'blah...' part is more complicated.

Current opinion: Could be done along the lines just mentioned, but not worth the trouble.

gasyoun · 2020-12-11T07:53:15Z

Current opinion: Could be done along the lines just mentioned, but not worth the trouble.

Agree, thanks for the detailed layout analysis.

funderburkjim added a commit that referenced this issue Dec 3, 2020

Allow Devanagari or IAST in /simple/xxx.

343ca03

Ref: #18 (comment)

funderburkjim mentioned this issue Dec 5, 2020

Request shorter URL for simple-search #19

Open

drdhaval2785 mentioned this issue Dec 20, 2020

todo list in 2021 (in descending order of importance) sanskrit-lexicon/COLOGNE#325

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Lanman to simple-search #18

Add Lanman to simple-search #18

funderburkjim commented Nov 29, 2020

gasyoun commented Nov 29, 2020

funderburkjim commented Dec 1, 2020 •

edited

Loading

gasyoun commented Dec 2, 2020

funderburkjim commented Dec 3, 2020 •

edited

Loading

gasyoun commented Dec 4, 2020

funderburkjim commented Dec 4, 2020

gasyoun commented Dec 4, 2020

funderburkjim commented Dec 5, 2020

gasyoun commented Dec 5, 2020

gasyoun commented Dec 9, 2020

funderburkjim commented Dec 10, 2020 •

edited

Loading

gasyoun commented Dec 11, 2020

Add Lanman to simple-search #18

Add Lanman to simple-search #18

Comments

funderburkjim commented Nov 29, 2020

gasyoun commented Nov 29, 2020

funderburkjim commented Dec 1, 2020 • edited Loading

gasyoun commented Dec 2, 2020

funderburkjim commented Dec 3, 2020 • edited Loading

gasyoun commented Dec 4, 2020

funderburkjim commented Dec 4, 2020

gasyoun commented Dec 4, 2020

funderburkjim commented Dec 5, 2020

gasyoun commented Dec 5, 2020

gasyoun commented Dec 9, 2020

funderburkjim commented Dec 10, 2020 • edited Loading

gasyoun commented Dec 11, 2020

funderburkjim commented Dec 1, 2020 •

edited

Loading

funderburkjim commented Dec 3, 2020 •

edited

Loading

funderburkjim commented Dec 10, 2020 •

edited

Loading