Statistics #2

drdhaval2785 · 2015-11-22T17:14:28Z

Preliminary version says the following statistics

Total entries without normalization are 434909 - hw1.txt
Total entries with anusvAra normalization are 421137 - hw2.txt
Total entries with duplication normalization are 418181 - hw3.txt

i.e. total 16728 decrease.

I am not saying whether all the deductions are correct or not.

The text was updated successfully, but these errors were encountered:

gasyoun · 2015-11-22T20:00:51Z

16.5k decrease is an interesting one. After @funderburkjim will add upasarga-dhatu combinations from PW and PWG we will get plenty of new words. The biggest gain (circa 60k words) will be gone when you learn to kill M, H at end of words.

drdhaval2785 · 2015-11-23T11:37:09Z

I learnt how to remove H and M at the end.
The question is to do it without loss of any genuine candidate.

gasyoun · 2015-11-23T12:00:38Z

If a single is lost we can handle it. Can't we?

funderburkjim mentioned this issue Oct 24, 2017

A viable 'ending am -> a' normalization rule #12

Open

drdhaval2785 added the Documentation label Dec 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Statistics #2

Statistics #2

drdhaval2785 commented Nov 22, 2015

gasyoun commented Nov 22, 2015

drdhaval2785 commented Nov 23, 2015

gasyoun commented Nov 23, 2015

Statistics #2

Statistics #2

Comments

drdhaval2785 commented Nov 22, 2015

gasyoun commented Nov 22, 2015

drdhaval2785 commented Nov 23, 2015

gasyoun commented Nov 23, 2015