-
Notifications
You must be signed in to change notification settings - Fork 98
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Hindi ITN Support for Cardinal, Decimal, Ordinal, Fraction, Date, Tim…
…e, Money and Measure (#223) * Hindi ITN Support for Cardinal, Decimal, Ordinal, Fraction, Date, Time Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Committing all changes made Signed-off-by: Tarushi V <[email protected]> * Updated date.py and added more test cases to cardinal for improved accuracy Signed-off-by: Tarushi V <[email protected]> * Updated date.py Signed-off-by: Tarushi V <[email protected]> * Added hi to Jenkins and cleanup Signed-off-by: Tarushi V <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Changes and cleanup based on feedback Signed-off-by: Tarushi V <[email protected]> * Changes and cleanup based on feedback Signed-off-by: Tarushi V <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Resolved conflicts Signed-off-by: Tarushi V <[email protected]> * Committing code for measure.py Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * changes to run_evaluate.py Signed-off-by: Tarushi V <[email protected]> * Hindi ITN for money.py Signed-off-by: Tarushi V <[email protected]> * Changes and cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup date verbalizer Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * Cleanup Signed-off-by: Tarushi V <[email protected]> * pushing .gitignore file from main branch Signed-off-by: Tarushi V <[email protected]> --------- Signed-off-by: Tarushi V <[email protected]> Signed-off-by: tarushi2k2 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
- Loading branch information
1 parent
3b3c3a3
commit 9aa9118
Showing
81 changed files
with
3,616 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
17 changes: 17 additions & 0 deletions
17
nemo_text_processing/inverse_text_normalization/hi/__init__.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
# Copyright (c) 2024, NVIDIA CORPORATION. All rights reserved. | ||
# | ||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. | ||
|
||
from nemo_text_processing.inverse_text_normalization.hi.taggers.tokenize_and_classify import ClassifyFst | ||
from nemo_text_processing.inverse_text_normalization.hi.verbalizers.verbalize import VerbalizeFst | ||
from nemo_text_processing.inverse_text_normalization.hi.verbalizers.verbalize_final import VerbalizeFinalFst |
13 changes: 13 additions & 0 deletions
13
nemo_text_processing/inverse_text_normalization/hi/data/__init__.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
# Copyright (c) 2024, NVIDIA CORPORATION. All rights reserved. | ||
# | ||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. |
13 changes: 13 additions & 0 deletions
13
nemo_text_processing/inverse_text_normalization/hi/data/date/__init__.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
# Copyright (c) 2024, NVIDIA CORPORATION. All rights reserved. | ||
# | ||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. |
34 changes: 34 additions & 0 deletions
34
nemo_text_processing/inverse_text_normalization/hi/data/date/date_days.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,34 @@ | ||
१ एक | ||
२ दो | ||
३ तीन | ||
४ चार | ||
५ पाँच | ||
६ छः | ||
६ छ: | ||
६ छह | ||
६ छे | ||
७ सात | ||
८ आठ | ||
९ नौ | ||
१० दस | ||
११ ग्यारह | ||
१२ बारह | ||
१३ तेरह | ||
१४ चौदह | ||
१५ पन्द्रह | ||
१६ सोलह | ||
१७ सत्रह | ||
१८ अठारह | ||
१९ उन्नीस | ||
२० बीस | ||
२१ इक्कीस | ||
२२ बाईस | ||
२३ तेईस | ||
२४ चौबीस | ||
२५ पच्चीस | ||
२६ छब्बीस | ||
२७ सत्ताईस | ||
२८ अट्ठाईस | ||
२९ उनतीस | ||
३० तीस | ||
३१ इकतीस |
14 changes: 14 additions & 0 deletions
14
nemo_text_processing/inverse_text_normalization/hi/data/date/months.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
जनवरी | ||
फ़रवरी | ||
फरवरी | ||
मार्च | ||
अप्रैल | ||
अप्रील | ||
मई | ||
जून | ||
जुलाई | ||
अगस्त | ||
सितंबर | ||
अक्टूबर | ||
नवंबर | ||
दिसंबर |
Oops, something went wrong.