Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sampling prob = 0.1 #6

Open
wants to merge 402 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
402 commits
Select commit Hold shift + click to select a range
b848327
Update args.py
yijiesun97 Mar 9, 2021
5b84e0c
Update backtranslate_sampling.py
yijiesun97 Mar 9, 2021
f926a95
Update backtranslate_sampling.py
yijiesun97 Mar 9, 2021
09a7e31
Update backtranslate_sampling.py
yijiesun97 Mar 9, 2021
0287a29
Update args.py
zhushen3128 Mar 9, 2021
c084e23
beam size = 5
zhushen3128 Mar 9, 2021
f3654a4
Update backtranslate_util.py
zhushen3128 Mar 9, 2021
fa87978
Update backtranslate_sampling.py
zhushen3128 Mar 9, 2021
a193616
Update backtranslate_util.py
yijiesun97 Mar 9, 2021
3b59c4f
Update backtranslate_sampling.py
yijiesun97 Mar 10, 2021
4e49c9f
update drop empty lines
zhushen3128 Mar 10, 2021
e8dfa3b
Update args.py
zhushen3128 Mar 10, 2021
1ea58d4
Update backtranslate_sampling.py
zhushen3128 Mar 10, 2021
0047b57
Update backtranslate_util.py
zhushen3128 Mar 10, 2021
fccadcc
Update backtranslate_util.py
yijiesun97 Mar 11, 2021
d890e4c
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
a5f31a6
Update backtranslate_util.py
yijiesun97 Mar 11, 2021
66869c1
Update backtranslate_sampling.py
zhushen3128 Mar 11, 2021
b59b5d2
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
a35361d
Update args.py
yijiesun97 Mar 11, 2021
09fd1c2
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
b7f3e27
Update backtranslate_util.py
yijiesun97 Mar 11, 2021
bbc1a35
Update args.py
yijiesun97 Mar 11, 2021
754a86f
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
9bed145
Update backtranslate_util.py
yijiesun97 Mar 11, 2021
c437844
Update backtranslate_util.py
yijiesun97 Mar 11, 2021
8a94d56
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
b477ed1
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
f895633
Update args.py
zhushen3128 Mar 11, 2021
cfe5af9
Update args.py
yijiesun97 Mar 11, 2021
f534c91
Update backtranslate_util.py
zhushen3128 Mar 11, 2021
df8274b
Update args.py
zhushen3128 Mar 11, 2021
ef5bcc2
Update backtranslate_sampling.py
zhushen3128 Mar 11, 2021
c53b2cb
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
df46002
Update backtranslate_sampling.py
yijiesun97 Mar 11, 2021
6ddbaf1
Update args.py
yijiesun97 Mar 11, 2021
739e891
Update backtranslate_sampling.py
zhushen3128 Mar 11, 2021
69e26ea
Update backtranslate_util.py
zhushen3128 Mar 11, 2021
28fb725
Update backtranslate_sampling.py
zhushen3128 Mar 11, 2021
4a904a2
Add files via upload
yijiesun97 Mar 12, 2021
b7ce61f
Update args.py
yijiesun97 Mar 12, 2021
c74f703
Update args.py
yijiesun97 Mar 12, 2021
f081078
Update load_pretrained.py
yijiesun97 Mar 12, 2021
e1e9348
Delete args_beam_1.py
yijiesun97 Mar 12, 2021
30231f3
Delete args_beam_10.py
yijiesun97 Mar 12, 2021
06356ad
Delete args_beam_2.py
yijiesun97 Mar 12, 2021
84e5032
Delete args_beam_5.py
yijiesun97 Mar 12, 2021
9571772
Delete backtranslate_sampling_beam_1.py
yijiesun97 Mar 12, 2021
2019cd7
Update args.py
zhushen3128 Mar 12, 2021
4169f34
Update load_pretrained.py
zhushen3128 Mar 12, 2021
0bcbdc2
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
1aa59f3
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
21bf0f1
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
144c1b7
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
aaa6a22
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
bcfe962
Create scratch.py
yijiesun97 Mar 12, 2021
9718c3d
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
70d3882
Update scratch.py
yijiesun97 Mar 12, 2021
9ecef03
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
d821bc6
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
0120e8a
add progress bar
zhushen3128 Mar 12, 2021
e5ce79b
Update args.py
yijiesun97 Mar 12, 2021
fa1db06
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
5416f32
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
7868ee1
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
1ac1c4a
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
54e7c7b
Update backtranslate_sampling.py
yijiesun97 Mar 12, 2021
9e237dd
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
78d6439
Update and rename backtranslate_sampling.py to backtranslate_nmt_samp…
yijiesun97 Mar 12, 2021
f41f15d
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 12, 2021
73823f8
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
9d26536
Update load_pretrained.py
zhushen3128 Mar 12, 2021
5c53e8c
Update load_pretrained.py
yijiesun97 Mar 12, 2021
a73b38b
Update load_pretrained.py
yijiesun97 Mar 12, 2021
cacae6a
Update load_pretrained.py
yijiesun97 Mar 12, 2021
a85e88d
Update args.py
yijiesun97 Mar 12, 2021
51ce9e9
Update load_pretrained.py
yijiesun97 Mar 12, 2021
7c69bcd
Update load_pretrained.py
yijiesun97 Mar 12, 2021
e42e0ab
Update load_pretrained.py
yijiesun97 Mar 12, 2021
a15a777
Create translate.py
yijiesun97 Mar 12, 2021
49b17b9
google api
zhushen3128 Mar 12, 2021
48a673c
Update translate.py
yijiesun97 Mar 12, 2021
3b8c1ee
Update google_translate.py
zhushen3128 Mar 12, 2021
290f3c7
Merge branch 'proj' of https://github.com/jamesli1997/robustqa into proj
zhushen3128 Mar 12, 2021
3bfcdb0
Update google_translate.py
zhushen3128 Mar 12, 2021
37f12eb
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 12, 2021
046dc36
Update backtranslate_util.py
zhushen3128 Mar 12, 2021
27a1032
Update backtranslate_util.py
zhushen3128 Mar 12, 2021
dff9607
Update translate.py
zhushen3128 Mar 12, 2021
6ba07f4
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
7ba58c1
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 12, 2021
b00aaee
Update args.py
yijiesun97 Mar 12, 2021
2cf8099
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 12, 2021
3482ad2
Update args.py
yijiesun97 Mar 12, 2021
512db9e
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 12, 2021
4714a8d
Update args.py
yijiesun97 Mar 12, 2021
04b44a0
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
371299b
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 12, 2021
7fcb28b
Update args.py
yijiesun97 Mar 12, 2021
9196f26
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 12, 2021
372359f
Update translate.py
zhushen3128 Mar 12, 2021
fbff085
Update args.py
yijiesun97 Mar 12, 2021
d97e35b
Update translate.py
yijiesun97 Mar 12, 2021
be80a1d
Update translate.py
yijiesun97 Mar 12, 2021
adf9654
Update translate.py
yijiesun97 Mar 12, 2021
0e13ea8
Update translate.py
zhushen3128 Mar 12, 2021
cf31d51
Update translate.py
zhushen3128 Mar 12, 2021
eac5588
Update translate.py
zhushen3128 Mar 12, 2021
688bdd1
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
33e8734
Update backtranslate_util.py
zhushen3128 Mar 12, 2021
6a8b3a2
Update backtranslate_util.py
zhushen3128 Mar 12, 2021
ab60301
Update backtranslate_util.py
zhushen3128 Mar 12, 2021
8a3412d
Update translate.py
zhushen3128 Mar 12, 2021
4b775f2
Update backtranslate_util.py
zhushen3128 Mar 12, 2021
0423d03
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 12, 2021
a554292
Update translate.py
zhushen3128 Mar 12, 2021
6287dda
Update backtranslate_util.py
yijiesun97 Mar 12, 2021
ebf1258
Update args.py
yijiesun97 Mar 12, 2021
bb61a99
Rename args.py to transformer_args.py
yijiesun97 Mar 12, 2021
3a6d07d
Update load_pretrained.py
yijiesun97 Mar 12, 2021
00b18b3
Update translate.py
yijiesun97 Mar 12, 2021
1a811c5
Create backtranslate_transformer_sampling.py
yijiesun97 Mar 12, 2021
b72fd09
Update args.py
yijiesun97 Mar 12, 2021
6d98d2c
Add files via upload
yijiesun97 Mar 12, 2021
9e6884e
Delete transformer/QA directory
yijiesun97 Mar 12, 2021
b43d266
Update transformer_args.py
yijiesun97 Mar 12, 2021
9c75004
Update args.py
yijiesun97 Mar 12, 2021
67145e9
transformer QA folder
zhushen3128 Mar 12, 2021
8f4cdc3
Merge branch 'proj' of https://github.com/jamesli1997/robustqa into proj
zhushen3128 Mar 12, 2021
e670f01
Update load_pretrained.py
yijiesun97 Mar 12, 2021
b32d7c8
Update translate.py
yijiesun97 Mar 12, 2021
caf592d
QA folder in transformer
zhushen3128 Mar 12, 2021
cf2c5c1
Merge branch 'proj' of https://github.com/jamesli1997/robustqa into proj
zhushen3128 Mar 12, 2021
ed97c90
Delete .DS_Store
zhushen3128 Mar 12, 2021
572d319
Delete gpu_requirements.txt
zhushen3128 Mar 12, 2021
86e9966
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
d2aa24d
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
9db4683
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
1778039
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
1eb3804
Update backtranslate_util.py
yijiesun97 Mar 13, 2021
607eb94
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
fa4862c
Update load_pretrained.py
zhushen3128 Mar 13, 2021
ffe227b
added concat backtranslate
jamesli1997 Mar 13, 2021
2cb4a73
merging
jamesli1997 Mar 13, 2021
1157909
fixed bug in args
jamesli1997 Mar 13, 2021
272e750
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 13, 2021
7026921
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
04edc86
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
7d740ff
Update google_translate.py
zhushen3128 Mar 13, 2021
2d584ca
Update load_pretrained.py
zhushen3128 Mar 13, 2021
ab68678
Update load_pretrained.py
zhushen3128 Mar 13, 2021
4dcf23d
Update load_pretrained.py
zhushen3128 Mar 13, 2021
0ad639b
Update load_pretrained.py
zhushen3128 Mar 13, 2021
920f0ea
Update load_pretrained.py
zhushen3128 Mar 13, 2021
419fcb4
added finetune
jamesli1997 Mar 13, 2021
822807d
merging
jamesli1997 Mar 13, 2021
fe57bc1
Update load_pretrained.py
yijiesun97 Mar 13, 2021
c192502
Update load_pretrained.py
yijiesun97 Mar 13, 2021
dbc6b78
Update load_pretrained.py
yijiesun97 Mar 13, 2021
4db2893
changed args
jamesli1997 Mar 13, 2021
2da35d0
fixed checkpoint for eval
jamesli1997 Mar 13, 2021
91d69b9
allow for only backtranslate finetune
jamesli1997 Mar 13, 2021
03ef2e4
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
3a3ddea
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
1cc3b84
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
2f9565e
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
e00e8cf
reformat
zhushen3128 Mar 13, 2021
58be148
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
8cbd023
Update backtranslate_util.py
zhushen3128 Mar 13, 2021
0e9ed28
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
26f1a05
Create google_translate_context.py
zhushen3128 Mar 13, 2021
cb9a553
Update google_translate_context.py
zhushen3128 Mar 13, 2021
030dccd
Update args.py
zhushen3128 Mar 13, 2021
7fa735b
Update args.py
zhushen3128 Mar 13, 2021
8f61999
Update args.py
zhushen3128 Mar 13, 2021
bf0d0b8
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 13, 2021
793b9ed
run beam 5
zhushen3128 Mar 13, 2021
85ff030
Update load_pretrained.py
zhushen3128 Mar 13, 2021
c5b29c5
Update load_pretrained.py
zhushen3128 Mar 13, 2021
d96be73
Update load_pretrained.py
zhushen3128 Mar 13, 2021
b084b67
Update load_pretrained.py
zhushen3128 Mar 13, 2021
8ea3f98
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 14, 2021
32e9afe
run beam 1
zhushen3128 Mar 14, 2021
d6eb07b
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 14, 2021
dcd5aba
Update load_pretrained.py
yijiesun97 Mar 14, 2021
7bd037f
Update load_pretrained.py
yijiesun97 Mar 14, 2021
0830517
Update load_pretrained.py
zhushen3128 Mar 14, 2021
af85323
Update and rename args.py to google_args.py
yijiesun97 Mar 14, 2021
fdabf15
Update load_pretrained.py
zhushen3128 Mar 14, 2021
279bf5b
Update load_pretrained.py
yijiesun97 Mar 14, 2021
bd2e228
Update google_translate.py
yijiesun97 Mar 14, 2021
3c81ce4
Update google_translate_context.py
yijiesun97 Mar 14, 2021
ecc64b4
backsampling_nmt/transformer
zhushen3128 Mar 14, 2021
10de6b3
Merge branch 'proj' of https://github.com/jamesli1997/robustqa into proj
zhushen3128 Mar 14, 2021
a8dd42b
Update backtranslate_nmt_sampling.py
zhushen3128 Mar 14, 2021
b907d58
revert args and train
zhushen3128 Mar 14, 2021
29000e3
Update train.py
zhushen3128 Mar 14, 2021
da0d3e9
Update load_pretrained.py
zhushen3128 Mar 14, 2021
a8fc67c
begin generate beam 5 pickle
zhushen3128 Mar 14, 2021
6f321d8
Update backtranslate_transformer_sampling.py
yijiesun97 Mar 14, 2021
6d755ae
Update backtranslate_transformer_sampling.py
yijiesun97 Mar 14, 2021
f1fd90b
Update args.py
yijiesun97 Mar 14, 2021
e99a461
overwrite with finetune args/train; original args/train in supersede
jamesli1997 Mar 14, 2021
ee0d54b
added new translate method
jamesli1997 Mar 14, 2021
7d1a657
Added pip directory
jamesli1997 Mar 14, 2021
47f9766
test if finetune eval data = indomain
zhushen3128 Mar 14, 2021
9b82670
Update args.py
zhushen3128 Mar 14, 2021
82df503
Update args.py
yijiesun97 Mar 14, 2021
19e7c50
Update train.py
yijiesun97 Mar 14, 2021
7abd691
Update train.py
yijiesun97 Mar 15, 2021
ef340a2
Update args.py
yijiesun97 Mar 15, 2021
9832ce0
Update backtranslate_transformer_sampling.py
yijiesun97 Mar 15, 2021
5d4cde2
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 15, 2021
03df775
Update args.py
yijiesun97 Mar 15, 2021
0a0f86c
Update backtranslate_transformer_sampling.py
yijiesun97 Mar 15, 2021
de59cf9
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 15, 2021
8d7283e
Update backtranslate_transformer_sampling.py
yijiesun97 Mar 15, 2021
95724cf
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 15, 2021
9b4b6f5
Update backtranslate_transformer_sampling.py
yijiesun97 Mar 15, 2021
8fe2b18
ood sh files
zhushen3128 Mar 15, 2021
b8100fe
Update transformer_args.py
zhushen3128 Mar 15, 2021
fe03d68
Update load_pretrained.py
zhushen3128 Mar 15, 2021
f51f2fa
Update load_pretrained.py
zhushen3128 Mar 15, 2021
cf92051
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 15, 2021
2e12d36
add ood case
zhushen3128 Mar 15, 2021
ef1e6a1
Update backtranslate_util.py
yijiesun97 Mar 15, 2021
1ab325a
add ood case
zhushen3128 Mar 15, 2021
085dd0c
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 15, 2021
7c0b10f
add ood case
zhushen3128 Mar 15, 2021
e92e9c0
Update backtranslate_nmt_sampling.py
yijiesun97 Mar 15, 2021
8e90bbf
Update args.py
yijiesun97 Mar 15, 2021
cdd6bce
Update backtranslate_util.py
zhushen3128 Mar 15, 2021
d7df501
allowed both indomain & oodomain val dataset for finetune
jamesli1997 Mar 15, 2021
d1dfe78
allow multiple aug pickle file
jamesli1997 Mar 15, 2021
f724d30
Update load_pretrained.py
zhushen3128 Mar 15, 2021
81579b6
added some printing command for debug
jamesli1997 Mar 15, 2021
4aab0aa
merging
jamesli1997 Mar 15, 2021
cb11a27
read pickle files from pickle_dir
jamesli1997 Mar 15, 2021
43ee6d4
added sampling for indomain finetune val & backtranslate finetune tra…
jamesli1997 Mar 15, 2021
275273e
added seed
jamesli1997 Mar 15, 2021
03d147f
Update train.py
yijiesun97 Mar 15, 2021
9c39329
Added separate sampling prob for oob/non-oob in finetune train
jamesli1997 Mar 15, 2021
c4a4fc0
fixed typo
jamesli1997 Mar 15, 2021
1d2b086
separate sampling prob for ood/non-ood in finetune train
zhushen3128 Mar 15, 2021
51c3504
Update args.py
zhushen3128 Mar 15, 2021
951e469
Update args.py
yijiesun97 Mar 16, 2021
d83b45d
Update args.py
yijiesun97 Mar 16, 2021
d6d4cf3
fixed pickle bug
jamesli1997 Mar 16, 2021
a18fe0d
Update backtranslate_transformer_sampling.py
zhushen3128 Mar 16, 2021
feae3d2
Update backtranslate_transformer_sampling.py
zhushen3128 Mar 16, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
datasets/
__pycache__
8 changes: 8 additions & 0 deletions .idea/.gitignore

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions .idea/inspectionProfiles/profiles_settings.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

7 changes: 7 additions & 0 deletions .idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

8 changes: 8 additions & 0 deletions .idea/modules.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

12 changes: 12 additions & 0 deletions .idea/robustqa.iml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions .idea/vcs.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Binary file added 2_layer_nmt/.DS_Store
Binary file not shown.
6 changes: 6 additions & 0 deletions 2_layer_nmt/.idea/inspectionProfiles/profiles_settings.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

7 changes: 7 additions & 0 deletions 2_layer_nmt/.idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

8 changes: 8 additions & 0 deletions 2_layer_nmt/.idea/modules.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

8 changes: 8 additions & 0 deletions 2_layer_nmt/.idea/nmt_es_en.iml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

50 changes: 50 additions & 0 deletions 2_layer_nmt/.ipynb_checkpoints/Untitled-checkpoint.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
{
"cells": [
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": []
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [],
"source": [
"import nltk"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"string = \"I have a B.S. degree in Engineering.\""
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.4"
}
},
"nbformat": 4,
"nbformat_minor": 2
}
Binary file added 2_layer_nmt/QA/.DS_Store
Binary file not shown.
56 changes: 56 additions & 0 deletions 2_layer_nmt/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
# 2-Layer NMT

## Language
- Source Language: English

- Pivot Languages: Spanish (TED Talks from 2020 CS224N Assignment 4), Vietnamese (IWSLT 2015)

*Need to find the exact source of the Spanish Language*

### Vocab Size

- **English: 16K**

- **Spanish:13K**

- Spanish vocab size is determined by domain knowledge in linguistics.

- **Vietnamese: 8K**

- Vietnamese Vocab size is determined according to the benchmark listed here: [Stanford NLP (word-level)](https://nlp.stanford.edu/projects/nmt/), [Tensorflow NMT (word level)](https://github.com/tensorflow/nmt#iwslt-english-vietnamese)

## Hyperparameter

- 2 hidden layer (benchmark: [Tensorflow NMT](https://github.com/tensorflow/nmt#iwslt-english-vietnamese), [Stanford NLP](https://nlp.stanford.edu/projects/nmt/))

- learning rate: 7.5e-4 (by experiment)

- patience = 2 (by experiment)

- batch size = 64 (by experiment)

- dropout = 0.2 (benchmark: [Tensorflow NMT](https://github.com/tensorflow/nmt#iwslt-english-vietnamese), [Stanford NLP (word-level)](https://nlp.stanford.edu/projects/nmt/))

- hidden size = 512 (benchmark: [Tensorflow NMT](https://github.com/tensorflow/nmt#iwslt-english-vietnamese))

- embed size = 512 (benchmark: [Tensorflow NMT](https://github.com/tensorflow/nmt#iwslt-english-vietnamese))

## Back-Translation

_Use English - Spanish - English with beam size = 1 as an example_

- Build the vocab: `sh run_en_es_beam_1.sh vocab`

- Train the NMT model that translates English to Spanish: `sh run_en_es_beam_1.sh train`

- Train the NMT model that translates Spanish to English: `sh run_es_en_beam_1.sh train`

- Translate source English contexts to Spanish contexts: `sh run_en_es_beam_1.sh translate_context`

- Translate source English queries to Spanish queries: `sh run_en_es_beam_1.sh translate_queries`

After dropping the non-translatable contexts and queries, we are ready to generate the back-translated English contexts and queries:

- Back-translate Spanish contexts to English contexts: `sh run_es_en_beam_1.sh translate_context`

- Back-translate Spanish queries to English queries: `sh run_es_en_beam_1.sh translate_queries`
Empty file added 2_layer_nmt/__init__.py
Empty file.
Binary file added 2_layer_nmt/es_data/.DS_Store
Binary file not shown.
Loading