Skip to content

Commit 4b7dbf2

Browse files
committed
update readme
1 parent ee92100 commit 4b7dbf2

7 files changed

+13
-12
lines changed

.idea/workspace.xml

+4-4
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

README.md

+7-6
Original file line numberDiff line numberDiff line change
@@ -333,16 +333,19 @@ class Decoder(nn.Module):
333333
## 2. Experiments
334334

335335
I use Multi30K Dataset to train and evaluate model <br>
336-
You can check detail of dataset [here](https://arxiv.org/abs/1605.00459)
336+
You can check detail of dataset [here](https://arxiv.org/abs/1605.00459) <br>
337+
I follow original paper's parameter settings. (below) <br>
338+
339+
![conf](image/transformer-model-size.jpg)
340+
### 2.1 Transformer - Baseline
337341

338-
### 2.1 Transformer - Small
339342
* total parameters = 69,919,919
340343
* model size = 273.2MB
341344
* lr scheduling : ReduceLROnPlateau
342345

343346
#### 2.1.1 configuration
344347

345-
* batch_size = 128
348+
* batch_size = 64
346349
* max_len = 50
347350
* d_model = 512
348351
* n_layers = 6
@@ -358,10 +361,8 @@ You can check detail of dataset [here](https://arxiv.org/abs/1605.00459)
358361
* epoch = 1000
359362
* clip = 1
360363

361-
362364
#### 2.1.2 Training Result
363-
![table](saved/transformer-small/train_result.jpg)
364-
<br><br>
365+
365366
* Minimum Training Loss = 2.852672759656864
366367
* Minimum Validation Loss = 3.2048025131225586
367368
<br><br>

image/transformer-model-size.jpg

58.2 KB
Loading
Binary file not shown.
Binary file not shown.

result/test.txt

+1-1
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
[5.760715872049332, 5.430302619934082, 5.130149096250534, 5.124002903699875, 5.081889361143112, 5.049355149269104, 4.9583838284015656, 4.919993549585342, 4.879352629184723]
1+
[5.869744032621384, 5.464778572320938, 5.381483763456345, 5.198367655277252, 5.1747742891311646, 5.201104640960693]

result/train.txt

+1-1
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
[6.487048629096951, 5.643908386188457, 5.413760770259975, 5.2877601085780475, 5.231303647751325, 5.191577813698856, 5.153288614382303, 5.098250471022686, 5.04833714657418]
1+
[7.047564462418073, 5.7127689853113655, 5.4532069918342625, 5.285951928420213, 5.216039784679329, 5.174198560252589]

0 commit comments

Comments
 (0)