Semantic Specialization for Knowledge-based Word Sense Disambiguation
Paper
•
2304.11340
•
Published
| Argument name | Value | Description |
|---|---|---|
| max_epochs | 15 | Maximum number of training epochs |
| cfg_similarity_class.temperature ($\beta^{-1}$) | 0.015625 (=1/64) | Temperature parameter for the contrastive loss |
| batch_size ($N_B$) | 256 | Number of samples in each batch for the attract-repel and self-training objectives |
| coef_max_pool_margin_loss ($\alpha$) | 0.2 | Coefficient for the self-training loss |
| cfg_gloss_projection_head.n_layer | 2 | Number of FFNN layers for the projection heads |
| cfg_gloss_projection_head.max_l2_norm_ratio ($\epsilon$) | 0.015 | Hyperparameter for the distance constraint integrated in the projection heads |
data/bert_embeddings/bert-large-cased_WordNet_Gloss_Corpus.hdf5bert-large-cased_SemCor.hdf5bert-large-cased_WSDEval-ALL.hdf5@inproceedings{Mizuki:EACL2023,
title = "Semantic Specialization for Knowledge-based Word Sense Disambiguation",
author = "Mizuki, Sakae and Okazaki, Naoaki",
booktitle = "Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume",
series = {EACL},
month = may,
year = "2023",
address = "Dubrovnik, Croatia",
publisher = "Association for Computational Linguistics",
pages = "3449--3462",
}