Document Type : Research Paper

Authors

1 Ph.D Candidate in General Linguistics University of Sistan and Baluchestan

2 Assistant professor of English Language & Literature Department, University of Sistan and

3 Assistant professor of Computer engineering Department, Vali -e- Asr University of Rafsanjan

4 Assistant professor of English Language & Literature Department, University of Sistan and Baluchestan

Abstract

Word sense disambiguation is one of the important challenges in natural language processing (NLP) and computational linguistics. However, preposition sense disambiguation especially, in Persian has been neglected. Therefore, the present study seeks to define an algorithm based on semantic frames to disambiguate the prepositions; «az», «dar», «ba» and «ta» in Persian. The algorithm and its implementation are new in this project. The data includes 1000 sentences as a training corpus, 100 sentences as a developing data and 500 sentences as a test data collected from internet sources and social networks such as YouTube. The mentioned prepositions were sense annotated. Semantic frames based on frame-based language (Minsky, 1975) were constructed for the mentioned prepositions in the training corpus and for the words before and after the prepositions included in the entries as a data file. The algorithm in three paths based on the information in the frames assigns a meaning to the prepositions in the sentences. The results indicated the high precision (99.16%) of the system in disambiguating the prepositions in Persian.

Keywords

انوری، حسن (1382). فرهنگ فشرده سخن. تهران، انتشارات آگاه.
انوری، حسن و احمدی گیوی، حسن (1389). دستور زبان فارسی، تهران، انتشارات فاطمی.
خطیب رهبر، خلیل (1367). دستور زبان فارسی: کتاب حروف‌اضافه و ربط. تهران، انتشارات فاطمی.
صفوی، کوروش (1392). درآمدی بر معناشناسی، تهران، هرمس.
Agirre, E and Edmonds, Ph. 2007. Word sense disambiguation algorithms and applications.  Springer.com.    
Arenas, G.A., Villanueva, D., Rasgado, A.D. and Juarez, O. 2014.Using  frames to disambiguate prepositions, Science  Direct, Vol 40(2),598-  610.
Buecheler, S. 2000. Social movement in advanced capitalism, Oxford: Oxford University Press.
Fillmore, C. J. 1976. Frame semantics and the nature of language, In Annals   of the NewYork Academy: Conferences on the Origin and Development of Language and Speech, Vol.280, 20-32.
ــــــــــــــــــــــــــ 1977b. Topics in lexical semantics, In Current Issues in          Linguistics, Bloomington, Indian University Press, 76-138.
ـــــــــــــــــــــــ 1982b. Frame semantics, In the Linguistics Society of  Korea, eds Linguistics in the Morning Calm, Seol,  Hanshim, 37-111.
ـــــــــــــــــــــــــ 1985a. Frames and the semantics of understanding. Quaderni de Semantica, 6(2). 222-254.
ــــــــــــــــــــــــــ 1997. Toward a frame based lexicon: the semantics of RISK and its neighbors in frames, fields and its contrast, New Essays in     Semantic and Lexical Organization, HillsdayNewjersy, 75-102.
Goffman, E. 2013. The presentation of self in the online world, Harvard  University Press.
Litkowski, K. and Hargraves, O. 2007. Word sense disambiguation of  prepositions, In proceeding of the 4th International Workshop on Semantic Evaluations, 24-29.
Minsky, M. 1975. A framework for representing knowledge, The Psychology of Computer Vision, MIT press, (19).