This paper proposes a new methodology for discrimination of different pronunciations in the Shtokavian dialect of the Serbian language. At the first, the written language (Unicode text) is converted into codes according to the energy status of each character in the text-line. Such a set of codes is seen as a grayscale image. Then, the local structures of the image are explored by local binary operators. It creates a vector set which differentiates various pronunciations of the Serbian language. The experiment is performed on fifty documents given in Serbian language. A comparison performed between the proposed method and the n-gram method shows its clear advantage.

Discrimination of Different Serbian Pronunciations from Shtokavian Dialect

Amelio A.
2017-01-01

Abstract

This paper proposes a new methodology for discrimination of different pronunciations in the Shtokavian dialect of the Serbian language. At the first, the written language (Unicode text) is converted into codes according to the energy status of each character in the text-line. Such a set of codes is seen as a grayscale image. Then, the local structures of the image are explored by local binary operators. It creates a vector set which differentiates various pronunciations of the Serbian language. The experiment is performed on fifty documents given in Serbian language. A comparison performed between the proposed method and the n-gram method shows its clear advantage.
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S1877050917313893-main.pdf

accesso aperto

Tipologia: PDF editoriale
Dimensione 597.39 kB
Formato Adobe PDF
597.39 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11564/770240
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
social impact