Is there a way to divide an Ancient Greek text (UTF-8) into syllables in R? I need to count the number of unique syllables in a corpus.
I cannot find an algorithm to do so, and the rules are quite complicated to write it from scratch.
Is there a way to divide an Ancient Greek text (UTF-8) into syllables in R? I need to count the number of unique syllables in a corpus.
I cannot find an algorithm to do so, and the rules are quite complicated to write it from scratch.
Basing on https://cran.r-project.org/web/packages/sylly/vignettes/sylly_vignette.html#fn2, here is a solution
Some words are not hyphenated correctly, though.