I want to guess the human language of a string. I found the Unicode scripts in Regular Expressions could do the trick. But I don't know what the script name stands for. As far as I know, Han
stands for Chinese, but what about others?
Unicode scripts in Regular Expressions
675 views Asked by Shisoft At
2
There are 2 answers
0
On
Don't know if it helps, but this is a great resource for information on writing scripts and languages: Omniglot . It may be that you are expected to know about these different scripts when using that feature of regexp.
I think this is what I need. Thanks @Jesper.
ISO 15924 Code Lists
List of Unicode Script names and their shorthand aliases, copied from PropertyValueAliases.txt: