RE2 is great. Fast and deterministic.
However, it supports only UTF8. My strings are natively UTF16, and converting back and forth would kill performance.
How difficult would it be to implement native UTF16 capability in RE2?
How difficult would it be to implement native UCS-2 capability in RE2? (this should be easier)
i.e. how many hours would a regular programmer need to do this.
This has been bothering me for a couple of weeks, so I thought I would ask!
Russ Cox, the creator of RE2, was kind enough to post the patch for UCS-2 support. Some assertions, however, are not supported for UCS-2. Reply from Russ is posted verbatim:
Link to code: http://code.google.com/p/re2/source/list