Is it possible to use a different lexer?

Question

Is it possible to use a different lexer?

206 views Asked by user2629532 At 20 July 2017 at 09:35

I would like to use a different lexer for tatsu, yet use tatsu's parser. Is this possible? For example, in the grammar:

expr = NUM | ID | (expr '+' expr) ;

is it possible to use an alternative lexer to provide NUM and ID?

Original Q&A

There are 2 answers

**Apalala** · Answer 1 · 2017-07-20T21:27:00+00:00

In general, PEG parsers don't use a separate lexer because they don't need one. Lexical elements can be specified using the same grammar language.

TatSu, a PEG parser generator, doesn't support separate lexers either, yet the Buffer class provides facilities for avoiding partial matches of literal tokens and for specifying lexical elements using regular expressions:

expr = num | id | (expr '+' expr) ;
num = /\d+/ ;
id = /[a-zA-Z_]\w*/ ;

**Apalala** · Answer 2 · 2021-03-24T23:57:01+00:00

Recent versions of TatSu allow the use of a different lexer (called Tokenizer in Tatsu).

The parser will probably have to rely on having semantic actions verity the grammar rules that correspond to tokens.

There are some unfinished experiments from my work helping with the Python PEG parser at https://github.com/neogeny/pygl.

TechQA.

Is it possible to use a different lexer?

There are 2 answers

Related Questions in TATSU

Popular Questions

Popular Tags

Trending Questions