Re: [racket] Survey of parsing libraries for Racket?

Neil Van Dyke Sat, 10 Mar 2012 13:25:45 -0800

Danny Yoo wrote at 03/10/2012 03:06 PM:

For people who have used these libraries, how was the experience?
Basically, I'm trying to find something powerful and stable to work
with.

I used "parser-tools" successfully the other day, to implement a parserfor a subset of PDF. It seemed great as a Lex&Yacc replacement (youcan't beat doing a parser toolkit as syntax extension), but not quitethe be-all and end-all. I haven't looked at the other Racket-basedparser tools.


Details follow...

I have used a bunch of different parser tools with other languages inthe past, and I'd say that "parser-tools" is a Lex&Yacc in Racket, witha few frills added for tokens, and it did give me all the hooks I neededto help things along with arbitrary code.

The parser grammar wasn't quite as readable as it would be if one couldput keyword lexemes literally in the grammar (some other parser toolkitsmap literals to tokens directly, and you may or may not define thekeywords separately). This is something you could layer atop with yourown pretty simple syntax extension, of course.

As I mentioned the other day, it didn't have some EBNF shorthand, whichI missed when I was writing the grammar, but found would have gotten inthe way when I built my AST. Again, EBNF is something you could layeratop with your own macro, especially if you have your macro implementyour own particular way of AST-building.

Some toolkits don't make the distinction that Lex and Yacc does, and youinstead use the same metalanguage to build up from characters to fullgrammars. If you want to do that, I suppose you might be able to layerthat atop "parser-tools" reasonably.

Language class is of course a consideration in whatever parser tool youuse. I don't recall for certain what class I needed, but it might havebeen only LL(1). At a glance, it looked doable in Yacc without anyconflicts, so I didn't have to look further. For the lexer, I needed toscan literals that involved balancing arbitrarily-nested parens, whichis not a job for regexps, but "parser-tools" gave me an easy hook tocode that part of the lexer manually.

"parser-tools" seems to have a lot of support for syntax position, whichI did not use for this project, but would for most projects.

I didn't look at whether "parser-tools" has error reporting/recoveryfeatures, but that's another thing that I've had some toolkits help withwhen parsing really nasty languages.

I also did not measure performance of "parser-tools", but it didn't seembad for what I was doing.

Holistically, the combination of "parser-tools" with Racket makes it thebest overall parsing toolkit I've used for a project, even though"parser-tools" didn't have all the conveniences I've found in sometoolkits that pair with much less nice languages.

Incidentally, I'm not sure of the performance implications, but I likethe idea of having the parser for a programming language translate thesyntax objects to sexp-like syntax objects promptly, and then"syntax-parse" the heck out of that newly sexp-encoded language to turnit into Racket code. I'm also doing a related, syntax-object-heavything in the McFly embedded documentation tool, in which McFly fills outthings like Scribble "defproc" signatures by parsing bits of informationfrom "lambda" argument forms, contracts, explicitly-provided pieces of"defproc", (later I'll add Typed Racket, too), etc., translating allthat info to a normalized form, and then using a simple unification ofthe various info before running the unified normalized form throughanother syntax transformer to output a Scribble "defproc". Surely notthe fastest way to do it, but I suspect it's in the noise when weconsider how much crunching Scribble already does.


Neil V.

--
http://www.neilvandyke.org/
____________________
 Racket Users list:
 http://lists.racket-lang.org/users

Re: [racket] Survey of parsing libraries for Racket?

Reply via email to