Max Brunsfeld
df05c75525
Backfill unit tests for follow sets
...
Also, add some comment docs for some compiler functions
2014-02-26 00:39:31 -08:00
Max Brunsfeld
e58a6d8ba7
Start work on error recovery
...
- In runtime, make parse errors part of the parse tree
- Add error state to lexers in which they can accept any token
2014-02-24 18:42:54 -08:00
Max Brunsfeld
713b3899c5
Simplify logic for extracting tokens from grammar
2014-02-23 10:00:49 -08:00
Max Brunsfeld
e87380a8b7
Make separate PreparedGrammar subclass of Grammar for internal use
2014-02-19 13:36:38 -08:00
Max Brunsfeld
acad97cfd2
Separate concepts of hidden and auxiliary symbols
...
This way, we can hide auxiliary symbols from library users,
but still allow them to use hidden symbols
2014-02-19 13:05:54 -08:00
Max Brunsfeld
2c56612650
Get makefile working
2014-02-18 09:07:00 -08:00
Max Brunsfeld
fd757c20d0
Move example grammars from fixtures -> examples dir
2014-02-17 13:18:09 -08:00
Max Brunsfeld
585f9f0054
Fix misplaced file
2014-02-17 12:59:24 -08:00
Max Brunsfeld
9e2dc14182
Extract public compiler API into its own header file
2014-02-17 12:40:46 -08:00
Max Brunsfeld
5c1a0982df
Change repeat to mean zero-or-more
2014-02-15 15:43:32 -08:00
Max Brunsfeld
bf07522026
Fix bug in FIRST set function
2014-02-13 18:52:17 -08:00
Max Brunsfeld
a77ca1ee08
Allow users to put their own auxiliary rules in grammars
2014-02-13 13:09:00 -08:00
Max Brunsfeld
70173512f1
Don't extract isolated blanks from grammars into tokens
2014-02-12 18:31:57 -08:00
Max Brunsfeld
9a737e2316
Use map, not unordered_map in Grammar
2014-02-12 08:30:50 -08:00
Max Brunsfeld
27f305d556
Put rule_can_be_blank function in its own file
2014-02-11 13:21:45 -08:00
Max Brunsfeld
1962c17f45
Remove transition_map class
2014-02-11 13:15:44 -08:00
Max Brunsfeld
3cb65c9c81
Make the EOF be an auxiliary symbol
...
This way, it couldn’t conflict if a user had a rule called “__END__”
2014-02-10 18:53:01 -08:00
Max Brunsfeld
15c9e2d398
Make ordering of cases deterministic in generated parsers
2014-02-10 18:38:01 -08:00
Max Brunsfeld
8baa1396fd
Properly merge transitions on overlapping character sets!
2014-02-10 13:20:43 -08:00
Max Brunsfeld
905a408998
Make separate functions for character-level and syntax-level rule transitions
2014-02-09 14:31:27 -08:00
Max Brunsfeld
d3d25f2683
Represent character sets as sets of character ranges
2014-02-05 18:56:04 -08:00
Max Brunsfeld
8cce11a52a
Rename Character -> CharacterSet, CharacterMatch -> CharacterRange
2014-02-03 13:05:51 -08:00
Max Brunsfeld
5ed5ae7514
In LR(1) items, only store consumed symbols as booleans
...
the booleans represent the symbols point to auxiliary tokens
or not. This is all we need to know for the purpose of building
parse tables. Any other information just leads to redundant
parse states.
2014-01-31 00:13:05 -08:00
Max Brunsfeld
0d3a941848
Add support for character ranges in regex patterns
2014-01-30 18:54:39 -08:00
Max Brunsfeld
60e2d00b4d
Parse simple character sets in pattern rules
2014-01-30 13:04:31 -08:00
Max Brunsfeld
28e10dc722
Fix bug in character rule equality
2014-01-30 13:04:10 -08:00
Max Brunsfeld
7f62e752be
Allow Character rules to handle arbitrary character sets
2014-01-30 08:34:20 -08:00
Max Brunsfeld
7d297f2f9e
Remove auxiliary rules from syntax tree
2014-01-28 22:09:37 -08:00
Max Brunsfeld
c3b65d22bf
Improve prepare_grammar specs
2014-01-28 18:44:14 -08:00
Max Brunsfeld
fd0d77ef8b
Separate auxiliary rules from user-specified rules
2014-01-28 13:27:30 -08:00
Max Brunsfeld
19e5b2a563
Make token extraction work for repeat rules
2014-01-28 12:52:29 -08:00
Max Brunsfeld
ca33c3942a
In parse table, store symbols as Symbol objects, not strings
2014-01-27 13:40:10 -08:00
Max Brunsfeld
3ca2e126be
Remove unnecessary public START and END constants
2014-01-25 21:34:46 -08:00
Max Brunsfeld
67fa81d079
Convert repeat rules into pairs of recursive rules
2014-01-24 18:27:29 -08:00
Max Brunsfeld
8208aae060
Fix transitions for sequences whose left side can be blank
2014-01-23 23:56:37 -08:00
Max Brunsfeld
5776846227
Make compile take a name argument
...
Start work on JSON fixture grammar
2014-01-23 13:48:07 -08:00
Max Brunsfeld
bb42543f1b
Fix broken compiler test
2014-01-23 13:10:37 -08:00
Max Brunsfeld
e174b89133
Treat end of input like a normal token
2014-01-22 23:04:11 -08:00
Max Brunsfeld
3c42de1718
Fix errors in item set transition computation
2014-01-21 23:45:02 -08:00
Max Brunsfeld
be5e406c96
Do less work when loading build_tables spec
2014-01-21 23:35:04 -08:00
Max Brunsfeld
1bf216b796
Rename next_{terminals,non_terminals} to first_set and follow_sets
...
This is to prepare for keeping track of lookahead symbols as part
of computing follow sets
2014-01-19 01:49:56 -08:00
Max Brunsfeld
29f73afbc5
Compute FIRST sets correctly
2014-01-13 12:57:48 -08:00
Max Brunsfeld
7a3092b765
Refactor parser error messages
...
- move message generation to TSParseError,
so that Parser doesn’t need to import <string>, and can
be compiled as plain C.
2014-01-11 17:59:45 -08:00
Max Brunsfeld
f342067293
Clean up specs
2014-01-11 17:08:32 -08:00
Max Brunsfeld
8881214f0d
Separate table building operations from Item, ItemSet value objects
2014-01-11 16:49:57 -08:00
Max Brunsfeld
92cec5758f
Reorganize compiler directory
2014-01-11 15:14:17 -08:00
Max Brunsfeld
55809f702d
Add parser error messages
2014-01-08 18:35:16 -08:00
Max Brunsfeld
0985fa3008
Collapse rules that contain only a single token
2014-01-05 01:19:32 -08:00
Max Brunsfeld
04d18b56ed
Cleanup
2014-01-04 15:30:05 -08:00
Max Brunsfeld
29c81167c0
Remove subclasses of Symbol for terminals and non-terminals
2014-01-04 15:01:06 -08:00