tree-sitter

Author	SHA1	Message	Date
Max Brunsfeld	2a9f51790f	Move is_token function to its own file	2014-09-07 13:49:44 -07:00
Max Brunsfeld	ed11ef557a	Fix expansion of repeat rules into recursive rules Previously, the way repeat rules were expanded, the auxiliary rule always needed to be reduced, even if the repeating content was empty. This caused problems in parse states where some items contained the repeat rule and some did not. To make those cases work, the repeat rule had to explicitly be marked as optional. With this change, that is no longer necessary.	2014-09-07 09:39:14 -07:00
Max Brunsfeld	c0a3f8d39c	Remove some macros from public parser header	2014-09-05 23:47:38 -07:00
Max Brunsfeld	d3204d3526	Include '_' in '\w' regex character class	2014-09-05 18:41:12 -07:00
Max Brunsfeld	545e575508	Revert "Remove the separator characters construct" This reverts commit `5cd07648fd`. The separators construct is useful as an optimization. It turns out that constructing a node for every chunk of whitespace in a document causes a significant performance regression. Conflicts: src/compiler/build_tables/build_lex_table.cc src/compiler/grammar.cc src/runtime/parser.c	2014-09-02 08:03:51 -07:00
Max Brunsfeld	e941f8c175	Fix error in document editing When breaking down the stack in parser.c, the previous code would not account for ubiquitous tokens. This was a problem for a long time, but wasn't noticed until ubiquitous tokens started being used to represent separator characters	2014-09-01 21:32:29 -07:00
Max Brunsfeld	5cd07648fd	Remove the separator characters construct Now, grammars can handle whitespace by making it another ubiquitous token, like comments. For now, this has the side effect of whitespace being included in the tree that precedes it. This was already an issue for other ubiquitous tokens though, so it needs to be fixed anyway.	2014-09-01 20:19:43 -07:00
Max Brunsfeld	346cf4fe5d	Remove LEX_PANIC macro	2014-08-26 13:12:12 -07:00
Max Brunsfeld	8f4939a3d3	unsigned char -> uint32_t in CharacterRange	2014-08-24 01:05:59 -07:00
Max Brunsfeld	9338249075	Remove implicit CharacterRange constructors Also fix misc smaller lint errors	2014-08-23 14:52:44 -07:00
Max Brunsfeld	0bb5663f0f	Refactor - represent char sets in terms of inclusions and exclusions	2014-08-23 14:25:45 -07:00
Max Brunsfeld	6f374fddff	Tweak format for pretty printing some classes	2014-08-23 13:52:00 -07:00
Max Brunsfeld	2963b08f79	Eliminate duplicates when constructing choice rules	2014-08-21 20:04:42 -07:00
Max Brunsfeld	fcdcdca303	Whitespace	2014-08-09 01:07:14 -07:00
Max Brunsfeld	1e79ed794b	Allow multiple top-level nodes Now, the root node of a document is always a document node. It will often have only one child node which corresponds to the grammar's start symbol, but not always. Currently, it may have more than one child if there are ubiquitous tokens such as comments at the beginning of the document. In the future, it will also be possible be possible to have multiple for the document to have multiple children if the document is partially parsed.	2014-08-09 00:00:20 -07:00
Max Brunsfeld	8da9219c3a	Remove redundant functions for Documents There's no need for a `string` function since one already exists for Nodes. Now the root node is always stored on the document. This means callers of `ts_document_root_node` don't need to release its return value.	2014-08-08 12:58:51 -07:00
Max Brunsfeld	41c4e7cd8f	Fix more namespace formatting	2014-08-08 08:35:26 -07:00
Max Brunsfeld	9366f11dcb	In generated C, only format printable characters as char literals	2014-08-07 08:12:15 -07:00
Max Brunsfeld	01571da30d	Handle more escaped characters in regexps	2014-08-03 21:57:21 -07:00
Max Brunsfeld	eecbcccee0	Remove generated parsers' dependency on the runtime library Generated parsers no longer export a parser constructor function. They now export an opaque Language object which can be set on Documents directly. This way, the logic for constructing parsers lives entirely in the runtime. The Languages are just structs which have no load-time dependency on the runtime	2014-07-30 23:40:02 -07:00
Max Brunsfeld	048a479b5f	Fix missing initializer warning in gcc	2014-07-29 13:04:19 -07:00
Max Brunsfeld	6a8addb84f	Fix segfault when grammar consists of a single token	2014-07-23 18:26:10 -07:00
Max Brunsfeld	98cc2f2264	Auto-format all source code with clang-format	2014-07-21 13:20:00 -07:00
Max Brunsfeld	6951acb13b	Fix error when grammar contains to error productions	2014-07-13 21:26:21 -07:00
Max Brunsfeld	4d14a65e22	In build_parse_table, switch recursion to explicit iteration	2014-07-13 18:06:37 -07:00
Max Brunsfeld	b217cd38fb	Handle built-in symbols correctly in conflict manager	2014-07-13 17:59:57 -07:00
Max Brunsfeld	44c4bf5f5e	Refactor add_ubiquitous_token_actions method	2014-07-11 13:21:44 -07:00
Max Brunsfeld	f4287c07d0	Fix ParseStateId / size_t confusion in parse table	2014-07-07 13:21:30 -07:00
Max Brunsfeld	77df7fe511	In lexer, always prefer the longest match Only use rules' precedence to decide between two tokens that match the same string	2014-07-03 08:57:35 -07:00
Max Brunsfeld	67590eddc7	Fix stream operator for new parse action types	2014-07-03 08:15:36 -07:00
Max Brunsfeld	18ae326459	Fix lint errors	2014-07-02 09:01:38 -07:00
Max Brunsfeld	6424eea62a	Refactor handling of ubiquitous tokens when building parse table	2014-07-01 21:43:26 -07:00
Max Brunsfeld	83a1b9439e	Fix handling of ubiquitous tokens used in grammar rules	2014-07-01 20:47:35 -07:00
Max Brunsfeld	7a6d3365c5	Remove helper function in rule_transitions.cc	2014-06-28 18:09:32 -07:00
Max Brunsfeld	3be648593e	merge_{sym,char}_transitions -> merge_{sym,char}_transition	2014-06-28 17:02:48 -07:00
Max Brunsfeld	9bad5dff3e	Avoid unnecessary std::map construction when merging transition sets	2014-06-26 13:42:42 -07:00
Max Brunsfeld	9686c57e90	Allow ubiquitous tokens to also be used in grammar rules	2014-06-26 08:52:42 -07:00
Max Brunsfeld	a9dff20658	Make grammars' separator characters configurable	2014-06-26 07:31:08 -07:00
Max Brunsfeld	7df35f9b8d	Make separate types for syntax and lexical grammars This way, the separator characters can be added as a field to lexical grammars only	2014-06-25 13:27:16 -07:00
Max Brunsfeld	240de1ca61	Reorder methods in c code generator	2014-06-18 08:21:43 -07:00
Max Brunsfeld	2c382b7363	Trim trailing whitespace	2014-06-16 21:33:35 -07:00
Max Brunsfeld	c312f985c8	Refactor c code generator It's been rewritten in a less functional style. String copies were actually taking significant time for large parsers.	2014-06-16 21:29:04 -07:00
Max Brunsfeld	1daaf4485f	Refactor item set transition functions	2014-06-16 13:37:34 -07:00
Max Brunsfeld	17040e32ec	Remove unused version of first_set function	2014-06-16 13:25:30 -07:00
Max Brunsfeld	39c1ab2d50	Refactor item_set_closure Inline unnecessary function	2014-06-16 13:20:39 -07:00
Max Brunsfeld	7a2c2c1c90	Store ParseItemSets as maps, w/ core items as keys ParseItem no longer has a lookahead_sym field; it now represents the 'core' of a parse item. The lookahead context is stored separately, as a set per core item. This makes iterating, copying and merging item sets more efficient, because before, the core items were repeated for each different lookahead symbol. Also, the memoization in sym_transitions(ParseItemSet) has been removed. Maybe I'll add it back later.	2014-06-16 08:35:20 -07:00
Max Brunsfeld	d203c15911	Remove unneeded blank rule in lex table builder	2014-06-11 17:43:07 -07:00
Max Brunsfeld	bb4d83ce47	Add regex postfix flags to javascript grammar - Refactor statement terminators in javascript grammar - Reorganize javascript language tests	2014-06-11 16:43:27 -07:00
Max Brunsfeld	3cd031af38	Add `keypattern` rule helper This way, pattern rules (e.g. golang's comment) can be easily given the same precedence as keyword rules.	2014-06-11 12:40:49 -07:00
Max Brunsfeld	174f306e2a	Fix precedence of comments vs '/' operator	2014-06-11 12:27:58 -07:00

1 2 3 4 5 ...

272 commits