tree-sitter

Author	SHA1	Message	Date
Max Brunsfeld	a69dfa08f3	Add spec for inserting text w/ unicode characters	2014-10-02 11:54:00 -07:00
Max Brunsfeld	ed11ef557a	Fix expansion of repeat rules into recursive rules Previously, the way repeat rules were expanded, the auxiliary rule always needed to be reduced, even if the repeating content was empty. This caused problems in parse states where some items contained the repeat rule and some did not. To make those cases work, the repeat rule had to explicitly be marked as optional. With this change, that is no longer necessary.	2014-09-07 09:39:14 -07:00
Max Brunsfeld	545e575508	Revert "Remove the separator characters construct" This reverts commit `5cd07648fd`. The separators construct is useful as an optimization. It turns out that constructing a node for every chunk of whitespace in a document causes a significant performance regression. Conflicts: src/compiler/build_tables/build_lex_table.cc src/compiler/grammar.cc src/runtime/parser.c	2014-09-02 08:03:51 -07:00
Max Brunsfeld	5cd07648fd	Remove the separator characters construct Now, grammars can handle whitespace by making it another ubiquitous token, like comments. For now, this has the side effect of whitespace being included in the tree that precedes it. This was already an issue for other ubiquitous tokens though, so it needs to be fixed anyway.	2014-09-01 20:19:43 -07:00
Max Brunsfeld	b91f48ced2	Call handle_error even when error occurs exactly where expected Previously, if an error happened right at the beginning of an error production, the error node would be immediately shifted onto the stack without calling the error handling function.	2014-08-27 18:44:27 -07:00
Max Brunsfeld	77941c85ff	Avoid building incomplete error nodes during lexing The lexer doesn't know the expected symbols, so it doesn't have enough information to construct error nodes. Now, when it encounters an invalid character, it returns NULL and the parser builds a correct error node.	2014-08-25 23:35:00 -07:00
Max Brunsfeld	1e79ed794b	Allow multiple top-level nodes Now, the root node of a document is always a document node. It will often have only one child node which corresponds to the grammar's start symbol, but not always. Currently, it may have more than one child if there are ubiquitous tokens such as comments at the beginning of the document. In the future, it will also be possible be possible to have multiple for the document to have multiple children if the document is partially parsed.	2014-08-09 00:00:20 -07:00
Max Brunsfeld	8da9219c3a	Remove redundant functions for Documents There's no need for a `string` function since one already exists for Nodes. Now the root node is always stored on the document. This means callers of `ts_document_root_node` don't need to release its return value.	2014-08-08 12:58:51 -07:00
Max Brunsfeld	7ba3953f7e	Simplify handling of ubiquitous tokens during reduce	2014-08-08 08:46:01 -07:00
Max Brunsfeld	b155994491	Fix indentation in specs	2014-08-07 08:11:21 -07:00
Max Brunsfeld	0d6d09cbd9	In generated parsers, export language as a function	2014-07-31 13:04:46 -07:00
Max Brunsfeld	eecbcccee0	Remove generated parsers' dependency on the runtime library Generated parsers no longer export a parser constructor function. They now export an opaque Language object which can be set on Documents directly. This way, the logic for constructing parsers lives entirely in the runtime. The Languages are just structs which have no load-time dependency on the runtime	2014-07-30 23:40:02 -07:00
Max Brunsfeld	b3385f20c8	Hide TSTree, expose TSNode	2014-07-17 23:29:11 -07:00
Max Brunsfeld	779bf0d745	Don't store tree's hidden children in a separate array Just mark hidden trees as such, and skip them when pretty-printing a tree	2014-07-17 13:36:53 -07:00
Max Brunsfeld	6e551d6d9f	Simplify handling of multiple top-level nodes after parsing	2014-07-14 20:46:20 -07:00
Max Brunsfeld	1c7d2d2d03	Add for-in loops and math assignment operators to js grammar	2014-07-07 13:35:55 -07:00
Max Brunsfeld	77df7fe511	In lexer, always prefer the longest match Only use rules' precedence to decide between two tokens that match the same string	2014-07-03 08:57:35 -07:00
Max Brunsfeld	c85841364e	Add throw statements to js grammar	2014-07-03 08:20:43 -07:00
Max Brunsfeld	83a1b9439e	Fix handling of ubiquitous tokens used in grammar rules	2014-07-01 20:47:35 -07:00
Max Brunsfeld	9686c57e90	Allow ubiquitous tokens to also be used in grammar rules	2014-06-26 08:52:42 -07:00
Max Brunsfeld	2c382b7363	Trim trailing whitespace	2014-06-16 21:33:35 -07:00
Max Brunsfeld	bb4d83ce47	Add regex postfix flags to javascript grammar - Refactor statement terminators in javascript grammar - Reorganize javascript language tests	2014-06-11 16:43:27 -07:00
Max Brunsfeld	082560dd6e	Fix operator precedence of '.' operator in js grammar	2014-06-11 14:01:38 -07:00
Max Brunsfeld	174f306e2a	Fix precedence of comments vs '/' operator	2014-06-11 12:27:58 -07:00
Max Brunsfeld	4ad6278334	Add finally, instance of, typeof, in to js grammar	2014-06-11 11:49:06 -07:00
Max Brunsfeld	c91c5cb730	Add range statements to golang grammar	2014-06-10 14:11:25 -07:00
Max Brunsfeld	53bc633a22	Add var decl and if statements to golang grammar	2014-06-10 13:27:55 -07:00
Max Brunsfeld	1c93d5e1a6	Add declarations w/o initialization to golang grammar	2014-06-10 11:57:45 -07:00
Max Brunsfeld	3968f36a03	Reorganize golang specs	2014-06-10 11:39:36 -07:00
Max Brunsfeld	123d3b26d8	Add more expressions, statements to golang grammar	2014-06-10 11:33:05 -07:00
Max Brunsfeld	e93e254518	In lexer, prefer tokens to skipped separator characters This was causing newlines in go and javascript to be parsed as meaningless separator characters instead of statement terminators	2014-05-30 13:29:54 -07:00
Max Brunsfeld	2988cc5aa2	Show offending lookahead chars when pretty-printing trees w/ errors	2014-05-26 21:50:01 -07:00
Max Brunsfeld	6f45380f71	Move type-related tests for go grammar into their own file	2014-05-26 21:48:00 -07:00
Max Brunsfeld	4c9ac3dada	Fix parsing of empty strings in javascript and golang	2014-05-20 09:47:26 -07:00
Max Brunsfeld	2d0f90c7d5	Add try and while statements to js grammar	2014-05-09 21:36:18 -07:00
Max Brunsfeld	e4be585c43	Handle ubiquitous tokens at the beginning of programs As a final step before returning the finished parse tree, check if there are still multiple nodes on the stack. If so, make the inner nodes children of the top node.	2014-05-09 12:46:36 -07:00
Max Brunsfeld	4700e33746	Introduce 'ubiquitous_tokens' concept, for parsing comments and such	2014-05-06 12:54:04 -07:00
Max Brunsfeld	bae32adc7b	Add constructor calls, pre/postfix operators to js grammar	2014-05-04 13:36:19 -07:00
Max Brunsfeld	1bdd87535a	Add prefix math operators +, - to javascript grammar	2014-05-02 07:42:13 -07:00
Max Brunsfeld	a2c125998e	Add single quoted strings and regexes to javascript grammar	2014-05-01 12:43:53 -07:00
Max Brunsfeld	801f4bd0a8	Add returns, deletes and bool operators to js grammar	2014-04-25 22:08:11 -07:00
Max Brunsfeld	61692c8bb1	Add error recovery in function calls to javascript gramamr	2014-04-24 13:22:54 -07:00
Max Brunsfeld	68c26a06b1	Add comments to javascript grammar	2014-04-24 13:22:23 -07:00
Max Brunsfeld	52c338ed60	Add some infix math operators to javascript grammar	2014-04-23 22:25:48 -07:00
Max Brunsfeld	7be8d469b8	Add ternary expressions to javascript grammar	2014-04-23 22:15:07 -07:00
Max Brunsfeld	a437d39773	Add rule precedence construct Still need to add some way of expressing left and right associativity	2014-04-15 08:40:46 -07:00
Max Brunsfeld	be1c8e0f17	Add dynamic property access to javascript grammar	2014-04-05 15:55:20 -07:00
Max Brunsfeld	2191a7d988	Add switch statements to javascript grammar	2014-04-04 13:10:33 -07:00
Max Brunsfeld	129d2b9314	Remove extra EOF actions in lexer	2014-04-04 08:44:35 -07:00
Max Brunsfeld	1cc7e32e2d	Fix handling of tokens consisting of separator characters The parser is no longer hard-coded to skip whitespace. Tokens such as newlines, whose characters overlap with the separator characters, can now be correctly recognized.	2014-04-03 19:10:09 -07:00

1 2

69 commits