tree-sitter

Author	SHA1	Message	Date
Max Brunsfeld	b833942bb8	Clean up Document spec	2016-12-21 11:42:32 -08:00
Max Brunsfeld	e6c82ead2c	Start work toward maintaining external scanner's state during incremental parses	2016-12-20 17:06:20 -08:00
Max Brunsfeld	0e595346be	Make lexer log output easier to read	2016-12-09 13:33:37 -08:00
Max Brunsfeld	535879a2bd	Represent byte, char and tree counts as 32 bit numbers The parser spends the majority of its time allocating and freeing trees and stack nodes. Also, the memory footprint of the AST is a significant concern when using tree-sitter with large files. This library is already unlikely to work very well with source files larger than 4GB, so representing rows, columns, byte lengths and child indices as unsigned 32 bit integers seems like the right choice.	2016-11-14 12:19:13 -08:00
Max Brunsfeld	eed54d95e1	Merge branch 'master' into changed-ranges	2016-10-16 21:10:25 -07:00
Max Brunsfeld	b3140b2689	Implement ts_document_parse_and_get_changed_ranges	2016-10-15 22:31:21 -07:00
Timothy Clem	0ffebc3d0c	s/TSDebugger/TSLogger in spec name	2016-10-05 08:48:12 -07:00
Max Brunsfeld	3014101104	Fix inconsistencies in nodes sizes after edits	2016-09-19 13:35:08 -07:00
Max Brunsfeld	00528e50ce	Change edit API to be byte-based	2016-09-13 13:08:52 -07:00
Max Brunsfeld	131bbee160	Rename parse_and_diff -> parse_and_get_changed_ranges Signed-off-by: Nathan Sobo <nathan@github.com>	2016-09-08 17:51:34 -07:00
Max Brunsfeld	fce8d57152	Start work on document_parse_and_diff API	2016-09-08 17:51:20 -07:00
Max Brunsfeld	38241d466b	Rename .read_fn, .seek_fn -> .read, .seek	2016-09-06 21:39:10 -07:00
Max Brunsfeld	096ac2d4b6	Rename ts_document_set_debugger -> ts_document_set_logger	2016-09-06 17:40:26 -07:00
Max Brunsfeld	64a6c9db0e	Rename ts_document_make -> ts_document_new	2016-09-06 17:26:18 -07:00
Max Brunsfeld	87ca3cb099	Reuse nodes based on state matching, not sentential form validity I think that state matching is the only correct strategy for incremental node reuse that is compatible with the new error recovery algorithm. It's also simpler than the sentential-form algorithm. With the compressed parse tables, state matching shouldn't be too conservative of a test.	2016-07-31 21:31:19 -07:00
Max Brunsfeld	09b019c530	Fix test for invalid blank input	2016-06-23 09:24:26 -07:00
Max Brunsfeld	c6e9b32d3f	Print all the same parse log messages for both debugging methods	2016-06-22 22:36:11 -07:00
Max Brunsfeld	1e353381ff	Don't create error node in lexer unless token is completely invalid Before, any syntax error would cause the lexer to create an error leaf node. This could happen even with a valid input, if the parse stack had split and one particular version of the parse stack failed to parse. Now, an error leaf node is only created when the lexer cannot understand part of the input stream at all. When a normal syntax error occurs, the lexer just returns a token that is outside of the expected token set, and the parser handles the unexpected token.	2016-05-26 14:15:10 -07:00
Max Brunsfeld	2e35587161	Use new stack_pop_until function for repairing errors	2016-03-07 20:06:46 -08:00
Max Brunsfeld	bc8df9f5c5	Avoid recompiling test languages when possible	2016-03-03 12:05:04 -08:00
Max Brunsfeld	aef7582a2a	Start using the forward move to recover from errors Some unit tests passing. Corpus tests still failing	2016-03-02 21:08:42 -08:00
Max Brunsfeld	e7abfdd373	Prevent string assertion failures from creating later memory leak errors	2016-03-02 20:58:39 -08:00
Max Brunsfeld	b80a330a74	Fix assorted memory leaks in test code	2016-02-05 12:23:54 -08:00
Max Brunsfeld	7c44b0e387	Fix leaked lookahead trees in normal parsing	2016-01-29 17:31:43 -08:00
Max Brunsfeld	0f7dbea9a3	Unify test targets, use externally defined languages as fixtures	2016-01-15 11:19:24 -08:00
Max Brunsfeld	f2e7058ad9	Support UTF16 directly This makes the API easier to use from javascript	2015-12-28 13:53:22 -08:00
Max Brunsfeld	c495076adb	Record in parse table which actions can hide splits Suppose a parse state S has multiple actions for a terminal lookahead symbol A. Then during incremental parsing, while in state S, the parser should not reuse a non-terminal lookahead B where FIRST(B) contains A, because reusing B might prematurely discard one of the possible actions that a batch parser would have attempted in state S, upon seeing A as a lookahead.	2015-12-17 13:11:56 -08:00
Max Brunsfeld	8a146a9bef	Reset lexer correctly when old input was blank	2015-12-03 10:00:39 -08:00
Max Brunsfeld	aba8af9e5b	Cleanup debug logging in parser	2015-09-22 19:35:13 -07:00
Max Brunsfeld	f37f73f92f	Add ability to edit multiple times between parses	2015-09-18 23:20:06 -07:00
Max Brunsfeld	252fa7b631	Add Document getter methods for input, language	2015-09-08 23:33:43 -07:00
Max Brunsfeld	ebd60213d9	Drop release functions from callback structs The caller can just as easily take care of the cleanup explicitly	2015-09-08 23:24:33 -07:00
Max Brunsfeld	53926c467e	Don't automatically hide singleton nodes	2015-09-02 16:36:29 -07:00
Max Brunsfeld	21258e6a9e	Remove 'document' wrapper node	2015-08-22 10:48:34 -07:00
Max Brunsfeld	54e40b8146	Rework AST access API: reduce heap allocation	2015-07-31 15:47:48 -07:00
Max Brunsfeld	766e3bab2c	Use 2-space continuation indent consistently in specs	2015-07-27 18:18:58 -07:00
Max Brunsfeld	f7e4445358	Handle goto actions after reductions in a more standard way Rather than letting the reduced tree become the new lookahead symbol, and re-adding it to the stack via a subsequent shift action, just add it to the stack as part of the reduce action. This is more in line with the way LR is described traditionally.	2015-05-27 10:53:02 -07:00
Max Brunsfeld	8bd11e1b58	Fix parser debug messages in spec	2015-03-07 16:40:39 -08:00
Max Brunsfeld	ec31249fe6	Add commas in expected message in document debugging spec	2014-11-01 17:03:32 -07:00
Max Brunsfeld	de9a48d11f	Tweak debugging in parser and lexer	2014-10-22 20:10:08 -07:00
Max Brunsfeld	41a067fef9	Fix build warnings in document spec	2014-10-17 21:27:49 -07:00
Max Brunsfeld	8cf800ef5d	Unify debugging API for parsing and lexing	2014-10-17 17:52:54 -07:00
Max Brunsfeld	b5d022a70c	Fix missing field warnings for debugger structs	2014-10-14 22:50:24 -07:00
Max Brunsfeld	c594208ab8	Allow callbacks to be specified for debug output	2014-10-13 01:02:18 -07:00
Max Brunsfeld	f05762b4a0	Move parser tests into their own file	2014-09-10 18:49:53 -07:00
Max Brunsfeld	1ff7cedf40	Unify ubiquitous tokens and lexical separators in API	2014-09-07 22:16:45 -07:00
Max Brunsfeld	ed11ef557a	Fix expansion of repeat rules into recursive rules Previously, the way repeat rules were expanded, the auxiliary rule always needed to be reduced, even if the repeating content was empty. This caused problems in parse states where some items contained the repeat rule and some did not. To make those cases work, the repeat rule had to explicitly be marked as optional. With this change, that is no longer necessary.	2014-09-07 09:39:14 -07:00
Max Brunsfeld	43ecac2a1d	Expose debug flag on document	2014-09-06 17:56:00 -07:00
Max Brunsfeld	3dea1261a6	Clean up document specs for incremental parsing	2014-09-03 18:48:10 -07:00
Max Brunsfeld	c72445d808	Fix inc parsing for nodes containing ubiq tokens	2014-09-03 13:17:06 -07:00

1 2

69 commits