tree-sitter/script
Matthew Krupcale ee9a3c0ebb lib: remove utf8proc dependency (#436)
* Remove dependency on utf8proc

This removes the only external dependency on utf8proc for UTF-8 decoding. It does so by implementing its own UTF-8 decoder. This decoder is both faster and has a simpler API.

 * .gitmodules: remove utf8proc submodule
 * docs/section-2-using-parsers.md: remove requirement for utf8proc submodule
 * docs/section-6-contributing.md: likewise
 * lib/Cargo.toml: remove utf8proc subdirectory package include
 * lib/README.md: remove utf8proc subdirectory description
 * lib/binding_rust/build.rs: remove utf8proc compiler include directory
 * lib/src/lexer.c: remove utf8proc dependencies and types
 * lib/src/lib.c: remove utf8proc dependency
 * lib/src/unicode.h: define types for Unicode decoders
 * lib/src/utf16.{c,h}: implement more readable UTF-16 decoder
 * lib/src/utf8.{c,h}: implement fast UTF-8 decoder
 * lib/utf8proc: remove utf8proc submodule directory
 * script/build-lib: remove utf8proc compiler include directory
 * script/build-wasm: likewise

* Optimize ts_lexer__get_lookahead.

Try to favor non-failure code path and assign lookahead values directly to lexer

 * lib/src/lexer.c: optimize for non-failure code path

* Fix some compiler errors

 * lib/src/lexer.c: cast from signed to unsigned for decode_next result
 * lib/src/utf16.c: fix non-constant initializers for older compilers

* Remove some missed remnants of utf8proc

 * docs/section-2-using-parsers.md: only two include paths necessary now
 * lib/src/lib.c: no need to define UTF8PROC_STATIC

* Use ICU's utf8 and utf16 decoding routines

* Remove unnecessary casts when calling icu macros

* Check buffer length before attempting to decode a unicode character

* Use new unicode function when parsing Queries

Co-Authored-By: Matthew Krupcale <mkrupcale@matthewkrupcale.com>

* Mark libicu files as vendored for GitHub's stats
2019-10-14 11:18:39 -07:00
..
util Rename lib.sh -> scan-build.sh 2017-07-06 10:32:41 -07:00
benchmark Avoid printing names of all skipped tests in benchmark script 2019-08-06 12:46:46 -07:00
benchmark.cmd Add benchmark script 2019-02-01 15:17:35 -08:00
build-fuzzers Remove invalid characters from grammar names 2019-02-05 13:37:59 +00:00
build-lib lib: remove utf8proc dependency (#436) 2019-10-14 11:18:39 -07:00
build-wasm lib: remove utf8proc dependency (#436) 2019-10-14 11:18:39 -07:00
check-mallocs Move parser generation code in to 'generate' module within CLI crate 2019-01-07 10:23:01 -08:00
fetch-emscripten Print emcc version after installing emscripten 2019-08-08 10:56:48 -07:00
fetch-fixtures Test against latest version of all fixture repos 2019-07-10 15:18:10 -05:00
fetch-fixtures.cmd Test against latest version of all fixture repos 2019-07-10 15:18:10 -05:00
generate-bindings Reorganize language bindings 2019-05-07 10:41:49 -07:00
generate-fixtures Make scripts work when repo path contains spaces 2019-08-12 15:13:41 -07:00
generate-fixtures-wasm Make scripts work when repo path contains spaces 2019-08-12 15:13:41 -07:00
generate-fixtures.cmd generate-fixtures: handle language repos that contain multiple grammars 2019-05-30 12:00:12 -07:00
generate-wasm-exports-list Make scripts work when repo path contains spaces 2019-08-12 15:13:41 -07:00
reproduce Build fuzzer in 'halt' and 'recover' modes 2018-03-02 10:13:13 -08:00
run-fuzzer Allow extra libFuzzer arguments to script/reproduce 2018-03-12 12:54:06 -07:00
serve-docs Fix layout issues in web-ui 2019-09-13 15:19:31 -07:00
show-symbol-sizes Add script for dumping the sizes of each symbol in a dynamic library 2019-06-16 11:17:53 -07:00
test test script: Find test binary correctly when -g is passed 2019-03-21 11:23:30 -07:00
test-wasm Reorganize language bindings 2019-05-07 10:41:49 -07:00
test.cmd test script: All tests are in the CLI package 2019-03-12 17:14:41 -07:00
version 0.14.5 2019-02-25 12:55:28 -08:00