| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 4 days | grammar: More verbose error | Joel Klinghed | |
| 4 days | java: Correct java tokens comments | Joel Klinghed | |
| int_value is not set to any index. | |||
| 4 days | java: Cleanup generated token files | Joel Klinghed | |
| Remove unused include and fix double spacing. | |||
| 2025-09-29 | java: Add tokens support for Java 21 | Joel Klinghed | |
| Some new keywords, I opted to modify java-8 grammar to use the new names, even if they are not going to match anything. Makes the tokenizer easier to write. | |||
| 2025-09-29 | java: Add tokens | Joel Klinghed | |
| Only parses Java 8 tokens for now. | |||
| 2025-09-28 | str: Add trim() | Joel Klinghed | |
| 2025-09-28 | gen_ugc: Print out error in case of bad arguments | Joel Klinghed | |
| 2025-09-28 | ugc: Fix typo | Joel Klinghed | |
| 2025-09-28 | prefix_tree: Make a byte optimized version for really small trees | Joel Klinghed | |
| Currently supports 8-bit distances and 16-bit distances. | |||
| 2025-09-27 | Add simple prefix_tree | Joel Klinghed | |
| Will be used by tokenizer for short lists of strings | |||
| 2025-09-22 | Change io::Reader and company to return ReadError::Eof instead of 0. | Joel Klinghed | |
| It's debatable if Eof should be considered an error or not. But it is pretty clear it generally is a special response that needs special handling, so easier to keep with the unexpected lot. Also keeps better at higher abstraction levels, such as the line reader. | |||
| 2025-09-18 | java::uescape: Unicode reader that knows about Java's \uXXXX escapes | Joel Klinghed | |
| 2025-09-17 | fixup! uio: Unicode reader | Joel Klinghed | |
| 2025-09-17 | fixup! uline: Add unicode line reader | Joel Klinghed | |
| 2025-09-17 | fixup! Add .clang-format | Joel Klinghed | |
| 2025-09-17 | uline: Add unicode line reader | Joel Klinghed | |
| 2025-09-17 | uio: Remove unnecessary wrappers | Joel Klinghed | |
| 2025-09-15 | Add .clang-format | Joel Klinghed | |
| Make it easier to keep a consistent style | |||
| 2025-09-15 | fixup! Improve test coverage of io and unicode | Joel Klinghed | |
| 2025-09-15 | decompress: Return better io error for BUF_ERROR | Joel Klinghed | |
| Use new MaxTooSmall. As the comment notes tho, it might be that we are lacking input as well, but until I figure out how to test for that case and determine the cause, lets at least return a more specific error. | |||
| 2025-09-15 | fixup! Fix issues in buffer | Joel Klinghed | |
| 2025-09-15 | uio: Unicode reader | Joel Klinghed | |
| Reads UTF-8 and UTF-16 into UTF-8 or UTF-16 strings. If strict is true, fails at first invalid character. If strict is false, invalid characters are replaced with U+FFFD. For the replacement, I changed behavior if uN::read_replace to only jump one byte. Otherwise a common invalid case when ISO-8859-1 or WIN-1252 are read as UTF-8 would skip many characters. If skip_bom is true any bom at start of stream is ignored. If skip_bom is false any bom will be included. Input format can be forced, if not detect is used which will try to guess and then fallback to UTF-8. | |||
| 2025-09-10 | Improve test coverage of io and unicode | Joel Klinghed | |
| 2025-09-10 | Fix issues in buffer | Joel Klinghed | |
| 2025-09-10 | Add unicode general category lookup | Joel Klinghed | |
| Generate the lookup tables from UnicodeData.txt, do to that, add gen_ugc, which uses csv, buffers, line, io and other modules to do the job. | |||
| 2025-09-10 | fixup! Make clang-tidy happy | Joel Klinghed | |
| 2025-09-08 | Disable RTTI and exceptions | Joel Klinghed | |
| Are not going to use them | |||
| 2025-09-08 | Make clang-tidy happy | Joel Klinghed | |
| 2025-09-04 | args: Increase test coverage | Joel Klinghed | |
| 2025-09-04 | Add UTF-8, UTF-16 and Modified UTF-8 support | Joel Klinghed | |
| 2025-09-03 | Initial commit | Joel Klinghed | |
| Only a basic argument parser to start with. | |||
