summaryrefslogtreecommitdiff
path: root/meson.build
AgeCommit message (Collapse)Author
8 hoursWIPHEADmainJoel Klinghed
2025-09-29java: Add tokens support for Java 21Joel Klinghed
Some new keywords, I opted to modify java-8 grammar to use the new names, even if they are not going to match anything. Makes the tokenizer easier to write.
2025-09-29java: Add tokensJoel Klinghed
Only parses Java 8 tokens for now.
2025-09-27Add simple prefix_treeJoel Klinghed
Will be used by tokenizer for short lists of strings
2025-09-18java::uescape: Unicode reader that knows about Java's \uXXXX escapesJoel Klinghed
2025-09-17uline: Add unicode line readerJoel Klinghed
2025-09-15uio: Unicode readerJoel Klinghed
Reads UTF-8 and UTF-16 into UTF-8 or UTF-16 strings. If strict is true, fails at first invalid character. If strict is false, invalid characters are replaced with U+FFFD. For the replacement, I changed behavior if uN::read_replace to only jump one byte. Otherwise a common invalid case when ISO-8859-1 or WIN-1252 are read as UTF-8 would skip many characters. If skip_bom is true any bom at start of stream is ignored. If skip_bom is false any bom will be included. Input format can be forced, if not detect is used which will try to guess and then fallback to UTF-8.
2025-09-10Improve test coverage of io and unicodeJoel Klinghed
2025-09-10Add unicode general category lookupJoel Klinghed
Generate the lookup tables from UnicodeData.txt, do to that, add gen_ugc, which uses csv, buffers, line, io and other modules to do the job.
2025-09-08Disable RTTI and exceptionsJoel Klinghed
Are not going to use them
2025-09-04Add UTF-8, UTF-16 and Modified UTF-8 supportJoel Klinghed
2025-09-03Initial commitJoel Klinghed
Only a basic argument parser to start with.