<feed xmlns='http://www.w3.org/2005/Atom'>
<title>jkc.git/meson.build, branch main</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<id>https://git.spawned.biz/jkc.git/atom?h=main</id>
<link rel='self' href='https://git.spawned.biz/jkc.git/atom?h=main'/>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/'/>
<updated>2026-05-02T08:54:02Z</updated>
<entry>
<title>WIP</title>
<updated>2026-05-02T08:54:02Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2026-05-01T16:45:23Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=19005581a0d35233f862e57308734d3486569bb9'/>
<id>urn:sha1:19005581a0d35233f862e57308734d3486569bb9</id>
<content type='text'>
</content>
</entry>
<entry>
<title>java: Add tokens support for Java 21</title>
<updated>2025-09-29T07:50:47Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-29T07:39:49Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=d196d51e07f50f3510c43ad375c5559b58860023'/>
<id>urn:sha1:d196d51e07f50f3510c43ad375c5559b58860023</id>
<content type='text'>
Some new keywords, I opted to modify java-8 grammar to use the new
names, even if they are not going to match anything. Makes the
tokenizer easier to write.
</content>
</entry>
<entry>
<title>java: Add tokens</title>
<updated>2025-09-29T07:39:17Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-28T20:53:30Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=1e9e51dae1c01bab7562911b958c47528b8011c8'/>
<id>urn:sha1:1e9e51dae1c01bab7562911b958c47528b8011c8</id>
<content type='text'>
Only parses Java 8 tokens for now.
</content>
</entry>
<entry>
<title>Add simple prefix_tree</title>
<updated>2025-09-27T16:49:23Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-27T16:25:10Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=2f13baa843bd1fb5db6630a2823681ffaff9fb11'/>
<id>urn:sha1:2f13baa843bd1fb5db6630a2823681ffaff9fb11</id>
<content type='text'>
Will be used by tokenizer for short lists of strings
</content>
</entry>
<entry>
<title>java::uescape: Unicode reader that knows about Java's \uXXXX escapes</title>
<updated>2025-09-18T21:57:56Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-18T21:57:56Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=50348284f5d82ccfd65b0c803ba0ba895912ceff'/>
<id>urn:sha1:50348284f5d82ccfd65b0c803ba0ba895912ceff</id>
<content type='text'>
</content>
</entry>
<entry>
<title>uline: Add unicode line reader</title>
<updated>2025-09-16T22:48:46Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-16T22:48:46Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=2a9e59adb5db8630ab7bdbdeedac623e3397989b'/>
<id>urn:sha1:2a9e59adb5db8630ab7bdbdeedac623e3397989b</id>
<content type='text'>
</content>
</entry>
<entry>
<title>uio: Unicode reader</title>
<updated>2025-09-15T18:52:51Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-15T18:52:51Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=18a622f378b403788c67fc785d30f4609caa3fc7'/>
<id>urn:sha1:18a622f378b403788c67fc785d30f4609caa3fc7</id>
<content type='text'>
Reads UTF-8 and UTF-16 into UTF-8 or UTF-16 strings.
If strict is true, fails at first invalid character.
If strict is false, invalid characters are replaced with U+FFFD.
For the replacement, I changed behavior if uN::read_replace to only
jump one byte. Otherwise a common invalid case when ISO-8859-1 or
WIN-1252 are read as UTF-8 would skip many characters.

If skip_bom is true any bom at start of stream is ignored.
If skip_bom is false any bom will be included.

Input format can be forced, if not detect is used which will
try to guess and then fallback to UTF-8.
</content>
</entry>
<entry>
<title>Improve test coverage of io and unicode</title>
<updated>2025-09-10T21:57:26Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-10T21:57:26Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=28c6425e4ed1cd2eab538e7cba08c18aa83d8af5'/>
<id>urn:sha1:28c6425e4ed1cd2eab538e7cba08c18aa83d8af5</id>
<content type='text'>
</content>
</entry>
<entry>
<title>Add unicode general category lookup</title>
<updated>2025-09-10T20:12:22Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-10T20:12:22Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=32e14551a90e85000e41b3f0445d34d58a1431e4'/>
<id>urn:sha1:32e14551a90e85000e41b3f0445d34d58a1431e4</id>
<content type='text'>
Generate the lookup tables from UnicodeData.txt, do to that,
add gen_ugc, which uses csv, buffers, line, io and other modules
to do the job.
</content>
</entry>
<entry>
<title>Disable RTTI and exceptions</title>
<updated>2025-09-08T21:10:04Z</updated>
<author>
<name>Joel Klinghed</name>
<email>the_jk@spawned.biz</email>
</author>
<published>2025-09-08T21:10:04Z</published>
<link rel='alternate' type='text/html' href='https://git.spawned.biz/jkc.git/commit/?id=f0ba930314e05c8d949ce92ffbda41fdb133198a'/>
<id>urn:sha1:f0ba930314e05c8d949ce92ffbda41fdb133198a</id>
<content type='text'>
Are not going to use them
</content>
</entry>
</feed>
