Commit Graph

11 Commits

Author SHA1 Message Date
0dda4dc974 update README, and consider repository finished (at least for now) 2024-09-11 13:19:41 +01:00
db8fa8ae80 add getPrefixList to bro-tree (just folds over the tree in O(n) time, which is the best we can do for that data structure). Will add a benchmark comparison of this. 2024-09-11 11:25:33 +01:00
af504350c7 further improvements to conv-words.sml, escaping chars when necessary, as we may generate an invalid .sml file otherwise 2024-09-11 10:01:30 +01:00
442f9f88d2 better word tokenisation algorithm in bench/conv-words.sml 2024-09-11 09:50:07 +01:00
662be1da75 a bit of progress moving insert functions into zip-string-set.sml 2024-09-10 09:05:41 +01:00
ede0da6643 alternative isEmpty implementation in string-set.sml 2024-09-09 18:22:56 +01:00
dbc0fdc178 progress with zip-string-set.sml (refactored up to the exists function), and reduce code duplication for one case in string-set.sml 2024-09-09 17:39:45 +01:00
b7a365c6b8 implement 'exists' function on bro-tree as well, and benchmark it compared to string-set 2024-09-09 15:57:58 +01:00
fc0fd6923b use larger set of words in words.txt dataset 2024-09-09 15:19:18 +01:00
7d707c9af4 add insert benchmark comparing StringSet to BroTree (StringSet is just a few ms faster for this) 2024-09-09 14:54:25 +01:00
4a16daaa1a begin coding bro-tree for benchmarks 2024-09-09 14:20:25 +01:00