Looks like my converter is done, or very close to it. The only thing I can think of now is that old thread URLs need to be pointed to new ones.
Yak shaving expedition: Why am I looking up details about listing products on Amazon again?
Weird bug: Failure in a module I was using, turned out to be I was parsing outer quote markup before inner quote markup. Thus deleting parent nodes before the child had been processed. Depth first processing fixed this.
Dataset: riddled with errors, SNAFU. See also Amazon product listings.
Tools: fulltext search on 155k posts was surprisingly fast, O(seconds)
sqlite> select count(*) from posts where author = 'Morganni';
0
Nope.-- ?×V
Yak shaving expedition: Why am I looking up details about listing products on Amazon again?
Weird bug: Failure in a module I was using, turned out to be I was parsing outer quote markup before inner quote markup. Thus deleting parent nodes before the child had been processed. Depth first processing fixed this.
Dataset: riddled with errors, SNAFU. See also Amazon product listings.
Tools: fulltext search on 155k posts was surprisingly fast, O(seconds)
sqlite> select count(*) from posts where author = 'Morganni';
0
Nope.-- ?×V