I have a backup of all of the text content of the forums now. Excluding this post, obviously. I'll do another pass on just the latest threads when it gets closer to moving time.
We have a total of 155244 posts on 11511 threads. The total size of the database holding all of the HTML post content is 178MB. The size of the code I wrote to grab this content and store it in a sqlite database is only 225 lines.
(TL;DR A programmer bashes Tapatalk
I didn't save the entire HTML of the site, because on testing, all of the stuff around the threads makes a page 5-10x bigger. That's not even including the 33 external stylesheets or 18 external JS files. Tapatalk promotes itself as a mobile first company, yet it inflates the size of the content by at least 5 times. This is what is wrong with the internet today. Doing conversions with software always gives me a sense of how terrible it is (TVT-pmwiki: terrible; Mediawiki: generally not terrible). Tapatalk is "mostly terrible", simply because of how sloppy the conversion is -- hiding post titles in comment tags, hiding elements they don't like under .clutter {display: none}. Thankfully this also means that I could still recover most of the missing elements.-- ?×V
We have a total of 155244 posts on 11511 threads. The total size of the database holding all of the HTML post content is 178MB. The size of the code I wrote to grab this content and store it in a sqlite database is only 225 lines.
(TL;DR A programmer bashes Tapatalk
