That must be uncompressed becuase that sounds like a lot for mostly just text.@MrTimscampi made a backup using a web crawling program. He said the whole thing weighs 26GB and will be made available through torrent if I am not mistaken.
That must be uncompressed becuase that sounds like a lot for mostly just text.@MrTimscampi made a backup using a web crawling program. He said the whole thing weighs 26GB and will be made available through torrent if I am not mistaken.
Some of the threads in the Junior Idol section still have a lot of images in them. Rei, for example, has 90 pages in her thread with tons of uploaded sets still intact.That must be uncompressed becuase that sounds like a lot for mostly just text.
I appreciate that but my attitude is always "We'll see." Plus I may not have 26 gigs to use to download it and in any case a live version that can be browsed in real time is always superior.@MrTimscampi made a backup using a web crawling program. He said the whole thing weighs 26GB and will be made available through torrent if I am not mistaken.
I can tell you from the size this was already done... Otherwise it would be 100+GB.Also, 26 gigs does still seem high to me. I've saved a decent amount of random stuff And I've spent no more like than like a gig total on it. Due to their age, most of the pics here aren't that HQ.
One thing I could see ballooning the size is sig/avatar pics. It would be good to either deduplicate those (make all instances of them refer to one copy) or delete them. You could also deduplicate all of the basic forum javascript, CSS, etc.
The 2015 law was for Japan, presumably not wherever AO is hosted.Honestly, the fact that this site was able to host JI threads post-2015 is a miracle in-of-itself.
Yeah true. I guess I meant in a world where the internet is becoming less and less decentralized.The 2015 law was for Japan, presumably not wherever AO is hosted.
Probably a zero at this point. I do wonder what amount of traffic or percentage of users were on AO for the JI content compared to the site overall, at least in the heyday before 2015. It's probably not as much as I think but it's an interesting question.Genuinely not just trying to trash the site, but I'm curious so to all browsing treat this as an informal survey: Without the JI, how interested are you in AO anymore out of 10? I'm at like a 3.
i made a backup, its 38gb with inline attachements files only and of course over 38k posts, but without the download and torrent pages they deleted before, i can give the data in any form if its not avaiable until 2months, but before i must clear content like loli pics or unproper content and anonymous the user data@MrTimscampi made a backup using a web crawling program. He said the whole thing weighs 26GB and will be made available through torrent if I am not mistaken.
So... everything?clear content like loli pics or unproper content and anonymous the user data
lol i caught nothing, i only make usernames random like user1 user2, if you want put the post in a other forum software and someone is moderator people can still claim their posts if recognize their username like user1, i dont set up any forum or something i share only raw data if no does after a certain time, and when i share a certain file i dont want someone take it down afterwards like on nyaa or somelse thats why exclude some pics, i can share them in a second file seperate if you wish, and yes i saw some toddler loliboru pics and voyeur pics no nude, this i will seperateSo... everything?
I wonder what kind of user data you might have caught, to be forced to anonymize .
This sounds like a bad idea to me - the goal of a community archive is that posts can be attributed to users, moreover: users can be recognized and distinguished visually. Without that aspect, chat itself becomes meaningless, it's like you were mashing up output of ChatGPT. Just a stack of information glued randomly with somewhat correct language.lol i caught nothing, i only make usernames random like user1 user2, if you want put the post in a other forum software and someone is moderator people can still claim their posts if recognize their username like user1
I don't think the usernames need to be anonymized. It's very unlikely anybody used a username they're particularly attached to to look at junior gravure.lol i caught nothing, i only make usernames random like user1 user2, if you want put the post in a other forum software and someone is moderator people can still claim their posts if recognize their username like user1, i dont set up any forum or something i share only raw data if no does after a certain time, and when i share a certain file i dont want someone take it down afterwards like on nyaa or somelse thats why exclude some pics, i can share them in a second file seperate if you wish, and yes i saw some toddler loliboru pics and voyeur pics no nude, this i will seperate