When you use this (I use DTA for FFox that GUIs this) what settings would you use to capture all of ES, including attachments, but not outside linked pages?
The one time you ask me a question
, and I could answer 8yrs ago only because I've used this type of software 10+ years ago. What you had to do back then was insert the the website(s) or sub-forums it would track & then "the depth" it could drill down to catch all the topics or posts. Then you would just "turn on" the software switch to limit it to endless-sphere.com/forums or a list of external websites that would limit it to where it could go & specify the link depth at the external websites. There were even some duplication prevention features too. I just don't know what "the state of the art" is now with this type of software. I don't even know if these two programs I listed are the best to use for ES purposes.
amberwolf wrote:I'm sure I can find out by experimentation, but if you already have them, it'd save lots of time (and ES server bandwidth, if I had to keep restarting it from scratch during tests).
I'd suggest running it late at night, when bandwidth demand would be lower in the US. I'll be testing the software myself over the next few weeks, but I don't have specific answers for you now.
amberwolf wrote:I'd like to archive ES periodically if for no other reason than to keep a backup of it in case the ultimate bad thing happens to it. (but also so I have a copy of it for use during times like this, and for when I just need a bit of info that I hadn't saved locally yet, and also so that even if another member decides to hara-kiri all their posts again I'll still have the valuable information that is in them. I abandoned at least one project idea because of the N&V mess and lost data).
These are excellent reasons to keep website snapshot images on a periodic basis
This website is very important to hundreds if not thousands of people, so having some off-site copies is important for disaster situations AND to preserve what sometimes gets deleted over time due to user deletion, mod maintenance/editing, or the real possibility of archive loss/damage, etc.
It's very likely "the data" could get scrambled & then be backed-up scrambled too before it is found-out to be damaged & it's too late to recover the original pre-scrambled data. I hope ES has a good back-up plan with monthly archives going back years, imo