How NOT to Backup a Blogger Blog
Over at the Google Operating System blog, they offer a way to "backup" your blog. It is mostly a manual hack to load the entire blog into one page in a web browser, then save the resulting HTML, though a similar technique is offered for saving the contents of your XML feed.
There are a few problems with this technique:
- It depends on knowing how many posts are in the blog, up front.
- The steps and tools given are manual.
- Comments are handled separately.
What to do?
Enter BlogBackup, the unimaginatively named, fully automatic, backup software for your blog. Just point the command line tool at your blog feed and a directory where the backup output should go. It will automatically perform a full backup, including:
- Every blog post is saved to a separate file in an easily parsable format, including all of the meta-data provided by the feed (categories, tags, publish dates, author, etc.).
- Comments are saved in separate directories, organized around the post with which they are associated. Comments also include all of their meta-data.
- The content of blog posts and comments are copied to a separate text file for easy indexing by desktop search tools such as Spotlight.
So, go forth and automate.
