Great idea with pandoc Lars... unfortunately, it did a terrible job :-(. It would be more work to fix up that output up than just grabbing the raw text from a browser window. All Brian's paragraphs are html tables, so it tried to make an actual table and completely messed that up. And it left in some <div> tags, lol. Quite a mess.
Collector, I'm not sure you understand what I'm trying to do. I just want to extract the text and hopefully some document structure. That's really not as trivial as search and replace. I mean, I'm sure it's possible with some regular expressions magic, but it would end up being more work than just doing everything manually.
My hope (I guess) was that Brian had used some "mark down" document format to generate the html files, and those files were still around somewhere. But maybe he didn't, he might have just written them in an html editor like FrontPage or something.