Import HTML
Related Stuff
The import_html.module was developed to assist migrating from old, static file sites (or spidered mirrors) into the Drupal CMS
Since I first developed it, It has been used in many different contexts to assist importing legacy content for:
- Government sites
- Print magazine archives
- Legacy discussion board archives
- Upgrading dynamic news sites to Drupal
- Port or mirror old functionality (as a wrapper)
import_html is an open source (GNU) project under Drupal, and I've been maintaining it since Drupal 4.6. Extensive documentation is included with the distribution but I also frequently get small support contracts from many users to assist in getting it tuned right for their uses.