I have a 10 year old Mambo (http://www.mamboserver.com/
) MySQL database. Around 130 blog entries I'd like to convert to a PDF book. The site itself is dead, not available, not upgrade path exists as the site was created with an alpha version..
What'd be the recommend way to do this?
- Set up the MySQL DB
- Write Python+SQL to select the required info from DB: Category, article title, timestamp, content
- Write Python to edit/clean the HTML and store all the data into XML (lxml?)
- XML+XSL-FO+Apace FOP to convert it all to PDF
Python is the language I've propably used the most so possibly would make sense to use it.
Any other possible paths?
What would be the best stage&way to add images to the end result? The DB and articles itself don't have any images/image links included.