Saturday, October 15, 2005

Single PDFs for CDK News articles

This week was the CDK5AW event, a workshop for users and developers of the Chemistry Development Kit (CDK). After talking with other developers we agreed on creating PDF and HTML versions of single articles that appeared in the CDK News newsletter. Well, I haven't figured out how to create nice HTML (the latex2html does not give nice results, anyone ideas?), but for the PDF version I now have a pipeline.

For each article, a split.config file determines which pages from the CDK News issue PDF should be extracted. To do this, I used the PDF ToolKit, or pdftk for short (comes with Debian/Unbuntu by default). And using a Perl script to read this config files, the pipeline creates PDF files for each article. Currently, I'll only have it do the features articles; that is, not the ChangeLog, Editorial, Literature and FAQ. For those you'll need to download the full issue. If you don't like that, let me know :)

Ok, you will probably have noticed that the almost server is down (Googling for 'CDK News' allows you read the cache!), and I the PDF's will be uploaded there asap. For those not familiar with CDK News, the articles are FDL, so feel free to copy and distribute them. If you reuse the text and update it, which is allowed too, please let us know.