Wednesday, August 12, 2009

ODFPy - Python ODF manipulation library & tools

I was looking for some cli tools to convert odf to html, however, most ODF converters I found out there utilizes OpenOffice-Uno or OpenOffice macro for conversion of ODF documents - which means, they need a full blown openoffice to be installed and running before being able to convert the documents. Unoconv is one of them (available in Fedora repos). These type of tools are resource intensive, and can't really run without GUI, not quite what I want.

Searching for ODF through yum revealed a package which looks quite interesting to me:

odfpy.noarch : Python library for manipulating OpenDocument files

Being curious, I installed ODFPy, and to my surprise, the package also provide some scripts to automatically convert ODF to several text based formats - odf2xhtml,odf2mht,odf2xml and odf2war - just what I was looking for. Looking at the available libs, theres also

While not perfect, the conversion works fairly ok, plus, if stuff does not work correctly, the scripts are in Python and from a quick glance, they looks quite easy to adjust and enhance.

Might be fun if can integrate this with Plone. ^-^
Post a Comment
Locations of visitors to this page