This program extracts text from MS-Word files, trying to preserve as many special printable characters as possible. catdoc supports everything up to Word-97. Also supported are MS Write documents and RTF files.
It doesn't even try to preserve fancy Word formatting, because Word users usually don't care about document structure, and it is this very thing which is important to LaTeX users.
Also provided is xls2csv, which extracts data from Excel spreadsheets and outputs it in comma-separated-value format and catppt, which extracts data from PowerPoint presentations.
This package suggests tk because it also includes wordview, an optional Tk-based GUI for catdoc. The MIME config provided in this package will use wordview is X is running, or catdoc directly if it is not.
Popularity
This page has been viewed 5989 times, catdoc has been kliked 31 times, and 28 successes have been estimated.
klik by Simon Peter
Thanks to all contributors on #klik.
Thanks to debian for the software compilation and packaging.
Thanks to our hosting sponsor, atekon.
Thanks to all users who give feedback. THIS IS PURELY EXPERIMENTAL SOFTWARE.
We know that by far not all debian packages are klik-able yet. But we believe they should be ;-)