These are plugin we need for ubuntu:
- sudo apt-get install html2text
- sudo apt-get install poppler-utils
- sudo apt-get install antiword
- sudo apt-get install python-excelerator
Using command line to convert:
- html2text test.html > html.txt
- pdftotext test.pdf pdf.txt
- antiword -m UTF-8.txt -t test.doc > doc.txt
- py_xls2txt test.xls > excel.txt
Good luck!
No comments:
Post a Comment