On this page I present some document conversion scripts and programs I wrote.
Note that the scripts using Microsoft Office or Corel WordpPerfect Office usually accept other known input formats too, or otherwise these scripts can easily be modified to do so.
| Input Format | Output Encoding | Script or Program |
|---|---|---|
| Plain text, any | Plain text, ASCII | 2ASCII.ps1 |
| Plain text, Unicode | 2Unicode.ps1 | |
| Plain text, UTF-8 | 2UTF8.ps1 |
| Input Format | Extension | Output Encoding | Script or Program | Requirements |
|---|---|---|---|---|
| EPUB | .epub | Any | Epub2Txt.exe | |
| OpenOffice | .odt | Any | ODT2Txt.exe | |
| Docx2Txt.exe | Microsoft Word | |||
| Pages | .pages | Any | Pages2Txt.bat | 7-Zip Tesseract OCR (uses OCR, hence single page simple layouts only) |
| ASCII | PDF2Txt.bat | GhostScript | ||
| Rich Text | .rtf | Any | Rtf2Txt.exe | |
| Word2Txt.exe | ||||
| Word | .doc | Any | Word2Txt.exe | |
| UTF-8 | Doc2Txt.exe | |||
| .docx | Any | Word2Txt.exe | ||
| Docx2Txt.exe | Microsoft Word | |||
| (all) | Any | Word2Txt.exe | Microsoft Word | |
| WordPerfect | .wpd | Any | Word2Txt.exe | |
| WPD2Txt.exe |
| Input Format | Extension | Output Format | Script or Program | Requirements |
|---|---|---|---|---|
| JPEG | .jpg, .jpeg | JPGs2PDF.bat | GhostScript | |
| OpenOffice | .odt | Word2PDF.vbs | Microsoft Word | |
| ODT2PDF.exe | Either LibreOffice and its SDK or OpenOffice and its SDK, and Java runtime | |||
| PDF2JPG2PDF.bat | GhostScript | |||
| Rich Text | .rtf | Word2PDF.vbs | Microsoft Word | |
| Word | .doc, .docx | Word2PDF.vbs | Microsoft Word | |
| WordPerfect | .wpd | WP2PDF.vbs | Corel WordPerfect Office |
| Input Format | Extension | Output Format | Script or Program | Requirements |
|---|---|---|---|---|
| Excel | .xls, .xlsx | XML | Excel2XML.vbs | Microsoft.XMLDOM |
| OpenOffice | .odt | Any known Office format | Word2Any.vbs | Microsoft Word |
| RTF | Word2RTF.vbs | Microsoft Word | ||
| Word .docx | ODT2Docx.exe | Either LibreOffice and its SDK or OpenOffice and its SDK, and Java runtime | ||
| XPS | Word2XPS.vbs | Microsoft Word | ||
| JPG | PDF2JPG.bat | GhostScript | ||
| Word | .doc, .docx | Any known Office format | Word2Any.vbs | Microsoft Word |
| HTML | WSaveAs.vbs | Microsoft Word | ||
| OpenOffice .odt | Word2OpenOffice.exe | Microsoft Word | ||
| RTF | Word2RTF.vbs | Microsoft Word | ||
| XPS | Word2XPS.vbs | Microsoft Word | ||
| WordPerfect | .wpd | HTML | WPSaveAs.vbs | Corel WordPerfect Office |
| Word .doc | WPD2Doc.vbs | Corel WordPerfect Office |
page last modified: 2024-09-12; loaded in 0.0049 seconds