TXT* chunks extractor
Synopsisdjvutxt [-page <page_num>] <djvu_file_in> [<txt_file_out>]Description --- File "djvutxt.cpp" illustrates how to use DjVuDocument, DjVuImage, DjVuTXT to retrieve textual information stored inside a TXT* chunk of a DjVu document.
TXT* chunks should have been created with the help of an OCR engine, and are used to allow indexing and searching of the DjVu document. The chunks contain the ASCII text itself, and layout information allowing the DjVu plugins to highlight found text.
This utility can be used to extract text from TXT* chunks and output it to a file or standard output.
Arguments:
- <djvu_file_in> - Name of input DjVu file.
- <txt_file_out> - Name of the file where ASCII text will be stored. - means standard output.
-page option can be used to select a particular page from the <djvu_file_in> for processing.
Alphabetic index Hierarchy of classes