Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.abbyy.com/llms.txt

Use this file to discover all available pages before exploring further.

TextEncodingTypeEnum enumeration constants are used to denote possible types of the output file encoding for export in HTML, TXT and CSV formats.
typedef enum {
 TET_Simple,
 TET_UTF8,
 TET_UTF16,
 TET_Auto
} TextEncodingTypeEnum;

Elements

Name

Description

TET_Auto

Encoding is selected automatically. The selection depends on the values of other properties of the parameters object:

TET_Simple

Simple encoding, one byte per symbol.

TET_UTF16

Native Unicode format where every symbol is represented by two-byte sequence.

TET_UTF8

Unicode UTF-8 format. UTF-8 is a code page that uses a string of bytes to represent a 16-bit Unicode string where ASCII text (<=U+007F) remains unchanged as a single byte, U+0080-07FF (including Latin, Greek, Cyrillic, Hebrew, and Arabic) is converted to a 2-byte sequence, and U+0800-FFFF (Chinese, Japanese, Korean, and others) becomes a 3-byte sequence.

Used in

ITextExportParams::EncodingType IHTMLExportParams::EncodingType IPlainText::SaveToTextFile