DocumentRecognitionSettings

DocumentRecognitionSettings class

Settings for the pdf recognition. Contains elements that allow customizing the recognition process.

public class DocumentRecognitionSettings : BaseRecognitionSettings

Constructors

NameDescription
DocumentRecognitionSettings(int, int)Initializes a new instance of the DocumentRecognitionSettings class with short set of properties.
DocumentRecognitionSettings(int, int, Language, bool, bool, int)Initializes a new instance of the DocumentRecognitionSettings class with full set of properties.

Properties

NameDescription
AllowedCharacters { get; set; }Allowed characters set. Determines the type of characters allowed for recognition result.
AutoContrast { get; set; }Allows using an additional contrast correction algorithm for the image before recognition.
AutoDenoising { get; set; }Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements.
AutoSkew { set; }Gets or sets a flag indicating whether automatic image skew correction should be enabled. Enabled (true) by default.
DetectAreasMode { get; set; }Allows to select the optimal mode for document type areas: document, photo, plain text, column, image.
IgnoredCharacters { get; set; }Sets blacklist for recognition symbols.
Language { set; }Gets or sets the language used for OCR. Determines the alphabet used during recognition. Multi-language by default.
LinesFiltration { get; set; }Allows to recognize text in the tables (regions surrounded lines).
PagesNumber { get; set; }Set the number of pages for recognition multipage pdf file.
PreprocessingFilters { get; set; }Allows to prepare the image for OCR by adjusting pre-processing methods.
SkewAngle { set; }Gets or sets angle in degrees for image rotation. Setting this value will disable the AutoSkew property, so that auto skew correction is not applied. Zero by default.
StartPage { get; set; }Set the first page for recognition.
ThreadsCount { set; }Gets or sets the number fo threads for processing. By default, 0 means that the image will be processed with the number of threads equal to your number of processors. ThreadsCount = 1 means that the image will be processed in the main thread.
ThresholdValue { set; }Gets or sets custom threshold value for image binarization. Range from 1 to 255.
UpscaleSmallFont { get; set; }Allows you to use additional algorithms specifically for small font recognition. Useful for images with small size characters.

See Also