Enum DocumentExtractionPreset

Namespace: IndicoV2-Ocr-Models
Assembly: IndicoV2-Abstractions-dll

public enum DocumentExtractionPreset

Fields

Detailed = 3

Provides detailed bounding box information on tokens and characters- Returns data in a nested format at the document level with all metadata included-

Legacy = 2

Provided to mimic the behavior of Indico’s older pdf_extraction function- Use this if your model was trained with data from the older pdf_extraction-

OnDocument = 4

Provides detailed information at the page-level in an unnested format-

Simple = 1

Provides a simple and fast response for native PDFs (3-5x faster)- Will NOT work with scanned PDFs-

Standard = 0

Provides page text and block text/position in a nested format-