Enum DocumentExtractionPreset
Namespace: IndicoV2-Ocr-Models
Assembly: IndicoV2-Abstractions-dll
public enum DocumentExtractionPreset
Fields
Detailed = 3
Provides detailed bounding box information on tokens and characters- Returns data in a nested format at the document level with all metadata included-
Legacy = 2
Provided to mimic the behavior of Indico’s older pdf_extraction function- Use this if your model was trained with data from the older pdf_extraction-
OnDocument = 4
Provides detailed information at the page-level in an unnested format-
Simple = 1
Provides a simple and fast response for native PDFs (3-5x faster)- Will NOT work with scanned PDFs-
Standard = 0
Provides page text and block text/position in a nested format-
Updated about 1 year ago