Superuser-Documents#

This describes additional Document settings which are only available to Superusers. Superuser Permissions are only available for self-hosted Konfuzio Installations.

Document:#

File Name:#

The file name of this Document.

File Producer:#

The name of the file producer. This is parsed from the metadata of the file.

Project:#

The Project of this Document.

Category:#

The Category of this Document.

Uploaded By:#

The user who uploaded the Document.

Assignee:#

See here

Number of Pages:#

The number of Pages of this Document.

Created at:#

The time when the Document was created.

Status:#

Status Data:#

The status of the Document in the pipeline, like “Done” or “Queuing for Extraction”. Full list can be found here

Dataset Status:#

See here

Labeling Available:#

Indicates if the SmartView is available for this Document.

Sync:#

Tracks if the Document has been uploaded in a synchronous way.

Is Public:#

Tracks if the Document is publicly accessible.

Has been rotated manually:#

Tracks if any of the Pages of the Document has been rotated manually.

Callback URL:#

See here

Callback Status Code:#

See here

AI:#

Top Extractions:#

The number of top (highest-confidence ones) Annotations.

Used AI Model Run:#

Tracks the Extraction AI usage. An AI Model Run represents the use of an AI on a Document.

Used Categorization AI model run:#

Tracks the Categorization AI usage. An AI Model Run represents the application of an AI on a Document.

Category Confidence:#

The confidence of the classification of a Document.

Extraction Log:#

The log of the Extraction AI Run.

Categorization Log:#

The log of the Categorization AI Run.

Files:#

File:#

The original file of this Document.

Sandwich File:#

A generated PDF file with text embeddings.

Time and Timing:#

Processing Time:#

The time (in seconds) which was needed to process this Document.

Generate Entities Time:#

The duration in seconds from start to end of the entity generation task.

Finalize OCR Time:#

The duration in seconds from start to end of the finalize_ocr (i.e. sandwich file) task.

OCR Time:#

The duration in seconds from start of the first OCR task until the last OCR task of this Document.

Categorization Time:#

The duration in seconds from start to end of the Categorization task.

Categorization AI loading time:#

The time it took to load the Categorization AI into memory.

Extraction Time:#

The duration in seconds from start to end of the Extraction task.

Extraction AI loading time:#

The time it took to load the Extraction AI into memmory.

Workflow Start Time:#

The time (unix timestamp) when the Document was uploaded.

Workflow End Time:#

The time (unix timestamp) when the Extraction has completed and the Document status is set to DONE.

Debug Information:#

API Version:#

The API version which was used to upload the Document.

Error Message:#

The error message in case Konfuzio was not able to process the Document.

Segmentation:#

The segmentation results (if segmentation was applied to this Document).

Summary:#

The summarization results (if summarization was applied to this Document).

Airtable Response:#

The API from Airtable (if the airtable integration is used).