DocDigitizer PowerCapture General Rules for Financial Documents

DocDigitizer PowerCapture General Rules for Financial Documents

Is it possible to have multiple subtypes for the same document?

No. In the case of multiple classifications on the same document, we return the first classification following the order described above.

Example: If the document has both an invoice and a receipt the sub-type will be invoice.

May a Financial Document be rejected?

Yes, All the documents that don’t have an indication of the supplier, recipient and values are rejected as "not included in the license".

What happens if the information is not readable?

Only readable data (fields) is extracted. If it’s not readable we return the field as empty

Does DocDigitizer PowerCapture extracts line items of any invoice layout?

No. DocDigitizer PowerCapture only extracts line items when they are presented in a tabular format on the document layout. For any other layout they will be returned empty.

If there is more than one table of line items, which one does DocDigitizer PowerCapture extract?

DocDigitizer PowerCapture always extracts the table that contains the greatest volume of information that can be extracted. This means that if there is a table with a summary of the items and another table with detailed information of the items, the extraction process opts for the table with more information. However, if a document has multiple detailed tables with different values and totals, we only extract the first detailed table.

Does DocDigitizer PowerCapture extracts all the information in the Item Description?

No. DocDigitizer PowerCapture extracts the first line of each line item and only if it has a value to pay associated with.

What kind of Taxes are supported?

All taxes that are presented on the document layout in the tax breakdown table.
Examples: VAT, GST, CGST, DPH, TVA, IVA.
Taxes presented outside the tax breakdown table, may not be properly extracted.

Example:

What we extract:

If there’s multiple tax lines in the tax breakdown table, how does DocDigitizer PowerCapture return the Item Rate and Item Tax on the description lines?

DocDigitizer PowerCapture returns the values if the Item Rate and Item tax are determined for each line.

If there’s only one Tax Rate on the tax breakdown table, how does DocDigitizer PowerCapture return the Item Rate and Item Tax on the description lines?

DocDigitizer PowerCapture applies the same Tax Rate to each line description.

What if there’s only one line description, what it’s introduced in the fields Tax Rate and Item Tax?

DocDigitizer PowerCapture can extract the information from the tax breakdown table because it’s only one line description.

When there’s a tax exemption in the document, what values are introduced in the fields regarding taxes?

In this case, DocDigitizer PowerCapture returns the values as 0.

When there is retention, how do we place the value of retention?

DocDigitizer PowerCapture returns the value exactly how it is in the document (could be positive or negative.)

What is the output format of currency?

We use standard currency acronyms:

Examples:

  • If € we return EUR
  • If $ we return USD
  • If £ we return GBP

Extraction by QR Codes

DocDigitizer supports the extraction of data from QR Codes. This means that in some financial-documents, if the document contains a QR Code with valid data, these values can be extracted.