No More Data Entry with our Data Extraction

Every receipt, bill, and statement that enters Hubdoc goes through our data extraction process. Computer software mines these documents for data, and then Hubdoc adds in some human quality assurance to certify that the correct information is extracted.

Extractor means no data entry or filing for you!

Extracted Information

The date, amount, and supplier's name are extracted and stored in Hubdoc along with the document. If present, the invoice number and due date are also extracted.


The extraction process generally takes fewer than 24 hours. Documents waiting to undergo the extraction process can be found in the 'Processing' tab. Once Hubdoc has completed the extraction, documents will move to the 'Review' tab. If Hubdoc is unable to extract information, documents will move to the 'Failed' tab.

Extraction Issues

A document could fail the extraction process for a few reasons. The most common are:

  • An image is too blurry, crumpled or faded to properly recognize the information.
  • A piece of mandatory information including the date, amount and/or supplier's name is missing from the image.
  • There are multiple receipts in the same image. With the exception of multiple receipts for the same transaction (for example: the original invoice from the supplier and the credit card transaction with the tip), our data extraction is unable to distinguish multiple receipts in the same image.
Was this article helpful?
6 out of 12 found this helpful
Questions? Raise a case on Xero Central. If you don't have a Xero login, click here.