Question 1

How does classification handle documents with no clear file type — scans, photos, multi-document PDFs?

Accepted Answer

Classification is content-based, so a scan that begins with the Form 1120-S header is classified as a corporate tax return regardless of its filename. Phone photos are read with lower extraction confidence than digital PDFs, and the classification verdict carries the confidence level. Multi-document PDFs — for example, a tax return immediately followed by financial statements in the same file — are detected and split into their constituent documents before classification. Each split document carries a reference back to the original composite file in the audit trail.

Question 2

Can we customize the document taxonomy for our institution?

Accepted Answer

Yes. The 70+ category taxonomy is a starting point, not a fixed schema. Categories your institution doesn't use can be hidden; categories specific to your portfolio — particular SBA forms, industry-specific licensure documents, your standard guarantee forms — can be added. The required-document checklist by product type is also configured to your institution; SBA 7(a) checklists, owner-occupied CRE checklists, and equipment-finance checklists all run from the same intake layer with different required sets.

Question 3

What happens when the borrower sends a document that isn't on our checklist?

Accepted Answer

It's classified, added to the credit file, and visible to the analyst — it just doesn't satisfy a checklist requirement. Extra documents are common (and often useful) at the bottom of a broker forward; the intake layer preserves them rather than discarding them. If a document is recognized as a category your institution doesn't use, it's classified as "other" with a content description, available for analyst review without driving any required-doc logic.

Question 4

How are entity and ownership relationships resolved across documents?

Accepted Answer

On a multi-entity deal — common in commercial lending — entity attribution starts from the application's disclosed entity list. Each document's legal name is matched against the disclosed entities; documents that name a previously-undeclared entity (a new affiliate, an unmentioned related-party) are flagged for analyst review rather than silently created. Guarantor PFS documents are attributed to named guarantors; K-1s route to the right partner based on the partnership return. The audit trail shows which documents drove which entity attribution.

Question 5

What about ongoing borrower documents — annual reviews, covenant testing, renewals?

Accepted Answer

The intake layer handles ongoing document arrivals the same way it handles new originations. Annual financial statements for an existing borrower are classified, period-tagged, and attached to the borrower's record — and the annual-review or renewal workflow picks them up. Covenant-testing documents (compliance certificates, quarterly financials) are recognized and routed to the covenant monitoring surface. The intake taxonomy is the same; the downstream routing depends on which workflow is active for the borrower.

Question 6

Does intake make decisions about document sufficiency, or just classify?

Accepted Answer

Intake classifies and reconciles against the checklist. It does not make underwriting sufficiency judgments — whether a particular PFS is detailed enough, whether interim financials are recent enough to rely on, whether a tax return is a draft or a filed copy. Those are analyst judgments. The intake layer surfaces the documents and their attributes; the analyst decides whether the package is underwriteable.

Borrower packages, indexed on intake.

Intake that turns a 30-file zip into a complete, organized credit file.

What document intake handles

70+ document categories

Period and entity detection

Duplicate and version detection

Missing-document checklist

Multi-entity routing

Renaming and audit trail

From folder of PDFs to organized credit file

Files arrive in any format and any structure

Each file classified by content, not filename

Period, entity, and ownership context applied

Checklist reconciled and file presented

The credit file your analyst opens

Organized document tree

Required-document checklist

Duplicate and version report

Routing to downstream analysis

Intake built for the messy reality of commercial borrower packages

What lenders ask before they switch

How does classification handle documents with no clear file type — scans, photos, multi-document PDFs?

Can we customize the document taxonomy for our institution?

What happens when the borrower sends a document that isn't on our checklist?

How are entity and ownership relationships resolved across documents?

What about ongoing borrower documents — annual reviews, covenant testing, renewals?

Does intake make decisions about document sufficiency, or just classify?

Built to work together, not in isolation

Bank Statement Analysis

Financial Spreading

Borrower Intelligence

See it run on a real borrower file.