06.10.2020 Views

Inside NIRMA Fall 2020

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

DOCUMENT<br />

DIGITIZATION<br />

QC<br />

S<br />

ome industries tolerate<br />

Errors and Omissions (EO’s)<br />

in their document digitization<br />

projects more than others. In<br />

fact, some clients irresponsibly assume,<br />

without properly checking, that their<br />

Digitization Service Provider delivered<br />

all records, all meeting the<br />

requirements. Unfortunately, a late<br />

detection may have devastating, often<br />

irreparable consequences.<br />

Clients should demand solid answers<br />

from their providers about how their<br />

QC methodologies apply to the project,<br />

then trust and verify. Answers may<br />

include:<br />

A. Random sampling: Although<br />

essential in most projects for testing<br />

purposes, random sampling<br />

overpromises and under delivers.<br />

B. 100% QC: an elusive,<br />

underestimated notion often<br />

challenging due to time, cost and<br />

other constraints.<br />

C. Counting pages: Partially useful,<br />

but error prone, unreliable.<br />

D. Weak answers signal trouble. If<br />

clients do not act, egg is on their<br />

face.<br />

This article aspires to illustrate some<br />

better answers, and to lay a foundation<br />

for a production model to prevent,<br />

detect, remediate, reduce, trace and<br />

monitor EOs in a document digitization<br />

project.<br />

The basic concept is to identify<br />

critical points where EOs may occur<br />

and establish preventive and corrective<br />

semi-automated methodologies around<br />

certain metrics. By “semi-automated” I<br />

mean a combination of smart<br />

automation and human oversight. I use<br />

methodologies that enable valuable<br />

checks and balances in four workflow<br />

stages:<br />

1. Origination: At the point of<br />

custody acceptance.<br />

2. Production (Physical Context): At<br />

the beginning or end of each task<br />

throughout the production<br />

workflow.<br />

3. Published (Logical Context): When<br />

pages are classified, indexed and<br />

organized as logical documents.<br />

4. Submittal: When deliverables are<br />

subject to final acceptance.<br />

By cross-referencing, comparing and<br />

analyzing aggregate numbers based on<br />

data represented in these four<br />

coordinate systems, EOs become more<br />

conspicuous, allowing for remediation<br />

processes to be effective.<br />

A simplified production workflow<br />

includes many tasks (*), what could<br />

possibly go wrong?:<br />

<br />

<br />

By Manuel Bulwa,<br />

Integrated Scanning of America<br />

www.isausa.com<br />

Some documents may not have<br />

made it entirely through the<br />

production workflow.<br />

Pages can be missed, mutilated,<br />

folded, obstructed, out of sequence,<br />

out of scale, illegible, wrongly split<br />

or merged, overlapped, files may be<br />

corrupted, …<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Page groups can be wrongly<br />

classified, not indexed, duplicated,<br />

“buried”, …<br />

Incorrect, missing or extraneous<br />

document structures.<br />

Incomplete, missing or “made up”<br />

documents, …<br />

Deliverables may be incomplete,<br />

redundant, not fully complying with<br />

SOW, …<br />

Originals may be returned or<br />

disposed of without a valid digital<br />

counterpart.<br />

Originals may not be returned in<br />

full.<br />

Originals may not be returned as<br />

expected.<br />

Other.<br />

What are the causes of EOs?<br />

Each EO is associated to a root cause<br />

(*).<br />

Who is responsible for<br />

consequences?<br />

Both service provider and client have<br />

separate responsibilities(*).<br />

How can EOs be detected and<br />

corrected?: (*)<br />

<br />

<br />

<br />

Planning, testing and training<br />

Timing<br />

Use production level equipment<br />

Continued on next page.<br />

Back to Content | <strong>Inside</strong> <strong>NIRMA</strong> <strong>NIRMA</strong>.org <strong>Fall</strong> <strong>2020</strong> 11

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!