In our upcoming release of DotImage 4.0 we are inculding a new add-on module called DotImage Advanced Document Cleanup.  This is something that our scientists have been working on for quite some time, and will add tremendous value to our product offering and and customers who choose to license it.  This module can be used to clean-up scanned documents

Here are some examples of what it can do:

Hole Punch Removal

This algorithm works with many types of hole punches, including the square spiral bound holes.  It will automatically calculates the hole size, or allow the user to specify the minimum and maximum hole diameter.

Auto Invert Text

This algorithms is very useful prior to OCR.  It does a great job detecting white on black regions that OCR engines will typically ignore. 

Border Removal

Useful for removing borders from pages that are scanned on a flatbed scanner.  Improves compression, and OCR accuracy. 

Line Removal

Useful for improving OCR accuracy.  This algorithm will reconnect characters that were broken from the line removal.  There are 5 different line removal commands including solid line removal, dotted line removal, broken line removal, and form line removal.

Other algorithms available in this product include Auto Border Crop, Auto Negate Page, Blank Page Detection, Blob Removal, Margin Crop, Noise Removal, and Speck Removal.  I'll provide more examples in future posts.

All of these algorithms and more are available in the beta that was released on Friday.  You can try them out with the demo which is installed with the product (Advanced Document Cleanup Demo).  Licensing will be $1999 for the SDK which includes 20 runtimes.  Additional runtimes cost $99 per machine or $999 per server.  This is in additional to DotImage Document Imaging which is a required dependency.

We have tested these algorithms on many images, but this is a beta product and we're looking for our customers to test these out on other images and report any problems.  Problems can be posted in our beta forms.

Now I'm off to Tech-Ed.  Hope to see some of you there!!