Ubuntu QA:
BlogBrainstorm
Log in
Ubuntu QA
The Ubuntu community has contributed 12252 ideas, 57766 comments, 1176667 votes

Idea #9997: Scanner Use - Automated Seek for Optimum Settings



up
37
down
Written by Tree MendUs the 18 Jun 08 at 10:30. Category: Hardware support.
Related to: Nothing/Others. Status: New
Description
Problem ;

In scanner software there can be many different settings to adjust, to get the best result.
Each adjustment may need some playing with, and then a test scan done to check the result, or have look at an estimated preview.

This can take quite a bit of time.


Objective ;

The user is usually trying to optimise results in terms of parameters like ;
1) Clarity (sharpness, contrast, etc)
2) Colour Balance
3) OCR success rate
4) Minimal spelling mistakes in OCR

Solution ;

These (performance) parameters can be measured.

Thumb Nailing tests (like in photography) are a handy way to preview the results of a range of settings and compare them. They can also be used to compare the variation of one setting (horizontally on the page/preview display), while also varying another setting (vertically).

Previews may be calculated as "estimates" of what the result "may" look like. But thumbnails could be "done for real", using different settings for different parts of the image, or for the same small representative part of the image.

The user could choose whether they want to use the preview estimates or scanned thumbnail results for testing.

The user could click on the preview estimates or scanned thumbnail results to choose which one gives the best result. This is a bit like aligning the inkjet cartridges on most printers.
The user could also select the other results which are acceptable, which would help the software "learn" what the range of guesses should be in any future (or next) guesses.

The program could also ask the user to identify preview estimates that were inaccurate when compared to their actual resulting thumbnail - or the software could maybe make those matches itself.

For OCR, the spell checker can be used to check the OCR of the letters, and settings varied to give the best results using the spell checker. In areas of the page where OCR is needed, results may be improved by blurring rather than sharpening, and letter detection based on min and max line/curve width.

After a while, it might be possible to insert a page, the scanner does a high res scan, detects the quality of media, and knowing the objective of the scan (picture, text, or both) would detect the optimum settings, and set the scan to the optimum settings (option to ask the user if they are happy).

This would result in pictorial scans that would need minimal(if any) post scan processing (with loss of quality and wasted time).
This would result in OCR scans with very high recognition rates compared to other systems.

The optimum settings, and ranges for best guesses are useful data that could be sent back to the Ubuntu or Scanner software website, so these can be made available for statistical comparison, incorporation as optional default settings, and download by other users.

Attachments
No attachments.


Duplicates


Comments
Auzy wrote on the 18 Jun 08 at 12:21
So a benchmark system for scanners. I think this would be a great idea.

It would be good if we could provide users with accurate ETA's for operations, and set optimal parameters for different scanners.

andruk wrote on the 19 Jun 08 at 07:37
I, as usual, agree with Auzy.

+1

Warbo wrote on the 19 Jun 08 at 11:16
This sounds very useful, but also cool at the same time. It's like the artificial selection process used in crafting genetic algorithms (compare results, those acceptable are chosen as "fit" and survive, the next batch of permutations is based on those surviving the last generation). The longer spent fine-tuning the better the results get :)

Tree MendUs wrote on the 23 Jul 08 at 06:01
andruk, Thank you for agreeing with Auzy.

Warbo, "genetic algorithms" sounds interesting.
It just seems to me that computers are a little under-used regarding the ability to 'optimise" results.

We tell them what to do.
We (it is us who) check their results.
We make decisions for improvement, then we tell them what to do again.

What if ...
we let them learn from the decisions that get made, either by emulation (a kind of statistical copying) or by tuition (we tell them the decision process - programmed decisions - systems analysis).

Then we tell them what to do.
They check the results.
They tell themselves what to do.
We let them do that a number of times, and then check that they are making the right decisions.

They can make it easy for use to check the decisions for optimum, by giving us a range of variations on the result.
we choose what we like form the range. We may even learn that there are factors to be considered when making decisions, that we had forgotten to include in the decision making process.
So we even get a chance
to correct

Our Own Thinking.


Post your comment