Unknown

Dataset Information

0

A new method of mark detection for software-based optical mark recognition.


ABSTRACT: Software optical mark recognition (SOMR) is the process whereby information entered on a survey form or questionnaire is converted using specialized software into a machine-readable format. SOMR normally requires input fields to be completely darkened, have no internal labels, or be filled with a soft pencil, otherwise mark detection will be inaccurate. Forms can also have print and scan artefacts that further increase the error rate. This article presents a new method of mark detection that improves over existing techniques based on pixel counting and simple thresholding. Its main advantage is that it can be used under a variety of conditions and yet maintain a high level of accuracy that is sufficient for scientific applications. Field testing shows no software misclassification in 5695 samples filled by trained personnel, and only two misclassifications in 6000 samples filled by untrained respondents. Sensitivity, specificity, and accuracy were 99.73%, 99.98%, and 99.94% respectively, even in the presence of print and scan artefacts, which was superior to other methods tested. A separate direct comparison for mark detection showed a sensitivity, specificity, and accuracy respectively of 99.7%, 100.0%, 100.0% (new method), 96.3%, 96.0%, 96.1% (pixel counting), and 99.9%, 99.8%, 99.8% (simple thresholding) on clean forms, and 100.0%, 99.1%, 99.3% (new method), 98.4%, 95.6%, 96.2% (pixel counting), 100.0%, 38.3%, 51.4% (simple thresholding) on forms with print artefacts. This method is designed for bubble and box fields, while other types such as handwriting fields require separate error control measures.

SUBMITTER: Loke SC 

PROVIDER: S-EPMC6226159 | biostudies-other | 2018

REPOSITORIES: biostudies-other

altmetric image

Publications

A new method of mark detection for software-based optical mark recognition.

Loke Seng Cheong SC   Kasmiran Khairul A KA   Haron Sharifah A SA  

PloS one 20181109 11


Software optical mark recognition (SOMR) is the process whereby information entered on a survey form or questionnaire is converted using specialized software into a machine-readable format. SOMR normally requires input fields to be completely darkened, have no internal labels, or be filled with a soft pencil, otherwise mark detection will be inaccurate. Forms can also have print and scan artefacts that further increase the error rate. This article presents a new method of mark detection that imp  ...[more]

Similar Datasets

| S-EPMC3111373 | biostudies-literature
| S-EPMC6385457 | biostudies-literature
| S-EPMC7423836 | biostudies-literature
| S-EPMC6670212 | biostudies-literature
| S-EPMC8384131 | biostudies-literature
| S-EPMC9729193 | biostudies-literature
| S-EPMC2049055 | biostudies-literature
| S-EPMC8321889 | biostudies-literature
2017-10-03 | GSE102476 | GEO
| S-EPMC4914605 | biostudies-literature