| Preface | 6 |
---|
| Contents | 8 |
---|
| Contributors | 17 |
---|
| 1 Reading Systems: An Introduction to Digital Document Processing | 20 |
---|
| 1.1 Introduction | 20 |
| 1.2 Text Sensing | 22 |
| 1.3 Sensor Scope | 22 |
| 1.4 Sensor Grid | 25 |
| 1.5 Pre-processing | 25 |
| 1.6 Invariance to Affine Transforms | 26 |
| 1.7 Invariance to Ink-Trace Thickness | 28 |
| 1.8 Shape Features | 29 |
| 1.9 Processing Type | 31 |
| 1.10 Computing Architecture | 32 |
| 1.11 Computing Strategy | 32 |
| 1.12 Knowledge Base | 33 |
| 1.13 Cognitive Reliability | 34 |
| 1.14 Response in Case of Difficult Input | 34 |
| 1.15 Classification Accuracy | 35 |
| 1.16 Energy and Mental Concentration | 36 |
| 1.17 Processing Speed | 36 |
| 1.18 Volume Processing | 36 |
| 1.19 Summary of Human Versus Machine Reading | 37 |
| 1.20 Conclusion | 45 |
| References | 45 |
| 2 Document Structure and Layout Analysis | 48 |
---|
| 2.1 Introduction | 48 |
| 2.2 Pre-processing | 50 |
| 2.3 Representing Document Structure and Layout | 53 |
| 2.4 Document Layout Analysis | 55 |
| 2.5 Understanding Document Structure | 61 |
| 2.6 Performance Evaluation | 62 |
| 2.7 Handwritten Document Analysis | 64 |
| 2.8 Summary | 65 |
| References | 66 |
| 3 OCR Technologies for Machine Printed and Hand Printed Japanese Text | 68 |
---|
| 3.1 Introduction | 68 |
| 3.2 Pre-Processing | 68 |
| 3.3 Feature Extraction | 77 |
| 3.4 Classification | 80 |
| 3.5 Dimension Reduction | 82 |
| 3.6 Performance Evaluation of OCR Technologies | 83 |
| 3.7 Learning Algorithms | 86 |
| 3.8 Conclusion | 88 |
| References | 89 |
| 4 Multi-Font Printed Tibetan OCR | 91 |
---|
| 4.1 Introduction | 91 |
| 4.2 Properties of Tibetan Characters and Scripts | 92 |
| 4.3 Isolated Tibetan Character Recognition | 96 |
| 4.4 Tibetan Document Segmentation | 106 |
| 4.5 Experiment Results | 112 |
| 4.6 Summary | 114 |
| Acknowledgments | 114 |
| References | 114 |
| 5 On OCR of a Printed Indian Script | 117 |
---|
| 5.1 Introduction | 117 |
| 5.2 Origin and Properties of Indian Scripts | 118 |
| 5.3 Document Pre-Processing | 122 |
| 5.4 Character Recognition | 125 |
| 5.5 Performance Analysis | 132 |
| 5.6 Conclusion | 135 |
| Acknowledgments | 135 |
| References | 136 |
| 6 A Bayesian Network Approach for On-line Handwriting Recognition | 138 |
---|
| 6.1 Introduction | 138 |
| 6.2 Modelling of Character Components and Their Relationships | 141 |
| 6.3 Recognition and Training Algorithms | 147 |
| 6.4 Experimental Results and Analysis | 149 |
| 6.5 Conclusions | 156 |
| References | 157 |
| 7 New Advances and New Challenges in On- Line Handwriting Recognition and Electronic Ink Management | 159 |
---|
| 7.1 Introduction | 159 |
| 7.2 On-Line Handwriting Recognition Systems | 160 |
| 7.3 New Trends in On-Line Handwriting Recognition | 160 |
| 7.4 New Trends in Electronic Ink Management Systems | 164 |
| 7.5 Conclusion, Open Problems and New Challenges | 172 |
| References | 173 |
| 8 Off-Line Roman Cursive Handwriting Recognition | 181 |
---|
| 8.1 Introduction | 181 |
| 8.2 Methodology | 182 |
| 8.3 Emerging Topics | 187 |
| 8.4 Outlook and Conclusions | 191 |
| Acknowledgment | 192 |
| References | 192 |
| 9 Robustness Design of Industrial Strength Recognition Systems | 200 |
---|
| 9.1 Characterization of Robustness | 200 |
| 9.2 Complex Recognition System: Postal Address Recognition | 202 |
| 9.3 Performance Influencing Factors | 204 |
| 9.4 Robustness Design Principles | 209 |
| 9.5 Robustness Strategy for Implementation | 218 |
| 9.6 Conclusions | 224 |
| Acknowledgments | 224 |
| References | 225 |
| 10 Arabic Cheque Processing System: Issues and Future Trends | 228 |
---|
| 10.1 Introduction | 228 |
| 10.2 Datasets | 229 |
| 10.3 Legal Amount Processing | 230 |
| 10.4 Courtesy Amount Processing | 237 |
| 10.5 Conclusion and Future Perspective | 245 |
| References | 247 |
| 11 OCR of Printed Mathematical Expressions | 250 |
---|
| 11.1 Introduction | 250 |
| 11.2 Identification of Expressions in Document Images | 252 |
| 11.3 Recognition of Expression Symbols | 256 |
| 11.4 Interpretation of Expression Structure | 260 |
| 11.5 Performance Evaluation | 266 |
| 11.6 Conclusion and Future Research | 270 |
| References | 271 |
| 12 The State of the Art of Document Image Degradation Modelling | 275 |
---|
| 12.1 Introduction | 275 |
| 12.2 Document Image Degradations | 276 |
| 12.3 The Measurement of Image Quality | 278 |
| 12.4 Document Image Degradation Models | 280 |
| 12.5 Applications of Models | 284 |
| 12.6 Public-Domain Software and Image Databases | 286 |
| 12.7 Open Problems | 287 |
| Acknowledgments | 289 |
| References | 289 |
| 13 Advances in Graphics Recognition | 294 |
---|
| 13.1 Introduction | 294 |
| 13.2 Application Scenarios | 297 |
| 13.3 Early Processing | 300 |
| 13.4 Symbol Recognition and Indexing | 301 |
| 13.5 Architectures and Meta-data Modelling | 302 |
| 13.6 On-Line Graphics Recognition and Sketching Interfaces | 304 |
| 13.7 Performance Evaluation | 306 |
| 13.8 An Application Scenario: Interpretation of Architectural Sketches | 307 |
| 13.9 Conclusions: Sketching the Future | 308 |
| Acknowledgment | 310 |
| Reference
|