Optical Character Recognition: Fundamentals and Applications
By Fouad Sabry
()
About this ebook
What Is Optical Character Recognition
OCR, also known as optical character recognition, is the process of electronically or mechanically converting images of typed, handwritten, or printed text into machine-encoded text. This can be done from a scanned document, a photo of a document, a scene photo, or from subtitle text that is superimposed on an image.
How You Will Benefit
(I) Insights, and validations about the following topics:
Chapter 1: Optical character recognition
Chapter 2: Typeface
Chapter 3: Handwriting recognition
Chapter 4: Image scanner
Chapter 5: Optical mark recognition
Chapter 6: Computer font
Chapter 7: Intelligent character recognition
Chapter 8: Tesseract (software)
Chapter 9: Comparison of optical character recognition software
Chapter 10: OCR Systems
(II) Answering the public top questions about optical character recognition.
(III) Real world examples for the usage of optical character recognition in many fields.
(IV) 17 appendices to explain, briefly, 266 emerging technologies in each industry to have 360-degree full understanding of optical character recognition' technologies.
Who This Book Is For
Professionals, undergraduate and graduate students, enthusiasts, hobbyists, and those who want to go beyond basic knowledge or information for any kind of optical character recognition.
Read more from Fouad Sabry
Related to Optical Character Recognition
Titles in the series (100)
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks Rating: 0 out of 5 stars0 ratingsRestricted Boltzmann Machine: Fundamentals and Applications for Unlocking the Hidden Layers of Artificial Intelligence Rating: 0 out of 5 stars0 ratingsHopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories Rating: 0 out of 5 stars0 ratingsConvolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery Rating: 0 out of 5 stars0 ratingsControl System: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsStatistical Classification: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsKernel Methods: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsHybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models Rating: 0 out of 5 stars0 ratingsAlternating Decision Tree: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsFeedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs Rating: 0 out of 5 stars0 ratingsArtificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation Rating: 0 out of 5 stars0 ratingsCompetitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition Rating: 0 out of 5 stars0 ratingsPerceptrons: Fundamentals and Applications for The Neural Building Block Rating: 0 out of 5 stars0 ratingsRecurrent Neural Networks: Fundamentals and Applications from Simple to Gated Architectures Rating: 0 out of 5 stars0 ratingsEmbodied Cognition: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsHebbian Learning: Fundamentals and Applications for Uniting Memory and Learning Rating: 0 out of 5 stars0 ratingsAttractor Networks: Fundamentals and Applications in Computational Neuroscience Rating: 0 out of 5 stars0 ratingsHierarchical Control System: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsBio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World Rating: 0 out of 5 stars0 ratingsLong Short Term Memory: Fundamentals and Applications for Sequence Prediction Rating: 0 out of 5 stars0 ratingsRadial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks Rating: 0 out of 5 stars0 ratingsGroup Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis Rating: 0 out of 5 stars0 ratingsArtificial Immune Systems: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsNouvelle Artificial Intelligence: Fundamentals and Applications for Producing Robots With Intelligence Levels Similar to Insects Rating: 0 out of 5 stars0 ratingsBackpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning Rating: 0 out of 5 stars0 ratingsK Nearest Neighbor Algorithm: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsNaive Bayes Classifier: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsLearning Intelligent Distribution Agent: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsAgent Architecture: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsEmbodied Cognitive Science: Fundamentals and Applications Rating: 0 out of 5 stars0 ratings
Related ebooks
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition Rating: 0 out of 5 stars0 ratingsIntelligent Word Recognition: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsIntelligent Character Recognition: Advancing Machine Perception in Computer Vision Rating: 0 out of 5 stars0 ratingsHandwriting Recognition: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsSmart Camera: Revolutionizing Visual Perception with Computer Vision Rating: 0 out of 5 stars0 ratingsLearn Computer Vision Using OpenCV: With Deep Learning CNNs and RNNs Rating: 0 out of 5 stars0 ratingsDictionary of Computer Terms Rating: 1 out of 5 stars1/5Face Detection and Recognition on Mobile Devices Rating: 1 out of 5 stars1/5Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana Rating: 0 out of 5 stars0 ratingsRobot Operating System (ROS): The Complete Reference (Volume 6) Rating: 0 out of 5 stars0 ratingsColor Profile: Exploring Visual Perception and Analysis in Computer Vision Rating: 0 out of 5 stars0 ratingsOptical Braille Recognition: Empowering Accessibility Through Visual Intelligence Rating: 0 out of 5 stars0 ratingsMastering OpenCV Android Application Programming Rating: 0 out of 5 stars0 ratingsAutomatic Number Plate Recognition: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsImage Collection Exploration: Unveiling Visual Landscapes in Computer Vision Rating: 0 out of 5 stars0 ratingsLearning OpenCV 3 Computer Vision with Python - Second Edition Rating: 0 out of 5 stars0 ratingsAutomatic Image Annotation: Enhancing Visual Understanding through Automated Tagging Rating: 0 out of 5 stars0 ratingsAutomatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition Rating: 0 out of 5 stars0 ratingsLearn OpenCV with Python by Examples Rating: 0 out of 5 stars0 ratingsARM® Cortex® M4 Cookbook Rating: 4 out of 5 stars4/5Automatic Target Recognition: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsLearning D3.js Mapping Rating: 0 out of 5 stars0 ratingsRust for Network Programming and Automation Rating: 0 out of 5 stars0 ratingsAutomatic Image Annotation: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsPipelined Processor Farms: Structured Design for Embedded Parallel Systems Rating: 0 out of 5 stars0 ratingsDigital Technologies – an Overview of Concepts, Tools and Techniques Associated with it Rating: 0 out of 5 stars0 ratings
Intelligence (AI) & Semantics For You
2084: Artificial Intelligence and the Future of Humanity Rating: 4 out of 5 stars4/5101 Midjourney Prompt Secrets Rating: 3 out of 5 stars3/5Summary of Super-Intelligence From Nick Bostrom Rating: 5 out of 5 stars5/5ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology Rating: 0 out of 5 stars0 ratingsDark Aeon: Transhumanism and the War Against Humanity Rating: 5 out of 5 stars5/5The Secrets of ChatGPT Prompt Engineering for Non-Developers Rating: 5 out of 5 stars5/5Mastering ChatGPT: 21 Prompts Templates for Effortless Writing Rating: 5 out of 5 stars5/5ChatGPT For Fiction Writing: AI for Authors Rating: 5 out of 5 stars5/5ChatGPT For Dummies Rating: 0 out of 5 stars0 ratingsArtificial Intelligence: A Guide for Thinking Humans Rating: 4 out of 5 stars4/5What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions Rating: 5 out of 5 stars5/5Dancing with Qubits: How quantum computing works and how it can change the world Rating: 5 out of 5 stars5/5Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures Rating: 4 out of 5 stars4/5Impromptu: Amplifying Our Humanity Through AI Rating: 5 out of 5 stars5/5Our Final Invention: Artificial Intelligence and the End of the Human Era Rating: 4 out of 5 stars4/5Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates Rating: 4 out of 5 stars4/5ChatGPT: The Future of Intelligent Conversation Rating: 4 out of 5 stars4/5A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®) Rating: 4 out of 5 stars4/5How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming Rating: 5 out of 5 stars5/5TensorFlow in 1 Day: Make your own Neural Network Rating: 4 out of 5 stars4/5
Reviews for Optical Character Recognition
0 ratings0 reviews
Book preview
Optical Character Recognition - Fouad Sabry
Chapter 1: Optical character recognition
From a scanned document, a photo of the document, a scene-photo (such as the text on signs and billboards in a landscape photo), or subtitle text superimposed on an image, optical character recognition (OCR) is the electronic or mechanical conversion of images of typed, handwritten, or printed text into machine-encoded text (for example: from a television broadcast).
It is a common method of digitizing printed texts for electronic editing, searching, compact storage, online display, and use in machine processes like cognitive computing, machine translation, (extracted) text-to-speech, and other suitable documentation, such as passport documents, invoices, bank statements, computerized receipts, business cards, mail, printouts of static-data, and other suitable documentation. Pattern recognition, AI, and computer vision all contribute to OCR.
Earlier versions only supported a single typeface and required training with photos of each character. Modern systems often handle many digital picture file formats and can provide a high level of identification accuracy for most typefaces. Some implementations may generate a copy of the page with all the formatting details preserved, such as graphics, columns, and other non-textual elements.
Telegraphy and the development of reading aids for the blind are two possible antecedents of modern optical character recognition.
For examining microfilm archives using an optical code recognition system, Emanuel Goldberg created what he dubbed a Statistical Machine
in the 1920s and 1930s. In 1931, he received U.S. Patent 1,838,389 for his creation. IBM now owns the patent.
After developing omni-font OCR, which could read text written in almost any typeface, Ray Kurzweil founded Kurzweil Computer Products, Inc. in 1974. (Kurzweil is often credited with inventing omni-font OCR, but it was in use by companies, including CompuScan, in the late 1960s and 1970s.) The optimum use for this technology, Kurzweil reasoned, would be to develop a reading machine for the blind, which would enable the visually impaired to have a computer read aloud whatever text the user inputs. Two key technologies, the CCD flatbed scanner and the text-to-speech synthesizer, had to be developed for this gadget to become a reality. The final product was introduced during a press conference on January 13, 1976, chaired by Kurzweil and the National Federation of the Blind. The first commercial version of the optical character recognition software was released by Kurzweil Computer Products in 1978. As one of the first adopters, LexisNexis purchased the software in order to include news articles and legal briefs into its fledgling online databases. Kurzweil sold his firm to Xerox two years later because of the latter's desire to commercialize text conversion from paper to computer. Scansoft, which had been spun off by Xerox, later merged with Nuance Communications.
On the 2000s, OCR was made accessible in the cloud, on mobile devices, and in real-time translation of foreign-language signs using a smartphone via services like WebOCR. With the rise of internet-connected mobile devices like smartphones and smartwatches, OCR is being put to use in apps that pull text from images taken with the device's camera. If the device doesn't have OCR capabilities integrated into the OS, an OCR API will be used to read the picture file and extract the text. The OCR API sends the extracted text back to the program on the device, together with information on where in the original picture the text was recognized, so that the app may do something with it (like convert it to voice or display it).
Latin, Cyrillic, Arabic, Hebrew, Indic, Bengali (Bangla), Devanagari, Tamil, Chinese, Japanese, and Korean characters are all supported by a wide variety of commercial and open source OCR systems.
Receipt OCR, Invoice OCR, Check OCR, and Legal Billing Document OCR are just some of the numerous types of domain-specific OCR applications that have been built on top of OCR engines.
You may put them to use in:
Keying in information from corporate records such checks, passports, invoices, bank statements, and receipts
License plate reading software
At airports, for scanning passports and extracting data
Key data extraction from insurance papers automatically
The ability to read traffic signs
Adding contact details from a business card
Accelerate the process of converting printed materials into text, such scanning books for Project Gutenberg.
Make it possible to search digital copies of printed books and magazines like Google Books.
Using real-time handwriting recognition to operate a computer (pen computing)
Getting around CAPTCHA anti-bot systems, even if they are meant to stop OCR. CAPTCHA anti-bot system strength testing is another possible goal.
Equipment designed to aid the sight impaired
Vehicle instructions are written by locating CAD pictures in a database that are relevant to the ever-evolving vehicle design.
The process of transforming scanned documents into searchable PDFs
Typewritten text is what optical character recognition (OCR) focuses on. It does this by analyzing each individual glyph or letter.
Word-by-word scanning of printed text is the focus of optical word recognition (for languages that use a space as a word divider). (Commonly abbreviated to OCR
)
Handwritten printscript or cursive writing is another target of intelligent character recognition (ICR), which focuses on each individual glyph or letter.
Word-by-word analysis of handwriting in printscript or cursive is another goal of intelligent word recognition (IWR). This is particularly helpful for languages that employ cursive writing without separating the glyphs.
Optical character recognition is often a offline
procedure that evaluates a preserved document. Online OCR APIs are offered by certain cloud-based services. It is possible to provide data about a writer's hand movements into a handwriting recognition system. This method is superior than solely relying on the forms of glyphs and words since it can record actions like the order in which parts are drawn, the direction taken, and the rhythm of when the pen is laid down and picked up. This supplementary data has the potential to improve the overall precision of the procedure. On-line character recognition, dynamic character recognition, real-time character recognition, and intelligent character recognition are all terms used to describe this technology.
The success rate of optical character recognition (OCR) software is sometimes increased by pre-processing
of pictures. Methods Can Include:
Text lines may be made completely horizontal or vertical by using a de-skew tool if the original document was not scanned in the correct orientation.
Despeckle: to get rid of