What is OCR & why it makes your life easier
Optical character recognition, or OCR, defines the process of mechanically or electronically converting scanned images of handwritten, typed or printed text into machine-encoded text.
In this blog article, you’ll learn about:
- What the heck is OCR
- How does optical character recognition work – explained for non techies
- Why OCR is the new marketing gadget
Just keep on reading and you will get the answers you’re looking for and not end up confused.
Explaining a complex technology can end in a text that is horrible to read. A text full of technical vocabulary, confusing explanations and badly selected examples. Even though we can not describe OCR without using any terminology, we will try to keep them to a minimum. So the good news is that you do not need to be a hardcore techy to learn about what OCR is and how it works.
Add Anyline’s Mobile OCR Technology to your app today!
What is OCR?
As already mentioned OCR stands for optical character recognition. The technology deals with the problem of recognizing all different kinds of characters. Both handwritten and printed characters can be recognized and converted into machine readable text.
The technology deals with the problem of recognizing all different kinds of characters.
Think of any kind of serial number or code consisting of numbers and letters that you need digitized. By using OCR you can transform those codes into digital output. The technology makes use of different techniques. Put in a very simplified way, the image taken will be preprocessed and the characters extracted and recognized. I will get to the just mentioned techniques, a little later, but you can also jump right to it.
What OCR does not take into account is the actual nature of the object that you want to scan. It simply “takes a look” at the text that you aim to transform. If you want the device to recognize both the nature of the object as well as the text on it you need to combine different technologies. Take a look of what you can do combining both OCR and augmented reality for example.
If you want the device to recognize both the nature of the object as well as the text on it you need to combine different technologies.
Different techniques of OCR
Let’s have a look at three steps of optical character recognition: image preprocessing, character recognition itself and the post-processing of the output.
OCR software often preprocesses images to improve the chances of a successful recognition. The aim of image preprocessing is an improvement of the image data. Thus unwanted distortions are suppressed and specific image features are enhanced. Both of which are important for further processing.
For the actual character recognition part it is important to understand what feature extraction is. When the input data to an algorithm is too large to be processed, only a reduced set of features is selected. Those features selected are expected to be the important ones. The ones that are suspected to be redundant are sorted out. By using the reduced set of data instead of the initial large one, the performance will be better.
For the process of OCR this is important because the algorithm has to detect specific portions or shapes of a digitized image or video stream.
Post-processing is another error correction technique that ensures the high accuracy of OCR. The accuracy can be further improved if the output is restricted by a lexicon. That way the algorithm can fall back to a list of words that are allowed to occur in the scanned document for example.
Also depending on the application OCR is not only used for proper words, but also for numbers and codes.To better deal with different types of input OCR providers started to develop specific OCR systems. Those systems are able to deal with the special images. To further improve the recognition accuracy they combined various optimization techniques. For example they used business rules, standard expressions or rich information contained in color image. The strategy of merging various optimization techniques is called “application oriented OCR” or “customized OCR”. It is used in fields like business card OCR, invoice OCR or ID card OCR.
Possibilities using OCR
The possibilities using optical character recognition software are widespread. As already mentioned OCR can be combined with technologies like augmented reality for example. But the technology itself is already very powerful.
Here are a few examples of possible use cases including OCR software:
Passports and IDs have a machine readable zone (MRZ) that can be scanned. OCR can speed up the process of identifying and registering people at borders or other checkpoints. It thus is useful for immigration officers or other security personal.
There are a lot of innovative mobile marketing campaigns out there. Many companies make use of codes to engage their customers in a little competition.Think of all the voucher codes that customers can redeem by typing them in. Or numbers printed on the inside of a bottle cap that you need to collect. All those campaigns can make use of OCR by integrating the software in their often existing app. That way they minimize the hurdle of online registration and the process of typing in a series of numbers and letters.
Have a look at how Karlsberg used OCR in their marketing campaign:
The International Bank Account Number (IBAN) serves to identify bank accounts across borders. The IBAN may come in different length and can consist of numbers as well as letters. To ease cross border transactions banking apps can easily integrate OCR software. That way their customers can scan their IBAN instead of tediously typing it in.
There are a lot of optical character recognition softwares that specialize in one specific use case. For example credit card scanning, or document scanning. But OCR can be useful for so many different parts in our lives. Thus it is kind of annoying to use a different software for every different use case.
Tesseract is an open source OCR engine that has gained popularity among OCR developers. Even though it can be painful to implement and modify sometimes, there weren’t too many free and powerful OCR alternatives on the market for the longest time.
Anyline offers an OCR SDK that you can download for free as well and which, in contrast to Tesseract works perfectly on mobile.
Further useful links
Anyline is a Gartner Cool Vendor
Gartner, the world wide research institute famous for creating the hype cycle model to predict the maturity of emerging technologies, named Anyline as a Cool Vendor for supply chain execution in 2018.
Gartner Cool Vendors exist across all major areas of technology innovation. Those that stand out, do so because they offer some disruptive capability or opportunity. Anyline has been listed as a Cool Vendor because of their mobile OCR technology. This technology is backed by artificial intelligence and machine learning to provide the most accurate scan possible.
“This is a huge success for us. Gartner are one of the best known research institutes in the tech industry. Their work on the “Magic Quadrant” and the “Hype Cycle” have helped to guide companies for years. Growing enterprises around the world rely on the analysis and reports they provide to choose only the best service providers. To be recognized by them is to receive the gold standard of approval within our field.”
– Anyline CEO, Lukas Kinigadner.
Anyline was recognized by Gartner because our technology supports the digitization of all kinds of data. By removing typing and manual data entry from everyday processes, you can save time, money and resources while also improving your data quality.
Test Your Use Case with the Anyline OCR Scanner App
You can test Anyline’s mobile scanning technology for yourself with our demo App. The Anyline OCR Scanner is a free app available on the App Store and Google Play that will help you to assess your scanning use cases. It includes more than 10 scanning solutions, works offline and has a +99% scan accuracy!
Try it for Yourself – Start Your Free Anyline Demo Now
Anyline’s mobile scanning technology is available to integrate with your mobile app as a cross-platform SDK. Test out our fully supported solutions on Android, iOS and Windows devices today. Download our SDK and begin your 30-day free trial to see how mobile scanning can revolutionize your services and processes!
The SDK demo includes access to all of the following solutions:
- License plate scanning
- Passport MRZ scanning
- Utility meter scanning
- Serial number scanning
- Document scanning
- Driving license scanning
- And more!
Don’t see a solution for your scanning use case? Get in touch now to discuss your solution!
Follow Us For All The Latest Scanning News!
You can also join our mailing list to get all Anyline announcements sent straight to your inbox. Sign-up is simple and we don’t send too many mails!
Finally, if you have questions, suggestions or feedback on this post, please don’t hesitate to reach out to us. Contact us via social media or else send us an email at [email protected]!
Sign up for our Newsletter!
Be the first to receive the latest news, release notes and updates!