Sunday, 5 January 2014

Optical Character Recognition

OCR Technology - +Maven Scientists 


Introduction

Optical Character Recognition technology, usually abbreviated as OCR.
It is the mechanical or electronic conversion of scanned or photoed images of typewritten or printed text into machine-encoded/computer-readable text. It is widely used as a form of data entry from some sort of original paper data source, whether passport documents, invoices, bank statement, receipts, business card, mail, or any number of printed records. It is a common method of digitizing printed texts so that they can be electronically edited, searched, stored more compactly, displayed on-line, and used in machine processes such as machine translation, text-to-speech, key data extraction and text mining. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.

Note: In lame words, OCR technology is used to extract the text from image into text format.


In early versions of OCR, it needed to be programmed with images of each character, and worked on one font at a time. "Intelligent" systems with a high degree of recognition accuracy for most fonts are now common. Some commercial systems are capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.

Biggest Advantage

The biggest advantage of this technology is for the blind users.

Companies using OCR

Big tech giant companies such as Microsoft, Google and Amazon are already using this technology in their products amongst many other.

The products in which you already use this technology are:
1. Google Books
2. Adobe Reader
3. Microsoft Windows Phone 8 (Bing Search)


Get Ebooks delivered to your email id

Comments

Subscribe to our channel

Facebook

Powered by Blogger.

Home | Contact Us | DMCA | Terms of Service | Privacy | Advertise

Maven Scientists