Disclaimer: This is not a startup company -Â These are some bright young students, they only needs encouragement and motivation. Maybe their current solution is not that good but still we all start with low quality solutions and improve it over the period of time.Â Lets look at this from that point of view.
Optical Character Recognition is a unique approach developed for recognizing isolated character that requires less complex calculations but still giving adequate results. In case of document image recognition, an additional step of detecting lines of text and possible set of character among those lines is a requisite. There are numerous methods available for character recognition. From numerical and statistical approach to AI based approach in an increasing order of their recognition accuracy, respectively. None of the approaches stated has recognition accuracy of 100%.Even the humans are not credited with absolute recognition accuracy. The main objective of the recognition software is to help its user in more physically tiring and cumbersome work of actually typing the whole document especially for a user. The error correction still resides with its user only. Hence, a recognition accuracy of even about 90% gives very satisfactory results. Apart from all this, the image quality also plays a very important role in the recognition accuracy.
So, a research project named Urdu OCR – A Digital Dream from Usman Institute of Technology fulfilling the needs. The team members of this project are Abdul Wahab, Shuwair Sardar, and Muhammad Abdul Sammad Khan. First prize winner of Combat 2008 (Software Competition – PAF Kiet) and Software Exhibition (Software Competition – SZABIST). Great work guys!
Urdu OCR is developed for first time. It has not been developed yet. The need of this product is in the printing media like Urdu news paper and magazines. It is useful in converting the books of Urdu in digital format, the large amount of useful and heritage data in Urdu language which are in vanishing form can be saved in digital format. It can produce electronic books and digital Urdu library online.