August 2, 2010 – HCL Infosystems will be collaborating with the Government of India to digitize the data generated from Census of India project for the year 2010-11. This project will involve an epic task of digitizing data collected across the country including maintenance, indexing, scanning and storage at a central repository.

DQ Channels brought this news to our attention in their article, “HCL Infosystems bags an order from Census of India”. The project will involve the processing of all data captured including images and would also require extraction of information in English language format. Specialized software and other tools like Intelligent Character Recognition (ICR) will be deployed for extraction and cross verification.

That is a huge task ahead of them. Let’s hope they have a solid taxonomy software that can be used across all their systems and supports many languages, like Access Innovations software can.

Melody K. Smith

Sponsored by Data Harmony, a unit of Access Innovations, the world leader in indexing and making content findable.