doi dblp Topic based language models for OCR correction Anurag Bhardwaj | Faisal Farooq | Huaigu Cao | Venu Govindaraju Proceedings of the Second Workshop on Analytics for Noisy Unstructured Text Data, AND 2008, Singapore, July 24, 2008