doi dblpExploiting content redundancy for web information extractionPankaj Gulhane | Rajeev Rastogi | Srinivasan H. Sengamedu | Ashwin TengliProceedings of the 19th International Conference on World Wide Web, WWW 2010, Raleigh, North Carolina, USA, April 26-30, 2010