doi dblp Exploiting content redundancy for web information extraction Pankaj Gulhane | Rajeev Rastogi | Srinivasan H. Sengamedu | Ashwin Tengli Proceedings of the 19th International Conference on World Wide Web, WWW 2010, Raleigh, North Carolina, USA, April 26-30, 2010