Main » WWW » 2007 » Proceedings of the 16th International Conference on World Wide Web, WWW 2007, Banff, Alberta, Canada, May 8-12, 2007 »

Detecting near-duplicates for web crawling

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma