Main » SIGIR » 2008 » Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, Singapore, July 20-24, 2008 »

SpotSigs: robust and efficient near duplicate detection in large web collections

Martin Theobald, Jonathan Siddharth, Andreas Paepcke