doi dblp Content extraction using diverse feature sets Matthew E. Peters | Dan Lecocq 22nd International World Wide Web Conference, WWW '13, Rio de Janeiro, Brazil, May 13-17, 2013, Companion Volume