Minhash SHINGLING + MINHASH: BASIC NEAR DUPLICATE DOCUMENT DETECTION Introduction Picture it…New York City 2014, two documents walk into a bar. We are given the task to determine if the documents are duplicates of each other or if they are just near duplicates. How would we do this if we weren't allowed