Method and system of similarity-based deduplicationпатент