What happens when I upload a duplicate clause or duplicate document to Truffle Hunt?

As in other ClauseBuddy tools, the Truffle Hunt takes into account file duplication.

In the Truffle hunt result list, de-duplication is done as follows:

  • All clauses are converted to lowercase (so that differences in capitalisation are ignored)

  • Punctuation is removed (so that differences in that respect are ignored)

  • Stop words are removed (so that differences in words like "and", "the", "will", etc. are ignored)

In the end, what it boils down to is that if you were to upload the same document, it would also end up in the basket (which should not be an issue. After all, Truffle Hunt is an inspiration module)

But, an identical clause will not be displayed twice in the results.

So for example, let's say you would upload the exact same document 8 times in a basket. Then a clause from the document is not going to appear 8 times in the results, only once.

Last updated