The metric option allows you to plug in a custom similarity metric.
Set it to NULL to use the default internal metric.
The default metric is based on sampling hashes of ranges of data in
the file, which is a pretty good similarity approximation that should
work fairly well for both text and binary data while still being
pretty fast with a fixed memory overhead.
The metric option allows you to plug in a custom similarity metric.
Set it to NULL to use the default internal metric.
The default metric is based on sampling hashes of ranges of data in the file, which is a pretty good similarity approximation that should work fairly well for both text and binary data while still being pretty fast with a fixed memory overhead.