“Computing similarity is one of the main tools of data science.” Foster Provost & Tom Fawcett ( 2014 )