This is a tool that uses dimensionality reduction to sort files according to the similarities and dissimilarities of their contents.
N - No of samples D - Dimension of input data dims - Dimension of output data X - memory space of size ND Y - memory space of size Ndims perplexity - shanon entropy
./fsort /path