What I mean is: Instead of training the K-Means with all the data maybe there is a method to find just the important vectors (those vectors who affect most the clusters) and use these important vectors(from training data) to traing the algorithm.I hope you understood me.