Tweaking difficulty curves, colors of interface elements, whatever, and then using statistics to see if there was a significant correlation between these experiments and desirable statistics such as the number of daily users, $ per daily user, total time logged, whatever.