If you need really huge data set to test your methods, then our data set with 133,885 species is one of the best choices. You can download it in figshare.