question about dataset for GCTA