Face recognition is popular in the field of pattern recognition and image processing. However, traditional recognition technologies spend too long there are a lot of images to be recognized or trained for great accuracy in the recognition. Parallel computing is an effective way to improve the processing speed. With the improvement of GPU performance, its widely applied in computing-concentrated data operations. This paper presents a study of performance speedup achieved by applying GPU for face recognition based on PCA (Principal Component Analysis) algorithm. We successfully accelerated the testing phase by 6868-folds compared to a sequential C implementation when it has 100 test images and 2400 training images.