Spatial audio has been demonstrated to enhance performance in a variety of listening tasks. The utility of visually reinforcing spatialized audio with depictions of voice locations in collaborative applications, however, has been questioned. In this experiment, we compared the accuracy, response time, confidence in task performance, and subjective mental workload of 18 participants in a voice-identification task under three different display conditions: 1) traditional mono audio; 2) spatial audio; 3) spatial audio with a visual representation of voice locations. Each format was investigated using four and eight unique stimuli voices. Results showed greater voice-identification accuracy for the spatial-plus-visual format than for the spatialand mono-only formats, and that visualization benefits increased with voice number. Spatialization was also found to increase confidence in task performance. Response time and mental workload remained unchanged across display conditions. These results indicate visualizations may benefit users of large, unfamiliar audio spaces.