Emotional sounds of crowds: spectrogram-based analysis using deep learning