N. Ding and J. Z. Simon, Emergence of neural encoding of auditory objects while listening to competing speakers, Proceedings of the National Academy of Sciences, vol.109, issue.29, pp.11-854, 2012.

S. Akram, J. Z. Simon, S. A. Shamma, and B. Babadi, A state-space model for decoding auditory attentional modulation from meg in a competing-speaker environment, Advances in Neural Information Processing Systems, pp.460-468, 2014.

C. Brodbeck, A. Presacco, and J. Z. Simon, Neural source dynamics of brain responses to continuous stimuli: speech processing from acoustics to comprehension, NeuroImage, vol.172, pp.162-174, 2018.

N. Mesgarani, S. V. David, J. B. Fritz, and S. A. Shamma, Influence of context and behavior on stimulus reconstruction from neural activity in primary auditory cortex, Journal of neurophysiology, 2009.

B. N. Pasley, S. V. David, N. Mesgarani, A. Flinker, S. A. Shamma et al., Reconstructing speech from human auditory cortex, PLoS biology, vol.10, issue.1, p.1001251, 2012.

N. Mesgarani and E. F. Chang, Selective cortical representation of attended speaker in multi-talker speech perception, Nature, vol.485, issue.7397, p.233, 2012.

J. A. O'sullivan, A. J. Power, N. Mesgarani, S. Rajaram, J. J. Foxe et al., Attentional selection in a cocktail party environment can be decoded from single-trial eeg, Cerebral Cortex, vol.25, issue.7, pp.1697-1706, 2014.

M. J. Crosse, G. M. Di-liberto, A. Bednar, and E. C. Lalor, The multivariate temporal response function (mtrf) toolbox: a matlab toolbox for relating neural signals to continuous stimuli, Frontiers in human neuroscience, vol.10, p.604, 2016.

K. L. Hyde, I. Peretz, and R. J. Zatorre, Evidence for the role of the right auditory cortex in fine pitch resolution, Neuropsychologia, vol.46, issue.2, pp.632-639, 2008.

S. Kumar, W. Sedley, K. V. Nourski, H. Kawasaki, H. Oya et al., Predictive coding and pitch processing in the auditory cortex, Journal of Cognitive Neuroscience, vol.23, issue.10, pp.3084-3094, 2011.

Y. Nan and A. D. Friederici, Differential roles of right temporal cortex and broca's area in pitch processing: evidence from music and mandarin, Human brain mapping, vol.34, issue.9, pp.2045-2054, 2013.

C. J. Plack, D. Barker, and D. A. Hall, Pitch coding and pitch processing in the human brain, Hearing Research, vol.307, pp.53-64, 2014.

A. Caclin, M. Giard, B. K. Smith, and S. Mcadams, Interactive processing of timbre dimensions: A garner interference study, Brain research, vol.1138, pp.159-170, 2007.

S. Deike, B. Gaschler-markefski, A. Brechmann, and H. Scheich, Auditory stream segregation relying on timbre involves left auditory cortex, Neuroreport, vol.15, issue.9, pp.1511-1514, 2004.

K. N. Goydke, E. Altenmüller, J. Möller, and T. F. Münte, Changes in emotional tone and instrumental timbre are reflected by the mismatch negativity, Cognitive Brain Research, vol.21, issue.3, pp.351-359, 2004.

I. Sturm, Analyzing the perception of natural music with eeg and ecog, 2016.

F. Cong, A. H. Phan, Q. Zhao, A. K. Nandi, V. Alluri et al., Analysis of ongoing eeg elicited by natural music stimuli using nonnegative tensor factorization, EUSIPCO, pp.494-498, 2012.

I. Sturm, M. Treder, D. Miklody, H. Purwins, S. Dähne et al., Extracting the neural representation of tone onsets for separate voices of ensemble music using multivariate eeg analysis, vol.25, p.366, 2015.

S. Stober, T. Prätzlich, and M. Müller, Brain beats: Tempo extraction from eeg data, ISMIR, pp.276-282, 2016.

M. H. Thaut, Rhythm, human temporality, and brain function, pp.171-191, 2005.

A. Ofner and S. Stober, Shared generative representation of auditory concepts and eeg to reconstruct perceived and imagined music, ISMIR, pp.392-399, 2018.

I. Sturm, S. Dähne, B. Blankertz, and G. Curio, Multi-variate eeg analysis as a novel tool to examine brain responses to naturalistic music stimuli, PloS one, vol.10, issue.10, p.141281, 2015.

R. S. Schaefer, P. Desain, and J. Farquhar, Shared processing of perception and imagery of music in decomposed eeg, Neuroimage, vol.70, pp.317-326, 2013.

M. S. Treder, H. Purwins, D. Miklody, I. Sturm, and B. Blankertz, Decoding auditory attention to instruments in polyphonic music using single-trial eeg classification, Journal of neural engineering, vol.11, issue.2, p.26009, 2014.

B. Kaneshiro, D. T. Nguyen, J. P. Dmochowski, A. M. Norcia, and J. Berger, Naturalistic music eeg dataset -hindi (nmedh), 2016.

S. Losorelli, D. T. Nguyen, J. P. Dmochowski, and B. Kaneshiro, Nmed-t: A tempo-focused dataset of cortical and behavioral responses to naturalistic music, 2017.

S. Stober, A. Sternin, A. M. Owen, and J. A. Grahn, Towards music imagery information retrieval: Introducing the openmiir dataset of eeg recordings from music perception and imagination, ISMIR, pp.763-769, 2015.

H. Akbari, B. Khalighinejad, J. L. Herrero, A. D. Mehta, and N. Mesgarani, Towards reconstructing intelligible speech from the human auditory cortex, Scientific reports, vol.9, issue.1, p.874, 2019.

B. Blankertz, S. Lemm, M. Treder, S. Haufe, and K. Müller, Single-trial analysis and classification of erp componentsa tutorial, NeuroImage, vol.56, issue.2, pp.814-825, 2011.

E. W. Noreen, Computer-intensive methods for testing hypotheses, 1989.

A. Yeh, More accurate tests for the statistical significance of result differences, Proceedings of the 18th conference on Computational linguistics, vol.2, pp.947-953, 2000.