D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

, -a-Box (BIAB) file archive

G. Brunner, A. Konrad, Y. Wang, and R. Wattenhofer, MIDI-VAE: Modeling dynamics and instrumentation of music with applications to style transfer, In ISMIR, 2018.

G. Brunner, Y. Wang, R. Wattenhofer, and S. Zhao, Symbolic music genre transfer with Cy-cleGAN, 2018.

K. Cho, Ç. Bart-van-merrienboer, D. Gülçehre, F. Bahdanau, H. Bougares et al., Learning phrase representations using RNN encoder-decoder for statistical machine translation, EMNLP, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

O. Cífka, Supplementary material: Supervised symbolic music style translation using synthetic data. Zenodo, 2019.

S. Dai, Z. Zhang, and G. Xia, Music style transfer: A, 2018.

J. Driedger, T. Prätzlich, and M. Müller, Let it Bee -towards NMF-inspired audio mosaicing, ISMIR, 2015.

D. Eck and J. Schmidhuber, Finding temporal structure in music: blues improvisation with LSTM recurrent networks, NNSP, 2002.

A. A. Efros and W. T. Freeman, Image quilting for texture synthesis and transfer, SIGGRAPH, 2001.

S. Flossmann and G. Widmer, Toward a multilevel model of expressive piano performance, ISPS, 2011.

L. A. Gatys, A. S. Ecker, and M. Bethge, Image style transfer using convolutional neural networks, CVPR, pp.2414-2423, 2016.

E. Grinstein, Q. K. Ngoc, A. Duong, P. Ozerov, and . Pérez, Audio style transfer, ICASSP, pp.586-590, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01626389

G. Hadjeres and F. Pachet, DeepBach: a steerable model for Bach chorales generation, ICML, 2017.

G. Hadjeres, J. Sakellariou, and F. Pachet, Style imitation and chord invention in polyphonic music with exponential families, 2016.

P. Isola, J. Zhu, T. Zhou, and A. A. Efros, Image-to-image translation with conditional adversarial networks, CVPR, pp.5967-5976, 2017.

P. Diederik, J. Kingma, and . Ba, Adam: A method for stochastic optimization. CoRR, abs/1412, vol.6980, 2015.

P. Diederik, M. Kingma, and . Welling, Auto-encoding variational Bayes. CoRR, abs/1312, vol.6114, 2014.

G. Lample, A. Conneau, L. Denoyer, and M. Ranzato, Unsupervised machine translation using monolingual corpora only, 2017.

K. Lee and M. Slaney, Acoustic chord transcription and key extraction from audio using keydependent hmms trained on synthesized audio, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, pp.291-301, 2008.

M. Liu, T. Breuel, and J. Kautz, Unsupervised image-to-image translation networks, NIPS, 2017.

W. Lu and L. Su, Transferring the style of homophonic music using recurrent neural networks and autoregressive models, In ISMIR, 2018.

I. Malik and C. H. Ek, Neural translation of musical style, 2017.

M. Mauch and S. Dixon, PYIN: A fundamental frequency estimator using probabilistic threshold distributions, ICASSP, pp.659-663, 2014.

C. Mckay, Automatic genre classification of MIDI recordings, 2004.

C. Mckay and I. Fujinaga, The Bodhidharma system and the results of the MIREX 2005 symbolic genre classification contest, ISMIR, 2005.

N. Mor and L. Wolf, Adam Polyak, and Yaniv Taigman. A universal music translation network, 2018.

, Proceedings of the 20th ISMIR Conference, 2019.

F. Pachet and P. Roy, Non-conformant harmonization: the Real Book in the style of Take 6, ICCC, 2014.

G. Ros, L. Sellart, J. Materzynska, D. Vázquez, and A. M. López, The SYN-THIA dataset: A large collection of synthetic images for semantic segmentation of urban scenes, CVPR, pp.3234-3243, 2016.

J. Sakellariou, F. Tria, V. Loreto, and F. Pachet, Maximum entropy models capture melodic styles, Scientific Reports, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01585581

J. Salamon, R. M. Bittner, J. Bonada, J. J. Bosch, E. Gómez et al., An analysis/synthesis framework for automatic F0 annotation of multitrack datasets, 2017.

I. Simon and S. Oore, Performance RNN: Generating music with expressive timing and dynamics. Magenta Blog, 2017.

B. L. Sturm, J. F. Santos, O. Ben-tal, and I. Korshunova, Music transcription modelling and composition using deep learning, 2016.

C. J. Tralie, Cover song synthesis by analogy, ISMIR, 2018.

G. Varol, J. Romero, X. Martin, N. Mahmood, M. J. Black et al., Learning from synthetic humans, CVPR, pp.4627-4635, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01505711

G. Widmer, S. Flossmann, and M. Grachten, YQX plays Chopin. AI Magazine, vol.30, pp.35-48, 2009.

X. Xie, Feng Tian, and Seah Hock Soon. Feature guided texture synthesis (FGTS) for artistic style transfer, DIMEA, 2007.

J. Junbo, Y. Zhao, K. Kim, A. M. Zhang, Y. Rush et al., Adversarially regularized autoencoders, In ICML, 2018.

J. Zhu, T. Park, P. Isola, and A. A. Efros, Unpaired image-to-image translation using cycle-consistent adversarial networks, ICCV, pp.2242-2251, 2017.

A. Zils and F. Pachet, Musical mosaicing, COST G-6 Conference on Digital Audio Effects (DAFX-01), 2001.

, Proceedings of the 20th ISMIR Conference, 2019.