Y. Aksoy, T. Oh, S. Paris, M. Pollefeys, and W. Matusik, Semantic soft segmentation, Proc. SIGGRAPH), vol.37, 2018.

X. Bai, J. Wang, D. Simons, and G. Sapiro, Video snapcut: robust video object cutout using localized classifiers, In ACM Transactions on Graphics, vol.28, p.70, 2009.

H. Bay, A. Ess, T. Tuytelaars, and L. Van-gool, Computer Vision and Image Understanding, vol.110, pp.346-359, 2008.

M. Bertalmio, G. Sapiro, V. Caselles, and C. Ballester, Image inpainting, Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH '00, pp.417-424, 2000.
URL : https://hal.archives-ouvertes.fr/hal-00522652

A. Bokov and D. Vatolin, 100+ times faster video completion by optical-flow-guided variational refinement, 25th IEEE International Conference on Image Processing (ICIP), pp.2122-2126, 2018.

N. Bonneel, K. Sunkavalli, J. Tompkin, D. Sun, S. Paris et al., Interactive intrinsic video editing, ACM Transactions on Graphics (TOG), vol.33, issue.6, p.197, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01264124

S. Caelles, Y. Chen, J. Pont-tuset, and L. Van-gool, Semantically-guided video object segmentation, 2017.

S. Caelles, K. Maninis, J. Pont-tuset, L. Leal-taixé, D. Cremers et al., One-shot video object segmentation, Computer Vision and Pattern Recognition (CVPR, 2017.

S. Caelles, A. Montes, K. Maninis, Y. Chen, L. Van-gool et al., The 2018 davis challenge on video object segmentation, 2018.

L. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs, 2016.

L. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, Encoder-decoder with atrous separable convolution for semantic image segmentation, 2018.

Y. Chen, J. Pont-tuset, A. Montes, and L. Van-gool, Blazingly fast video object segmentation with pixelwise metric learning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1189-1198, 2018.

J. Cheng, Y. Tsai, W. Hung, S. Wang, and M. Yang, Fast and accurate online video object segmentation via tracking parts, 2018.

W. Chiu and M. Fritz, Multi-class video cosegmentation with a generative multi-video model, Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pp.321-328, 2013.

S. Choi, T. Kim, and W. Yu, Robust video stabilization to outlier motion using adaptive ransac, IEEE/RSJ International Conference on, pp.1897-1902, 2009.

Y. Chuang, A. Agarwala, B. Curless, D. H. Salesin, and R. Szeliski, Video matting of complex scenes, ACM Transactions on Graphics (ToG), vol.21, issue.3, pp.243-248, 2002.

A. Colombari, A. Fusiello, and V. Murino, Segmentation and tracking of multiple video objects, Pattern Recognition, vol.40, issue.4, pp.1307-1317, 2007.

A. Criminisi, P. Pérez, and K. Toyama, Region filling and object removal by exemplar-based image inpainting, IEEE Transactions on image processing, vol.13, issue.9, pp.1200-1212, 2004.

Z. Cui, O. Wang, P. Tan, and J. Wang, Time slice video synthesis by robust video alignment, ACM Transactions on Graphics (TOG), vol.36, issue.4, p.131, 2017.

J. Dai, K. He, and J. Sun, Instance-Aware Semantic Segmentation via Multi-task Network Cascades, (CVPR) IEEE Conference on Computer Vision and Pattern Recognition, pp.3150-3158, 2016.

A. Dehghan, S. M. Assari, and M. Shah, Gmmcp tracker: Globally optimal generalized maximum multi clique problem for multiple object tracking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.4091-4099, 2015.

B. Drayer and T. Brox, Object detection, tracking, and motion segmentation for object-level video segmentation, 2016.

I. Drori, D. Cohen-or, and H. Yeshurun, Fragmentbased image completion, In ACM Transactions on graphics, vol.22, pp.303-312, 2003.

A. A. Efros and T. K. Leung, Texture synthesis by non-parametric sampling, The Proceedings of the Seventh IEEE International Conference on, vol.2, pp.1033-1038, 1999.

M. Everingham, S. A. Eslami, L. Van-gool, C. K. Williams, J. Winn et al., The pascal visual object classes challenge: A retrospective, International journal of computer vision, vol.111, issue.1, pp.98-136, 2015.

M. Granados, K. I. Kim, J. Tompkin, J. Kautz, and C. Theobalt, Background inpainting for videos with dynamic objects and a free-moving camera, European Conference on Computer Vision, pp.682-695, 2012.

M. Granados, J. Tompkin, K. Kim, O. Grau, J. Kautz et al., How not to be seenobject removal from videos of crowded scenes, Computer Graphics Forum, vol.31, pp.219-228, 2012.

H. Grossauer, Inpainting of movies using optical flow, Mathematical Models for Registration and Applications to Medical Imaging, pp.151-162

. Springer, , 2006.

K. He, G. Gkioxari, P. Dollár, and R. Girshick, Mask r-cnn, 2017 IEEE International Conference on, pp.2980-2988, 2017.

J. Herling and W. Broll, Pixmix: A real-time approach to high-quality diminished reality, 2012 IEEE International Symposium on, pp.141-150, 2012.

Y. Hu, J. Huang, and A. Schwing, Maskrnn: Instance level video object segmentation, Advances in Neural Information Processing Systems, pp.324-333, 2017.

J. Huang, S. B. Kang, N. Ahuja, and J. Kopf, Temporally coherent completion of dynamic video, ACM Transactions on Graphics (TOG), vol.35, issue.6, p.196, 2016.

S. Iizuka, E. Simo-serra, and H. Ishikawa, Globally and locally consistent image completion, ACM Transactions on Graphics (TOG), vol.36, issue.4, p.107, 2017.

E. Ilg, N. Mayer, T. Saikia, M. Keuper, A. Dosovitskiy et al., Flownet 2.0: Evolution of optical flow estimation with deep networks, 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp.1647-1655, 2017.

S. D. Jain and K. Grauman, Click carving: Segmenting objects in video with point clicks, 2016.

J. Jia, Y. Tai, T. Wu, and C. Tang, Video repairing under variable illumination using cyclic motions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.5, pp.832-839, 2006.

A. Khoreva, R. Benenson, E. Ilg, T. Brox, and B. Schiele, Lucid data dreaming for object tracking, 2017.

S. Korman and S. Avidan, Coherency sensitive hashing, Computer Vision (ICCV), 2011 IEEE International Conference on, pp.1607-1614, 2011.

T. Le, A. Almansa, Y. Gousseau, and S. Masnou, Motion-consistent video inpainting, ICIP 2017: IEEE International Conference on Image Processing, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01492536

T. T. Le, A. Almansa, Y. Gousseau, and S. Masnou, Removing objects from videos with a few strokes, SIGGRAPH Asia, p.22, 2018.
URL : https://hal.archives-ouvertes.fr/hal-02099051

M. Leake, A. Davis, A. Truong, and M. Agrawala, Computational video editing for dialogue-driven scenes, ACM Transactions on Graphics (TOG), vol.36, issue.130, 2017.

Y. J. Lee, J. Kim, and K. Grauman, Key-segments for video object segmentation, Computer Vision (ICCV), 2011 IEEE International Conference on, pp.1995-2002, 2011.

A. Levin, D. Lischinski, and Y. Weiss, A closedform solution to natural image matting, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.2, pp.228-242, 2008.

E. Levinkov, J. Tompkin, N. Bonneel, S. Kirchhoff, B. Andres et al., Interactive multicut video segmentation, Pacific Graphics, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01378800

F. Li, T. Kim, A. Humayun, D. Tsai, and J. M. Rehg, Video segmentation by tracking many figure-ground segments, Proceedings of the IEEE International Conference on Computer Vision, pp.2192-2199, 2013.

X. Li, Y. Qi, Z. Wang, K. Chen, Z. Liu et al., Video object segmentation with re-identification, The 2017 DAVIS Challenge on Video Object Segmentation-CVPR Workshops, 2017.

J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.3431-3440, 2015.

J. Luiten, P. Voigtlaender, and B. Leibe, Premvos: Proposal-generation, refinement and merging for video object segmentation, 2018.

N. Märki, F. Perazzi, O. Wang, and A. Sorkine-hornung, Bilateral space video segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.743-751, 2016.

S. Masnou and J. Morel, Level lines based disocclusion, ICIP 98. Proceedings. 1998 International Conference on, pp.259-263, 1998.

Y. Matsushita, E. Ofek, W. Ge, X. Tang, and H. Shum, Full-frame video stabilization with motion inpainting, IEEE Transactions on pattern analysis and Machine Intelligence, vol.28, issue.7, pp.1150-1163, 2006.

F. Meyer, Topographic distance and watershed lines, Signal Processing, vol.38, issue.1, pp.113-125, 1994.

N. S. Nagaraja, F. R. Schmidt, and T. Brox, Video segmentation with just a few strokes, ICCV, pp.3235-3243, 2015.

A. Newson, A. Almansa, M. Fradet, Y. Gousseau, and P. Pérez, Video inpainting of complex scenes, SIAM Journal on Imaging Sciences, vol.7, issue.4, pp.1993-2019, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00937795

A. Newson, A. Almansa, Y. Gousseau, and P. Pérez, Non-local patch-based image inpainting, Image Processing On Line, vol.7, pp.373-385, 2017.

J. Odobez and P. Bouthemy, Robust Multiresolution Estimation of Parametric Motion Models, Journal of Visual Communication and Image Representation, vol.6, issue.4, pp.348-365, 1995.

S. W. Oh, J. Lee, K. Sunkavalli, and S. J. Kim, Fast video object segmentation by reference-guided mask propagation, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.7376-7385, 2018.

A. Papazoglou and V. Ferrari, Fast object segmentation in unconstrained video, Proceedings of the IEEE International Conference on Computer Vision, pp.1777-1784, 2013.

D. Pathak, P. Krahenbuhl, J. Donahue, T. Darrell, and A. A. Efros, Context encoders: Feature learning by inpainting, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.2536-2544, 2016.

K. A. Patwardhan, G. Sapiro, and M. Bertalmio, Video inpainting of occluding and occluded objects, Image Processing, 2005. ICIP 2005. IEEE International Conference on, vol.2, p.69, 2005.

K. A. Patwardhan, G. Sapiro, and M. Bertalmío, Video inpainting under constrained camera motion, IEEE Transactions on Image Processing, vol.16, issue.2, pp.545-553, 2007.

F. Perazzi, A. Khoreva, R. Benenson, B. Schiele, and A. Sorkine-hornung, Learning video object segmentation from static images, Computer Vision and Pattern Recognition, 2017.

F. Perazzi, J. Pont-tuset, B. Mcwilliams, L. Van-gool, M. Gross et al., A benchmark dataset and evaluation methodology for video object segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.724-732, 2016.

F. Perazzi, J. Pont-tuset, B. Mcwilliams, L. Van-gool, M. Gross et al., A benchmark dataset and evaluation methodology for video object segmentation, Computer Vision and Pattern Recognition, 2016.

P. Pérez, M. Gangnet, and A. Blake, Poisson image editing, ACM Transactions on graphics (TOG), vol.22, issue.3, pp.313-318, 2003.

J. Pont-tuset, F. Perazzi, S. Caelles, P. Arbeláez, A. Sorkine-hornung et al., The 2017 davis challenge on video object segmentation, 2017.

S. A. Ramakanth and R. V. Babu, Featurematch: A general annf estimation technique and its applications, IEEE Transactions on Image Processing, vol.23, issue.5, pp.2193-2205, 2014.

S. A. Ramakanth and R. V. Babu, Seamseg: Video object segmentation using patch seams, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.376-383, 2014.

A. Zamir, A. Dehghan, and M. Shah, Gmcptracker: Global multi-object tracking using generalized minimum clique graphs, Computer Vision-ECCV 2012, pp.343-356, 2012.

J. Sánchez, Comparison of Motion Smoothing Strategies for Video Stabilization using Parametric Models, Image Processing On Line, vol.7, pp.309-346, 2017.

G. Seguin, P. Bojanowski, R. Lajugie, and I. Laptev, Instance-level video segmentation from object tracks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.3678-3687, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01255765

T. Shiratori, Y. Matsushita, X. Tang, and S. B. Kang, Video completion by motion field transfer, Computer Vision and Pattern Recognition, vol.1, pp.411-418, 2006.

N. C. Tang, C. Hsu, C. Su, T. K. Shih, and H. M. Liao, Video inpainting on digitized vintage films via maintaining spatiotemporal continuity, IEEE Trans. Multimedia, vol.13, issue.4, pp.602-614, 2011.

P. Tokmakov, K. Alahari, and C. Schmid, Learning motion patterns in videos, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01427480

Y. Tsai, M. Yang, and M. J. Black, Video segmentation via object flow, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.3899-3908, 2016.

H. V. Vo, N. Q. Duong, and P. Pérez, Structural inpainting, Proceedings of the 26th ACM International Conference on Multimedia, MM '18, pp.1948-1956, 2018.

P. Voigtlaender and B. Leibe, Online adaptation of convolutional neural networks for the 2017 davis challenge on video object segmentation, 2017.

S. Wang, H. Lu, F. Yang, and M. Yang, Superpixel tracking, Computer Vision (ICCV), 2011 IEEE International Conference on, pp.1323-1330, 2011.

Y. Wexler, E. Shechtman, and M. Irani, Space-time completion of video, IEEE Transactions, vol.29, issue.3, 2007.

S. Xie and Z. Tu, Holistically-nested edge detection, International Journal of Computer Vision, pp.1-16, 2017.

B. Xu, S. Pathak, H. Fujii, A. Yamashita, and T. Le,

H. Asama, Spatio-temporal video completion in spherical image sequences, IEEE Robotics and Automation Letters, vol.2, issue.4, pp.2032-2039, 2017.

N. Xu, B. Price, S. Cohen, J. Yang, and T. Huang, Deep grabcut for object selection, 2017.

N. Xu, L. Yang, Y. Fan, D. Yue, Y. Liang et al., Youtube-vos: A large-scale video object segmentation benchmark, 2018.

H. Yang, L. Shao, F. Zheng, L. Wang, and Z. Song, Recent advances and trends in visual tracking: A review, Neurocomputing, vol.74, issue.18, pp.3823-3831, 2011.

L. Yang, Y. Wang, X. Xiong, J. Yang, and A. K. Katsaggelos, Efficient video object segmentation via network modulation. algorithms, vol.29, p.15, 2018.

M. Y. Yang, M. Reso, J. Tang, W. Liao, and B. Rosenhahn, Temporally object-based video cosegmentation, International Symposium on Visual Computing, pp.198-209, 2015.

Y. Yang, G. Sundaramoorthi, and S. Soatto, Selfocclusions and disocclusions in causal video object segmentation, Proceedings of the IEEE International Conference on Computer Vision, pp.4408-4416, 2015.

S. You, R. T. Tan, R. Kawakami, and K. Ikeuchi, Robust and fast motion estimation for video completion, MVA, pp.181-184, 2013.

F. Zhang, X. Wu, H. Zhang, J. Wang, and S. Hu, Robust background identification for dynamic video editing, ACM Transactions on Graphics (TOG), vol.35, issue.6, p.197, 2016.