Tuesday, October 29, 2019, 1030–1300 Poster 1.1 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Deep Learning | 1 | 10:30 | FaceForensics++: Learning to Detect Manipulated Facial Images | Andreas Rössler, Davide Cozzolino, Luisa Verdoliva, Christian Riess, Justus Thies, Matthias Nießner | 4159 |
2 | 10:30 | DeepVCP: An End-to-End Deep Neural Network for Point Cloud Registration | Weixin Lu, Guowei Wan, Yao Zhou, Xiangyu Fu, Pengfei Yuan, Shiyu Song | 3735 | |
3 | 10:30 | Shape Reconstruction Using Differentiable Projections and Deep Priors | Matheus Gadelha, Rui Wang, Subhransu Maji | 3627 | |
4 | 10:30 | Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization | Måns Larsson, Erik Stenborg, Carl Toft, Lars Hammarstrand, Torsten Sattler, Fredrik Kahl | 4075 | |
5 | 10:30 | SANet: Scene Agnostic Network for Camera Localization | Luwei Yang, Ziqian Bai, Chengzhou Tang, Honghua Li, Yasutaka Furukawa, Ping Tan | 3140 | |
6 | 10:30 | Total Denoising: Unsupervised Learning of 3D Point Cloud Cleaning | Pedro Hermosilla, Tobias Ritschel, Timo Ropinski | 5698 | |
7 | 10:30 | Hierarchical Self-Attention Network for Action Localization in Videos | Rizard Renanda Adhi Pramono, Yie-Tarng Chen, Wen-Hsien Fang | 4600 | |
8 | 10:30 | Goal-Driven Sequential Data Abstraction | Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song | 2421 | |
9 | 10:30 | Jointly Aligning Millions of Images With Deep Penalised Reconstruction Congealing | Roberto Annunziata, Christos Sagonas, Jacques Cali | 4672 | |
10 | 10:30 | Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation | Seungmin Lee, Dongwan Kim, Namil Kim, Seong-Gyun Jeong | 3788 | |
11 | 10:30 | NLNL: Negative Learning for Noisy Labels | Youngdong Kim, Junho Yim, Juseung Yun, Junmo Kim | 3795 | |
12 | 10:30 | Adversarial Robustness vs. Model Compression, or Both? | Shaokai Ye, Kaidi Xu, Sijia Liu, Hao Cheng, Jan-Henrik Lambrechts, Huan Zhang, Aojun Zhou, Kaisheng Ma, Yanzhi Wang, Xue Lin | 4102 | |
13 | 10:30 | On the Design of Black-Box Adversarial Examples by Leveraging Gradient-Free Optimization and Operator Splitting Method | Pu Zhao, Sijia Liu, Pin-Yu Chen, Nghia Hoang, Kaidi Xu, Bhavya Kailkhura, Xue Lin | 6428 | |
14 | 10:30 | DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks | Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras, Roy Shilkrot | 3243 | |
15 | 10:30 | Learning Robust Facial Landmark Detection via Hierarchical Structured Ensemble | Xu Zou, Sheng Zhong, Luxin Yan, Xiangyun Zhao, Jiahuan Zhou, Ying Wu | 84 | |
16 | 10:30 | Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement | Zitong Yu, Wei Peng, Xiaobai Li, Xiaopeng Hong, Guoying Zhao | 456 | |
17 | 10:30 | Face-to-Parameter Translation for Game Character Auto-Creation | Tianyang Shi, Yi Yuan, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong Liu | 2957 | |
18 | 10:30 | Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions | Guha Balakrishnan, Adrian V. Dalca, Amy Zhao, John V. Guttag, Frédo Durand, William T. Freeman | 5128 | |
19 | 10:30 | StructureFlow: Image Inpainting via Structure-Aware Appearance Flow | Yurui Ren, Xiaoming Yu, Ruonan Zhang, Thomas H. Li, Shan Liu, Ge Li | 407 | |
20 | 10:30 | Learning Fixed Points in Generative Adversarial Networks: From Image-to-Image Translation to Disease Detection and Localization | Md Mahfuzur Rahman Siddiquee, Zongwei Zhou, Nima Tajbakhsh, Ruibin Feng, Michael B. Gotway, Yoshua Bengio, Jianming Liang | 184 | |
21 | 10:30 | Generative Adversarial Training for Weakly Supervised Cloud Matting | Zhengxia Zou, Wenyuan Li, Tianyang Shi, Zhenwei Shi, Jieping Ye | 5164 | |
22 | 10:30 | PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data | Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, Xiaodong Yang | 6414 | |
23 | 10:30 | Generative Adversarial Networks for Extreme Learned Image Compression | Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, Luc Van Gool | 2939 | |
24 | 10:30 | Instance-Guided Context Rendering for Cross-Domain Person Re-Identification | Yanbei Chen, Xiatian Zhu, Shaogang Gong | 102 | |
25 | 10:30 | What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance | Mahmoud Afifi, Michael S. Brown | 105 | |
26 | 10:30 | Beyond Cartesian Representations for Local Descriptors | Patrick Ebel, Anastasiia Mishchuk, Kwang Moo Yi, Pascal Fua, Eduard Trulls | 2277 | |
27 | 10:30 | Distilling Knowledge From a Deep Pose Regressor Network | Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Yasin Almalioglu, Andrew Markham, Niki Trigoni | 5088 | |
28 | 10:30 | Instance-Level Future Motion Estimation in a Single Image Based on Ordinal Regression | Kyung-Rae Kim, Whan Choi, Yeong Jun Koh, Seong-Gyun Jeong, Chang-Su Kim | 6968 | |
29 | 10:30 | Vision-Infused Deep Audio Inpainting | Hang Zhou, Ziwei Liu, Xudong Xu, Ping Luo, Xiaogang Wang | 754 | |
30 | 10:30 | HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision | Zhen Dong, Zhewei Yao, Amir Gholami, Michael W. Mahoney, Kurt Keutzer | 6519 | |
31 | 10:30 | Evaluating Robustness of Deep Image Super-Resolution Against Adversarial Attacks | Jun-Ho Choi, Huan Zhang, Jun-Hyuk Kim, Cho-Jui Hsieh, Jong-Seok Lee | 6668 | |
32 | 10:30 | Overcoming Catastrophic Forgetting With Unlabeled Data in the Wild | Kibok Lee, Kimin Lee, Jinwoo Shin, Honglak Lee | 3510 | |
33 | 10:30 | Symmetric Cross Entropy for Robust Learning With Noisy Labels | Yisen Wang, Xingjun Ma, Zaiyi Chen, Yuan Luo, Jinfeng Yi, James Bailey | 2580 | |
34 | 10:30 | Few-Shot Learning With Embedded Class Models and Shot-Free Meta Training | Avinash Ravichandran, Rahul Bhotika, Stefano Soatto | 3571 | |
35 | 10:30 | Dual Directed Capsule Network for Very Low Resolution Image Recognition | Maneet Singh, Shruti Nagpal, Richa Singh, Mayank Vatsa | 1067 | |
36 | 10:30 | Recognizing Part Attributes With Insufficient Data | Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu | 49 | |
37 | 10:30 | USIP: Unsupervised Stable Interest Point Detection From 3D Point Clouds | Jiaxin Li, Gim Hee Lee | 5548 | |
38 | 10:30 | Mixed High-Order Attention Network for Person Re-Identification | Binghui Chen, Weihong Deng, Jiani Hu | 2040 | |
39 | 10:30 | Budget-Aware Adapters for Multi-Domain Learning | Rodrigo Berriel, Stéphane Lathuillère, Moin Nabi, Tassilo Klein, Thiago Oliveira-Santos, Nicu Sebe, Elisa Ricci | 4541 | |
40 | 10:30 | Compact Trilinear Interaction for Visual Question Answering | Tuong Do, Thanh-Toan Do, Huy Tran, Erman Tjiputra, Quang D. Tran | 2177 | |
41 | 10:30 | Towards Latent Attribute Discovery From Triplet Similarities | Ishan Nigam, Pavel Tokmakov, Deva Ramanan | 940 | |
42 | 10:30 | GeoStyle: Discovering Fashion Trends and Events | Utkarsh Mall, Kevin Matzen, Bharath Hariharan, Noah Snavely, Kavita Bala | 5348 | |
43 | 10:30 | Towards Adversarially Robust Object Detection | Haichao Zhang, Jianyu Wang | 2155 | |
44 | 10:30 | Recover and Identify: A Generative Dual Model for Cross-Resolution Person Re-Identification | Yu-Jhe Li, Yun-Chun Chen, Yen-Yu Lin, Xiaofei Du, Yu-Chiang Frank Wang | 157 | |
Recognition | 45 | 10:30 | Automatic and Robust Skull Registration Based on Discrete Uniformization | Junli Zhao, Xin Qi, Chengfeng Wen, Na Lei, Xianfeng Gu | 6829 |
46 | 10:30 | Few-Shot Image Recognition With Knowledge Transfer | Zhimao Peng, Zechao Li, Junge Zhang, Yan Li, Guo-Jun Qi, Jinhui Tang | 965 | |
47 | 10:30 | Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings | Michael Wray, Diane Larlus, Gabriela Csurka, Dima Damen | 3496 | |
48 | 10:30 | Vehicle Re-Identification in Aerial Imagery: Dataset and Approach | Peng Wang, Bingliang Jiao, Lu Yang, Yifei Yang, Shizhou Zhang, Wei Wei, Yanning Zhang | 2043 | |
49 | 10:30 | Bridging the Domain Gap for Ground-to-Aerial Image Matching | Krishna Regmi, Mubarak Shah | 3768 | |
50 | 10:30 | A Robust Learning Approach to Domain Adaptive Object Detection | Mehran Khodabandeh, Arash Vahdat, Mani Ranjbar, William G. Macready | 1356 | |
51 | 10:30 | Graph-Based Object Classification for Neuromorphic Vision Sensing | Yin Bi, Aaron Chadha, Alhabib Abbas, Eirina Bourtsoulatze, Yiannis Andreopoulos | 5892 | |
52 | 10:30 | Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving | Jiwoong Choi, Dayoung Chun, Hyun Kim, Hyuk-Jae Lee | 4602 | |
53 | 10:30 | Sharpen Focus: Learning With Attention Separability and Consistency | Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu, Dimitris N. Metaxas | 1215 | |
54 | 10:30 | Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition | Tianshui Chen, Muxin Xu, Xiaolu Hui, Hefeng Wu, Liang Lin | 4293 | |
55 | 10:30 | DeceptionNet: Network-Driven Domain Randomization | Sergey Zakharov, Wadim Kehl, Slobodan Ilic | 3378 | |
56 | 10:30 | Pose-Guided Feature Alignment for Occluded Person Re-Identification | Jiaxu Miao, Yu Wu, Ping Liu, Yuhang Ding, Yi Yang | 504 | |
57 | 10:30 | Robust Person Re-Identification by Modelling Feature Uncertainty | Tianyuan Yu, Da Li, Yongxin Yang, Timothy M. Hospedales, Tao Xiang | 1657 | |
58 | 10:30 | Co-Segmentation Inspired Attention Networks for Video-Based Person Re-Identification | Arulkumar Subramaniam, Athira Nambiar, Anurag Mittal | 3518 | |
59 | 10:30 | A Delay Metric for Video Object Detection: What Average Precision Fails to Tell | Huizi Mao, Xiaodong Yang, William J. Dally | 3572 | |
60 | 10:30 | IL2M: Class Incremental Learning With Dual Memory | Eden Belouadah, Adrian Popescu | 6021 | |
61 | 10:30 | Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once | Jiangfan Han, Xiaoyi Dong, Ruimao Zhang, Dongdong Chen, Weiming Zhang, Nenghai Yu, Ping Luo, Xiaogang Wang | 899 | |
Segmentation, Grouping, & Shape | 62 | 10:30 | Asymmetric Non-Local Neural Networks for Semantic Segmentation | Zhen Zhu, Mengde Xu, Song Bai, Tengteng Huang, Xiang Bai | 673 |
63 | 10:30 | CCNet: Criss-Cross Attention for Semantic Segmentation | Zilong Huang, Xinggang Wang, Lichao Huang, Chang Huang, Yunchao Wei, Wenyu Liu | 1960 | |
64 | 10:30 | Convex Shape Prior for Multi-Object Segmentation Using a Single Level Set Function | Shousheng Luo, Xue-Cheng Tai, Limei Huo, Yang Wang, Roland Glowinski | 5935 | |
65 | 10:30 | Surface Networks via General Covers | Niv Haim, Nimrod Segol, Heli Ben-Hamu, Haggai Maron, Yaron Lipman | 4047 | |
66 | 10:30 | SSAP: Single-Shot Instance Segmentation With Affinity Pyramid | Naiyu Gao, Yanhu Shan, Yupei Wang, Xin Zhao, Yinan Yu, Ming Yang, Kaiqi Huang | 2385 | |
67 | 10:30 | Learning Propagation for Arbitrarily-Structured Data | Sifei Liu, Xueting Li, Varun Jampani, Shalini De Mello, Jan Kautz | 1882 | |
68 | 10:30 | MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input | Jun Hao Liew, Scott Cohen, Brian Price, Long Mai, Sim-Heng Ong, Jiashi Feng | 3166 | |
69 | 10:30 | Robust Motion Segmentation From Pairwise Matches | Federica Arrigoni, Tomas Pajdla | 2127 | |
70 | 10:30 | InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting | Hao-Shu Fang, Jianhua Sun, Runzhong Wang, Minghao Gou, Yong-Lu Li, Cewu Lu | 2259 | |
71 | 10:30 | Attention Bridging Network for Knowledge Transfer | Kunpeng Li, Yulun Zhang, Kai Li, Yuanyuan Li, Yun Fu | 2717 | |
Face & Body | 72 | 10:30 | Racial Faces in the Wild: Reducing Racial Bias by Information Maximization Adaptation Network | Mei Wang, Weihong Deng, Jiani Hu, Xunqiang Tao, Yaohai Huang | 2792 |
73 | 10:30 | Uncertainty Modeling of Contextual-Connections Between Tracklets for Unconstrained Video-Based Face Recognition | Jingxiao Zheng, Ruichi Yu, Jun-Cheng Chen, Boyu Lu, Carlos D. Castillo, Rama Chellappa | 1647 | |
74 | 10:30 | Spatio-Temporal Fusion Based Convolutional Sequence Learning for Lip Reading | Xingxuan Zhang, Feng Cheng, Shilin Wang | 1501 | |
75 | 10:30 | Occlusion-Aware Networks for 3D Human Pose Estimation in Video | Yu Cheng, Bo Yang, Bo Wang, Wending Yan, Robby T. Tan | 2284 | |
76 | 10:30 | Context-Aware Feature and Label Fusion for Facial Action Unit Intensity Estimation With Partially Labeled Data | Yong Zhang, Haiyong Jiang, Baoyuan Wu, Yanbo Fan, Qiang Ji | 1180 | |
77 | 10:30 | Distill Knowledge From NRSfM for Weakly Supervised 3D Pose Learning | Chaoyang Wang, Chen Kong, Simon Lucey | 6608 | |
78 | 10:30 | MONET: Multiview Semi-Supervised Keypoint Detection via Epipolar Divergence | Yuan Yao, Yasamin Jafarian, Hyun Soo Park | 4171 | |
79 | 10:30 | Talking With Hands 16.2M: A Large-Scale Dataset of Synchronized Body-Finger Motion and Audio for Conversational Motion Analysis and Synthesis | Gilwoo Lee, Zhiwei Deng, Shugao Ma, Takaaki Shiratori, Siddhartha S. Srinivasa, Yaser Sheikh | 1390 | |
80 | 10:30 | Occlusion Robust Face Recognition Based on Mask Learning With Pairwise Differential Siamese Network | Lingxue Song, Dihong Gong, Zhifeng Li, Changsong Liu, Wei Liu | 2865 | |
81 | 10:30 | Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection | Xuanyi Dong, Yi Yang | 1020 | |
82 | 10:30 | A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image | Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan | 3060 | |
83 | 10:30 | TexturePose: Supervising Human Mesh Estimation With Texture Consistency | Georgios Pavlakos, Nikos Kolotouros, Kostas Daniilidis | 6512 | |
84 | 10:30 | FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape From Single RGB Images | Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox | 5626 | |
85 | 10:30 | Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles | Nitin Saini, Eric Price, Rahul Tallamraju, Raffi Enficiaud, Roman Ludwig, Igor Martinovic, Aamir Ahmad, Michael J. Black | 5116 | |
86 | 10:30 | Aggregation via Separation: Boosting Facial Landmark Detector With Semi-Supervised Style Translation | Shengju Qian, Keqiang Sun, Wayne Wu, Chen Qian, Jiaya Jia | 1870 | |
Action & Video | 87 | 10:30 | Toyota Smarthome: Real-World Activities of Daily Living | Srijan Das, Rui Dai, Michal Koperski, Luca Minciullo, Lorenzo Garattoni, Francois Bremond, Gianpiero Francesca | 1959 |
88 | 10:30 | Relation Parsing Neural Network for Human-Object Interaction Detection | Penghao Zhou, Mingmin Chi | 1592 | |
89 | 10:30 | DistInit: Learning Video Representations Without a Single Labeled Video | Rohit Girdhar, Du Tran, Lorenzo Torresani, Deva Ramanan | 1823 | |
90 | 10:30 | Zero-Shot Anticipation for Instructional Activities | Fadime Sener, Angela Yao | 902 | |
91 | 10:30 | Making the Invisible Visible: Action Recognition Through Walls and Occlusions | Tianhong Li, Lijie Fan, Mingmin Zhao, Yingcheng Liu, Dina Katabi | 3592 | |
92 | 10:30 | Recursive Visual Sound Separation Using Minus-Plus Net | Xudong Xu, Bo Dai, Dahua Lin | 3035 | |
Motion & Tracking | 93 | 10:30 | Unsupervised Video Interpolation Using Cycle Consistency | Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro | 3636 |
94 | 10:30 | Deformable Surface Tracking by Graph Matching | Tao Wang, Haibin Ling, Congyan Lang, Songhe Feng, Xiaohui Hou | 1113 | |
95 | 10:30 | Deep Meta Learning for Real-Time Target-Aware Visual Tracking | Janghoon Choi, Junseok Kwon, Kyoung Mu Lee | 4309 | |
96 | 10:30 | Looking to Relations for Future Trajectory Forecast | Chiho Choi, Behzad Dariush | 1342 | |
97 | 10:30 | Anchor Diffusion for Unsupervised Video Object Segmentation | Zhao Yang, Qiang Wang, Luca Bertinetto, Weiming Hu, Song Bai, Philip H. S. Torr | 1343 | |
98 | 10:30 | Tracking Without Bells and Whistles | Philipp Bergmann, Tim Meinhardt, Laura Leal-Taixé | 2441 | |
99 | 10:30 | Self-Supervised Moving Vehicle Tracking With Stereo Sound | Chuang Gan, Hang Zhao, Peihao Chen, David Cox, Antonio Torralba | 1083 | |
Scene Understanding | 100 | 10:30 | Perspective-Guided Convolution Networks for Crowd Counting | Zhaoyi Yan, Yuchen Yuan, Wangmeng Zuo, Xiao Tan, Yezhen Wang, Shilei Wen, Errui Ding | 1032 |
101 | 10:30 | End-to-End Wireframe Parsing | Yichao Zhou, Haozhi Qi, Yi Ma | 1586 | |
102 | 10:30 | Incremental Class Discovery for Semantic Segmentation With RGBD Sensing | Yoshikatsu Nakajima, Byeongkeun Kang, Hideo Saito, Kris Kitani | 4851 | |
103 | 10:30 | SSF-DAN: Separated Semantic Feature Based Domain Adaptation Network for Semantic Segmentation | Liang Du, Jingang Tan, Hongye Yang, Jianfeng Feng, Xiangyang Xue, Qibao Zheng, Xiaoqing Ye, Xiaolin Zhang | 176 | |
104 | 10:30 | SpaceNet MVOI: A Multi-View Overhead Imagery Dataset | Nicholas Weir, David Lindenbaum, Alexei Bastidas, Adam Van Etten, Sean McPherson, Jacob Shermeyer, Varun Kumar, Hanlin Tang | 3068 | |
105 | 10:30 | Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting | Vishwanath A. Sindagi, Vishal M. Patel | 1098 | |
106 | 10:30 | Learning Lightweight Lane Detection CNNs by Self Attention Distillation | Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy | 251 | |
107 | 10:30 | SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation | Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman, Dhruv Batra | 3164 | |
3D From Multiview & Sensors | 108 | 10:30 | Cascaded Parallel Filtering for Memory-Efficient Image-Based Localization | Wentao Cheng, Weisi Lin, Kan Chen, Xinfeng Zhang | 6740 |
109 | 10:30 | Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation | Chao Wen, Yinda Zhang, Zhuwen Li, Yanwei Fu | 1656 | |
110 | 10:30 | A Differential Volumetric Approach to Multi-View Photometric Stereo | Fotios Logothetis, Roberto Mecca, Roberto Cipolla | 5099 | |
111 | 10:30 | Revisiting Radial Distortion Absolute Pose | Viktor Larsson, Torsten Sattler, Zuzana Kukelova, Marc Pollefeys | 4766 | |
112 | 10:30 | Estimating the Fundamental Matrix Without Point Correspondences With Application to Transmission Imaging | Tobias Würfl, André Aichert, Nicole Maaß, Frank Dennerlein, Andreas Maier | 5105 | |
113 | 10:30 | QUARCH: A New Quasi-Affine Reconstruction Stratum From Vague Relative Camera Orientation Knowledge | Devesh Adlakha, Adlane Habed, Fabio Morbidi, Cédric Demonceaux, Michel de Mathelin | 4811 | |
114 | 10:30 | Homography From Two Orientation- and Scale-Covariant Features | Dániel Baráth, Zuzana Kukelova | 2962 | |
Applications. Medical, & Robotics | 115 | 10:30 | Hiding Video in Audio via Reversible Generative Models | Hyukryul Yang, Hao Ouyang, Vladlen Koltun, Qifeng Chen | 2545 |
116 | 10:30 | GSLAM: A General SLAM Framework and Benchmark | Yong Zhao, Shibiao Xu, Shuhui Bu, Hongkai Jiang, Pengcheng Han | 4414 | |
117 | 10:30 | Elaborate Monocular Point and Line SLAM With Robust Initialization | Sang Jun Lee, Sung Soo Hwang | 3321 | |
118 | 10:30 | Adaptive Density Map Generation for Crowd Counting | Jia Wan, Antoni Chan | 2578 | |
119 | 10:30 | Attention-Aware Polarity Sensitive Embedding for Affective Image Retrieval | Xingxu Yao, Dongyu She, Sicheng Zhao, Jie Liang, Yu-Kun Lai, Jufeng Yang | 4614 | |
120 | 10:30 | Zero-Shot Emotion Recognition via Affective Structural Embedding | Chi Zhan, Dongyu She, Sicheng Zhao, Ming-Ming Cheng, Jufeng Yang | 4618 | |
121 | 10:30 | FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On | Haoye Dong, Xiaodan Liang, Xiaohui Shen, Bowen Wu, Bing-Cheng Chen, Jian Yin | 1624 | |
122 | 10:30 | Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation | Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman | 4121 | |
123 | 10:30 | Attention-Based Autism Spectrum Disorder Screening With Privileged Modality | Shi Chen, Qi Zhao | 1785 | |
124 | 10:30 | Image Aesthetic Assessment Based on Pairwise Comparison A Unified Approach to Score Regression, Binary Classification, and Personalization | Jun-Tae Lee, Chang-Su Kim | 2448 | |
125 | 10:30 | Delving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach | Zhenyu Wu, Karthik Suresh, Priya Narayanan, Hongyu Xu, Heesung Kwon, Zhangyang Wang | 2556 | |
126 | 10:30 | Bit-Flip Attack: Crushing Neural Network With Progressive Bit Search | Adnan Siraj Rakin, Zhezhi He, Deliang Fan | 6178 | |
127 | 10:30 | Employing Deep Part-Object Relationships for Salient Object Detection | Yi Liu, Qiang Zhang, Dingwen Zhang, Jungong Han | 5649 | |
128 | 10:30 | Self-Supervised Deep Depth Denoising | Vladimiros Sterzentsenko, Leonidas Saroglou, Anargyros Chatzitofis, Spyridon Thermos, Nikolaos Zioulis, Alexandros Doumanoglou, Dimitrios Zarpalas, Petros Daras | 6215 | |
129 | 10:30 | Cost-Aware Fine-Grained Recognition for IoTs Based on Sequential Fixations | Hanxiao Wang, Venkatesh Saligrama, Stan Sclaroff, Vitaly Ablavsky | 2392 | |
130 | 10:30 | Layout-Induced Video Representation for Recognizing Agent-in-Place Actions | Ruichi Yu, Hongcheng Wang, Ang Li, Jingxiao Zheng, Vlad I. Morariu, Larry S. Davis | 1859 | |
131 | 10:30 | Anomaly Detection in Video Sequence With Appearance-Motion Correspondence | Trong-Nguyen Nguyen, Jean Meunier | 5180 |
Tuesday, October 29, 2019, 1330–1530 Oral 1.2A (Hall D1) Judy Hoffman (Facebook AI Research; Georgia Tech), Min Sun (National Tsing Hua Univ.) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Architectures, Multi-Task Learning, Domain Adaptation | 1 | 13:30 | Exploring Randomly Wired Neural Networks for Image Recognition [Video] | Saining Xie, Alexander Kirillov, Ross Girshick, Kaiming He | 634 |
2 | 13:35 | Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation [Video] | Xin Chen, Lingxi Xie, Jun Wu, Qi Tian | 2397 | |
3 | 13:40 | Multinomial Distribution Learning for Effective Neural Architecture Search [Video] | Xiawu Zheng, Rongrong Ji, Lang Tang, Baochang Zhang, Jianzhuang Liu, Qi Tian | 1865 | |
4 | 13:48 | Searching for MobileNetV3 [Video] | Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, Hartwig Adam | 5162 | |
5 | 13:53 | Data-Free Quantization Through Weight Equalization and Bias Correction [Video] | Markus Nagel, Mart van Baalen, Tijmen Blankevoort, Max Welling | 6174 | |
6 | 13:58 | A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays [Video] | Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas | 5201 | |
7 | 14:06 | Knowledge Distillation via Route Constrained Optimization [Video] | Xiao Jin, Baoyun Peng, Yichao Wu, Yu Liu, Jiaheng Liu, Ding Liang, Junjie Yan, Xiaolin Hu | 1252 | |
8 | 14:11 | Distillation-Based Training for Multi-Exit Architectures [Video] | Mary Phuong, Christoph H. Lampert | 3548 | |
9 | 14:16 | Similarity-Preserving Knowledge Distillation [Video] | Frederick Tung, Greg Mori | 5161 | |
10 | 14:24 | Many Task Learning With Task Routing [Video] | Gjorgji Strezoski, Nanne van Noord, Marcel Worring | 1330 | |
11 | 14:29 | Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels [Video] | Felix J.S. Bragman, Ryutaro Tanno, Sebastien Ourselin, Daniel C. Alexander, Jorge Cardoso | 3436 | |
12 | 14:34 | Transferability and Hardness of Supervised Classification Tasks [Video] | Anh T. Tran, Cuong V. Nguyen, Tal Hassner | 6640 | |
13 | 14:42 | Moment Matching for Multi-Source Domain Adaptation [Video] | Xingchao Peng, Qinxun Bai, Xide Xia, Zijun Huang, Kate Saenko, Bo Wang | 4125 | |
14 | 14:47 | Unsupervised Domain Adaptation via Regularized Conditional Alignment [Video] | Safa Cicek, Stefano Soatto | 4113 | |
15 | 14:52 | Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation [Video] | Ruijia Xu, Guanbin Li, Jihan Yang, Liang Lin | 959 | |
16 | 15:00 | UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation [Video] | Jogendra Nath Kundu, Nishank Lakkakula, R. Venkatesh Babu | 2419 | |
17 | 15:05 | Episodic Training for Domain Generalization [Video] | Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, Timothy M. Hospedales | 1488 | |
18 | 15:10 | Domain Adaptation for Structured Output via Discriminative Patch Representations [Video] | Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker | 609 | |
19 | 15:18 | Semi-Supervised Learning by Augmented Distribution Alignment [Video] | Qin Wang, Wen Li, Luc Van Gool | 3523 | |
20 | 15:23 | S4L: Self-Supervised Semi-Supervised Learning [Video] | Xiaohua Zhai, Avital Oliver, Alexander Kolesnikov, Lucas Beyer | 4757 |
Tuesday, October 29, 2019, 1330–1530 Oral 1.2B (Hall D2) Ko Nishino (Kyoto Univ.), Yong Jae Lee (Univ. of California, Davis) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Multi-View Geometry, 3D Scene Understanding | 21 | 13:30 | Privacy Preserving Image Queries for Camera Localization [Video] | Pablo Speciale, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys | 3611 |
22 | 13:35 | Calibration Wizard: A Guidance System for Camera Calibration Based on Modelling Geometric and Corner Uncertainty [Video] | Songyou Peng, Peter Sturm | 3435 | |
23 | 13:40 | Gated2Depth: Real-Time Dense Lidar From Gated Images [Video] | Tobias Gruber, Frank Julca-Aguilar, Mario Bijelic, Felix Heide | 3718 | |
24 | 13:48 | X-Section: Cross-Section Prediction for Enhanced RGB-D Fusion [Video] | Andrea Nicastro, Ronald Clark, Stefan Leutenegger | 3933 | |
25 | 13:53 | Learning an Event Sequence Embedding for Dense Event-Based Deep Stereo [Video] | Stepan Tulyakov, Francois Fleuret, Martin Kiefel, Peter Gehler, Michael Hirsch | 4599 | |
26 | 13:58 | Point-Based Multi-View Stereo Network [Video] | Rui Chen, Songfang Han, Jing Xu, Hao Su | 3151 | |
27 | 14:06 | Discrete Laplace Operator Estimation for Dynamic 3D Reconstruction [Video] | Xiangyu Xu, Enrique Dunn | 981 | |
28 | 14:11 | Deep Non-Rigid Structure From Motion [Video] | Chen Kong, Simon Lucey | 3127 | |
29 | 14:16 | Equivariant Multi-View Networks [Video] | Carlos Esteves, Yinshuang Xu, Christine Allen-Blanchette, Kostas Daniilidis | 5370 | |
30 | 14:24 | Interpolated Convolutional Networks for 3D Point Cloud Understanding [Video] | Jiageng Mao, Xiaogang Wang, Hongsheng Li | 2615 | |
31 | 14:29 | Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data [Video] | Mikaela Angelina Uy, Quang-Hieu Pham, Binh-Son Hua, Thanh Nguyen, Sai-Kit Yeung | 2353 | |
32 | 14:34 | PointCloud Saliency Maps [Video] | Tianhang Zheng, Changyou Chen, Junsong Yuan, Bo Li, Kui Ren | 2290 | |
33 | 14:42 | ShellNet: Efficient Point Cloud Convolutional Neural Networks Using Concentric Shells Statistics [Video] | Zhiyuan Zhang, Binh-Son Hua, Sai-Kit Yeung | 582 | |
34 | 14:47 | Unsupervised Deep Learning for Structured Shape Matching [Video] | Jean-Michel Roufosse, Abhishek Sharma, Maks Ovsjanikov | 2437 | |
35 | 14:52 | Linearly Converging Quasi Branch and Bound Algorithms for Global Rigid Registration [Video] | Nadav Dym, Shahar Ziv Kovalsky | 2303 | |
36 | 15:00 | Consensus Maximization Tree Search Revisited [Video] | Zhipeng Cai, Tat-Jun Chin, Vladlen Koltun | 4168 | |
37 | 15:05 | Quasi-Globally Optimal and Efficient Vanishing Point Estimation in Manhattan World [Video] | Haoang Li, Ji Zhao, Jean-Charles Bazin, Wen Chen, Zhe Liu, Yun-Hui Liu | 4865 | |
38 | 15:10 | An Efficient Solution to the Homography-Based Relative Pose Problem With a Common Reference Direction [Video] | Yaqing Ding, Jian Yang, Jean Ponce, Hui Kong | 3379 | |
39 | 15:18 | A Quaternion-Based Certifiably Optimal Solution to the Wahba Problem With Outliers [Video] | Heng Yang, Luca Carlone | 3476 | |
40 | 15:23 | PLMP - Point-Line Minimal Problems in Complete Multi-View Visibility [Video] | Timothy Duff, Kathlén Kohn, Anton Leykin, Tomas Pajdla | 5582 |
Tuesday, October 29, 2019, 1530–1800 Poster 1.2 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Deep Learning | 41 | 15:30 | Variational Few-Shot Learning | Jian Zhang, Chenglong Zhao, Bingbing Ni, Minghao Xu, Xiaokang Yang | 1006 |
42 | 15:30 | Generative Adversarial Minority Oversampling | Sankha Subhra Mullick, Shounak Datta, Swagatam Das | 2963 | |
43 | 15:30 | Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection | Dong Gong, Lingqiao Liu, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, Anton van den Hengel | 3198 | |
44 | 15:30 | Topological Map Extraction From Overhead Images | Zuoyue Li, Jan Dirk Wegner, Aurélien Lucchi | 3034 | |
45 | 15:30 | Exploiting Temporal Consistency for Real-Time Video Depth Estimation | Haokui Zhang, Chunhua Shen, Ying Li, Yuanzhouhan Cao, Yu Liu, Youliang Yan | 1457 | |
46 | 15:30 | The Sound of Motions | Hang Zhao, Chuang Gan, Wei-Chiu Ma, Antonio Torralba | 2561 | |
47 | 15:30 | SC-FEGAN: Face Editing Generative Adversarial Network With User’s Sketch and Color | Youngjoo Jo, Jongyoul Park | 1667 | |
48 | 15:30 | Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style | Hongwei Ge, Zehang Yan, Kai Zhang, Mingde Zhao, Liang Sun | 6385 | |
49 | 15:30 | Order-Aware Generative Modeling Using the 3D-Craft Dataset | Zhuoyuan Chen, Demi Guo, Tong Xiao, Saining Xie, Xinlei Chen, Haonan Yu, Jonathan Gray, Kavya Srinet, Haoqi Fan, Jerry Ma, Charles R. Qi, Shubham Tulsiani, Arthur Szlam, C. Lawrence Zitnick | 6532 | |
50 | 15:30 | Crowd Counting With Deep Structured Scale Integration Network | Lingbo Liu, Zhilin Qiu, Guanbin Li, Shufan Liu, Wanli Ouyang, Liang Lin | 2862 | |
51 | 15:30 | Bidirectional One-Shot Unsupervised Domain Mapping | Tomer Cohen, Lior Wolf | 2633 | |
52 | 15:30 | Evolving Space-Time Neural Architectures for Videos | AJ Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo | 3505 | |
53 | 15:30 | Universally Slimmable Networks and Improved Training Techniques | Jiahui Yu, Thomas S. Huang | 904 | |
54 | 15:30 | AutoDispNet: Improving Disparity Estimation With AutoML | Tonmoy Saikia, Yassine Marrakchi, Arber Zela, Frank Hutter, Thomas Brox | 4901 | |
55 | 15:30 | Deep Meta Functionals for Shape Representation | Gidi Littwin, Lior Wolf | 2637 | |
56 | 15:30 | Differentiable Kernel Evolution | Yu Liu, Jihao Liu, Ailing Zeng, Xiaogang Wang | 3361 | |
57 | 15:30 | Batch Weight for Domain Adaptation With Mass Shift | Mikolaj Bińkowski, Devon Hjelm, Aaron Courville | 6579 | |
58 | 15:30 | SRM: A Style-Based Recalibration Module for Convolutional Neural Networks | HyunJae Lee, Hyo-Eun Kim, Hyeonseob Nam | 3857 | |
59 | 15:30 | Switchable Whitening for Deep Representation Learning | Xingang Pan, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo | 382 | |
60 | 15:30 | Adaptative Inference Cost With Convolutional Neural Mixture Models | Adria Ruiz, Jakob Verbeek | 4085 | |
61 | 15:30 | On Network Design Spaces for Visual Recognition | Ilija Radosavovic, Justin Johnson, Saining Xie, Wan-Yen Lo, Piotr Dollár | 2551 | |
62 | 15:30 | Improved Techniques for Training Adaptive Deep Networks | Hao Li, Hong Zhang, Xiaojuan Qi, Ruigang Yang, Gao Huang | 5797 | |
63 | 15:30 | Resource Constrained Neural Network Architecture Search: Will a Submodularity Assumption Help? | Yunyang Xiong, Ronak Mehta, Vikas Singh | 3772 | |
64 | 15:30 | ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks | Xiaohan Ding, Yuchen Guo, Guiguang Ding, Jungong Han | 5264 | |
65 | 15:30 | A Comprehensive Overhaul of Feature Distillation | Byeongho Heo, Jeesoo Kim, Sangdoo Yun, Hyojin Park, Nojun Kwak, Jin Young Choi | 3700 | |
Recognition | 66 | 15:30 | Transferable Semi-Supervised 3D Object Detection From RGB-D Data | Yew Siang Tang, Gim Hee Lee | 1844 |
67 | 15:30 | DPOD: 6D Pose Object Detector and Refiner | Sergey Zakharov, Ivan Shugurov, Slobodan Ilic | 3375 | |
68 | 15:30 | STD: Sparse-to-Dense 3D Object Detector for Point Cloud | Zetong Yang, Yanan Sun, Shu Liu, Xiaoyong Shen, Jiaya Jia | 2501 | |
69 | 15:30 | DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense | Hang Zhou, Kejiang Chen, Weiming Zhang, Han Fang, Wenbo Zhou, Nenghai Yu | 4560 | |
70 | 15:30 | Learning Rich Features at High-Speed for Single-Shot Object Detection | Tiancai Wang, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao | 5011 | |
71 | 15:30 | Detecting Unseen Visual Relations Using Analogies | Julia Peyre, Ivan Laptev, Cordelia Schmid, Josef Sivic | 2531 | |
72 | 15:30 | Disentangling Monocular 3D Object Detection | Andrea Simonelli, Samuel Rota Bulò, Lorenzo Porzi, Manuel López-Antequera, Peter Kontschieder | 3921 | |
73 | 15:30 | STM: SpatioTemporal and Motion Encoding for Action Recognition | Boyuan Jiang, MengMeng Wang, Weihao Gan, Wei Wu, Junjie Yan | 3832 | |
74 | 15:30 | Dynamic Context Correspondence Network for Semantic Alignment | Shuaiyi Huang, Qiuyue Wang, Songyang Zhang, Shipeng Yan, Xuming He | 4434 | |
75 | 15:30 | Fooling Network Interpretation in Image Classification | Akshayvarun Subramanya, Vipin Pillai, Hamed Pirsiavash | 5129 | |
76 | 15:30 | Unconstrained Foreground Object Search | Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari | 1855 | |
77 | 15:30 | Embodied Amodal Recognition: Learning to Move to Perceive Objects | Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall, Devi Parikh, Dhruv Batra | 320 | |
78 | 15:30 | SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition | Kaiyu Yang, Olga Russakovsky, Jia Deng | 912 | |
79 | 15:30 | TensorMask: A Foundation for Dense Object Segmentation | Xinlei Chen, Ross Girshick, Kaiming He, Piotr Dollár | 669 | |
80 | 15:30 | Integral Object Mining via Online Attention Accumulation | Peng-Tao Jiang, Qibin Hou, Yang Cao, Ming-Ming Cheng, Yunchao Wei, Hong-Kai Xiong | 2481 | |
Segmentation, Grouping, & Shape | 81 | 15:30 | Accelerated Gravitational Point Set Alignment With Altered Physical Laws | Vladislav Golyanik, Christian Theobalt, Didier Stricker | 3620 |
82 | 15:30 | Domain Adaptation for Semantic Segmentation With Maximum Squares Loss | Minghao Chen, Hongyang Xue, Deng Cai | 4649 | |
83 | 15:30 | Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization Without Accessing Target Domain Data | Xiangyu Yue, Yang Zhang, Sicheng Zhao, Alberto Sangiovanni-Vincentelli, Kurt Keutzer, Boqing Gong | 2690 | |
84 | 15:30 | Semi-Supervised Skin Detection by Network With Mutual Guidance | Yi He, Jiayuan Shi, Chuan Wang, Haibin Huang, Jiaming Liu, Guanbin Li, Risheng Liu, Jue Wang | 3172 | |
85 | 15:30 | ACE: Adapting to Changing Environments for Semantic Segmentation | Zuxuan Wu, Xin Wang, Joseph E. Gonzalez, Tom Goldstein, Larry S. Davis | 1860 | |
86 | 15:30 | Efficient Segmentation: Learning Downsampling Near Semantic Boundaries | Dmitrii Marin, Zijian He, Peter Vajda, Priyam Chatterjee, Sam Tsai, Fei Yang, Yuri Boykov | 668 | |
87 | 15:30 | Recurrent U-Net for Resource-Constrained Segmentation | Wei Wang, Kaicheng Yu, Joachim Hugonot, Pascal Fua, Mathieu Salzmann | 845 | |
88 | 15:30 | Detecting the Unexpected via Image Resynthesis | Krzysztof Lis, Krishna Nakka, Pascal Fua, Mathieu Salzmann | 4794 | |
3D From Single View & RGBD | 89 | 15:30 | Self-Supervised Monocular Depth Hints | Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov | 6191 |
90 | 15:30 | 3D Scene Reconstruction With Multi-Layer Depth and Epipolar Transformers | Daeyun Shin, Zhile Ren, Erik B. Sudderth, Charless C. Fowlkes | 759 | |
91 | 15:30 | How Do Neural Networks See Depth in Single Images? | Tom van Dijk, Guido de Croon | 4576 | |
92 | 15:30 | On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos | Zhi Li, Xuan Wang, Fei Wang, Peilin Jiang | 1394 | |
93 | 15:30 | Canonical Surface Mapping via Geometric Cycle Consistency | Nilesh Kulkarni, Abhinav Gupta, Shubham Tulsiani | 3758 | |
94 | 15:30 | GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild | Alexander Grabner, Peter M. Roth, Vincent Lepetit | 1919 | |
Face & Body | 95 | 15:30 | Moulding Humans: Non-Parametric 3D Human Shape Estimation From Single Images | Valentin Gabeur, Jean-Sébastien Franco, Xavier Martin, Cordelia Schmid, Grégory Rogez | 5728 |
96 | 15:30 | 3DPeople: Modeling the Geometry of Dressed Humans | Albert Pumarola, Jordi Sanchez-Riera, Gary P. T. Choi, Alberto Sanfeliu, Francesc Moreno-Noguer | 5869 | |
97 | 15:30 | Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop | Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis | 6602 | |
98 | 15:30 | Optimizing Network Structure for 3D Human Pose Estimation | Hai Ci, Chunyu Wang, Xiaoxuan Ma, Yizhou Wang | 3717 | |
99 | 15:30 | Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks | Yujun Cai, Liuhao Ge, Jun Liu, Jianfei Cai, Tat-Jen Cham, Junsong Yuan, Nadia Magnenat Thalmann | 1881 | |
100 | 15:30 | Resolving 3D Human Pose Ambiguities With 3D Scene Constraints | Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas, Michael J. Black | 2115 | |
101 | 15:30 | Tex2Shape: Detailed Full Human Body Geometry From a Single Image | Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor | 1288 | |
102 | 15:30 | PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization | Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, Hao Li | 3540 | |
103 | 15:30 | DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction | Xiaoxing Zeng, Xiaojiang Peng, Yu Qiao | 5519 | |
104 | 15:30 | Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking | Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain | 5051 | |
105 | 15:30 | Aligning Latent Spaces for 3D Hand Pose Estimation | Linlin Yang, Shile Li, Dongheui Lee, Angela Yao | 4946 | |
106 | 15:30 | HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation | Kun Zhou, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu | 4275 | |
107 | 15:30 | End-to-End Hand Mesh Recovery From a Monocular RGB Image | Xiong Zhang, Qiang Li, Hong Mo, Wenbo Zhang, Wen Zheng | 4647 | |
Motion & Tracking | 108 | 15:30 | Robust Multi-Modality Multi-Object Tracking | Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy | 285 |
109 | 15:30 | The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs | Boris Ivanovic, Marco Pavone | 1011 | |
110 | 15:30 | ‘Skimming-Perusal’ Tracking: A Framework for Real-Time and Robust Long-Term Tracking | Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang | 1917 | |
111 | 15:30 | TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection | Kyle Min, Jason J. Corso | 2157 | |
112 | 15:30 | Attacking Optical Flow | Anurag Ranjan, Joel Janai, Andreas Geiger, Michael J. Black | 496 | |
Computational Photography & Graphics | 113 | 15:30 | Pro-Cam SSfM: Projector-Camera System for Structure and Spectral Reflectance From Motion | Chunyu Li, Yusuke Monno, Hironori Hidaka, Masatoshi Okutomi | 2101 |
114 | 15:30 | Mop Moiré Patterns Using MopNet | Bin He, Ce Wang, Boxin Shi, Ling-Yu Duan | 2272 | |
115 | 15:30 | Kernel Modeling Super-Resolution on Real Low-Resolution Images | Ruofan Zhou, Sabine Süsstrunk | 3508 | |
116 | 15:30 | Learning to Jointly Generate and Separate Reflections | Daiqian Ma, Renjie Wan, Boxin Shi, Alex C. Kot, Ling-Yu Duan | 767 | |
117 | 15:30 | Deep Multi-Model Fusion for Single-Image Dehazing | Zijun Deng, Lei Zhu, Xiaowei Hu, Chi-Wing Fu, Xuemiao Xu, Qing Zhang, Jing Qin, Pheng-Ann Heng | 2339 | |
118 | 15:30 | Deep Learning for Seeing Through Window With Raindrops | Yuhui Quan, Shijie Deng, Yixin Chen, Hui Ji | 3676 | |
119 | 15:30 | Mask-ShadowGAN: Learning to Remove Shadows From Unpaired Data | Xiaowei Hu, Yitong Jiang, Chi-Wing Fu, Pheng-Ann Heng | 2609 | |
Low-Level Vision & Optimization | 120 | 15:30 | Spatio-Temporal Filter Adaptive Network for Video Deblurring | Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Haozhe Xie, Wangmeng Zuo, Jimmy Ren | 1160 |
121 | 15:30 | Learning Deep Priors for Image Dehazing | Yang Liu, Jinshan Pan, Jimmy Ren, Zhixun Su | 3339 | |
122 | 15:30 | JPEG Artifacts Reduction via Deep Convolutional Sparse Coding | Xueyang Fu, Zheng-Jun Zha, Feng Wu, Xinghao Ding, John Paisley | 1942 | |
123 | 15:30 | Self-Guided Network for Fast Image Denoising | Shuhang Gu, Yawei Li, Luc Van Gool, Radu Timofte | 4910 | |
124 | 15:30 | Non-Local Intrinsic Decomposition With Near-Infrared Priors | Ziang Cheng, Yinqiang Zheng, Shaodi You, Imari Sato | 756 | |
Scene Understanding | 125 | 15:30 | VideoMem: Constructing, Analyzing, Predicting Short-Term and Long-Term Video Memorability | Romain Cohendet, Claire-Hélène Demarty, Ngoc Q. K. Duong, Martin Engilberge | 5131 |
126 | 15:30 | Rescan: Inductive Instance Segmentation for Indoor RGBD Scans | Maciej Halber, Yifei Shi, Kai Xu, Thomas Funkhouser | 2289 | |
127 | 15:30 | End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans | Armen Avetisyan, Angela Dai, Matthias Nießner | 1933 | |
128 | 15:30 | Making History Matter: History-Advantage Sequence Training for Visual Dialog | Tianhao Yang, Zheng-Jun Zha, Hanwang Zhang | 5927 | |
129 | 15:30 | Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization | Liu Liu, Hongdong Li, Yuchao Dai | 2795 | |
130 | 15:30 | Scene Graph Prediction With Limited Labels | Vincent S. Chen, Paroma Varma, Ranjay Krishna, Michael Bernstein, Christopher Ré, Li Fei-Fei | 3739 | |
Language & Reasoning | 131 | 15:30 | Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded | Ramprasaath R. Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Shalini Ghosh, Larry Heck, Dhruv Batra, Devi Parikh | 706 |
132 | 15:30 | Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment | Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran | 6175 | |
133 | 15:30 | Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding | Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Dechao Meng, Qingming Huang | 742 | |
134 | 15:30 | Hierarchy Parsing for Image Captioning | Ting Yao, Yingwei Pan, Yehao Li, Tao Mei | 5683 | |
135 | 15:30 | HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips | Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, Josef Sivic | 45 | |
136 | 15:30 | Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network | Bairui Wang, Lin Ma, Wei Zhang, Wenhao Jiang, Jingwen Wang, Wei Liu | 1039 | |
3D From Multiview & Sensors | 137 | 15:30 | Multi-View Stereo by Temporal Nonparametric Fusion | Yuxin Hou, Juho Kannala, Arno Solin | 6035 |
138 | 15:30 | Floor-SP: Inverse CAD for Floorplans by Sequential Room-Wise Shortest Path | Jiacheng Chen, Chen Liu, Jiaye Wu, Yasutaka Furukawa | 1233 | |
139 | 15:30 | Polarimetric Relative Pose Estimation | Zhaopeng Cui, Viktor Larsson, Marc Pollefeys | 3638 | |
140 | 15:30 | Closed-Form Optimal Two-View Triangulation Based on Angular Errors | Seong Hun Lee, Javier Civera | 1911 | |
141 | 15:30 | Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images | Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang | 1148 | |
Image & Video Synthesis | 142 | 15:30 | Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis | Patrick Esser, Johannes Haux, Björn Ommer | 3555 |
143 | 15:30 | SROBB: Targeted Perceptual Loss for Single Image Super-Resolution | Mohammad Saeed Rad, Behzad Bozorgtabar, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel, Jean-Philippe Thiran | 2968 | |
144 | 15:30 | An Internal Learning Approach to Video Inpainting | Haotian Zhang, Long Mai, Ning Xu, Zhaowen Wang, John Collomosse, Hailin Jin | 3273 | |
145 | 15:30 | Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement | Sai Bi, Kalyan Sunkavalli, Federico Perazzi, Eli Shechtman, Vladimir G. Kim, Ravi Ramamoorthi | 4212 | |
146 | 15:30 | Adversarial Defense via Learning to Generate Diverse Attacks | Yunseok Jang, Tianchen Zhao, Seunghoon Hong, Honglak Lee | 1811 | |
147 | 15:30 | Image Generation From Small Datasets via Batch Statistics Adaptation | Atsuhiro Noguchi, Tatsuya Harada | 3808 | |
148 | 15:30 | Lifelong GAN: Continual Learning for Conditional Image Generation | Mengyao Zhai, Lei Chen, Frederick Tung, Jiawei He, Megha Nawhal, Greg Mori | 3391 | |
Applications. Medical, & Robotics | 149 | 15:30 | Bayesian Relational Memory for Semantic Visual Navigation | Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian | 4634 |
150 | 15:30 | Mono-SF: Multi-View Geometry Meets Single-View Depth for Monocular Scene Flow Estimation of Dynamic Traffic Scenes | Fabian Brickwedde, Steffen Abraham, Rudolf Mester | 5103 | |
151 | 15:30 | Prior Guided Dropout for Robust Visual Localization in Dynamic Environments | Zhaoyang Huang, Yan Xu, Jianping Shi, Xiaowei Zhou, Hujun Bao, Guofeng Zhang | 3203 | |
152 | 15:30 | Drive&Act: A Multi-Modal Dataset for Fine-Grained Driver Behavior Recognition in Autonomous Vehicles | Manuel Martin, Alina Roitberg, Monica Haurilet, Matthias Horne, Simon Reiß, Michael Voit, Rainer Stiefelhagen | 2 | |
153 | 15:30 | Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints | Yan Xu, Xinge Zhu, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li | 2490 | |
154 | 15:30 | PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings | Nicholas Rhinehart, Rowan McAllister, Kris Kitani, Sergey Levine | 1984 | |
155 | 15:30 | LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis | Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Peng Yin, Wen Chen, Hesheng Wang, Haoang Li, Yun-Hui Liu | 6783 | |
156 | 15:30 | Local Supports Global: Deep Camera Relocalization With Sequence Enhancement | Fei Xue, Xin Wang, Zike Yan, Qiuyuan Wang, Junqiu Wang, Hongbin Zha | 1455 | |
157 | 15:30 | Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Shunkai Li, Fei Xue, Xin Wang, Zike Yan, Hongbin Zha | 909 | |
158 | 15:30 | TextPlace: Visual Place Recognition and Topological Localization Through Reading Scene Texts | Ziyang Hong, Yvan Petillot, David Lane, Yishu Miao, Sen Wang | 4996 | |
159 | 15:30 | CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization | Mingyu Ding, Zhe Wang, Jiankai Sun, Jianping Shi, Ping Luo | 4057 | |
160 | 15:30 | Situational Fusion of Visual Representation for Visual Navigation | William B. Shen, Danfei Xu, Yuke Zhu, Leonidas J. Guibas, Li Fei-Fei, Silvio Savarese | 806 | |
161 | 15:30 | Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking | Ziyuan Huang, Changhong Fu, Yiming Li, Fuling Lin, Peng Lu | 6944 | |
162 | 15:30 | 6-DOF GraspNet: Variational Grasp Generation for Object Manipulation | Arsalan Mousavian, Clemens Eppner, Dieter Fox | 6250 | |
163 | 15:30 | DAGMapper: Learning to Map by Discovering Lane Topology | Namdar Homayounfar, Wei-Chiu Ma, Justin Liang, Xinyu Wu, Jack Fan, Raquel Urtasun | 3609 | |
164 | 15:30 | 3D-LaneNet: End-to-End 3D Multiple Lane Detection | Noa Garnett, Rafi Cohen, Tomer Pe'er, Roee Lahav, Dan Levi | 876 |
Wednesday, October 30, 2019, 0900–1030 Oral 2.1A (Hall D1) Ming-Ming Cheng (Nankai Univ.), Camille Couprie (Facebook) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Feature Representations, Similarity Learning | 1 | 09:00 | Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation [Video] | Janis Postels, Francesco Ferroni, Huseyin Coskun, Nassir Navab, Federico Tombari | 1774 |
2 | 09:05 | Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation [Video] | Hong Liu, Rongrong Ji, Jie Li, Baochang Zhang, Yue Gao, Yongjian Wu, Feiyue Huang | 2912 | |
3 | 09:10 | Understanding Deep Networks via Extremal Perturbations and Smooth Masks [Video] | Ruth Fong, Mandela Patrick, Andrea Vedaldi | 3949 | |
4 | 09:18 | Unsupervised Pre-Training of Image Features on Non-Curated Data [Video] | Mathilde Caron, Piotr Bojanowski, Julien Mairal, Armand Joulin | 3438 | |
5 | 09:23 | Learning Local Descriptors With a CDF-Based Dynamic Soft Margin [Video] | Linguang Zhang, Szymon Rusinkiewicz | 791 | |
6 | 09:28 | Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement [Video] | Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic | 4549 | |
7 | 09:36 | Linearized Multi-Sampling for Differentiable Image Transformation [Video] | Wei Jiang, Weiwei Sun, Andrea Tagliasacchi, Eduard Trulls, Kwang Moo Yi | 811 | |
8 | 09:41 | AdaTransform: Adaptive Data Transformation [Video] | Zhiqiang Tang, Xi Peng, Tingfeng Li, Yizhe Zhu, Dimitris N. Metaxas | 715 | |
9 | 09:46 | CARAFE: Content-Aware ReAssembly of FEatures [Video] | Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin | 2495 | |
10 | 09:54 | AFD-Net: Aggregated Feature Difference Learning for Cross-Spectral Image Patch Matching [Video] | Dou Quan, Xuefeng Liang, Shuang Wang, Shaowei Wei, Yanfeng Li, Ning Huyan, Licheng Jiao | 4249 | |
11 | 09:59 | Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval [Video] | Shupeng Su, Zhisheng Zhong, Chao Zhang | 1426 | |
12 | 10:04 | Unsupervised Neural Quantization for Compressed-Domain Similarity Search [Video] | Stanislav Morozov, Artem Babenko | 3300 | |
13 | 10:12 | Siamese Networks: The Tale of Two Manifolds [Video] | Soumava Kumar Roy, Mehrtash Harandi, Richard Nock, Richard Hartley | 4856 | |
14 | 10:17 | Learning Combinatorial Embedding Networks for Deep Graph Matching [Video] | Runzhong Wang, Junchi Yan, Xiaokang Yang | 41 | |
15 | 10:22 | Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid [Video] | Zhanghui Kuang, Yiming Gao, Guanbin Li, Ping Luo, Yimin Chen, Liang Lin, Wayne Zhang | 2037 |
Wednesday, October 30, 2019, 0900–1030 Oral 2.1B (Hall D2) Hiroshi Ishikawa (Waseda Univ.), Jinwei Gu (SenseTime) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Low Level Vision | 16 | 09:00 | Wavelet Domain Style Transfer for an Effective Perception-Distortion Tradeoff in Single Image Super-Resolution [Video] | Xin Deng, Ren Yang, Mai Xu, Pier Luigi Dragotti | 6221 |
17 | 09:05 | Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model [Video] | Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, Lei Zhang | 507 | |
18 | 09:10 | RankSRGAN: Generative Adversarial Networks With Ranker for Image Super-Resolution [Video] | Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao | 539 | |
19 | 09:18 | Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlations [Video] | Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Jiayi Ma | 733 | |
20 | 09:23 | Deep SR-ITM: Joint Learning of Super-Resolution and Inverse Tone-Mapping for 4K UHD HDR Applications [Video] | Soo Ye Kim, Jihyong Oh, Munchurl Kim | 5601 | |
21 | 09:28 | Dynamic PET Image Reconstruction Using Nonnegative Matrix Factorization Incorporated With Deep Image Prior [Video] | Tatsuya Yokota, Kazuya Kawai, Muneyuki Sakata, Yuichi Kimura, Hidekata Hontani | 4500 | |
22 | 09:36 | DSIC: Deep Stereo Image Compression [Video] | Jerry Liu, Shenlong Wang, Raquel Urtasun | 2148 | |
23 | 09:41 | Variable Rate Deep Image Compression With a Conditional Autoencoder [Video] | Yoojin Choi, Mostafa El-Khamy, Jungwon Lee | 3646 | |
24 | 09:46 | Real Image Denoising With Feature Attention [Video] | Saeed Anwar, Nick Barnes | 3246 | |
25 | 09:54 | Noise Flow: Noise Modeling With Conditional Normalizing Flows [Video] | Abdelrahman Abdelhamed, Marcus A. Brubaker, Michael S. Brown | 33 | |
26 | 09:59 | Bottleneck Potentials in Markov Random Fields [Video] | Ahmed Abbas, Paul Swoboda | 3559 | |
27 | 10:04 | Seeing Motion in the Dark [Video] | Chen Chen, Qifeng Chen, Minh N. Do, Vladlen Koltun | 2553 | |
28 | 10:09 | SENSE: A Shared Encoder Network for Scene-Flow Estimation [Video] | Huaizu Jiang, Deqing Sun, Varun Jampani, Zhaoyang Lv, Erik Learned-Miller, Jan Kautz | 917 |
Wednesday, October 30, 2019, 1030–1300 Poster 2.1 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Deep Learning | 29 | 10:30 | Adversarial Feedback Loop | Firas Shama, Roey Mechrez, Alon Shoshan, Lihi Zelnik-Manor | 868 |
30 | 10:30 | Dynamic-Net: Tuning the Objective Without Re-Training for Synthesis Tasks | Alon Shoshan, Roey Mechrez, Lihi Zelnik-Manor | 892 | |
31 | 10:30 | AutoGAN: Neural Architecture Search for Generative Adversarial Networks | Xinyu Gong, Shiyu Chang, Yifan Jiang, Zhangyang Wang | 4192 | |
32 | 10:30 | Co-Evolutionary Compression for Unpaired Image Translation | Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu | 4342 | |
33 | 10:30 | Self-Supervised Representation Learning From Multi-Domain Data | Zeyu Feng, Chang Xu, Dacheng Tao | 5423 | |
34 | 10:30 | Controlling Neural Networks via Energy Dissipation | Michael Moeller, Thomas Möllenhoff, Daniel Cremers | 2246 | |
35 | 10:30 | Indices Matter: Learning to Index for Deep Image Matting | Hao Lu, Yutong Dai, Chunhua Shen, Songcen Xu | 1411 | |
36 | 10:30 | LAP-Net: Level-Aware Progressive Network for Image Dehazing | Yunan Li, Qiguang Miao, Wanli Ouyang, Zhenxin Ma, Huijuan Fang, Chao Dong, Yining Quan | 4253 | |
37 | 10:30 | Attention Augmented Convolutional Networks | Irwan Bello, Barret Zoph, Ashish Vaswani, Jonathon Shlens, Quoc V. Le | 6563 | |
38 | 10:30 | MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning | Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Kwang-Ting Cheng, Jian Sun | 3403 | |
39 | 10:30 | Accelerate CNN via Recursive Bayesian Pruning | Yuefu Zhou, Ya Zhang, Yanfeng Wang, Qi Tian | 2413 | |
40 | 10:30 | HBONet: Harmonious Bottleneck on Two Orthogonal Dimensions | Duo Li, Aojun Zhou, Anbang Yao | 3761 | |
41 | 10:30 | O2U-Net: A Simple Noisy Label Detection Approach for Deep Neural Networks | Jinchi Huang, Lie Qu, Rongfei Jia, Binqiang Zhao | 5801 | |
42 | 10:30 | Continual Learning by Asymmetric Loss Approximation With Single-Side Overestimation | Dongmin Park, Seokil Hong, Bohyung Han, Kyoung Mu Lee | 4533 | |
43 | 10:30 | Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation | Weifeng Ge, Sheng Guo, Weilin Huang, Matthew R. Scott | 3838 | |
44 | 10:30 | LIP: Local Importance-Based Pooling | Ziteng Gao, Limin Wang, Gangshan Wu | 3926 | |
45 | 10:30 | Global Feature Guided Local Pooling | Takumi Kobayashi | 4446 | |
46 | 10:30 | Conditional Coupled Generative Adversarial Networks for Zero-Shot Domain Adaptation | Jinghua Wang, Jianmin Jiang | 5338 | |
47 | 10:30 | Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks | Aamir Mustafa, Salman Khan, Munawar Hayat, Roland Goecke, Jianbing Shen, Ling Shao | 2247 | |
48 | 10:30 | Hyperpixel Flow: Semantic Correspondence With Multi-Layer Neural Features | Juhong Min, Jongmin Lee, Jean Ponce, Minsu Cho | 5417 | |
49 | 10:30 | Information Entropy Based Feature Pooling for Convolutional Neural Networks | Weitao Wan, Jiansheng Chen, Tianpeng Li, Yiqing Huang, Jingqi Tian, Cheng Yu, Youze Xue | 5290 | |
50 | 10:30 | Patchwork: A Patch-Wise Attention Network for Efficient Object Detection and Segmentation in Video Streams | Yuning Chai | 5412 | |
51 | 10:30 | AttentionRNN: A Structured Spatial Attention Mechanism | Siddhesh Khandelwal, Leonid Sigal | 4217 | |
52 | 10:30 | Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution | Yunpeng Chen, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng | 91 | |
53 | 10:30 | Domain Intersection and Domain Difference | Sagie Benaim, Michael Khaitov, Tomer Galanti, Lior Wolf | 6403 | |
54 | 10:30 | Learned Video Compression | Oren Rippel, Sanjay Nair, Carissa Lew, Steve Branson, Alexander G. Anderson, Lubomir Bourdev | 5742 | |
55 | 10:30 | Local Relation Networks for Image Recognition | Han Hu, Zheng Zhang, Zhenda Xie, Stephen Lin | 2275 | |
56 | 10:30 | DiscoNet: Shapes Learning on Disconnected Manifolds for 3D Editing | Éloi Mehr, Ariane Jourdan, Nicolas Thome, Matthieu Cord, Vincent Guitteny | 3502 | |
57 | 10:30 | Deep Residual Learning in the JPEG Transform Domain | Max Ehrlich, Larry S. Davis | 110 | |
58 | 10:30 | Approximated Bilinear Modules for Temporal Modeling | Xinqi Zhu, Chang Xu, Langwen Hui, Cewu Lu, Dacheng Tao | 1479 | |
59 | 10:30 | Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation | Chengchao Shen, Mengqi Xue, Xinchao Wang, Jie Song, Li Sun, Mingli Song | 2478 | |
60 | 10:30 | Data-Free Learning of Student Networks | Hanting Chen, Yunhe Wang, Chang Xu, Zhaohui Yang, Chuanjian Liu, Boxin Shi, Chunjing Xu, Chao Xu, Qi Tian | 1272 | |
61 | 10:30 | Deep Closest Point: Learning Representations for Point Cloud Registration | Yue Wang, Justin M. Solomon | 6534 | |
62 | 10:30 | Orientation-Aware Semantic Segmentation on Icosahedron Spheres | Chao Zhang, Stephan Liwicki, William Smith, Roberto Cipolla | 1943 | |
63 | 10:30 | Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks | Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang, Xiaogang Wang, Ping Luo | 1395 | |
64 | 10:30 | HarDNet: A Low Memory Traffic Network | Ping Chao, Chao-Yang Kao, Yu-Shan Ruan, Chien-Hsiang Huang, Youn-Long Lin | 5269 | |
65 | 10:30 | Dynamic Multi-Scale Filters for Semantic Segmentation | Junjun He, Zhongying Deng, Yu Qiao | 4825 | |
66 | 10:30 | Online Model Distillation for Efficient Video Inference | Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan, Kayvon Fatahalian | 5274 | |
Recognition | 67 | 10:30 | Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective | Kai Li, Martin Renqiang Min, Yun Fu | 5378 |
68 | 10:30 | Task-Driven Modular Networks for Zero-Shot Compositional Learning | Senthil Purushwalkam, Maximilian Nickel, Abhinav Gupta, Marc'Aurelio Ranzato | 2140 | |
69 | 10:30 | Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning | Limeng Qiao, Yemin Shi, Jia Li, Yaowei Wang, Tiejun Huang, Yonghong Tian | 4534 | |
70 | 10:30 | Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition | Wei Zhai, Yang Cao, Jing Zhang, Zheng-Jun Zha | 3942 | |
71 | 10:30 | RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment | Guan'an Wang, Tianzhu Zhang, Jian Cheng, Si Liu, Yang Yang, Zengguang Hou | 354 | |
72 | 10:30 | EvalNorm: Estimating Batch Normalization Statistics for Evaluation | Saurabh Singh, Abhinav Shrivastava | 5029 | |
73 | 10:30 | Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification | Jianyuan Guo, Yuhui Yuan, Lang Huang, Chao Zhang, Jin-Ge Yao, Kai Han | 2833 | |
74 | 10:30 | Person Search by Text Attribute Query As Zero-Shot Learning | Qi Dong, Shaogang Gong, Xiatian Zhu | 1642 | |
75 | 10:30 | Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval | Qing Liu, Lingxi Xie, Huiyu Wang, Alan L. Yuille | 2796 | |
76 | 10:30 | Active Learning for Deep Detection Neural Networks | Hamed H. Aghdam, Abel Gonzalez-Garcia, Joost van de Weijer, Antonio M. López | 4697 | |
77 | 10:30 | One-Shot Neural Architecture Search via Self-Evaluated Template Network | Xuanyi Dong, Yi Yang | 126 | |
78 | 10:30 | Batch DropBlock Network for Person Re-Identification and Beyond | Zuozhuo Dai, Mingqiang Chen, Xiaodong Gu, Siyu Zhu, Ping Tan | 3242 | |
79 | 10:30 | Omni-Scale Feature Learning for Person Re-Identification | Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, Tao Xiang | 2964 | |
80 | 10:30 | Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation | Linfeng Zhang, Jiebo Song, Anni Gao, Jingwei Chen, Chenglong Bao, Kaisheng Ma | 3293 | |
81 | 10:30 | Diversity With Cooperation: Ensemble Methods for Few-Shot Classification | Nikita Dvornik, Cordelia Schmid, Julien Mairal | 4097 | |
82 | 10:30 | Enhancing 2D Representation via Adjacent Views for 3D Shape Retrieval | Cheng Xu, Zhaoqun Li, Qiang Qiu, Biao Leng, Jingfei Jiang | 4428 | |
83 | 10:30 | Adversarial Fine-Grained Composition Learning for Unseen Attribute-Object Recognition | Kun Wei, Muli Yang, Hao Wang, Cheng Deng, Xianglong Liu | 175 | |
84 | 10:30 | Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification | Ruijie Quan, Xuanyi Dong, Yu Wu, Linchao Zhu, Yi Yang | 491 | |
85 | 10:30 | Second-Order Non-Local Attention Networks for Person Re-Identification | Bryan (Ning) Xia, Yuan Gong, Yizhe Zhang, Christian Poellabauer | 3622 | |
Segmentation, Grouping, & Shape | 86 | 10:30 | Fast Computation of Content-Sensitive Superpixels and Supervoxels Using Q-Distances | Zipeng Ye, Ran Yi, Minjing Yu, Yong-Jin Liu, Ying He | 3346 |
87 | 10:30 | Progressive-X: Efficient, Anytime, Multi-Model Fitting Algorithm | Dániel Baráth, Jiří Matas | 5018 | |
88 | 10:30 | Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection | Yingyue Xu, Dan Xu, Xiaopeng Hong, Wanli Ouyang, Rongrong Ji, Min Xu, Guoying Zhao | 5196 | |
89 | 10:30 | Selectivity or Invariance: Boundary-Aware Salient Object Detection | Jinming Su, Jia Li, Yu Zhang, Changqun Xia, Yonghong Tian | 356 | |
90 | 10:30 | Online Unsupervised Learning of the 3D Kinematic Structure of Arbitrary Rigid Bodies | Urbano Miguel Nunes, Yiannis Demiris | 6400 | |
3D From Single View & RGBD | 91 | 10:30 | Few-Shot Generalization for Single-Image 3D Reconstruction via Priors | Bram Wallace, Bharath Hariharan | 3552 |
92 | 10:30 | Digging Into Self-Supervised Monocular Depth Estimation | Clément Godard, Oisin Mac Aodha, Michael Firman, Gabriel J. Brostow | 3670 | |
93 | 10:30 | Learning Object-Specific Distance From a Monocular Image | Jing Zhu, Yi Fang | 1517 | |
94 | 10:30 | Unsupervised 3D Reconstruction Networks | Geonho Cha, Minsik Lee, Songhwai Oh | 2406 | |
95 | 10:30 | 3D Point Cloud Generative Adversarial Network Based on Tree Structured Graph Convolutions | Dong Wook Shu, Sung Woo Park, Junseok Kwon | 3923 | |
96 | 10:30 | Visualization of Convolutional Neural Networks for Monocular Depth Estimation | Junjie Hu, Yan Zhang, Takayuki Okatani | 1477 | |
97 | 10:30 | 3D-RelNet: Joint Object and Relational Network for 3D Prediction | Nilesh Kulkarni, Ishan Misra, Shubham Tulsiani, Abhinav Gupta | 3139 | |
Action & Video | 98 | 10:30 | Co-Separating Sounds of Visual Objects | Ruohan Gao, Kristen Grauman | 1597 |
99 | 10:30 | BMN: Boundary-Matching Network for Temporal Action Proposal Generation | Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen | 3382 | |
100 | 10:30 | Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks | Ziyi Liu, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua | 3215 | |
101 | 10:30 | Progressive Sparse Local Attention for Video Object Detection | Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Véronique Prinet, Chunhong Pan | 838 | |
102 | 10:30 | Reasoning About Human-Object Interactions Through Dual Attention Networks | Tete Xiao, Quanfu Fan, Dan Gutfreund, Mathew Monfort, Aude Oliva, Bolei Zhou | 1706 | |
103 | 10:30 | DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation | Xiaohui Zeng, Renjie Liao, Li Gu, Yuwen Xiong, Sanja Fidler, Raquel Urtasun | 2725 | |
104 | 10:30 | Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query | Hao Wang, Cheng Deng, Junchi Yan, Dacheng Tao | 2607 | |
105 | 10:30 | AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation | Huaijia Lin, Xiaojuan Qi, Jiaya Jia | 3818 | |
106 | 10:30 | Global-Local Temporal Representations for Video Person Re-Identification | Jianing Li, Jingdong Wang, Qi Tian, Wen Gao, Shiliang Zhang | 1672 | |
107 | 10:30 | AdvIT: Adversarial Frames Identifier Based on Temporal Consistency in Videos | Chaowei Xiao, Ruizhi Deng, Bo Li, Taesung Lee, Benjamin Edwards, Jinfeng Yi, Dawn Song, Mingyan Liu, Ian Molloy | 3521 | |
Motion & Tracking | 108 | 10:30 | RANet: Ranking Attention Network for Fast Video Object Segmentation | Ziqin Wang, Jun Xu, Li Liu, Fan Zhu, Ling Shao | 1434 |
109 | 10:30 | Spatial-Temporal Relation Networks for Multi-Object Tracking | Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu | 3971 | |
110 | 10:30 | Bridging the Gap Between Detection and Tracking: A Unified Approach | Lianghua Huang, Xin Zhao, Kaiqi Huang | 680 | |
111 | 10:30 | Learning the Model Update for Siamese Trackers | Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer, Martin Danelljan, Fahad Shahbaz Khan | 3123 | |
112 | 10:30 | Fast-deepKCF Without Boundary Effect | Linyu Zheng, Ming Tang, Yingying Chen, Jinqiao Wang, Hanqing Lu | 2169 | |
Computational Photography & Graphics | 113 | 10:30 | Program-Guided Image Manipulators | Jiayuan Mao, Xiuming Zhang, Yikai Li, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu | 1538 |
114 | 10:30 | Calibration of Axial Fisheye Cameras Through Generic Virtual Central Models | Pierre-André Brousseau, Sébastien Roy | 5053 | |
115 | 10:30 | Micro-Baseline Structured Light | Vishwanath Saragadam, Jian Wang, Mohit Gupta, Shree Nayar | 1639 | |
116 | 10:30 | l-Net: Reconstruct Hyperspectral Images From a Snapshot Measurement | Xin Miao, Xin Yuan, Yunchen Pu, Vassilis Athitsos | 3136 | |
117 | 10:30 | Deep Depth From Aberration Map | Masako Kashiwagi, Nao Mishima, Tatsuo Kozakaya, Shinsaku Hiura | 3897 | |
118 | 10:30 | A Dataset of Multi-Illumination Images in the Wild | Lukas Murmann, Michaël Gharbi, Miika Aittala, Frédo Durand | 3059 | |
119 | 10:30 | Monocular Neural Image Based Rendering With Continuous View Control | Xu Chen, Jie Song, Otmar Hilliges | 2660 | |
120 | 10:30 | Multi-View Image Fusion | Marc Comino Trinidad, Ricardo Martin Brualla, Florian Kainz, Janne Kontkanen | 4176 | |
Low-Level & Optimization | 121 | 10:30 | Enhancing Low Light Videos by Exploring High Sensitivity Camera Noise | Wei Wang, Xin Chen, Cheng Yang, Xiang Li, Xuemei Hu, Tao Yue | 5457 |
122 | 10:30 | Deep Restoration of Vintage Photographs From Scanned Halftone Prints | Qifan Gao, Xiao Shu, Xiaolin Wu | 5301 | |
123 | 10:30 | Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation | Qiqi Hou, Feng Liu | 93 | |
124 | 10:30 | CFSNet: Toward a Controllable Feature Space for Image Restoration | Wei Wang, Ruiming Guo, Yapeng Tian, Wenming Yang | 3439 | |
125 | 10:30 | Deep Blind Hyperspectral Image Fusion | Wu Wang, Weihong Zeng, Yue Huang, Xinghao Ding, John Paisley | 3787 | |
126 | 10:30 | Fully Convolutional Pixel Adaptive Image Denoiser | Sungmin Cha, Taesup Moon | 2608 | |
127 | 10:30 | Coherent Semantic Attention for Image Inpainting | Hongyu Liu, Bin Jiang, Yi Xiao, Chao Yang | 4737 | |
128 | 10:30 | Embedded Block Residual Network: A Recursive Restoration Model for Single-Image Super-Resolution | Yajun Qiu, Ruxin Wang, Dapeng Tao, Jun Cheng | 4785 | |
129 | 10:30 | Fast Image Restoration With Multi-Bin Trainable Linear Units | Shuhang Gu, Wen Li, Luc Van Gool, Radu Timofte | 4890 | |
Scene Understanding | 130 | 10:30 | Counting With Focus for Free | Zenglin Shi, Pascal Mettes, Cees G. M. Snoek | 1042 |
131 | 10:30 | SynDeMo: Synergistic Deep Feature Alignment for Joint Learning of Depth and Ego-Motion | Behzad Bozorgtabar, Mohammad Saeed Rad, Dwarikanath Mahapatra, Jean-Philippe Thiran | 3114 | |
132 | 10:30 | Diverse Image Synthesis From Semantic Layouts via Conditional IMLE | Ke Li, Tianhao Zhang, Jitendra Malik | 3807 | |
133 | 10:30 | Towards Bridging Semantic Gap to Improve Semantic Segmentation | Yanwei Pang, Yazhao Li, Jianbing Shen, Ling Shao | 818 | |
Language & Reasoning | 134 | 10:30 | Generating Diverse and Descriptive Image Captions Using Visual Paraphrases | Lixin Liu, Jiajun Tang, Xiaojun Wan, Zongming Guo | 4530 |
135 | 10:30 | Learning to Collocate Neural Modules for Image Captioning | Xu Yang, Hanwang Zhang, Jianfei Cai | 3394 | |
136 | 10:30 | Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning | Jyoti Aneja, Harsh Agrawal, Dhruv Batra, Alexander Schwing | 6554 | |
137 | 10:30 | Why Does a Visual Question Have Different Answers? | Nilavra Bhattacharya, Qing Li, Danna Gurari | 1111 | |
138 | 10:30 | G3raphGround: Graph-Based Language Grounding | Mohit Bajaj, Lanjun Wang, Leonid Sigal | 4410 | |
139 | 10:30 | Scene Text Visual Question Answering | Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluis Gomez, Marçal Rusiñol, Ernest Valveny, C.V. Jawahar, Dimosthenis Karatzas | 5810 | |
140 | 10:30 | Compositional Video Prediction | Yufei Ye, Maneesh Singh, Abhinav Gupta, Shubham Tulsiani | 3120 | |
3D From Multiview & Sensors | 141 | 10:30 | Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM | Lu Sheng, Dan Xu, Wanli Ouyang, Xiaogang Wang | 2517 |
142 | 10:30 | MVSCRF: Learning Multi-View Stereo With Conditional Random Fields | Youze Xue, Jiansheng Chen, Weitao Wan, Yiqing Huang, Cheng Yu, Tianpeng Li, Jiayu Bao | 5280 | |
143 | 10:30 | Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses | Eric Brachmann, Carsten Rother | 4033 | |
144 | 10:30 | Efficient Learning on Point Clouds With Basis Point Sets | Sergey Prokudin, Christoph Lassner, Javier Romero | 6267 | |
145 | 10:30 | Cross View Fusion for 3D Human Pose Estimation | Haibo Qiu, Chunyu Wang, Jingdong Wang, Naiyan Wang, Wenjun Zeng | 732 | |
146 | 10:30 | Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images | Junbang Liang, Ming C. Lin | 1779 | |
147 | 10:30 | Monocular Piecewise Depth Estimation in Dynamic Scenes by Exploiting Superpixel Relations | Yan Di, Henrique Morimitsu, Shan Gao, Xiangyang Ji | 367 | |
148 | 10:30 | Is This the Right Place? Geometric-Semantic Pose Verification for Indoor Visual Localization | Hajime Taira, Ignacio Rocco, Jiri Sedlar, Masatoshi Okutomi, Josef Sivic, Tomas Pajdla, Torsten Sattler, Akihiko Torii | 4509 | |
149 | 10:30 | DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch | Shivam Duggal, Shenlong Wang, Wei-Chiu Ma, Rui Hu, Raquel Urtasun | 2740 | |
Image & Video Synthesis | 150 | 10:30 | Convolutional Sequence Generation for Skeleton-Based Action Synthesis | Sijie Yan, Zhizhong Li, Yuanjun Xiong, Huahan Yan, Dahua Lin | 508 |
151 | 10:30 | Onion-Peel Networks for Deep Video Completion | Seoung Wug Oh, Sungho Lee, Joon-Young Lee, Seon Joo Kim | 2840 | |
152 | 10:30 | Copy-and-Paste Networks for Deep Video Inpainting | Sungho Lee, Seoung Wug Oh, DaeYeun Won, Seon Joo Kim | 2855 | |
153 | 10:30 | Content and Style Disentanglement for Artistic Style Transfer | Dmytro Kotovenko, Artsiom Sanakoyeu, Sabine Lang, Björn Ommer | 2684 |
Thursday, October 31, 2019, 0900–1030 Oral 3.1A (Hall D1) Ming-Yu Liu (NVIDIA), Eli Shechtman (Adobe Research) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Generative Modeling & Synthesis | 1 | 09:00 | Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [Video] | Rameen Abdal, Yipeng Qin, Peter Wonka | 3080 |
2 | 09:05 | Controllable Artistic Text Style Transfer via Shape-Matching GAN [Video] | Shuai Yang, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu, Zongming Guo | 30 | |
3 | 09:10 | Understanding Generalized Whitening and Coloring Transform for Universal Style Transfer [Video] | Tai-Yin Chiu | 5752 | |
4 | 09:18 | Learning Implicit Generative Models by Matching Perceptual Features [Video] | Cicero Nogueira dos Santos, Youssef Mroueh, Inkit Padhi, Pierre Dognin | 5968 | |
5 | 09:23 | Free-Form Image Inpainting With Gated Convolution [Video] | Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang | 905 | |
6 | 09:28 | FiNet: Compatible and Diverse Fashion Image Inpainting [Video] | Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott, Larry S. Davis | 1963 | |
7 | 09:36 | InGAN: Capturing and Retargeting the “DNA” of a Natural Image [Video] | Assaf Shocher, Shai Bagon, Phillip Isola, Michal Irani | 1017 | |
8 | 09:41 | Seeing What a GAN Cannot Generate [Video] | David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, Antonio Torralba | 5158 | |
9 | 09:46 | COCO-GAN: Generation by Parts via Conditional Coordinating [Video] | Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen | 229 | |
10 | 09:54 | Neural Turtle Graphics for Modeling City Road Layouts [Video] | Hang Chu, Daiqing Li, David Acuna, Amlan Kar, Maria Shugrina, Xinkai Wei, Ming-Yu Liu, Antonio Torralba, Sanja Fidler | 1223 | |
11 | 09:59 | Texture Fields: Learning Texture Representations in Function Space [Video] | Michael Oechsle, Lars Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger | 5950 | |
12 | 10:04 | PointFlow: 3D Point Cloud Generation With Continuous Normalizing Flows [Video] | Guandao Yang, Xun Huang, Zekun Hao, Ming-Yu Liu, Serge Belongie, Bharath Hariharan | 59 | |
13 | 10:12 | Meta-Sim: Learning to Generate Synthetic Datasets [Video] | Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler | 3463 | |
14 | 10:17 | Specifying Object Attributes and Relations in Interactive Scene Generation [Video] | Oron Ashual, Lior Wolf | 2134 | |
15 | 10:22 | SinGAN: Learning a Generative Model From a Single Natural Image [Video] | Tamar Rott Shaham, Tali Dekel, Tomer Michaeli | 2245 |
Thursday, October 31, 2019, 0900–1030 Oral 3.1B (Hall D2) Gunhee Kim (Seoul National Univ.), Vicente Ordonez (Univ. of Virginia) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Vision, Language, & Text | 16 | 09:00 | VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research [Video] | Xin Wang, Jiawei Wu, Junkun Chen, Lei Li, Yuan-Fang Wang, William Yang Wang | 1135 |
17 | 09:05 | A Graph-Based Framework to Bridge Movies and Synopses [Video] | Yu Xiong, Qingqiu Huang, Lingfeng Guo, Hang Zhou, Bolei Zhou, Dahua Lin | 385 | |
18 | 09:10 | From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason [Video] | Ajeet Kumar Singh, Anand Mishra, Shashank Shekhar, Anirban Chakraborty | 6036 | |
19 | 09:18 | Counterfactual Critic Multi-Agent Training for Scene Graph Generation [Video] | Long Chen, Hanwang Zhang, Jun Xiao, Xiangnan He, Shiliang Pu, Shih-Fu Chang | 1565 | |
20 | 09:23 | Robust Change Captioning [Video] | Dong Huk Park, Trevor Darrell, Anna Rohrbach | 5135 | |
21 | 09:28 | Attention on Attention for Image Captioning [Video] | Lun Huang, Wenmin Wang, Jie Chen, Xiao-Yong Wei | 5657 | |
22 | 09:36 | Dynamic Graph Attention for Referring Expression Comprehension [Video] | Sibei Yang, Guanbin Li, Yizhou Yu | 3058 | |
23 | 09:41 | Visual Semantic Reasoning for Image-Text Matching [Video] | Kunpeng Li, Yulun Zhang, Kai Li, Yuanyuan Li, Yun Fu | 4211 | |
24 | 09:46 | Phrase Localization Without Paired Training Examples [Video] | Josiah Wang, Lucia Specia | 4024 | |
25 | 09:54 | Learning to Assemble Neural Module Tree Networks for Visual Grounding [Video] | Daqing Liu, Hanwang Zhang, Feng Wu, Zheng-Jun Zha | 4090 | |
26 | 09:59 | A Fast and Accurate One-Stage Approach to Visual Grounding [Video] | Zhengyuan Yang, Boqing Gong, Liwei Wang, Wenbing Huang, Dong Yu, Jiebo Luo | 4151 | |
27 | 10:04 | Zero-Shot Grounding of Objects From Natural Language Queries [Video] | Arka Sadhu, Kan Chen, Ram Nevatia | 1092 | |
28 | 10:12 | Towards Unconstrained End-to-End Text Spotting [Video] | Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao | 5112 | |
29 | 10:17 | What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis [Video] | Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk Lee | 5725 |
Thursday, October 31, 2019, 1030–1300 Poster 3.1 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Deep Learning | 30 | 10:30 | Sparse and Imperceivable Adversarial Attacks | Francesco Croce, Matthias Hein | 6410 |
31 | 10:30 | Enhancing Adversarial Example Transferability With an Intermediate Level Attack | Qian Huang, Isay Katsman, Horace He, Zeqi Gu, Serge Belongie, Ser-Nam Lim | 5296 | |
32 | 10:30 | Implicit Surface Representations As Layers in Neural Networks | Mateusz Michalkiewicz, Jhony K. Pontes, Dominic Jack, Mahsa Baktashmotlagh, Anders Eriksson | 4370 | |
33 | 10:30 | A Tour of Convolutional Networks Guided by Linear Interpreters | Pablo Navarrete Michelini, Hanwen Liu, Yunhua Lu, Xingqun Jiang | 1827 | |
34 | 10:30 | Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning | João F. Henriques, Sebastien Ehrhardt, Samuel Albanie, Andrea Vedaldi | 5148 | |
35 | 10:30 | Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers | Ameya Joshi, Amitangshu Mukherjee, Soumik Sarkar, Chinmay Hegde | 5195 | |
36 | 10:30 | Hilbert-Based Generative Defense for Adversarial Examples | Yang Bai, Yan Feng, Yisen Wang, Tao Dai, Shu-Tao Xia, Yong Jiang | 5540 | |
37 | 10:30 | On the Efficacy of Knowledge Distillation | Jang Hyun Cho, Bharath Hariharan | 5344 | |
38 | 10:30 | Sym-Parameterized Dynamic Inference for Mixed-Domain Image Translation | Simyung Chang, SeongUk Park, John Yang, Nojun Kwak | 3919 | |
39 | 10:30 | Better and Faster: Exponential Loss for Image Patch Matching | Shuang Wang, Yanfeng Li, Xuefeng Liang, Dou Quan, Bowu Yang, Shaowei Wei, Licheng Jiao | 4903 | |
40 | 10:30 | Physical Adversarial Textures That Fool Visual Object Tracking | Rey Reza Wiyatno, Anqi Xu | 5323 | |
41 | 10:30 | Wasserstein GAN With Quadratic Transport Cost | Huidong Liu, Xianfeng Gu, Dimitris Samaras | 6026 | |
42 | 10:30 | Scalable Verified Training for Provably Robust Image Classification | Sven Gowal, Krishnamurthy (Dj) Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelović, Timothy Mann, Pushmeet Kohli | 4660 | |
43 | 10:30 | Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks | Ruihao Gong, Xianglong Liu, Shenghu Jiang, Tianxiang Li, Peng Hu, Jiazhen Lin, Fengwei Yu, Junjie Yan | 4246 | |
44 | 10:30 | The LogBarrier Adversarial Attack: Making Effective Use of Decision Boundary Information | Chris Finlay, Aram-Alexandre Pooladian, Adam Oberman | 4144 | |
45 | 10:30 | Proximal Mean-Field for Neural Network Quantization | Thalaiyasingam Ajanthan, Puneet K. Dokania, Richard Hartley, Philip H. S. Torr | 3682 | |
46 | 10:30 | Improving Adversarial Robustness via Guided Complement Entropy | Hao-Yun Chen, Jhao-Hong Liang, Shih-Chieh Chang, Jia-Yu Pan, Yu-Ting Chen, Wei Wei, Da-Cheng Juan | 2240 | |
47 | 10:30 | A Geometry-Inspired Decision-Based Attack | Yujia Liu, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard | 3860 | |
48 | 10:30 | Universal Perturbation Attack Against Image Retrieval | Jie Li, Rongrong Ji, Hong Liu, Xiaopeng Hong, Yue Gao, Qi Tian | 2070 | |
49 | 10:30 | Bayesian Optimized 1-Bit CNNs | Jiaxin Gu, Junhe Zhao, Xiaolong Jiang, Baochang Zhang, Jianzhuang Liu, Guodong Guo, Rongrong Ji | 1746 | |
50 | 10:30 | Rethinking ImageNet Pre-Training | Kaiming He, Ross Girshick, Piotr Dollár | 2571 | |
51 | 10:30 | Defending Against Universal Perturbations With Shared Adversarial Training | Chaithanya Kumar Mummadi, Thomas Brox, Jan Hendrik Metzen | 5651 | |
52 | 10:30 | Adaptive Activation Thresholding: Dynamic Routing Type Behavior for Interpretability in Convolutional Neural Networks | Yiyou Sun, Sathya N. Ravi, Vikas Singh | 1093 | |
53 | 10:30 | XRAI: Better Attributions Through Regions | Andrei Kapishnikov, Tolga Bolukbasi, Fernanda Viégas, Michael Terry | 6352 | |
54 | 10:30 | Guessing Smart: Biased Sampling for Efficient Black-Box Adversarial Attacks | Thomas Brunner, Frederik Diehl, Michael Truong Le, Alois Knoll | 6204 | |
Recognition | 55 | 10:30 | Mask-Guided Attention Network for Occluded Pedestrian Detection | Yanwei Pang, Jin Xie, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Ling Shao | 568 |
56 | 10:30 | Spectral Feature Transformation for Person Re-Identification | Chuanchen Luo, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang | 1165 | |
57 | 10:30 | Permutation-Invariant Feature Restructuring for Correlation-Aware Image Set-Based Recognition | Xiaofeng Liu, Zhenhua Guo, Site Li, Lingsheng Kong, Ping Jia, Jane You, B.V.K. Vijaya Kumar | 832 | |
58 | 10:30 | Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization | Chufeng Tang, Lu Sheng, Zhaoxiang Zhang, Xiaolin Hu | 2029 | |
59 | 10:30 | Correlation Congruence for Knowledge Distillation | Baoyun Peng, Xiao Jin, Jiaheng Liu, Dongsheng Li, Yichao Wu, Yu Liu, Shunfeng Zhou, Zhaoning Zhang | 1693 | |
60 | 10:30 | Dynamic Curriculum Learning for Imbalanced Data Classification | Yiru Wang, Weihao Gan, Jie Yang, Wei Wu, Junjie Yan | 4765 | |
61 | 10:30 | Video Face Clustering With Unknown Number of Clusters | Makarand Tapaswi, Marc T. Law, Sanja Fidler | 3489 | |
62 | 10:30 | Targeted Mismatch Adversarial Attack: Query With a Flower to Retrieve the Tower | Giorgos Tolias, Filip Radenovic, Ondřej Chum | 5008 | |
63 | 10:30 | Fashion++: Minimal Edits for Outfit Improvement | Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman | 4311 | |
64 | 10:30 | Semi-Supervised Pedestrian Instance Synthesis and Detection With Mutual Reinforcement | Si Wu, Sihao Lin, Wenhao Wu, Mohamed Azzam, Hau-San Wong | 3824 | |
65 | 10:30 | SILCO: Show a Few Images, Localize the Common Object | Tao Hu, Pascal Mettes, Jia-Hong Huang, Cees G. M. Snoek | 171 | |
66 | 10:30 | A Deep Step Pattern Representation for Multimodal Retinal Image Registration | Jimmy Addison Lee, Peng Liu, Jun Cheng, Huazhu Fu | 1378 | |
67 | 10:30 | Deep Graphical Feature Learning for the Feature Matching Problem | Zhen Zhang, Wee Sun Lee | 579 | |
68 | 10:30 | Minimum Delay Object Detection From Video | Dong Lao, Ganesh Sundaramoorthi | 2679 | |
69 | 10:30 | Learning With Average Precision: Training Image Retrieval With a Listwise Loss | Jérôme Revaud, Jon Almazán, Rafael S. Rezende, César Roberto de Souza | 4690 | |
70 | 10:30 | Learning to Find Common Objects Across Few Image Collections | Amirreza Shaban, Amir Rahimi, Shray Bansal, Stephen Gould, Byron Boots, Richard Hartley | 3345 | |
71 | 10:30 | Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection | Lu Zhang, Xiangyu Zhu, Xiangyu Chen, Xu Yang, Zhen Lei, Zhiyong Liu | 520 | |
72 | 10:30 | Deep Self-Learning From Noisy Labels | Jiangfan Han, Ping Luo, Xiaogang Wang | 900 | |
73 | 10:30 | DSConv: Efficient Convolution Operator | Marcelo Gennari do Nascimento, Roger Fawcett, Victor Adrian Prisacariu | 450 | |
Segmentation, Grouping, & Shape | 74 | 10:30 | Explicit Shape Encoding for Real-Time Instance Segmentation | Wenqiang Xu, Haiyang Wang, Fubo Qi, Cewu Lu | 1045 |
75 | 10:30 | IMP: Instance Mask Projection for High Accuracy Semantic Segmentation of Things | Cheng-Yang Fu, Tamara L. Berg, Alexander C. Berg | 3281 | |
76 | 10:30 | Video Instance Segmentation | Linjie Yang, Yuchen Fan, Ning Xu | 1972 | |
77 | 10:30 | Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation | Wataru Shimoda, Keiji Yanai | 6886 | |
78 | 10:30 | SPGNet: Semantic Prediction Guidance for Scene Parsing | Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong, Thomas S. Huang, Wen-Mei Hwu, Honghui Shi | 531 | |
79 | 10:30 | Gated-SCNN: Gated Shape CNNs for Semantic Segmentation | Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler | 3458 | |
80 | 10:30 | DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing | Yongcheng Liu, Bin Fan, Gaofeng Meng, Jiwen Lu, Shiming Xiang, Chunhong Pan | 989 | |
81 | 10:30 | AMP: Adaptive Masked Proxies for Few-Shot Segmentation | Mennatullah Siam, Boris N. Oreshkin, Martin Jagersand | 4105 | |
82 | 10:30 | Universal Semi-Supervised Semantic Segmentation | Tarun Kalluri, Girish Varma, Manmohan Chandraker, C.V. Jawahar | 4481 | |
83 | 10:30 | Feature Weighting and Boosting for Few-Shot Segmentation | Khoi Nguyen, Sinisa Todorovic | 1826 | |
Statistics, Physics, Theory & Datasets | 84 | 10:30 | Accelerate Learning of Deep Hashing With Gradient Attention | Long-Kai Huang, Jianda Chen, Sinno Jialin Pan | 5705 |
85 | 10:30 | SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval | Qing-Yuan Jiang, Yi He, Gen Li, Jian Lin, Lei Li, Wu-Jun Li | 5384 | |
86 | 10:30 | Block Annotation: Better Image Annotation With Sub-Image Decomposition | Hubert Lin, Paul Upchurch, Kavita Bala | 5314 | |
87 | 10:30 | Probabilistic Deep Ordinal Regression Based on Gaussian Processes | Yanzhu Liu, Fan Wang, Adams Wai Kin Kong | 1681 | |
88 | 10:30 | Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations | Tianlu Wang, Jieyu Zhao, Mark Yatskar, Kai-Wei Chang, Vicente Ordonez | 3280 | |
89 | 10:30 | Teacher Guided Architecture Search | Pouya Bashivan, Mark Tensen, James J. DiCarlo | 4943 | |
3D From Single View & RGBD | 90 | 10:30 | FACSIMILE: Fast and Accurate Scans From an Image in Less Than a Second | David Smith, Matthew Loper, Xiaochen Hu, Paris Mavroidis, Javier Romero | 6506 |
91 | 10:30 | Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild | Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy | 2209 | |
92 | 10:30 | Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation | Yu Sun, Yun Ye, Wu Liu, Wenpeng Gao, Yili Fu, Tao Mei | 426 | |
93 | 10:30 | Three-D Safari: Learning to Estimate Zebra Pose, Shape, and Texture From Images “In the Wild” | Silvia Zuffi, Angjoo Kanazawa, Tanya Berger-Wolf, Michael J. Black | 6034 | |
94 | 10:30 | Object-Driven Multi-Layer Scene Decomposition From a Single Image | Helisa Dhamo, Nassir Navab, Federico Tombari | 877 | |
95 | 10:30 | Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics | Michael Niemeyer, Lars Mescheder, Michael Oechsle, Andreas Geiger | 5948 | |
96 | 10:30 | Joint Monocular 3D Vehicle Detection and Tracking | Hou-Ning Hu, Qi-Zhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, Fisher Yu | 839 | |
97 | 10:30 | FrameNet: Learning Local Canonical Frames of 3D Surfaces From a Single RGB Image | Jingwei Huang, Yichao Zhou, Thomas Funkhouser, Leonidas J. Guibas | 2149 | |
Face & Body | 98 | 10:30 | Fingerspelling Recognition in the Wild With Iterative Visual Attention | Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu | 2557 |
99 | 10:30 | PointAE: Point Auto-Encoder for 3D Statistical Shape and Texture Modelling | Hang Dai, Ling Shao | 479 | |
100 | 10:30 | Multi-Garment Net: Learning to Dress 3D People From Images | Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll | 4019 | |
101 | 10:30 | Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds | Haiyong Jiang, Jianfei Cai, Jianmin Zheng | 3988 | |
102 | 10:30 | AMASS: Archive of Motion Capture As Surface Shapes | Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje, Gerard Pons-Moll, Michael J. Black | 6302 | |
103 | 10:30 | Person-in-WiFi: Fine-Grained Person Perception Using WiFi | Fei Wang, Sanping Zhou, Stanislav Panev, Jinsong Han, Dong Huang | 2322 | |
104 | 10:30 | FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos | Keqiang Sun, Wayne Wu, Tinghao Liu, Shuo Yang, Quan Wang, Qiang Zhou, Zuochang Ye, Chen Qian | 778 | |
105 | 10:30 | Attentional Feature-Pair Relation Networks for Accurate Face Recognition | Bong-Nam Kang, Yonghyun Kim, Bongjin Jun, Daijin Kim | 1452 | |
106 | 10:30 | Face Alignment With Kernel Density Deep Neural Network | Lisha Chen, Hui Su, Qiang Ji | 4164 | |
Action & Video | 107 | 10:30 | Action Recognition With Spatial-Temporal Discriminative Filter Banks | Brais Martínez, Davide Modolo, Yuanjun Xiong, Joseph Tighe | 6483 |
108 | 10:30 | EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition | Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen | 1805 | |
109 | 10:30 | Weakly-Supervised Action Localization With Background Modeling | Phuc Xuan Nguyen, Deva Ramanan, Charless C. Fowlkes | 6496 | |
110 | 10:30 | Grouped Spatial-Temporal Aggregation for Efficient Action Recognition | Chenxu Luo, Alan L. Yuille | 2206 | |
111 | 10:30 | Temporal Structure Mining for Weakly Supervised Action Detection | Tan Yu, Zhou Ren, Yuncheng Li, Enxu Yan, Ning Xu, Junsong Yuan | 245 | |
112 | 10:30 | Temporal Recurrent Networks for Online Action Detection | Mingze Xu, Mingfei Gao, Yi-Ting Chen, Larry S. Davis, David J. Crandall | 405 | |
113 | 10:30 | StartNet: Online Detection of Action Start in Untrimmed Videos | Mingfei Gao, Mingze Xu, Larry S. Davis, Richard Socher, Caiming Xiong | 25 | |
114 | 10:30 | Video Classification With Channel-Separated Convolutional Networks | Du Tran, Heng Wang, Lorenzo Torresani, Matt Feiszli | 2739 | |
115 | 10:30 | Predicting the Future: A Jointly Learnt Model for Action Anticipation | Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes | 3197 | |
Low-Level & Optimization | 116 | 10:30 | Human-Aware Motion Deblurring | Ziyi Shen, Wenguan Wang, Xiankai Lu, Jianbing Shen, Haibin Ling, Tingfa Xu, Ling Shao | 2850 |
117 | 10:30 | Fast Video Object Segmentation via Dynamic Targeting Network | Lu Zhang, Zhe Lin, Jianming Zhang, Huchuan Lu, You He | 2065 | |
118 | 10:30 | Solving Vision Problems via Filtering | Sean I. Young, Aous T. Naman, Bernd Girod, David Taubman | 247 | |
119 | 10:30 | GAN-Based Projector for Faster Recovery With Convergence Guarantees in Linear Inverse Problems | Ankit Raj, Yuqi Li, Yoram Bresler | 6405 | |
120 | 10:30 | Scoot: A Perceptual Metric for Facial Sketches | Deng-Ping Fan, ShengChuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng, Bo Ren, Paul L. Rosin, Rongrong Ji | 1527 | |
121 | 10:30 | Learning Filter Basis for Convolutional Neural Network Compression | Yawei Li, Shuhang Gu, Luc Van Gool, Radu Timofte | 1428 | |
122 | 10:30 | End-to-End Learning of Representations for Asynchronous Event-Based Data | Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza | 1773 | |
123 | 10:30 | ERL-Net: Entangled Representation Learning for Single Image De-Raining | Guoqing Wang, Changming Sun, Arcot Sowmya | 5174 | |
124 | 10:30 | Perceptual Deep Depth Super-Resolution | Oleg Voynov, Alexey Artemov, Vage Egiazarian, Alexander Notchenko, Gleb Bobrovskikh, Evgeny Burnaev, Denis Zorin | 4118 | |
Scene Understanding | 125 | 10:30 | 3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera | Iro Armeni, Zhi-Yang He, JunYoung Gwak, Amir R. Zamir, Martin Fischer, Jitendra Malik, Silvio Savarese | 6691 |
126 | 10:30 | Floorplan-Jigsaw: Jointly Estimating Scene Layout and Aligning Partial Scans | Cheng Lin, Changjian Li, Wenping Wang | 4835 | |
127 | 10:30 | Enforcing Geometric Constraints of Virtual Normal for Depth Prediction | Wei Yin, Yifan Liu, Chunhua Shen, Youliang Yan | 976 | |
128 | 10:30 | Deep Contextual Attention for Human-Object Interaction Detection | Tiancai Wang, Rao Muhammad Anwer, Muhammad Haris Khan, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Jorma Laaksonen | 3102 | |
129 | 10:30 | Learning Compositional Neural Information Fusion for Human Parsing | Wenguan Wang, Zhijie Zhang, Siyuan Qi, Jianbing Shen, Yanwei Pang, Ling Shao | 2055 | |
130 | 10:30 | Attentional Neural Fields for Crowd Counting | Anran Zhang, Lei Yue, Jiayi Shen, Fan Zhu, Xiantong Zhen, Xianbin Cao, Ling Shao | 2223 | |
131 | 10:30 | Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning | Lifeng Fan, Wenguan Wang, Siyuan Huang, Xinyu Tang, Song-Chun Zhu | 421 | |
132 | 10:30 | Controllable Attention for Structured Layered Video Decomposition | Jean-Baptiste Alayrac, João Carreira, Relja Arandjelović, Andrew Zisserman | 4034 | |
133 | 10:30 | GANalyze: Toward Visual Definitions of Cognitive Image Properties | Lore Goetschalckx, Alex Andonian, Aude Oliva, Phillip Isola | 4167 | |
Language & Reasoning | 134 | 10:30 | Saliency-Guided Attention Network for Image-Sentence Matching | Zhong Ji, Haoran Wang, Jungong Han, Yanwei Pang | 4245 |
135 | 10:30 | CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval | Zihao Wang, Xihui Liu, Hongsheng Li, Lu Sheng, Junjie Yan, Xiaogang Wang, Jing Shao | 294 | |
136 | 10:30 | ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching | Yan Huang, Liang Wang | 1205 | |
137 | 10:30 | Creativity Inspired Zero-Shot Learning | Mohamed Elhoseiny, Mohamed Elfeki | 2548 | |
138 | 10:30 | Generating Easy-to-Understand Referring Expressions for Target Identifications | Mikihiro Tanaka, Takayuki Itamochi, Kenichi Narioka, Ikuro Sato, Yoshitaka Ushiku, Tatsuya Harada | 3740 | |
139 | 10:30 | Language-Agnostic Visual-Semantic Embeddings | Jônatas Wehrmann, Douglas M. Souza, Maurício A. Lopes, Rodrigo C. Barros | 6209 | |
140 | 10:30 | Adversarial Representation Learning for Text-to-Image Matching | Nikolaos Sarafianos, Xiang Xu, Ioannis A. Kakadiaris | 60 | |
141 | 10:30 | Multi-Modality Latent Interaction Network for Visual Question Answering | Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li | 350 | |
3D From Multiview & Sensors | 142 | 10:30 | Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters | Axel Barroso-Laguna, Edgar Riba, Daniel Ponsa, Krystian Mikolajczyk | 3104 |
143 | 10:30 | Learning Two-View Correspondences and Geometry Using Order-Aware Network | Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen, Long Quan, Hongen Liao | 2338 | |
144 | 10:30 | Learning Meshes for Dense Visual SLAM | Michael Bloesch, Tristan Laidlow, Ronald Clark, Stefan Leutenegger, Andrew J. Davison | 6287 | |
145 | 10:30 | EM-Fusion: Dynamic Object-Level SLAM With Probabilistic Data Association | Michael Strecke, Jörg Stückler | 5672 | |
146 | 10:30 | ClusterSLAM: A SLAM Backend for Simultaneous Rigid Body Clustering and Motion Estimation | Jiahui Huang, Sheng Yang, Zishuo Zhao, Yu-Kun Lai, Shi-Min Hu | 5594 | |
147 | 10:30 | Efficient and Robust Registration on the 3D Special Euclidean Group | Uttaran Bhattacharya, Venu Madhav Govindu | 1550 | |
148 | 10:30 | Algebraic Characterization of Essential Matrices and Their Averaging in Multiview Settings | Yoni Kasten, Amnon Geifman, Meirav Galun, Ronen Basri | 1621 | |
Image & Video Synthesis | 149 | 10:30 | Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis | Wen Liu, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao | 4701 |
150 | 10:30 | RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes | Po-Wei Wu, Yu-Jing Lin, Che-Han Chang, Edward Y. Chang, Shih-Wei Liao | 3879 | |
151 | 10:30 | Attribute-Driven Spontaneous Motion in Unpaired Image Translation | Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia | 5827 | |
152 | 10:30 | Everybody Dance Now | Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros | 2304 | |
153 | 10:30 | Multimodal Style Transfer via Graph Cuts | Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang | 737 | |
154 | 10:30 | A Closed-Form Solution to Universal Style Transfer | Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen, Feng Xu, Li Zhang | 464 | |
155 | 10:30 | Progressive Reconstruction of Visual Structure for Image Inpainting | Jingyuan Li, Fengxiang He, Lefei Zhang, Bo Du, Dacheng Tao | 3072 |
Thursday, October 31, 2019, 1330–1530 Oral 3.2A (Hall D1) Diane Larlus (Naver Labs Europe), David Crandall (Indiana Univ.) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Recognition, Detection, & Re-Identification | 1 | 13:30 | Variational Adversarial Active Learning [Video] | Samarth Sinha, Sayna Ebrahimi, Trevor Darrell | 6625 |
2 | 13:35 | Confidence Regularized Self-Training [Video] | Yang Zou, Zhiding Yu, Xiaofeng Liu, B.V.K. Vijaya Kumar, Jinsong Wang | 774 | |
3 | 13:40 | Anchor Loss: Modulating Loss Scale Based on Prediction Difficulty [Video] | Serim Ryou, Seong-Gyun Jeong, Pietro Perona | 3141 | |
4 | 13:48 | Local Aggregation for Unsupervised Learning of Visual Embeddings [Video] | Chengxu Zhuang, Alex Lin Zhai, Daniel Yamins | 4162 | |
5 | 13:53 | PR Product: A Substitute for Inner Product in Neural Networks [Video] | Zhennan Wang, Wenbin Zou, Chen Xu | 5532 | |
6 | 13:58 | CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features [Video] | Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, Youngjoon Yoo | 2927 | |
7 | 14:06 | Towards Interpretable Object Detection by Unfolding Latent Structures [Video] | Tianfu Wu, Xi Song | 6232 | |
8 | 14:11 | Scaling Object Detection by Transferring Classification Weights [Video] | Jason Kuen, Federico Perazzi, Zhe Lin, Jianming Zhang, Yap-Peng Tan | 1866 | |
9 | 14:16 | Scale-Aware Trident Networks for Object Detection [Video] | Yanghao Li, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang | 2005 | |
10 | 14:24 | Object-Aware Instance Labeling for Weakly Supervised Object Detection [Video] | Satoshi Kosugi, Toshihiko Yamasaki, Kiyoharu Aizawa | 2344 | |
11 | 14:29 | Generative Modeling for Small-Data Object Detection [Video] | Lanlan Liu, Michael Muelly, Jia Deng, Tomas Pfister, Li-Jia Li | 1983 | |
12 | 14:34 | Transductive Learning for Zero-Shot Object Detection [Video] | Shafin Rahman, Salman Khan, Nick Barnes | 2642 | |
13 | 14:42 | Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection [Video] | Seunghyeon Kim, Jaehoon Choi, Taekyung Kim, Changick Kim | 3987 | |
14 | 14:47 | Memory-Based Neighbourhood Embedding for Visual Recognition [Video] | Suichan Li, Dapeng Chen, Bin Liu, Nenghai Yu, Rui Zhao | 3327 | |
15 | 14:52 | Self-Similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-Identification [Video] | Yang Fu, Yunchao Wei, Guanshuo Wang, Yuqian Zhou, Honghui Shi, Thomas S. Huang | 622 | |
16 | 15:00 | Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification [Video] | Zimo Liu, Jingya Wang, Shaogang Gong, Huchuan Lu, Dacheng Tao | 2521 | |
17 | 15:05 | A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification [Video] | Pirazh Khorramshahi, Amit Kumar, Neehar Peri, Sai Saketh Rambhatla, Jun-Cheng Chen, Rama Chellappa | 6404 | |
18 | 15:10 | Bayesian Loss for Crowd Count Estimation With Point Supervision [Video] | Zhiheng Ma, Xing Wei, Xiaopeng Hong, Yihong Gong | 380 | |
19 | 15:15 | Learning Spatial Awareness to Improve Crowd Counting [Video] | Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander G. Hauptmann | 3596 |
Thursday, October 31, 2019, 1330–1530 Oral 3.2B (Hall D2) Laura Leal-Taixé (Technische Univ. München), Bohyung Han (Seoul National Univ.) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Video & Action Understanding | 20 | 13:30 | GradNet: Gradient-Guided Network for Visual Object Tracking [Video] | Peixia Li, Boyu Chen, Wanli Ouyang, Dong Wang, Xiaoyun Yang, Huchuan Lu | 620 |
21 | 13:35 | FAMNet: Joint Learning of Feature, Affinity and Multi-Dimensional Assignment for Online Multiple Object Tracking [Video] | Peng Chu, Haibin Ling | 2798 | |
22 | 13:40 | Learning Discriminative Model Prediction for Tracking [Video] | Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte | 3941 | |
23 | 13:48 | DynamoNet: Dynamic Action and Motion Network [Video] | Ali Diba, Vivek Sharma, Luc Van Gool, Rainer Stiefelhagen | 4 | |
24 | 13:53 | SlowFast Networks for Video Recognition [Video] | Christoph Feichtenhofer, Haoqi Fan, Jitendra Malik, Kaiming He | 720 | |
25 | 13:58 | Generative Multi-View Human Action Recognition [Video] | Lichen Wang, Zhengming Ding, Zhiqiang Tao, Yunyu Liu, Yun Fu | 3326 | |
26 | 14:06 | Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition [Video] | Wenhao Wu, Dongliang He, Xiao Tan, Shifeng Chen, Shilei Wen | 800 | |
27 | 14:11 | SCSampler: Sampling Salient Clips From Video for Efficient Action Recognition [Video] | Bruno Korbar, Du Tran, Lorenzo Torresani | 5172 | |
28 | 14:16 | Weakly Supervised Energy-Based Learning for Action Segmentation [Video] | Jun Li, Peng Lei, Sinisa Todorovic | 1820 | |
29 | 14:24 | What Would You Expect? Anticipating Egocentric Actions With Rolling-Unrolling LSTMs and Modality Attention [Video] | Antonino Furnari, Giovanni Maria Farinella | 4092 | |
30 | 14:29 | PIE: A Large-Scale Dataset and Models for Pedestrian Intention Estimation and Trajectory Prediction [Video] | Amir Rasouli, Iuliia Kotseruba, Toni Kunic, John K. Tsotsos | 1650 | |
31 | 14:34 | STGAT: Modeling Spatial-Temporal Interactions for Human Trajectory Prediction [Video] | Yingfan Huang, Huikun Bi, Zhaoxin Li, Tianlu Mao, Zhaoqi Wang | 1174 | |
32 | 14:42 | Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection [Video] | Khoi-Nguyen C. Mac, Dhiraj Joshi, Raymond A. Yeh, Jinjun Xiong, Rogerio S. Feris, Minh N. Do | 3133 | |
33 | 14:47 | Dual Attention Matching for Audio-Visual Event Localization [Video] | Yu Wu, Linchao Zhu, Yan Yan, Yi Yang | 3016 | |
34 | 14:52 | Uncertainty-Aware Audiovisual Activity Recognition Using Deep Bayesian Variational Inference [Video] | Mahesh Subedar, Ranganath Krishnan, Paulo Lopez Meyer, Omesh Tickoo, Jonathan Huang | 6568 | |
35 | 15:00 | Non-Local Recurrent Neural Memory for Supervised Sequence Modeling [Video] | Canmiao Fu, Wenjie Pei, Qiong Cao, Chaopeng Zhang, Yong Zhao, Xiaoyong Shen, Yu-Wing Tai | 4506 | |
36 | 15:05 | Temporal Attentive Alignment for Large-Scale Video Domain Adaptation [Video] | Min-Hung Chen, Zsolt Kira, Ghassan AlRegib, Jaekwon Yoo, Ruxin Chen, Jian Zheng | 5437 | |
37 | 15:10 | Action Assessment by Joint Relation Graphs [Video] | Jia-Hui Pan, Jibin Gao, Wei-Shi Zheng | 1277 | |
38 | 15:18 | Unsupervised Procedure Learning via Joint Dynamic Summarization [Video] | Ehsan Elhamifar, Zwe Naing | 1652 | |
39 | 15:23 | ViSiL: Fine-Grained Spatio-Temporal Video Similarity Learning [Video] | Giorgos Kordopatis-Zilos, Symeon Papadopoulos, Ioannis Patras, Ioannis Kompatsiaris | 2282 |
Thursday, October 31, 2019, 1530–1800 Poster 3.2 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Deep Learning | 40 | 15:30 | Unsupervised Learning of Landmarks by Descriptor Vector Exchange | James Thewlis, Samuel Albanie, Hakan Bilen, Andrea Vedaldi | 4870 |
41 | 15:30 | Learning Compositional Representations for Few-Shot Recognition | Pavel Tokmakov, Yu-Xiong Wang, Martial Hebert | 1982 | |
42 | 15:30 | Spectral Regularization for Combating Mode Collapse in GANs | Kanglin Liu, Wenming Tang, Fei Zhou, Guoping Qiu | 4768 | |
43 | 15:30 | Scaling and Benchmarking Self-Supervised Visual Representation Learning | Priya Goyal, Dhruv Mahajan, Abhinav Gupta, Ishan Misra | 1542 | |
44 | 15:30 | Learning an Effective Equivariant 3D Descriptor Without Supervision | Riccardo Spezialetti, Samuele Salti, Luigi Di Stefano | 6132 | |
45 | 15:30 | KPConv: Flexible and Deformable Convolution for Point Clouds | Hugues Thomas, Charles R. Qi, Jean-Emmanuel Deschaud, Beatriz Marcotegui, François Goulette, Leonidas J. Guibas | 5957 | |
46 | 15:30 | Neural Inter-Frame Compression for Video Coding | Abdelaziz Djelouah, Joaquim Campos, Simone Schaub-Meyer, Christopher Schroers | 3869 | |
47 | 15:30 | Task2Vec: Task Embedding for Meta-Learning | Alessandro Achille, Michael Lam, Rahul Tewari, Avinash Ravichandran, Subhransu Maji, Charless C. Fowlkes, Stefano Soatto, Pietro Perona | 5171 | |
48 | 15:30 | Deep Clustering by Gaussian Mixture Variational Autoencoders With Graph Embedding | Linxiao Yang, Ngai-Man Cheung, Jiaying Li, Jun Fang | 4343 | |
49 | 15:30 | SoftTriple Loss: Deep Metric Learning Without Triplet Sampling | Qi Qian, Lei Shang, Baigui Sun, Juhua Hu, Hao Li, Rong Jin | 5183 | |
50 | 15:30 | A Weakly Supervised Fine Label Classifier Enhanced by Coarse Supervision | Fariborz Taherkhani, Hadi Kazemi, Ali Dabouei, Jeremy Dawson, Nasser M. Nasrabadi | 4122 | |
51 | 15:30 | Gaussian Affinity for Max-Margin Class Imbalanced Learning | Munawar Hayat, Salman Khan, Syed Waqas Zamir, Jianbing Shen, Ling Shao | 4032 | |
52 | 15:30 | AttPool: Towards Hierarchical Feature Representation in Graph Convolutional Networks via Attention Mechanism | Jingjia Huang, Zhangheng Li, Nannan Li, Shan Liu, Ge Li | 4346 | |
53 | 15:30 | Deep Metric Learning With Tuplet Margin Loss | Baosheng Yu, Dacheng Tao | 1705 | |
54 | 15:30 | Normalized Wasserstein for Mixture Distributions With Applications in Adversarial Learning and Domain Adaptation | Yogesh Balaji, Rama Chellappa, Soheil Feizi | 4881 | |
55 | 15:30 | Fast and Practical Neural Architecture Search | Jiequan Cui, Pengguang Chen, Ruiyu Li, Shu Liu, Xiaoyong Shen, Jiaya Jia | 2012 | |
56 | 15:30 | Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning | Jiwoong Park, Minsik Lee, Hyung Jin Chang, Kyuewang Lee, Jin Young Choi | 735 | |
57 | 15:30 | Deep Elastic Networks With Model Selection for Multi-Task Learning | Chanho Ahn, Eunwoo Kim, Songhwai Oh | 2097 | |
58 | 15:30 | Metric Learning With HORDE: High-Order Regularizer for Deep Embeddings | Pierre Jacob, David Picard, Aymeric Histace, Edouard Klein | 1412 | |
59 | 15:30 | Adversarial Learning With Margin-Based Triplet Embedding Regularization | Yaoyao Zhong, Weihong Deng | 4562 | |
Recognition | 60 | 15:30 | Simultaneous Multi-View Instance Detection With Learned Geometric Soft-Constraints | Ahmed Samy Nassar, Sébastien Lefèvre, Jan Dirk Wegner | 3964 |
61 | 15:30 | CenterNet: Keypoint Triplets for Object Detection | Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, Qi Tian | 770 | |
62 | 15:30 | Online Hyper-Parameter Learning for Auto-Augmentation Strategy | Chen Lin, Minghao Guo, Chuming Li, Xin Yuan, Wei Wu, Junjie Yan, Dahua Lin, Wanli Ouyang | 4042 | |
63 | 15:30 | DANet: Divergent Activation for Weakly Supervised Object Localization | Haolan Xue, Chang Liu, Fang Wan, Jianbin Jiao, Xiangyang Ji, Qixiang Ye | 302 | |
64 | 15:30 | Selective Sparse Sampling for Fine-Grained Image Recognition | Yao Ding, Yanzhao Zhou, Yi Zhu, Qixiang Ye, Jianbin Jiao | 213 | |
65 | 15:30 | Dynamic Anchor Feature Selection for Single-Shot Object Detection | Shuai Li, Lingxiao Yang, Jianqiang Huang, Xian-Sheng Hua, Lei Zhang | 4798 | |
66 | 15:30 | Incremental Learning Using Conditional Adversarial Networks | Ye Xiang, Ying Fu, Pan Ji, Hua Huang | 5393 | |
67 | 15:30 | Bilateral Adversarial Training: Towards Fast Training of More Robust Models Against Adversarial Attacks | Jianyu Wang, Haichao Zhang | 3233 | |
68 | 15:30 | View Confusion Feature Learning for Person Re-Identification | Fangyi Liu, Lei Zhang | 5413 | |
69 | 15:30 | Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification | Hang Xu, Lewei Yao, Wei Zhang, Xiaodan Liang, Zhenguo Li | 4453 | |
70 | 15:30 | PARN: Position-Aware Relation Networks for Few-Shot Learning | Ziyang Wu, Yuwei Li, Lihua Guo, Kui Jia | 2498 | |
71 | 15:30 | Multi-Adversarial Faster-RCNN for Unrestricted Object Detection | Zhenwei He, Lei Zhang | 5292 | |
72 | 15:30 | Object Guided External Memory Network for Video Object Detection | Hanming Deng, Yang Hua, Tao Song, Zongpu Zhang, Zhengui Xue, Ruhui Ma, Neil Robertson, Haibing Guan | 3352 | |
73 | 15:30 | An Empirical Study of Spatial Attention Mechanisms in Deep Networks | Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin, Jifeng Dai | 3729 | |
74 | 15:30 | Attribute Attention for Semantic Disambiguation in Zero-Shot Learning | Yang Liu, Jishun Guo, Deng Cai, Xiaofei He | 141 | |
75 | 15:30 | CIIDefence: Defeating Adversarial Attacks by Fusing Class-Specific Image Inpainting and Image Denoising | Puneet Gupta, Esa Rahtu | 4848 | |
76 | 15:30 | ThunderNet: Towards Real-Time Generic Object Detection on Mobile Devices | Zheng Qin, Zeming Li, Zhaoning Zhang, Yiping Bao, Gang Yu, Yuxing Peng, Jian Sun | 1142 | |
77 | 15:30 | Dual Student: Breaking the Limits of the Teacher in Semi-Supervised Learning | Zhanghan Ke, Daoye Wang, Qiong Yan, Jimmy Ren, Rynson W.H. Lau | 2205 | |
78 | 15:30 | MVP Matching: A Maximum-Value Perfect Matching for Mining Hard Samples, With Application to Person Re-Identification | Han Sun, Zhiyuan Chen, Shiyang Yan, Lin Xu | 678 | |
Segmentation, Grouping, & Shape | 79 | 15:30 | Adaptive Context Network for Scene Parsing | Jun Fu, Jing Liu, Yuhang Wang, Yong Li, Yongjun Bao, Jinhui Tang, Hanqing Lu | 1772 |
80 | 15:30 | Constructing Self-Motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation: A Non-Adversarial Approach | Qing Lian, Fengmao Lv, Lixin Duan, Boqing Gong | 2179 | |
81 | 15:30 | SparseMask: Differentiable Connectivity Learning for Dense Image Prediction | Huikai Wu, Junge Zhang, Kaiqi Huang | 711 | |
82 | 15:30 | Significance-Aware Information Bottleneck for Domain Adaptive Semantic Segmentation | Yawei Luo, Ping Liu, Tao Guan, Junqing Yu, Yi Yang | 1197 | |
83 | 15:30 | Relational Attention Network for Crowd Counting | Anran Zhang, Jiayi Shen, Zehao Xiao, Fan Zhu, Xiantong Zhen, Xianbin Cao, Ling Shao | 888 | |
84 | 15:30 | ACFNet: Attentional Class Feature Network for Semantic Segmentation | Fan Zhang, Yanqin Chen, Zhihang Li, Zhibin Hong, Jingtuo Liu, Feifei Ma, Junyu Han, Errui Ding | 3380 | |
85 | 15:30 | Frame-to-Frame Aggregation of Active Regions in Web Videos for Weakly Supervised Semantic Segmentation | Jungbeom Lee, Eunji Kim, Sungmin Lee, Jangho Lee, Sungroh Yoon | 2336 | |
86 | 15:30 | Boundary-Aware Feature Propagation for Scene Segmentation | Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang | 636 | |
87 | 15:30 | Self-Ensembling With GAN-Based Data Augmentation for Domain Adaptation in Semantic Segmentation | Jaehoon Choi, Taekyung Kim, Changick Kim | 3992 | |
3D From Single View & RGBD | 88 | 15:30 | Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data | Fabian Manhardt, Diego Martín Arroyo, Christian Rupprecht, Benjamin Busam, Tolga Birdal, Nassir Navab, Federico Tombari | 1768 |
89 | 15:30 | Accurate Monocular 3D Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving | Xinzhu Ma, Zhihui Wang, Haojie Li, Pengbo Zhang, Wanli Ouyang, Xin Fan | 4154 | |
90 | 15:30 | MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation | Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi | 6529 | |
91 | 15:30 | Unsupervised High-Resolution Depth Learning From Videos With Dual Networks | Junsheng Zhou, Yuwang Wang, Kaihuai Qin, Wenjun Zeng | 1700 | |
Face & Body | 92 | 15:30 | Bayesian Graph Convolution LSTM for Skeleton Based Action Recognition | Rui Zhao, Kang Wang, Hui Su, Qiang Ji | 6752 |
93 | 15:30 | DeCaFA: Deep Convolutional Cascade for Face Alignment in the Wild | Arnaud Dapogny, Kevin Bailly, Matthieu Cord | 1630 | |
94 | 15:30 | Probabilistic Face Embeddings | Yichun Shi, Anil K. Jain | 6328 | |
95 | 15:30 | Gaze360: Physically Unconstrained Gaze Estimation in the Wild | Petr Kellnhofer, Adrià Recasens, Simon Stent, Wojciech Matusik, Antonio Torralba | 4341 | |
96 | 15:30 | Unsupervised Person Re-Identification by Camera-Aware Similarity Consistency Learning | Ancong Wu, Wei-Shi Zheng, Jian-Huang Lai | 6696 | |
97 | 15:30 | Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks | Zhe He, Adrian Spurr, Xucong Zhang, Otmar Hilliges | 5925 | |
98 | 15:30 | Dynamic Kernel Distillation for Efficient Pose Estimation in Videos | Xuecheng Nie, Yuncheng Li, Linjie Luo, Ning Zhang, Jiashi Feng | 890 | |
99 | 15:30 | Single-Stage Multi-Person Pose Machines | Xuecheng Nie, Jiashi Feng, Jianfeng Zhang, Shuicheng Yan | 1417 | |
100 | 15:30 | SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation With Semi-Supervised Learning | Yujin Chen, Zhigang Tu, Liuhao Ge, Dejun Zhang, Ruizhi Chen, Junsong Yuan | 2928 | |
101 | 15:30 | Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression | Xinyao Wang, Liefeng Bo, Li Fuxin | 3145 | |
102 | 15:30 | Single-Network Whole-Body Pose Estimation | Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh | 6480 | |
Action & Video | 103 | 15:30 | Spatiotemporal Feature Residual Propagation for Action Prediction | He Zhao, Richard P. Wildes | 6471 |
104 | 15:30 | Identity From Here, Pose From There: Self-Supervised Disentanglement and Generation of Objects Using Unlabeled Videos | Fanyi Xiao, Haotian Liu, Yong Jae Lee | 2564 | |
105 | 15:30 | Relation Distillation Networks for Video Object Detection | Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, Tao Mei | 5819 | |
106 | 15:30 | Video Compression With Rate-Distortion Autoencoders | Amirhossein Habibian, Ties van Rozendaal, Jakub M. Tomczak, Taco S. Cohen | 5662 | |
107 | 15:30 | Non-Local ConvLSTM for Video Compression Artifact Reduction | Yi Xu, Longwen Gao, Kai Tian, Shuigeng Zhou, Huyang Sun | 5576 | |
108 | 15:30 | Self-Supervised Learning With Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera | Yuhua Chen, Cordelia Schmid, Cristian Sminchisescu | 1551 | |
109 | 15:30 | Learning Temporal Action Proposals With Fewer Labels | Jingwei Ji, Kaidi Cao, Juan Carlos Niebles | 1463 | |
110 | 15:30 | TSM: Temporal Shift Module for Efficient Video Understanding | Ji Lin, Chuang Gan, Song Han | 751 | |
111 | 15:30 | Graph Convolutional Networks for Temporal Action Localization | Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan | 797 | |
112 | 15:30 | Fast Object Detection in Compressed Video | Shiyao Wang, Hongchao Lu, Zhidong Deng | 3430 | |
Motion & Tracking | 113 | 15:30 | Predicting 3D Human Dynamics From Video | Jason Y. Zhang, Panna Felsen, Angjoo Kanazawa, Jitendra Malik | 6597 |
114 | 15:30 | Imitation Learning for Human Pose Prediction | Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles | 6083 | |
115 | 15:30 | Human Motion Prediction via Spatio-Temporal Inpainting | Alejandro Hernandez, Jürgen Gall, Francesc Moreno-Noguer | 5823 | |
116 | 15:30 | Structured Prediction Helps 3D Human Motion Modelling | Emre Aksan, Manuel Kaufmann, Otmar Hilliges | 6296 | |
Computational Photography & Graphics | 117 | 15:30 | Learning Shape Templates With Structured Implicit Functions | Kyle Genova, Forrester Cole, Daniel Vlasic, Aaron Sarna, William T. Freeman, Thomas Funkhouser | 6467 |
118 | 15:30 | CompenNet++: End-to-End Full Projector Compensation | Bingyao Huang, Haibin Ling | 1105 | |
119 | 15:30 | Deep Parametric Indoor Lighting Estimation | Marc-André Gardner, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Christian Gagné, Jean-François Lalonde | 5073 | |
120 | 15:30 | FSGAN: Subject Agnostic Face Swapping and Reenactment | Yuval Nirkin, Yosi Keller, Tal Hassner | 2619 | |
121 | 15:30 | Deep Single-Image Portrait Relighting | Hao Zhou, Sunil Hadap, Kalyan Sunkavalli, David W. Jacobs | 3589 | |
122 | 15:30 | PU-GAN: A Point Cloud Upsampling Adversarial Network | Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng | 2799 | |
123 | 15:30 | Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation | Giorgos Bouritsas, Sergiy Bokhnyak, Stylianos Ploumpis, Michael Bronstein, Stefanos Zafeiriou | 6521 | |
Low-Level & Optimization | 124 | 15:30 | Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation | Yu Zeng, Yunzhi Zhuge, Huchuan Lu, Lihe Zhang | 5235 |
125 | 15:30 | Towards High-Resolution Salient Object Detection | Yi Zeng, Pingping Zhang, Jianming Zhang, Zhe Lin, Huchuan Lu | 2390 | |
126 | 15:30 | Event-Based Motion Segmentation by Motion Compensation | Timo Stoffregen, Guillermo Gallego, Tom Drummond, Lindsay Kleeman, Davide Scaramuzza | 1829 | |
127 | 15:30 | Depth-Induced Multi-Scale Recurrent Attention Network for Saliency Detection | Yongri Piao, Wei Ji, Jingjing Li, Miao Zhang, Huchuan Lu | 1249 | |
128 | 15:30 | Stacked Cross Refinement Network for Edge-Aware Salient Object Detection | Zhe Wu, Li Su, Qingming Huang | 3222 | |
129 | 15:30 | Motion Guided Attention for Video Salient Object Detection | Haofeng Li, Guanqi Chen, Guanbin Li, Yizhou Yu | 3051 | |
130 | 15:30 | Semi-Supervised Video Salient Object Detection Using Pseudo-Labels | Pengxiang Yan, Guanbin Li, Yuan Xie, Zhen Li, Chuan Wang, Tianshui Chen, Liang Lin | 2126 | |
131 | 15:30 | Joint Learning of Semantic Alignment and Object Landmark Detection | Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn | 6978 | |
132 | 15:30 | RainFlow: Optical Flow Under Rain Streaks and Rain Veiling Effect | Ruoteng Li, Robby T. Tan, Loong-Fah Cheong, Angelica I. Aviles-Rivero, Qingnan Fan, Carola-Bibiane Schönlieb | 2908 | |
133 | 15:30 | GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing | Xiaohong Liu, Yongrui Ma, Zhihao Shi, Jun Chen | 1323 | |
134 | 15:30 | Learning to See Moving Objects in the Dark | Haiyang Jiang, Yinqiang Zheng | 2857 | |
Scene Understanding | 135 | 15:30 | SegSort: Segmentation by Discriminative Sorting of Segments | Jyh-Jing Hwang, Stella X. Yu, Jianbo Shi, Maxwell D. Collins, Tien-Ju Yang, Xiao Zhang, Liang-Chieh Chen | 383 |
136 | 15:30 | What Synthesis Is Missing: Depth Adaptation Integrated With Weak Supervision for Indoor Scene Parsing | Keng-Chi Liu, Yi-Ting Shen, Jan P. Klopp, Liang-Gee Chen | 2391 | |
137 | 15:30 | AdaptIS: Adaptive Instance Selection Network | Konstantin Sofiiuk, Olga Barinova, Anton Konushin | 3410 | |
138 | 15:30 | DADA: Depth-Aware Domain Adaptation in Semantic Segmentation | Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez | 159 | |
139 | 15:30 | Guided Curriculum Model Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation | Christos Sakaridis, Dengxin Dai, Luc Van Gool | 921 | |
140 | 15:30 | SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation | Yang Zhou, Zachary While, Evangelos Kalogerakis | 3632 | |
141 | 15:30 | SkyScapes Fine-Grained Semantic Understanding of Aerial Scenes | Seyed Majid Azimi, Corentin Henry, Lars Sommer, Arne Schumann, Eleonora Vig | 3556 | |
Language & Reasoning | 142 | 15:30 | Transferable Representation Learning in Vision-and-Language Navigation | Haoshuo Huang, Vihan Jain, Harsh Mehta, Alexander Ku, Gabriel Magalhaes, Jason Baldridge, Eugene Ie | 6780 |
143 | 15:30 | Towards Unsupervised Image Captioning With Shared Multimodal Embeddings | Iro Laina, Christian Rupprecht, Nassir Navab | 242 | |
144 | 15:30 | ViCo: Word Embeddings From Visual Co-Occurrences | Tanmay Gupta, Alexander Schwing, Derek Hoiem | 2312 | |
145 | 15:30 | Seq-SG2SL: Inferring Semantic Layout From Scene Graph Through Sequence to Sequence Learning | Boren Li, Boyu Zhuang, Mingyang Li, Jian Gu | 4365 | |
146 | 15:30 | U-CAM: Visual Explanation Using Uncertainty Based Class Activation Maps | Badri N. Patro, Mayank Lunayach, Shivansh Patel, Vinay P. Namboodiri | 2214 | |
147 | 15:30 | See-Through-Text Grouping for Referring Image Segmentation | Ding-Jie Chen, Songhao Jia, Yi-Chen Lo, Hwann-Tzong Chen, Tyng-Luh Liu | 3315 | |
148 | 15:30 | VideoBERT: A Joint Model for Video and Language Representation Learning | Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid | 1714 | |
149 | 15:30 | Language Features Matter: Effective Language Representations for Vision-Language Tasks | Andrea Burns, Reuben Tan, Kate Saenko, Stan Sclaroff, Bryan A. Plummer | 3652 | |
3D From Multiview & Sensors | 150 | 15:30 | Semantic Stereo Matching With Pyramid Cost Volumes | Zhenyao Wu, Xinyi Wu, Xiaoping Zhang, Song Wang, Lili Ju | 585 |
151 | 15:30 | Learning Relationships for Multi-View 3D Object Recognition | Ze Yang, Liwei Wang | 4721 | |
152 | 15:30 | View N-Gram Network for 3D Object Retrieval | Xinwei He, Tengteng Huang, Song Bai, Xiang Bai | 1926 | |
153 | 15:30 | Expert Sample Consensus Applied to Camera Re-Localization | Eric Brachmann, Carsten Rother | 3443 | |
154 | 15:30 | Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints From Limited Training Data | Yutong Bai, Qing Liu, Lingxi Xie, Weichao Qiu, Yan Zheng, Alan L. Yuille | 2132 | |
155 | 15:30 | Dynamic Points Agglomeration for Hierarchical Point Sets Learning | Jinxian Liu, Bingbing Ni, Caiyuan Li, Jiancheng Yang, Qi Tian | 1005 | |
Image & Video Synthesis | 156 | 15:30 | Attributing Fake Images to GANs: Learning and Analyzing GAN Fingerprints | Ning Yu, Larry S. Davis, Mario Fritz | 2010 |
157 | 15:30 | Dual Adversarial Inference for Text-to-Image Synthesis | Qicheng Lao, Mohammad Havaei, Ahmad Pesaranghader, Francis Dutil, Lisa Di Jorio, Thomas Fevens | 2547 | |
158 | 15:30 | View-LSTM: Novel-View Video Synthesis Through View Decomposition | Mohamed Ilyes Lakhal, Oswald Lanz, Andrea Cavallaro | 6009 | |
159 | 15:30 | HoloGAN: Unsupervised Learning of 3D Representations From Natural Images | Thu Nguyen-Phuoc, Chuan Li, Lucas Theis, Christian Richardt, Yong-Liang Yang | 5005 | |
160 | 15:30 | Unpaired Image-to-Speech Synthesis With Multimodal Information Bottleneck | Shuang Ma, Daniel McDuff, Yale Song | 6289 | |
161 | 15:30 | Improved Conditional VRNNs for Video Prediction | Lluis Castrejon, Nicolas Ballas, Aaron Courville | 4093 | |
162 | 15:30 | Visualizing the Invisible: Occluded Vehicle Segmentation and Recovery | Xiaosheng Yan, Feigege Wang, Wenxi Liu, Yuanlong Yu, Shengfeng He, Jia Pan | 3998 |
Friday, November 1, 2019, 0900–1030 Oral 4.1A (Hall D1) Natalia Neverova (Facebook AI Research), Jaesik Park (POSTECH) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Single-View 3D Modeling, Pose Estimation | 1 | 09:00 | Learning Single Camera Depth Estimation Using Dual-Pixels [Video] | Rahul Garg, Neal Wadhwa, Sameer Ansari, Jonathan T. Barron | 1334 |
2 | 09:05 | Domain-Adaptive Single-View 3D Reconstruction [Video] | Pedro O. Pinheiro, Negar Rostamzadeh, Sungjin Ahn | 6367 | |
3 | 09:10 | Transformable Bottleneck Networks [Video] | Kyle Olszewski, Sergey Tulyakov, Oliver Woodford, Hao Li, Linjie Luo | 809 | |
4 | 09:18 | RIO: 3D Object Instance Re-Localization in Changing Indoor Environments [Video] | Johanna Wald, Armen Avetisyan, Nassir Navab, Federico Tombari, Matthias Nießner | 1206 | |
5 | 09:23 | Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation [Video] | Kiru Park, Timothy Patten, Markus Vincze | 4067 | |
6 | 09:28 | CDPN: Coordinates-Based Disentangled Pose Network for Real-Time RGB-Based 6-DoF Object Pose Estimation [Video] | Zhigang Li, Gu Wang, Xiangyang Ji | 2230 | |
7 | 09:36 | C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion [Video] | David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi | 17 | |
8 | 09:41 | Learning to Reconstruct 3D Manhattan Wireframes From a Single Image [Video] | Yichao Zhou, Haozhi Qi, Yuexiang Zhai, Qi Sun, Zhili Chen, Li-Yi Wei, Yi Ma | 1585 | |
9 | 09:46 | Soft Rasterizer: A Differentiable Renderer for Image-Based 3D Reasoning [Video] | Shichen Liu, Tianye Li, Weikai Chen, Hao Li | 4199 | |
10 | 09:54 | Learnable Triangulation of Human Pose [Video] | Karim Iskakov, Egor Burkov, Victor Lempitsky, Yury Malkov | 5696 | |
11 | 09:59 | xR-EgoPose: Egocentric 3D Human Pose From an HMD Camera [Video] | Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino | 879 | |
12 | 10:04 | DeepHuman: 3D Human Reconstruction From a Single Image [Video] | Zerong Zheng, Tao Yu, Yixuan Wei, Qionghai Dai, Yebin Liu | 4289 | |
13 | 10:12 | A Neural Network for Detailed Human Depth Estimation From a Single Image [Video] | Sicong Tang, Feitong Tan, Kelvin Cheng, Zhaoyang Li, Siyu Zhu, Ping Tan | 3239 | |
14 | 10:17 | DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare [Video] | Yuanlu Xu, Song-Chun Zhu, Tony Tung | 2160 | |
15 | 10:22 | Not All Parts Are Created Equal: 3D Pose Estimation by Modeling Bi-Directional Dependencies of Body Parts [Video] | Jue Wang, Shaoli Huang, Xinchao Wang, Dacheng Tao | 3185 |
Friday, November 1, 2019, 0900–1030 Oral 4.1B (Hall D2) Min H. Kim (KAIST), Imari Sato (National Institute of Informatics) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Computational Photography | 16 | 09:00 | Extreme View Synthesis [Video] | Inchang Choi, Orazio Gallo, Alejandro Troccoli, Min H. Kim, Jan Kautz | 238 |
17 | 09:05 | View Independent Generative Adversarial Network for Novel View Synthesis [Video] | Xiaogang Xu, Ying-Cong Chen, Jiaya Jia | 6929 | |
18 | 09:10 | Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion [Video] | Pingping Zhang, Wei Liu, Yinjie Lei, Huchuan Lu, Xiaoyun Yang | 2610 | |
19 | 09:18 | View-Consistent 4D Light Field Superpixel Segmentation [Video] | Numair Khan, Qian Zhang, Lucas Kasser, Henry Stone, Min H. Kim, James Tompkin | 6398 | |
20 | 09:23 | GLoSH: Global-Local Spherical Harmonics for Intrinsic Image Decomposition [Video] | Hao Zhou, Xiang Yu, David W. Jacobs | 2288 | |
21 | 09:28 | Surface Normals and Shape From Water [Video] | Satoshi Murai, Meng-Yu Jennifer Kuo, Ryo Kawahara, Shohei Nobuhara, Ko Nishino | 3867 | |
22 | 09:36 | Restoration of Non-Rigidly Distorted Underwater Images Using a Combination of Compressive Sensing and Local Polynomial Image Representations [Video] | Jerin Geo James, Pranay Agrawal, Ajit Rajwade | 2355 | |
23 | 09:41 | Learning Perspective Undistortion of Portraits [Video] | Yajie Zhao, Zeng Huang, Tianye Li, Weikai Chen, Chloe LeGendre, Xinglei Ren, Ari Shapiro, Hao Li | 3593 | |
24 | 09:46 | Towards Photorealistic Reconstruction of Highly Multiplexed Lensless Images [Video] | Salman S. Khan, Adarsh V. R., Vivek Boominathan, Jasper Tan, Ashok Veeraraghavan, Kaushik Mitra | 5952 | |
25 | 09:54 | Unconstrained Motion Deblurring for Dual-Lens Cameras [Video] | M. R. Mahesh Mohan, Sharath Girish, A. N. Rajagopalan | 6329 | |
26 | 09:59 | Stochastic Exposure Coding for Handling Multi-ToF-Camera Interference [Video] | Jongho Lee, Mohit Gupta | 3180 | |
27 | 10:04 | Convolutional Approximations to the General Non-Line-of-Sight Imaging Operator [Video] | Byeongjoo Ahn, Akshat Dave, Ashok Veeraraghavan, Ioannis Gkioulekas, Aswin C. Sankaranarayanan | 1211 | |
28 | 10:12 | Agile Depth Sensing Using Triangulation Light Curtains [Video] | Joseph R. Bartels, Jian Wang, William "Red" Whittaker, Srinivasa G. Narasimhan | 1675 | |
29 | 10:17 | Asynchronous Single-Photon 3D Imaging [Video] | Anant Gupta, Atul Ingle, Mohit Gupta | 1166 |
Friday, November 1, 2019, 1030–1300 Poster 4.1 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Deep Learning | 30 | 10:30 | Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation | Yu-Jhe Li, Ci-Siang Lin, Yan-Bo Lin, Yu-Chiang Frank Wang | 1182 |
31 | 10:30 | A Learned Representation for Scalable Vector Graphics | Raphael Gontijo Lopes, David Ha, Douglas Eck, Jonathon Shlens | 6433 | |
32 | 10:30 | ELF: Embedded Localisation of Features in Pre-Trained CNN | Assia Benbihi, Matthieu Geist, Cédric Pradalier | 3882 | |
33 | 10:30 | Joint Group Feature Selection and Discriminative Filter Learning for Robust Visual Object Tracking | Tianyang Xu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler | 4112 | |
34 | 10:30 | Sampling Wisely: Deep Image Embedding by Top-K Precision Optimization | Jing Lu, Chaofan Xu, Wei Zhang, Ling-Yu Duan, Tao Mei | 5302 | |
35 | 10:30 | On the Global Optima of Kernelized Adversarial Representation Learning | Bashir Sadeghi, Runyi Yu, Vishnu Boddeti | 5016 | |
36 | 10:30 | Addressing Model Vulnerability to Distributional Shifts Over Image Transformation Sets | Riccardo Volpi, Vittorio Murino | 2511 | |
37 | 10:30 | Attract or Distract: Exploit the Margin of Open Set | Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang | 37 | |
38 | 10:30 | MIC: Mining Interclass Characteristics for Improved Metric Learning | Karsten Roth, Biagio Brattoli, Björn Ommer | 2266 | |
39 | 10:30 | Self-Supervised Representation Learning via Neighborhood-Relational Encoding | Mohammad Sabokrou, Mohammad Khalooei, Ehsan Adeli | 1552 | |
40 | 10:30 | AWSD: Adaptive Weighted Spatiotemporal Distillation for Video Representation | Mohammad Tavakolian, Hamed R. Tavakoli, Abdenour Hadid | 2622 | |
41 | 10:30 | Bilinear Attention Networks for Person Retrieval | Pengfei Fang, Jieming Zhou, Soumava Kumar Roy, Lars Petersson, Mehrtash Harandi | 3749 | |
42 | 10:30 | Discriminative Feature Learning With Consistent Attention Regularization for Person Re-Identification | Sanping Zhou, Fei Wang, Zeyi Huang, Jinjun Wang | 2337 | |
43 | 10:30 | Semi-Supervised Domain Adaptation via Minimax Entropy | Kuniaki Saito, Donghyun Kim, Stan Sclaroff, Trevor Darrell, Kate Saenko | 1971 | |
44 | 10:30 | Boosting Few-Shot Visual Learning With Self-Supervision | Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord | 4124 | |
45 | 10:30 | FDA: Feature Disruptive Attack | Aditya Ganeshan, Vivek B.S., R. Venkatesh Babu | 2436 | |
46 | 10:30 | A Novel Unsupervised Camera-Aware Domain Adaptation Framework for Person Re-Identification | Lei Qi, Lei Wang, Jing Huo, Luping Zhou, Yinghuan Shi, Yang Gao | 3767 | |
47 | 10:30 | Cross-View Policy Learning for Street Navigation | Ang Li, Huiyi Hu, Piotr Mirowski, Mehrdad Farajtabar | 2470 | |
48 | 10:30 | Learning Across Tasks and Domains | Pierluigi Zama Ramirez, Alessio Tonioni, Samuele Salti, Luigi Di Stefano | 3494 | |
49 | 10:30 | EMPNet: Neural Localisation and Mapping Using Embedded Memory Points | Gil Avraham, Yan Zuo, Thanuja Dharmasiri, Tom Drummond | 3734 | |
50 | 10:30 | AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations | Guo-Jun Qi, Liheng Zhang, Chang Wen Chen, Qi Tian | 2603 | |
51 | 10:30 | Deep Comprehensive Correlation Mining for Image Clustering | Jianlong Wu, Keyu Long, Fei Wang, Chen Qian, Cheng Li, Zhouchen Lin, Hongbin Zha | 1163 | |
52 | 10:30 | Unsupervised Multi-Task Feature Learning on Point Clouds | Kaveh Hassani, Mike Haley | 3107 | |
53 | 10:30 | Reciprocal Multi-Layer Subspace Learning for Multi-View Clustering | Ruihuang Li, Changqing Zhang, Huazhu Fu, Xi Peng, Tianyi Zhou, Qinghua Hu | 1200 | |
54 | 10:30 | Geometric Disentanglement for Generative Latent Shape Models | Tristan Aumentado-Armstrong, Stavros Tsogkas, Allan Jepson, Sven Dickinson | 4094 | |
55 | 10:30 | GAN-Tree: An Incrementally Learned Hierarchical Generative Framework for Multi-Modal Data Distributions | Jogendra Nath Kundu, Maharshi Gor, Dakshit Agrawal, R. Venkatesh Babu | 3932 | |
56 | 10:30 | GODS: Generalized One-Class Discriminative Subspaces for Anomaly Detection | Jue Wang, Anoop Cherian | 943 | |
57 | 10:30 | Neighborhood Preserving Hashing for Scalable Video Retrieval | Shuyan Li, Zhixiang Chen, Jiwen Lu, Xiu Li, Jie Zhou | 1001 | |
Recognition | 58 | 10:30 | Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification | Xinyu Zhang, Jiewei Cao, Chunhua Shen, Mingyu You | 2919 |
59 | 10:30 | SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects | Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Xian Sun, Kun Fu | 1291 | |
60 | 10:30 | Cross-X Learning for Fine-Grained Visual Categorization | Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim | 2291 | |
61 | 10:30 | Maximum-Margin Hamming Hashing | Rong Kang, Yue Cao, Mingsheng Long, Jianmin Wang, Philip S. Yu | 1031 | |
62 | 10:30 | Conservative Wasserstein Training for Pose Estimation | Xiaofeng Liu, Yang Zou, Tong Che, Peng Ding, Ping Jia, Jane You, B.V.K. Vijaya Kumar | 406 | |
63 | 10:30 | Learning to Rank Proposals for Object Detection | Zhiyu Tan, Xuecheng Nie, Qi Qian, Nan Li, Hao Li | 4285 | |
64 | 10:30 | Vehicle Re-Identification With Viewpoint-Aware Metric Learning | Ruihang Chu, Yifan Sun, Yadong Li, Zheng Liu, Chi Zhang, Yichen Wei | 3036 | |
65 | 10:30 | WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection | Zhaoyang Zeng, Bei Liu, Jianlong Fu, Hongyang Chao, Lei Zhang | 3260 | |
66 | 10:30 | Localization of Deep Inpainting Using High-Pass Fully Convolutional Network | Haodong Li, Jiwu Huang | 4666 | |
67 | 10:30 | Clustered Object Detection in Aerial Images | Fan Yang, Heng Fan, Peng Chu, Erik Blasch, Haibin Ling | 4194 | |
68 | 10:30 | Unsupervised Graph Association for Person Re-Identification | Jinlin Wu, Yang Yang, Hao Liu, Shengcai Liao, Zhen Lei, Stan Z. Li | 2077 | |
69 | 10:30 | Learning a Mixture of Granularity-Specific Experts for Fine-Grained Categorization | Lianbo Zhang, Shaoli Huang, Wei Liu, Dacheng Tao | 5465 | |
70 | 10:30 | advPattern: Physical-World Attacks on Deep Person Re-Identification via Adversarially Transformable Patterns | Zhibo Wang, Siyan Zheng, Mengkai Song, Qian Wang, Alireza Rahimpour, Hairong Qi | 2525 | |
71 | 10:30 | ABD-Net: Attentive but Diverse Person Re-Identification | Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, Zhangyang Wang | 4330 | |
72 | 10:30 | From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer | Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Zhiguo Cao, Chunhua Shen | 3286 | |
73 | 10:30 | Towards Precise End-to-End Weakly Supervised Object Detection Network | Ke Yang, Dongsheng Li, Yong Dou | 1581 | |
74 | 10:30 | Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting | Chenfeng Xu, Kai Qiu, Jianlong Fu, Song Bai, Yongchao Xu, Xiang Bai | 768 | |
75 | 10:30 | Ground-to-Aerial Image Geo-Localization With a Hard Exemplar Reweighting Triplet Loss | Sudong Cai, Yulan Guo, Salman Khan, Jiwei Hu, Gongjian Wen | 3372 | |
76 | 10:30 | Learning to Discover Novel Visual Categories via Deep Transfer Clustering | Kai Han, Andrea Vedaldi, Andrew Zisserman | 281 | |
77 | 10:30 | AM-LFS: AutoML for Loss Function Search | Chuming Li, Xin Yuan, Chen Lin, Minghao Guo, Wei Wu, Junjie Yan, Wanli Ouyang | 4712 | |
78 | 10:30 | Few-Shot Object Detection via Feature Reweighting | Bingyi Kang, Zhuang Liu, Xin Wang, Fisher Yu, Jiashi Feng, Trevor Darrell | 1616 | |
79 | 10:30 | Objects365: A Large-Scale, High-Quality Dataset for Object Detection | Shuai Shao, Zeming Li, Tianyuan Zhang, Chao Peng, Gang Yu, Xiangyu Zhang, Jing Li, Jian Sun | 596 | |
80 | 10:30 | Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network | Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen | 4305 | |
81 | 10:30 | Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification | Lingxiao He, Yinggang Wang, Wu Liu, He Zhao, Zhenan Sun, Jiashi Feng | 5853 | |
82 | 10:30 | Collect and Select: Semantic Alignment Metric Learning for Few-Shot Learning | Fusheng Hao, Fengxiang He, Jun Cheng, Lei Wang, Jianzhong Cao, Dacheng Tao | 3419 | |
Segmentation, Grouping, & Shape | 83 | 10:30 | Bayesian Adaptive Superpixel Segmentation | Roy Uziel, Meitar Ronen, Oren Freifeld | 1879 |
84 | 10:30 | CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing | Kevin Duarte, Yogesh S. Rawat, Mubarak Shah | 5136 | |
85 | 10:30 | Bae-Net: Branched Autoencoder for Shape Co-Segmentation | Zhiqin Chen, Kangxue Yin, Matthew Fisher, Siddhartha Chaudhuri, Hao Zhang | 4130 | |
86 | 10:30 | VV-Net: Voxel VAE Net With Group Convolutions for Point Cloud Segmentation | Hsien-Yu Meng, Lin Gao, Yu-Kun Lai, Dinesh Manocha | 1057 | |
87 | 10:30 | Miss Detection vs. False Alarm: Adversarial Learning for Small Object Segmentation in Infrared Images | Huan Wang, Luping Zhou, Lei Wang | 4248 | |
88 | 10:30 | Group-Wise Deep Object Co-Segmentation With Co-Attention Recurrent Neural Network | Bo Li, Zhengxing Sun, Qian Li, Yunjie Wu, Anqi Hu | 3462 | |
Statistics, Physics, Theory & Datasets | 89 | 10:30 | Human Attention in Image Captioning: Dataset and Analysis | Sen He, Hamed R. Tavakoli, Ali Borji, Nicolas Pugeault | 2248 |
90 | 10:30 | Variational Uncalibrated Photometric Stereo Under General Lighting | Bjoern Haefner, Zhenzhang Ye, Maolin Gao, Tao Wu, Yvain Quéau, Daniel Cremers | 1335 | |
91 | 10:30 | SPLINE-Net: Sparse Photometric Stereo Through Lighting Interpolation and Normal Estimation Networks | Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot | 2980 | |
92 | 10:30 | Hyperspectral Image Reconstruction Using Deep External and Internal Learning | Tao Zhang, Ying Fu, Lizhi Wang, Hua Huang | 4846 | |
93 | 10:30 | Gravity as a Reference for Estimating a Person’s Height From Video | Didier Bieler, Semih Günel, Pascal Fua, Helge Rhodin | 6294 | |
94 | 10:30 | Shadow Removal via Shadow Image Decomposition | Hieu Le, Dimitris Samaras | 2450 | |
95 | 10:30 | OperatorNet: Recovering 3D Shapes From Difference Operators | Ruqi Huang, Marie-Julie Rakotosaona, Panos Achlioptas, Leonidas J. Guibas, Maks Ovsjanikov | 5880 | |
96 | 10:30 | Neural Inverse Rendering of an Indoor Scene From a Single Image | Soumyadip Sengupta, Jinwei Gu, Kihwan Kim, Guilin Liu, David W. Jacobs, Jan Kautz | 3544 | |
3D From Single View & RGBD | 97 | 10:30 | ForkNet: Multi-Branch Volumetric Semantic Completion From a Single Depth Image | Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari | 5704 |
98 | 10:30 | Moving Indoor: Unsupervised Video Depth Learning in Challenging Environments | Junsheng Zhou, Yuwang Wang, Kaihuai Qin, Wenjun Zeng | 1850 | |
99 | 10:30 | GraphX-Convolution for Point Cloud Deformation in 2D-to-3D Conversion | Anh-Duc Nguyen, Seonghwa Choi, Woojae Kim, Sanghoon Lee | 351 | |
100 | 10:30 | Holistic++ Scene Understanding: Single-View 3D Holistic Scene Parsing and Human Pose Estimation With Human-Object Interaction and Physical Commonsense | Yixin Chen, Siyuan Huang, Tao Yuan, Siyuan Qi, Yixin Zhu, Song-Chun Zhu | 3359 | |
Action & Video | 101 | 10:30 | MMAct: A Large-Scale Dataset for Cross Modal Human Action Understanding | Quan Kong, Ziming Wu, Ziwei Deng, Martin Klinkigt, Bin Tong, Tomokazu Murakami | 5961 |
102 | 10:30 | HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization | Hang Zhao, Antonio Torralba, Lorenzo Torresani, Zhicheng Yan | 964 | |
103 | 10:30 | 3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization | Sanath Narayan, Hisham Cholakkal, Fahad Shahbaz Khan, Ling Shao | 4007 | |
104 | 10:30 | Grounded Human-Object Interaction Hotspots From Video | Tushar Nagarajan, Christoph Feichtenhofer, Kristen Grauman | 1638 | |
105 | 10:30 | Hallucinating IDT Descriptors and I3D Optical Flow Features for Action Recognition With CNNs | Lei Wang, Piotr Koniusz, Du Q. Huynh | 2979 | |
Computational Photography & Graphics | 106 | 10:30 | Learning to Paint With Model-Based Deep Reinforcement Learning | Zhewei Huang, Wen Heng, Shuchang Zhou | 1874 |
107 | 10:30 | Neural Re-Simulation for Generating Bounces in Single Images | Carlo Innamorati, Bryan Russell, Danny M. Kaufman, Niloy J. Mitra | 5023 | |
108 | 10:30 | Deep Appearance Maps | Maxim Maximov, Laura Leal-Taixé, Mario Fritz, Tobias Ritschel | 4021 | |
109 | 10:30 | GarNet: A Two-Stream Network for Fast and Accurate 3D Cloth Draping | Erhan Gundogdu, Victor Constantin, Amrollah Seifoddini, Minh Dang, Mathieu Salzmann, Pascal Fua | 701 | |
110 | 10:30 | Joint Embedding of 3D Scan and CAD Objects | Manuel Dahnert, Angela Dai, Leonidas J. Guibas, Matthias Nießner | 2732 | |
111 | 10:30 | CompoNet: Learning to Generate the Unseen by Part Synthesis and Composition | Nadav Schor, Oren Katzir, Hao Zhang, Daniel Cohen-Or | 1643 | |
112 | 10:30 | DDSL: Deep Differentiable Simplex Layer for Learning Geometric Signals | Chiyu "Max" Jiang, Dana Lansigan, Philip Marcus, Matthias Nießner | 3695 | |
113 | 10:30 | Composite Shape Modeling via Latent Space Factorization | Anastasia Dubrovina, Fei Xia, Panos Achlioptas, Mira Shalah, Raphaël Groscot, Leonidas J. Guibas | 827 | |
Low-Level & Optimization | 114 | 10:30 | EGNet: Edge Guidance Network for Salient Object Detection | Jia-Xing Zhao, Jiang-Jiang Liu, Deng-Ping Fan, Yang Cao, Jufeng Yang, Ming-Ming Cheng | 1596 |
115 | 10:30 | SID4VAM: A Benchmark Dataset With Synthetic Images for Visual Attention Modeling | David Berga, Xosé R. Fdez-Vidal, Xavier Otazu, Xosé M. Pardo | 1404 | |
116 | 10:30 | Two-Stream Action Recognition-Oriented Video Super-Resolution | Haochen Zhang, Dong Liu, Zhiwei Xiong | 5215 | |
117 | 10:30 | Where Is My Mirror? | Xin Yang, Haiyang Mei, Ke Xu, Xiaopeng Wei, Baocai Yin, Rynson W.H. Lau | 5844 | |
118 | 10:30 | Disentangled Image Matting | Shaofan Cai, Xiaoshuai Zhang, Haoqiang Fan, Haibin Huang, Jiangyu Liu, Jiaming Liu, Jiaying Liu, Jue Wang, Jian Sun | 1381 | |
119 | 10:30 | Guided Super-Resolution As Pixel-to-Pixel Transformation | Riccardo de Lutio, Stefano D'Aronco, Jan Dirk Wegner, Konrad Schindler | 6423 | |
120 | 10:30 | Deep Learning for Light Field Saliency Detection | Tiantian Wang, Yongri Piao, Xiao Li, Lihe Zhang, Huchuan Lu | 6622 | |
121 | 10:30 | Optimizing the F-Measure for Threshold-Free Salient Object Detection | Kai Zhao, Shanghua Gao, Wenguan Wang, Ming-Ming Cheng | 1406 | |
122 | 10:30 | Image Inpainting With Learnable Bidirectional Attention Maps | Chaohao Xie, Shaohui Liu, Chao Li, Ming-Ming Cheng, Wangmeng Zuo, Xiao Liu, Shilei Wen, Errui Ding | 3220 | |
123 | 10:30 | Joint Demosaicking and Denoising by Fine-Tuning of Bursts of Raw Images | Thibaud Ehret, Axel Davy, Pablo Arias, Gabriele Facciolo | 4702 | |
124 | 10:30 | DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better | Orest Kupyn, Tetiana Martyniuk, Junru Wu, Zhangyang Wang | 4861 | |
Language & Reasoning | 125 | 10:30 | Reflective Decoding Network for Image Captioning | Lei Ke, Wenjie Pei, Ruiyu Li, Xiaoyong Shen, Yu-Wing Tai | 3813 |
126 | 10:30 | Joint Optimization for Cooperative Image Captioning | Gilad Vered, Gal Oren, Yuval Atzmon, Gal Chechik | 4925 | |
127 | 10:30 | Watch, Listen and Tell: Multi-Modal Weakly Supervised Dense Event Captioning | Tanzila Rahman, Bicheng Xu, Leonid Sigal | 5184 | |
128 | 10:30 | Joint Syntax Representation Learning and Visual Cue Translation for Video Captioning | Jingyi Hou, Xinxiao Wu, Wentian Zhao, Jiebo Luo, Yunde Jia | 2017 | |
129 | 10:30 | Entangled Transformer for Image Captioning | Guang Li, Linchao Zhu, Ping Liu, Yi Yang | 2359 | |
130 | 10:30 | Shapeglot: Learning Language for Shape Differentiation | Panos Achlioptas, Judy Fan, Robert Hawkins, Noah Goodman, Leonidas J. Guibas | 2872 | |
131 | 10:30 | Nocaps: Novel Object Captioning at Scale | Harsh Agrawal, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson | 31 | |
3D From Multiview & Sensors | 132 | 10:30 | Fully Convolutional Geometric Features | Christopher Choy, Jaesik Park, Vladlen Koltun | 3655 |
133 | 10:30 | Learning Local RGB-to-CAD Correspondences for Object Pose Estimation | Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jana Košecká | 1967 | |
134 | 10:30 | Depth From Videos in the Wild: Unsupervised Monocular Depth Learning From Unknown Cameras | Ariel Gordon, Hanhan Li, Rico Jonschkowski, Anelia Angelova | 5444 | |
135 | 10:30 | OmniMVS: End-to-End Learning for Omnidirectional Stereo Matching | Changhee Won, Jongbin Ryu, Jongwoo Lim | 4237 | |
136 | 10:30 | On the Over-Smoothing Problem of CNN Based Disparity Estimation | Chuangrong Chen, Xiaozhi Chen, Hui Cheng | 1970 | |
137 | 10:30 | Spatial Correspondence With Generative Adversarial Network: Learning Depth From Monocular Videos | Zhenyao Wu, Xinyi Wu, Xiaoping Zhang, Song Wang, Lili Ju | 1062 | |
Image & Video Synthesis | 138 | 10:30 | Disentangling Propagation and Generation for Video Prediction | Hang Gao, Huazhe Xu, Qi-Zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell | 69 |
139 | 10:30 | Guided Image-to-Image Translation With Bi-Directional Feature Transformation | Badour AlBahar, Jia-Bin Huang | 409 | |
140 | 10:30 | Towards Multi-Pose Guided Virtual Try-On Network | Haoye Dong, Xiaodan Liang, Xiaohui Shen, Bochao Wang, Hanjiang Lai, Jia Zhu, Zhiting Hu, Jian Yin | 1574 | |
141 | 10:30 | Photorealistic Style Transfer via Wavelet Transforms | Jaejun Yoo, Youngjung Uh, Sanghyuk Chun, Byeongkyu Kang, Jung-Woo Ha | 3743 | |
142 | 10:30 | Personalized Fashion Design | Cong Yu, Yang Hu, Yan Chen, Bing Zeng | 4406 | |
143 | 10:30 | Tag2Pix: Line Art Colorization Using Text Tag With SECat and Changing Loss | Hyunsu Kim, Ho Young Jhoo, Eunhyeok Park, Sungjoo Yoo | 2830 | |
144 | 10:30 | Free-Form Video Inpainting With 3D Gated Convolution and Temporal PatchGAN | Ya-Liang Chang, Zhe Yu Liu, Kuan-Ying Lee, Winston Hsu | 1485 | |
Applications, Medical & Robotics | 145 | 10:30 | TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting | Wei Feng, Wenhao He, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu | 727 |
146 | 10:30 | Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning | Yipeng Sun, Jiaming Liu, Wei Liu, Junyu Han, Errui Ding, Jingtuo Liu | 2402 | |
147 | 10:30 | Deep Floor Plan Recognition Using a Multi-Task Network With Room-Boundary-Guided Attention | Zhiliang Zeng, Xianzhi Li, Ying Kin Yu, Chi-Wing Fu | 3762 | |
148 | 10:30 | GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition | Fangneng Zhan, Chuhui Xue, Shijian Lu | 187 | |
149 | 10:30 | Large-Scale Tag-Based Font Retrieval With Generative Feature Learning | Tianlang Chen, Zhaowen Wang, Ning Xu, Hailin Jin, Jiebo Luo | 3128 | |
150 | 10:30 | Convolutional Character Networks | Linjie Xing, Zhi Tian, Weilin Huang, Matthew R. Scott | 5903 | |
151 | 10:30 | Geometry Normalization Networks for Accurate Scene Text Detection | Youjiang Xu, Jiaqi Duan, Zhanghui Kuang, Xiaoyu Yue, Hongbin Sun, Yue Guan, Wayne Zhang | 4009 | |
152 | 10:30 | Symmetry-Constrained Rectification Network for Scene Text Recognition | Mingkun Yang, Yushuo Guan, Minghui Liao, Xin He, Kaigui Bian, Song Bai, Cong Yao, Xiang Bai | 258 | |
153 | 10:30 | Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method | Vishwanath A. Sindagi, Rajeev Yasarla, Vishal M. Patel | 1097 |
Friday, November 1, 2019, 1330–1530 Oral 4.2A (Hall D1) Lourdes Agapito (Univ. College London), Minsu Cho (POSTECH) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Segmentation, Detection, 3D Scene Understanding | 1 | 13:30 | YOLACT: Real-Time Instance Segmentation [Video] | Daniel Bolya, Chong Zhou, Fanyi Xiao, Yong Jae Lee | 1384 |
2 | 13:35 | Expectation-Maximization Attention Networks for Semantic Segmentation [Video] | Xia Li, Zhisheng Zhong, Jianlong Wu, Yibo Yang, Zhouchen Lin, Hong Liu | 1583 | |
3 | 13:40 | Multi-Class Part Parsing With Joint Boundary-Semantic Awareness [Video] | Yifan Zhao, Jia Li, Yu Zhang, Yonghong Tian | 361 | |
4 | 13:48 | Explaining Neural Networks Semantically and Quantitatively [Video] | Runjin Chen, Hao Chen, Jie Ren, Ge Huang, Quanshi Zhang | 3492 | |
5 | 13:53 | PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment [Video] | Kaixin Wang, Jun Hao Liew, Yingtian Zou, Daquan Zhou, Jiashi Feng | 3468 | |
6 | 13:58 | ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors [Video] | Weicheng Kuo, Anelia Angelova, Jitendra Malik, Tsung-Yi Lin | 2768 | |
7 | 14:06 | Sequence Level Semantics Aggregation for Video Object Detection [Video] | Haiping Wu, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang | 2032 | |
8 | 14:11 | Video Object Segmentation Using Space-Time Memory Networks [Video] | Seoung Wug Oh, Joon-Young Lee, Ning Xu, Seon Joo Kim | 2838 | |
9 | 14:16 | Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks [Video] | Wenguan Wang, Xiankai Lu, Jianbing Shen, David J. Crandall, Ling Shao | 2842 | |
10 | 14:24 | MeteorNet: Deep Learning on Dynamic 3D Point Cloud Sequences [Video] | Xingyu Liu, Mengyuan Yan, Jeannette Bohg | 1168 | |
11 | 14:29 | 3D Instance Segmentation via Multi-Task Metric Learning [Video] | Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald | 5052 | |
12 | 14:34 | DeepGCNs: Can GCNs Go As Deep As CNNs? [Video] | Guohao Li, Matthias Müller, Ali Thabet, Bernard Ghanem | 153 | |
13 | 14:42 | Deep Hough Voting for 3D Object Detection in Point Clouds [Video] | Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas | 1861 | |
14 | 14:47 | M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [Video] | Garrick Brazil, Xiaoming Liu | 6744 | |
15 | 14:52 | SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [Video] | Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, Jürgen Gall | 2944 | |
16 | 15:00 | WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for Autonomous Driving [Video] | Senthil Yogamani, Ciarán Hughes, Jonathan Horgan, Ganesh Sistu, Padraig Varley, Derek O'Dea, Michal Uřičář, Stefan Milz, Martin Simon, Karl Amende, Christian Witt, Hazem Rashed, Sumanth Chennupati, Sanjaya Nayak, Saquib Mansoor, Xavier Perrotton, Patrick Pérez | 630 | |
17 | 15:05 | Scalable Place Recognition Under Appearance Change for Autonomous Driving [Video] | Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Thanh-Toan Do, Ian Reid | 3667 | |
18 | 15:10 | Exploring the Limitations of Behavior Cloning for Autonomous Driving [Video] | Felipe Codevilla, Eder Santana, Antonio M. López, Adrien Gaidon | 3693 | |
19 | 15:15 | Habitat: A Platform for Embodied AI Research [Video] | Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra | 4367 |
Friday, November 1, 2019, 1330–1530 Oral 4.2B (Hall D2) Victor Lempitsky (Samsung), Yu-Chiang Frank Wang (National Taiwan Univ.) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Face & Body Modeling | 20 | 13:30 | Towards Interpretable Face Recognition [Video] | Bangjie Yin, Luan Tran, Haoxiang Li, Xiaohui Shen, Xiaoming Liu | 1788 |
21 | 13:35 | Co-Mining: Deep Face Recognition With Noisy Labels [Video] | Xiaobo Wang, Shuo Wang, Jun Wang, Hailin Shi, Tao Mei | 1562 | |
22 | 13:40 | Few-Shot Adaptive Gaze Estimation [Video] | Seonwook Park, Shalini De Mello, Pavlo Molchanov, Umar Iqbal, Otmar Hilliges, Jan Kautz | 1112 | |
23 | 13:48 | Live Face De-Identification in Video [Video] | Oran Gafni, Lior Wolf, Yaniv Taigman | 1077 | |
24 | 13:53 | Face Video Deblurring Using 3D Facial Priors [Video] | Wenqi Ren, Jiaolong Yang, Senyou Deng, David Wipf, Xiaochun Cao, Xin Tong | 428 | |
25 | 13:58 | Semi-Supervised Monocular 3D Face Reconstruction With End-to-End Shape-Preserved Domain Transfer [Video] | Jingtan Piao, Chen Qian, Hongsheng Li | 1854 | |
26 | 14:06 | 3D Face Modeling From Diverse Raw Scan Data [Video] | Feng Liu, Luan Tran, Xiaoming Liu | 722 | |
27 | 14:11 | A Decoupled 3D Facial Shape Model by Adversarial Training [Video] | Victoria Fernández Abrevaya, Adnane Boukhayma, Stefanie Wuhrer, Edmond Boyer | 2899 | |
28 | 14:16 | Photo-Realistic Facial Details Synthesis From Single Image [Video] | Anpei Chen, Zhang Chen, Guli Zhang, Kenny Mitchell, Jingyi Yu | 1590 | |
29 | 14:24 | S2GAN: Share Aging Factors Across Ages and Share Aging Trends Among Individuals [Video] | Zhenliang He, Meina Kan, Shiguang Shan, Xilin Chen | 490 | |
30 | 14:29 | PuppetGAN: Cross-Domain Image Manipulation by Demonstration [Video] | Ben Usman, Nick Dufour, Kate Saenko, Chris Bregler | 502 | |
31 | 14:34 | Few-Shot Adversarial Learning of Realistic Neural Talking Head Models [Video] | Egor Zakharov, Aliaksandra Shysheya, Egor Burkov, Victor Lempitsky | 4563 | |
32 | 14:42 | Pose-Aware Multi-Level Feature Network for Human Object Interaction Detection [Video] | Bo Wan, Desen Zhou, Yongfei Liu, Rongjie Li, Xuming He | 3547 | |
33 | 14:47 | TRB: A Novel Triplet Representation for Understanding 2D Human Body [Video] | Haodong Duan, Kwan-Yee Lin, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang | 3786 | |
34 | 14:52 | Learning Trajectory Dependencies for Human Motion Prediction [Video] | Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li | 3937 | |
35 | 14:57 | Cross-Domain Adaptation for Animal Pose Estimation [Video] | Jinkun Cao, Hongyang Tang, Hao-Shu Fang, Xiaoyong Shen, Cewu Lu, Yu-Wing Tai | 1570 |
Friday, November 1, 2019, 1530–1800 Poster 4.2 (Hall B) | |||||
---|---|---|---|---|---|
Session Title/Poster Group | Poster # | Presentation Time | Title | Author(s) | Paper ID |
Recognition | 36 | 15:30 | NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-Supervised Object Detection | Jiyang Gao, Jiang Wang, Shengyang Dai, Li-Jia Li, Ram Nevatia | 5454 |
37 | 15:30 | Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy | Qing Yu, Kiyoharu Aizawa | 5455 | |
38 | 15:30 | SBSGAN: Suppression of Inter-Domain Background Shift for Person Re-Identification | Yan Huang, Qiang Wu, JingSong Xu, Yi Zhong | 4485 | |
39 | 15:30 | Enriched Feature Guided Refinement Network for Object Detection | Jing Nie, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao | 813 | |
40 | 15:30 | Deep Meta Metric Learning | Guangyi Chen, Tianren Zhang, Jiwen Lu, Jie Zhou | 1040 | |
41 | 15:30 | Discriminative Feature Transformation for Occluded Pedestrian Detection | Chunluan Zhou, Ming Yang, Junsong Yuan | 3834 | |
42 | 15:30 | Contextual Attention for Hand Detection in the Wild | Supreeth Narasimhaswamy, Zhengwei Wei, Yang Wang, Justin Zhang, Minh Hoai | 4971 | |
43 | 15:30 | Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning | Xiaopeng Yan, Ziliang Chen, Anni Xu, Xiaoxi Wang, Xiaodan Liang, Liang Lin | 1535 | |
44 | 15:30 | Pyramid Graph Networks With Connection Attentions for Region-Based One-Shot Semantic Segmentation | Chi Zhang, Guosheng Lin, Fayao Liu, Jiushuang Guo, Qingyao Wu, Rui Yao | 3398 | |
45 | 15:30 | Presence-Only Geographical Priors for Fine-Grained Image Classification | Oisin Mac Aodha, Elijah Cole, Pietro Perona | 3674 | |
46 | 15:30 | POD: Practical Object Detection With Scale-Sensitive Network | Junran Peng, Ming Sun, Zhaoxiang Zhang, Tieniu Tan, Junjie Yan | 5684 | |
47 | 15:30 | Human Uncertainty Makes Classification More Robust | Joshua C. Peterson, Ruairidh M. Battleday, Thomas L. Griffiths, Olga Russakovsky | 6284 | |
48 | 15:30 | FCOS: Fully Convolutional One-Stage Object Detection | Zhi Tian, Chunhua Shen, Hao Chen, Tong He | 445 | |
49 | 15:30 | Self-Critical Attention Learning for Person Re-Identification | Guangyi Chen, Chunze Lin, Liangliang Ren, Jiwen Lu, Jie Zhou | 1046 | |
50 | 15:30 | Temporal Knowledge Propagation for Image-to-Video Person Re-Identification | Xinqian Gu, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen | 1370 | |
51 | 15:30 | RepPoints: Point Set Representation for Object Detection | Ze Yang, Shaohui Liu, Han Hu, Liwei Wang, Stephen Lin | 3001 | |
52 | 15:30 | SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering | Haonan Luo, Guosheng Lin, Zichuan Liu, Fayao Liu, Zhenmin Tang, Yazhou Yao | 4526 | |
53 | 15:30 | No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques | Tanmay Gupta, Alexander Schwing, Derek Hoiem | 2458 | |
54 | 15:30 | Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection | Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li, Danfeng Qin, Jesse Berent | 2638 | |
55 | 15:30 | No Fear of the Dark: Image Retrieval Under Varying Illumination Conditions | Tomas Jenicek, Ondřej Chum | 4874 | |
56 | 15:30 | Hierarchical Shot Detector | Jiale Cao, Yanwei Pang, Jungong Han, Xuelong Li | 2563 | |
57 | 15:30 | Few-Shot Learning With Global Class Representations | Aoxue Li, Tiange Luo, Tao Xiang, Weiran Huang, Liwei Wang | 2267 | |
58 | 15:30 | Better to Follow, Follow to Be Better: Towards Precise Supervision of Feature Super-Resolution for Small Object Detection | Junhyug Noh, Wonho Bae, Wonhee Lee, Jinhwan Seo, Gunhee Kim | 5127 | |
59 | 15:30 | Weakly Supervised Object Detection With Segmentation Collaboration | Xiaoyan Li, Meina Kan, Shiguang Shan, Xilin Chen | 855 | |
60 | 15:30 | AutoFocus: Efficient Multi-Scale Inference | Mahyar Najibi, Bharat Singh, Larry S. Davis | 2286 | |
61 | 15:30 | Leveraging Long-Range Temporal Relationships Between Proposals for Video Object Detection | Mykhailo Shvets, Wei Liu, Alexander C. Berg | 6430 | |
62 | 15:30 | Transferable Contrastive Network for Generalized Zero-Shot Learning | Huajie Jiang, Ruiping Wang, Shiguang Shan, Xilin Chen | 2323 | |
63 | 15:30 | Fast Point R-CNN | Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia | 5863 | |
64 | 15:30 | Mesh R-CNN | Georgia Gkioxari, Jitendra Malik, Justin Johnson | 2473 | |
65 | 15:30 | Deep Supervised Hashing With Anchor Graph | Yudong Chen, Zhihui Lai, Yujuan Ding, Kaiyi Lin, Wai Keung Wong | 836 | |
66 | 15:30 | Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes | Hao Yang, Hao Wu, Hao Chen | 3110 | |
67 | 15:30 | Re-ID Driven Localization Refinement for Person Search | Chuchu Han, Jiacheng Ye, Yunshan Zhong, Xin Tan, Chi Zhang, Changxin Gao, Nong Sang | 2024 | |
68 | 15:30 | Hierarchical Encoding of Sequential Data With Compact and Sub-Linear Storage Cost | Huu Le, Ming Xu, Tuan Hoang, Michael Milford | 3662 | |
69 | 15:30 | C-MIDN: Coupled Multiple Instance Detection Network With Segmentation Guidance for Weakly Supervised Object Detection | Yan Gao, Boxiao Liu, Nan Guo, Xiaochun Ye, Fang Wan, Haihang You, Dongrui Fan | 3723 | |
70 | 15:30 | Learning Feature-to-Feature Translator by Alternating Back-Propagation for Generative Zero-Shot Learning | Yizhe Zhu, Jianwen Xie, Bingchen Liu, Ahmed Elgammal | 1990 | |
71 | 15:30 | Deep Constrained Dominant Sets for Person Re-Identification | Leulseged Tesfaye Alemu, Marcello Pelillo, Mubarak Shah | 3272 | |
72 | 15:30 | Invariant Information Clustering for Unsupervised Image Classification and Segmentation | Xu Ji, João F. Henriques, Andrea Vedaldi | 3193 | |
Statistics, Physics, Theory & Datasets | 73 | 15:30 | Subspace Structure-Aware Spectral Clustering for Robust Subspace Clustering | Masataka Yamaguchi, Go Irie, Takahito Kawanishi, Kunio Kashino | 4366 |
74 | 15:30 | Order-Preserving Wasserstein Discriminant Analysis | Bing Su, Jiahuan Zhou, Ying Wu | 2886 | |
75 | 15:30 | LayoutVAE: Stochastic Scene Layout Generation From a Label Set | Akash Abdu Jyothi, Thibaut Durand, Jiawei He, Leonid Sigal, Greg Mori | 1245 | |
76 | 15:30 | Robust Variational Bayesian Point Set Registration | Jie Zhou, Xinke Ma, Li Liang, Yang Yang, Shijin Xu, Yuhe Liu, Sim-Heng Ong | 4022 | |
77 | 15:30 | Is an Affine Constraint Needed for Affine Subspace Clustering? | Chong You, Chun-Guang Li, Daniel P. Robinson, René Vidal | 4088 | |
78 | 15:30 | Meta-Learning to Detect Rare Objects | Yu-Xiong Wang, Deva Ramanan, Martial Hebert | 5223 | |
79 | 15:30 | New Convex Relaxations for MRF Inference With Unknown Graphs | Zhenhua Wang, Tong Liu, Qinfeng Shi, M. Pawan Kumar, Jianhua Zhang | 4928 | |
80 | 15:30 | Cluster Alignment With a Teacher for Unsupervised Domain Adaptation | Zhijie Deng, Yucen Luo, Jun Zhu | 2613 | |
81 | 15:30 | Analyzing the Variety Loss in the Context of Probabilistic Trajectory Prediction | Luca Anthony Thiede, Pratik Prabhanjan Brahma | 6578 | |
3D From Single View & RGBD | 82 | 15:30 | Deep Mesh Reconstruction From Single RGB Images via Topology Modification Networks | Junyi Pan, Xiaoguang Han, Weikai Chen, Jiapeng Tang, Kui Jia | 2601 |
83 | 15:30 | UprightNet: Geometry-Aware Camera Orientation Estimation From Single Images | Wenqi Xian, Zhengqi Li, Matthew Fisher, Jonathan Eisenmann, Eli Shechtman, Noah Snavely | 1442 | |
84 | 15:30 | Escaping Plato’s Cave: 3D Shape From Adversarial Rendering | Philipp Henzler, Niloy J. Mitra, Tobias Ritschel | 2235 | |
85 | 15:30 | Deep End-to-End Alignment and Refinement for Time-of-Flight RGB-D Module | Di Qiu, Jiahao Pang, Wenxiu Sun, Chengxi Yang | 3514 | |
86 | 15:30 | GEOBIT: A Geodesic-Based Binary Descriptor Invariant to Non-Rigid Deformations for RGB-D Images | Erickson R. Nascimento, Guilherme Potje, Renato Martins, Felipe Cadar, Mario F. M. Campos, Ruzena Bajcsy | 5114 | |
87 | 15:30 | CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark | Alan Lukežič, Ugur Kart, Jani Käpylä, Ahmed Durmush, Joni-Kristian Kämäräinen, Jiří Matas, Matej Kristan | 1743 | |
88 | 15:30 | Learning Joint 2D-3D Representations for Depth Completion | Yun Chen, Bin Yang, Ming Liang, Raquel Urtasun | 1104 | |
Face & Body | 89 | 15:30 | Make a Face: Towards Arbitrary High Fidelity Face Manipulation | Shengju Qian, Kwan-Yee Lin, Wayne Wu, Yangxiaokang Liu, Quan Wang, Fumin Shen, Chen Qian, Ran He | 782 |
90 | 15:30 | M2FPA: A Multi-Yaw Multi-Pitch High-Quality Dataset and Benchmark for Facial Pose Analysis | Peipei Li, Xiang Wu, Yibo Hu, Ran He, Zhenan Sun | 446 | |
91 | 15:30 | Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition | Bingyu Liu, Weihong Deng, Yaoyao Zhong, Mei Wang, Jiani Hu, Xunqiang Tao, Yaohai Huang | 3821 | |
92 | 15:30 | Face De-Occlusion Using 3D Morphable Model and Generative Adversarial Network | Xiaowei Yuan, In Kyu Park | 3750 | |
93 | 15:30 | Detecting Photoshopped Faces by Scripting Photoshop | Sheng-Yu Wang, Oliver Wang, Andrew Owens, Richard Zhang, Alexei A. Efros | 1648 | |
94 | 15:30 | Ego-Pose Estimation and Forecasting As Real-Time PD Control | Ye Yuan, Kris Kitani | 1096 | |
95 | 15:30 | End-to-End Learning for Graph Decomposition | Jie Song, Bjoern Andres, Michael J. Black, Otmar Hilliges, Siyu Tang | 2661 | |
96 | 15:30 | Laplace Landmark Localization | Joseph P. Robinson, Yuncheng Li, Ning Zhang, Yun Fu, Sergey Tulyakov | 3137 | |
97 | 15:30 | Through-Wall Human Mesh Recovery Using Radio Signals | Mingmin Zhao, Yingcheng Liu, Aniruddh Raghu, Tianhong Li, Hang Zhao, Antonio Torralba, Dina Katabi | 6700 | |
98 | 15:30 | Discriminatively Learned Convex Models for Set Based Face Recognition | Hakan Cevikalp, Golara Ghorban Dordinejad | 4079 | |
99 | 15:30 | Camera Distance-Aware Top-Down Approach for 3D Multi-Person Pose Estimation From a Single RGB Image | Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee | 4267 | |
100 | 15:30 | Context-Aware Emotion Recognition Networks | Jiyoung Lee, Seungryong Kim, Sunok Kim, Jungin Park, Kwanghoon Sohn | 5700 | |
101 | 15:30 | Deep Head Pose Estimation Using Synthetic Images and Partial Adversarial Domain Adaption for Continuous Label Spaces | Felix Kuhnke, Jörn Ostermann | 6072 | |
Computational Photography & Graphics | 102 | 15:30 | Flare in Interference-Based Hyperspectral Cameras | Eden Sassoon, Yoav Y. Schechner, Tali Treibitz | 3464 |
103 | 15:30 | Computational Hyperspectral Imaging Based on Dimension-Discriminative Low-Rank Tensor Recovery | Shipeng Zhang, Lizhi Wang, Ying Fu, Xiaoming Zhong, Hua Huang | 1139 | |
104 | 15:30 | Deep Optics for Monocular Depth Estimation and 3D Object Detection | Julie Chang, Gordon Wetzstein | 1439 | |
105 | 15:30 | Physics-Based Rendering for Improving Robustness to Rain | Shirsendu Sukanta Halder, Jean-François Lalonde, Raoul de Charette | 6981 | |
106 | 15:30 | ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal | Bin Ding, Chengjiang Long, Ling Zhang, Chunxia Xiao | 4787 | |
107 | 15:30 | Deep Tensor ADMM-Net for Snapshot Compressive Imaging | Jiawei Ma, Xiao-Yang Liu, Zheng Shou, Xin Yuan | 375 | |
Low-Level & Optimization | 108 | 15:30 | Convex Relaxations for Consensus and Non-Minimal Problems in 3D Vision | Thomas Probst, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool | 5062 |
109 | 15:30 | Pareto Meets Huber: Efficiently Avoiding Poor Minima in Robust Estimation | Christopher Zach, Guillaume Bourmaud | 3095 | |
110 | 15:30 | K-Best Transformation Synchronization | Yifan Sun, Jiacheng Zhuo, Arnav Mohan, Qixing Huang | 719 | |
111 | 15:30 | Parametric Majorization for Data-Driven Energy Minimization Methods | Jonas Geiping, Michael Moeller | 6258 | |
112 | 15:30 | A Bayesian Optimization Framework for Neural Network Compression | Xingchen Ma, Amal Rannen Triki, Maxim Berman, Christos Sagonas, Jacques Cali, Matthew B. Blaschko | 5845 | |
113 | 15:30 | HiPPI: Higher-Order Projected Power Iterations for Scalable Multi-Matching | Florian Bernard, Johan Thunberg, Paul Swoboda, Christian Theobalt | 691 | |
Language & Reasoning | 114 | 15:30 | Language-Conditioned Graph Networks for Relational Reasoning | Ronghang Hu, Anna Rohrbach, Trevor Darrell, Kate Saenko | 1119 |
115 | 15:30 | Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction | Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W. Taylor | 6105 | |
116 | 15:30 | Relation-Aware Graph Attention Network for Visual Question Answering | Linjie Li, Zhe Gan, Yu Cheng, Jingjing Liu | 3154 | |
117 | 15:30 | Unpaired Image Captioning via Scene Graph Alignments | Jiuxiang Gu, Shafiq Joty, Jianfei Cai, Handong Zhao, Xu Yang, Gang Wang | 330 | |
118 | 15:30 | Modeling Inter and Intra-Class Relations in the Triplet Loss for Zero-Shot Learning | Yannick Le Cacheux, Hervé Le Borgne, Michel Crucianu | 1023 | |
119 | 15:30 | Occlusion-Shared and Feature-Separated Network for Occlusion Relationship Reasoning | Rui Lu, Feng Xue, Menghan Zhou, Anlong Ming, Yu Zhou | 3402 | |
120 | 15:30 | Mixture-Kernel Graph Attention Network for Situation Recognition | Mohammed Suhail, Leonid Sigal | 2454 | |
121 | 15:30 | Learning Similarity Conditions Without Explicit Supervision | Reuben Tan, Mariya I. Vasileva, Kate Saenko, Bryan A. Plummer | 3618 | |
122 | 15:30 | Joint Prediction for Kinematic Trajectories in Vehicle-Pedestrian-Mixed Scenes | Huikun Bi, Zhong Fang, Tianlu Mao, Zhaoqi Wang, Zhigang Deng | 397 | |
123 | 15:30 | Learning to Caption Images Through a Lifetime by Asking Questions | Tingke Shen, Amlan Kar, Sanja Fidler | 2555 | |
124 | 15:30 | VrR-VG: Refocusing Visually-Relevant Relationships | Yuanzhi Liang, Yalong Bai, Wei Zhang, Xueming Qian, Li Zhu, Tao Mei | 2595 | |
3D From Multiview & Sensors | 125 | 15:30 | TAPA-MVS: Textureless-Aware PAtchMatch Multi-View Stereo | Andrea Romanoni, Matteo Matteucci | 2958 |
126 | 15:30 | U4D: Unsupervised 4D Dynamic Scene Understanding | Armin Mustafa, Chris Russell, Adrian Hilton | 1924 | |
127 | 15:30 | Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation | Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia | 2836 | |
128 | 15:30 | Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds From Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction | Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker | 3658 | |
129 | 15:30 | P-MVSNet: Learning Patch-Wise Matching Confidence Aggregation for Multi-View Stereo | Keyang Luo, Tao Guan, Lili Ju, Haipeng Huang, Yawei Luo | 1285 | |
Image & Video Synthesis | 130 | 15:30 | SME-Net: Sparse Motion Estimation for Parametric Video Prediction Through Reinforcement Learning | Yung-Han Ho, Chuan-Yuan Cho, Wen-Hsiao Peng, Guo-Lun Jin | 2683 |
131 | 15:30 | ClothFlow: A Flow-Based Model for Clothed Person Generation | Xintong Han, Xiaojun Hu, Weilin Huang, Matthew R. Scott | 2790 | |
132 | 15:30 | LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup | Qiao Gu, Guanzhi Wang, Mang Tik Chiu, Yu-Wing Tai, Chi-Keung Tang | 5321 | |
133 | 15:30 | Point-to-Point Video Generation | Tsun-Hsuan Wang, Yen-Chi Cheng, Chieh Hubert Lin, Hwann-Tzong Chen, Min Sun | 226 | |
134 | 15:30 | Semantics-Enhanced Adversarial Nets for Text-to-Image Synthesis | Hongchen Tan, Xiuping Liu, Xin Li, Yi Zhang, Baocai Yin | 3290 | |
135 | 15:30 | VTNFP: An Image-Based Virtual Try-On Network With Body and Clothing Feature Preservation | Ruiyun Yu, Xiaoqi Wang, Xiaohui Xie | 3665 | |
136 | 15:30 | Boundless: Generative Adversarial Networks for Image Extension | Piotr Teterwak, Aaron Sarna, Dilip Krishnan, Aaron Maschinot, David Belanger, Ce Liu, William T. Freeman | 1422 | |
137 | 15:30 | Image Synthesis From Reconfigurable Layout and Style | Wei Sun, Tianfu Wu | 178 | |
138 | 15:30 | Attribute Manipulation Generative Adversarial Networks for Fashion Images | Kenan E. Ak, Joo Hwee Lim, Jo Yew Tham, Ashraf A. Kassim | 4374 | |
139 | 15:30 | Few-Shot Unsupervised Image-to-Image Translation | Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz | 3116 | |
140 | 15:30 | Very Long Natural Scenery Image Prediction by Outpainting | Zongxin Yang, Jian Dong, Ping Liu, Yi Yang, Shuicheng Yan | 3554 | |
Applications, Medical & Robotics | 141 | 15:30 | Scaling Recurrent Models via Orthogonal Approximations in Tensor Trains | Ronak Mehta, Rudrasis Chakraborty, Yunyang Xiong, Vikas Singh | 2163 |
142 | 15:30 | A Deep Cybersickness Predictor Based on Brain Signal Analysis for Virtual Reality Contents | Jinwoo Kim, Woojae Kim, Heeseok Oh, Seongmin Lee, Sanghoon Lee | 4430 | |
143 | 15:30 | Learning With Unsure Data for Medical Image Diagnosis | Botong Wu, Xinwei Sun, Lingjing Hu, Yizhou Wang | 466 | |
144 | 15:30 | Recursive Cascaded Networks for Unsupervised Medical Image Registration | Shengyu Zhao, Yue Dong, Eric I-Chao Chang, Yan Xu | 4449 |