本内容现在是ICCV2019,后期会更新为ICCV2021以随时更新

Awesome-ICCV2019 更新所有录用论文（已更新1000余篇ICCV2019论文）

Last updated: 2021/01/25

【计算机视觉联盟】回复【ICCV2019】即可获得百度云所有论文下载链接

Update log

2019/07/26 * - 更新28篇IIAI录用论文
2019/07/28 * - 更新11篇旷视ICCV2019
2019/08/28 * - 更新31篇Oral
2019/08/29 * - 增加116篇ICCV2019文章
2019/08/29 * - 增加35篇包含开源代码的ICCV2019
2019/09/12 * - ICCV 1998-2017最佳论文
2019/10/26 * - ICCV 所有论文百度云链接更新

ICCV 简介

ICCV 的全称是 IEEE International Conference on Computer Vision，即国际计算机视觉大会，由IEEE主办，与计算机视觉模式识别会议（CVPR）和欧洲计算机视觉会议（ECCV）并称计算机视觉方向的三大顶级会议，被澳大利亚ICT学术会议排名和中国计算机学会等机构评为最高级别学术会议，在业内具有极高的评价。不同于在美国每年召开一次的CVPR和只在欧洲召开的ECCV，ICCV在世界范围内每两年召开一次。ICCV论文录用率非常低，是三大会议中公认级别最高的。上一届提交的论文中，其中621篇被接收，录用比例达 28.9%；其中 poster、spotlight、oral 的比例分别为 24.61%、2.61% 以及 2.09%。

ICCV主席

今年有一名大会主席是来自香港中文大学的信息工程系系主任汤晓鸥，他同时还是中国科学院深圳先进技术研究院的副院长兼商汤科技创始人。其他三名大会主席则分别是首尔大学的 Kyoung Mu Lee 教授、伊利诺伊大学厄巴纳-香槟分校的 David Forsyth 教授以及苏黎世联邦理工学院的 Marc Pollefeys 教授。

召开地点

本届大会最终的递交补充材料的截止日期为 3 月 29 日。大会召开时间为2019年10月27日至11月2日，举行地点是韩国首尔的 COEX 会议中心。

刚刚，计算机视觉三大顶会之一ICCV2019终于公布了它的最终论文接收结果，一共有1077篇论文被接收，接收率为25.02%

ICCV2019最新录用论文编号：

24 17 25 30 31 33 37 41 45 49 59 60 69 84 91 93 102 105 110 126 141 153 157 159 171 175 176 178 184 187 213 226 229 238 242 245 247 251 258 281 285 294 302 320 330 350 351 354 356 361 367 375 376 380 382 383 385 397 405 406 407 409 421 426 428 445 446 450 456 464 466 479 490 491 496 502 504 507 508 520 531 539 568 579 582 585 596 609 620 622 630 634 636 668 669 673 678 680 691 701 706 711 715 719 720 722 727 732 733 735 737 742 751 754 756 759 767 768 770 774 778 782 791 797 800 806 809 811 813 818 827 832 836 838 839 845 855 862 868 876 877 879 888 890 892 899 900 902 904 905 909 912 917 920 921 940 943 959 964 965 976 981 989 1001 1005 1006 1011 1017 1020 1023 1031 1032 1039 1040 1042 1045 1046 1057 1062 1067 1077 1083 1092 1093 1096 1097 1098 1104 1105 1111 1112 1113 1119 1135 1139 1142 1148 1160 1163 1165 1166 1168 1174 1180 1182 1197 1200 1205 1206 1211 1215 1223 1233 1245 1249 1252 1272 1277 1285 1288 1291 1323 1330 1334 1335 1342 1343 1356 1370 1378 1381 1384 1390 1394 1395 1403 1404 1406 1411 1412 1417 1422 1426 1428 1434 1439 1442 1452 1455 1457 1463 1477 1479 1485 1488 1501 1517 1527 1535 1538 1542 1550 1551 1552 1562 1565 1570 1574 1581 1583 1585 1586 1590 1592 1596 1597 1616 1621 1624 1630 1638 1639 1642 1643 1647 1648 1650 1652 1656 1657 1667 1672 1675 1681 1693 1700 1705 1706 1714 1743 1746 1768 1772 1773 1774 1779 1785 1788 1805 1811 1819 1820 1823 1826 1827 1829 1844 1850 1854 1855 1859 1860 1861 1863 1865 1866 1870 1874 1879 1881 1882 1911 1917 1919 1924 1926 1933 1942 1943 1959 1960 1963 1967 1970 1971 1972 1982 1983 1984 1990 2005 2010 2012 2017 2024 2029 2032 2037 2040 2043 2055 2065 2070 2077 2097 2101 2115 2126 2127 2132 2134 2140 2148 2149 2155 2157 2160 2163 2169 2177 2179 2205 2206 2209 2214 2223 2230 2235 2240 2245 2246 2247 2248 2259 2266 2267 2272 2275 2277 2282 2284 2286 2288 2289 2290 2291 2303 2304 2312 2322 2323 2336 2337 2338 2339 2344 2353 2355 2359 2385 2390 2391 2392 2397 2402 2406 2413 2419 2420 2421 2436 2437 2441 2448 2450 2454 2458 2470 2473 2478 2481 2490 2495 2498 2501 2511 2517 2521 2525 2531 2545 2547 2548 2551 2553 2555 2556 2557 2561 2563 2564 2571 2578 2580 2595 2601 2603 2607 2608 2609 2610 2613 2615 2619 2622 2633 2634 2637 2638 2642 2660 2661 2679 2683 2684 2690 2717 2725 2732 2739 2740 2768 2790 2792 2795 2796 2798 2799 2814 2820 2830 2833 2836 2838 2840 2842 2850 2855 2857 2862 2865 2872 2886 2899 2908 2912 2919 2927 2928 2939 2944 2957 2958 2962 2963 2964 2968 2979 2980 3001 3016 3034 3035 3036 3051 3058 3059 3060 3068 3072 3080 3095 3102 3104 3107 3110 3114 3116 3120 3123 3127 3128 3133 3136 3137 3139 3140 3141 3145 3151 3154 3164 3166 3172 3180 3185 3193 3197 3198 3203 3215 3220 3222 3233 3239 3242 3243 3246 3260 3272 3273 3280 3281 3286 3290 3293 3300 3315 3321 3326 3327 3339 3345 3346 3352 3359 3361 3372 3375 3378 3379 3380 3382 3391 3394 3398 3402 3403 3410 3419 3430 3435 3436 3438 3439 3443 3458 3462 3463 3464 3468 3476 3489 3492 3494 3496 3502 3505 3508 3510 3514 3518 3521 3523 3540 3544 3547 3548 3552 3554 3555 3556 3559 3571 3572 3589 3592 3593 3596 3609 3611 3618 3620 3622 3627 3632 3636 3638 3646 3652 3655 3658 3662 3665 3667 3670 3674 3676 3682 3693 3695 3700 3717 3718 3723 3729 3734 3735 3739 3740 3743 3749 3750 3758 3761 3762 3767 3768 3772 3786 3787 3788 3795 3807 3808 3813 3818 3821 3824 3832 3834 3838 3857 3860 3867 3869 3879 3882 3897 3919 3921 3923 3926 3932 3933 3937 3941 3942 3949 3964 3971 3987 3988 3992 3998 4006 4007 4009 4019 4021 4022 4024 4032 4033 4034 4042 4047 4057 4067 4075 4079 4085 4088 4090 4092 4093 4094 4097 4102 4105 4112 4113 4118 4121 4122 4124 4125 4130 4144 4151 4154 4159 4162 4164 4167 4168 4171 4176 4192 4194 4199 4211 4212 4217 4237 4245 4246 4248 4249 4253 4267 4275 4285 4289 4293 4305 4309 4311 4330 4341 4342 4343 4346 4365 4366 4367 4370 4374 4406 4410 4414 4428 4430 4434 4446 4449 4453 4481 4485 4500 4506 4509 4526 4530 4533 4534 4541 4549 4560 4562 4563 4576 4585 4599 4600 4602 4614 4618 4634 4647 4649 4660 4666 4672 4690 4697 4701 4702 4712 4721 4737 4757 4765 4766 4768 4785 4787 4794 4798 4811 4825 4835 4846 4848 4851 4856 4861 4865 4870 4874 4881 4890 4901 4903 4910 4925 4928 4943 4946 4971 4996 5005 5008 5011 5016 5018 5023 5029 5051 5052 5053 5062 5073 5088 5099 5103 5105 5112 5114 5116 5127 5128 5129 5131 5135 5136 5148 5158 5161 5162 5164 5171 5172 5174 5180 5183 5184 5195 5196 5201 5215 5223 5235 5264 5269 5274 5280 5290 5292 5296 5301 5302 5314 5321 5323 5338 5344 5348 5370 5378 5384 5393 5412 5413 5417 5423 5437 5444 5454 5455 5457 5465 5519 5532 5540 5548 5576 5582 5594 5601 5626 5649 5651 5657 5662 5672 5683 5684 5696 5698 5700 5704 5705 5725 5728 5742 5752 5797 5801 5810 5819 5823 5827 5844 5845 5853 5863 5869 5880 5892 5903 5925 5927 5935 5948 5950 5952 5957 5961 5968 6009 6021 6026 6034 6035 6036 6072 6083 6105 6132 6174 6175 6178 6191 6204 6209 6215 6221 6232 6250 6258 6267 6284 6287 6289 6294 6296 6302 6328 6329 6352 6367 6372 6379 6385 6398 6400 6403 6404 6405 6410 6414 6423 6428 6430 6433 6467 6471 6480 6483 6496 6506 6512 6519 6521 6529 6532 6534 6554 6563 6568 6578 6579 6597 6602 6608 6622 6625 6640 6668 6691 6696 6700 6740 6744 6752 6780 6783 6829 6886 6887 6929 6944 6968 6978 6981

起源人工智能研究院 - Inception Institute of Artificial Intelligence (IIAI) 28篇论文

IIAI主页：www.inceptioniai.org/

Unsupervised Video Object Segmentation via Attentive Graph Neural Networks
DUAL-GLOWs: Conditional Flow-Based Generative Models for Inter-Modality Transfer in Brain Imaging
Unsupervised Graph Association for Person Re-identification
Relational Attention Network for Crowd Counting
Attentional Neural Fields for Crowd Counting
Learning Compositional Neural Information Fusion for Human Parsing
RANet: Ranking Attention Network for Fast Video Object Segmentation
Learning to Mask Visible Regions for Occluded Pedestrian Detection
Boosted Feature Guided Refinement Network for Single-Shot Detection
Deep Contextual Attention for Human-Object Interaction Detection
Learning the Model Update for Siamese Trackers
3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization
Learning Rich Features at High-Speed for Single-Shot Object Detection
Transductive learning for zero-shot object detection
Ground-to-aerial Image Geo-localization with a Hard Exemplar Reweighting Triplet Loss
Towards Bridging Semantic Gap to Improve Semantic Segmentation
Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks
Motion Deblurring via Human-Aware Attention Network
Gaussian Affinity for Max-margin Class Imbalanced Learning
A Deep Step Pattern Representation for Multimodal Retinal Image Registration
SegEQA: Video Segmentation based Visual Attention for Embodied Question Answering
Reciprocal Multi-Layer Subspace Learning for Multi-View Clustering
Scoot: A Perceptual Metric for Facial Sketches
EGNet: Edge Guidance Network for Salient Object Detection
PointAE: Point Auto-encoder for 3D Statistical Shape and Texture Modelling
Understanding Human Gaze Communication by Spatio-temporal Graph Reasoning
Optimizing the F-measure for Threshold-free Salient Object Detection
SynDeMo: Synergistic Deep Feature Alignment for Joint Learning of Depth and Ego-Motion

旷视研究院 11 篇论文入选 ICCV 2019

1、Objects365: A Large-scale, High-quality Dataset for Object Detection

2、ThunderNet: Towards Real-time Generic Object Detection

3、Efficient and Accurate Arbitrary-Shaped Text Detection with PixelAggregation Network

4、Semi-supervised Skin Detection by Network with Mutual Guidance

5、Semi-Supervised Video Salient Object Detection Using Pseudo-Labels

6、Disentangled Image Matting

7、Re-ID Driven Localization Refinement for Person Search

8、Vehicle Re-identification with Viewpoint-aware Metric Learning

9、MetaPruning: Meta Learning for Automatic Neural Network ChannelPruning

10、Symmetry-constrained Rectification Network for Scene Text Recognition

11、Learning to Paint with Model-based Deep Reinforcement Learning

2019 ICCV Oral

https://arxiv.org/abs/1908.00382

Interpolated Convolutional Networks for 3D Point Cloud Understanding

https://arxiv.org/abs/1908.04512

Memory-Based Neighbourhood Embedding for Visual Recognition

https://arxiv.org/abs/1908.04992

Learning Trajectory Dependencies for Human Motion Prediction

https://arxiv.org/abs/1908.05436

Domain Adaptation for Structured Output via Discriminative Patch Representations

https://arxiv.org/abs/1901.05427

Deep Non-Rigid Structure from Motion

https://arxiv.org/abs/1908.00052

Scalable Place Recognition Under Appearance Change for Autonomous Driving

https://arxiv.org/abs/1908.00178

Restoration of Non-rigidly Distorted Underwater Images using a Combination of Compressive Sensing and Local Polynomial Image Representations

https://arxiv.org/abs/1908.01940

Consensus Maximization Tree Search Revisited

https://arxiv.org/abs/1908.02021

Weakly Supervised Energy-Based Learning for Action Segmentation
Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification

https://arxiv.org/abs/1811.10144

Controllable Artistic Text Style Transfer via Shape-Matching GAN

https://arxiv.org/abs/1905.01354

Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition

https://arxiv.org/abs/1907.13369

Expectation-Maximization Attention Networks for Semantic Segmentation

https://arxiv.org/abs/1907.13426

VideoBERT: A Joint Model for Video and Language Representation Learning

https://arxiv.org/abs/1904.01766

CARAFE: Content-Aware ReAssembly of FEatures

https://arxiv.org/pdf/1905.02188.pdf

Habitat: A Platform for Embodied AI Research

https://arxiv.org/abs/1904.01201

Equivariant Multi-View Networks

https://arxiv.org/abs/1904.00993

PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows

https://arxiv.org/abs/1906.12320

Learnable Triangulation of Human Pose

https://arxiv.org/abs/1905.05754

Learning Implicit Generative Models by Matching Perceptual Features

https://arxiv.org/abs/1904.02762v1

COCO-GAN: Generation by Parts via Conditional Coordinating

https://arxiv.org/abs/1904.00284

SlowFast Networks for Video Recognition

https://arxiv.org/abs/1812.03982

Exploring Randomly Wired Neural Networks for Image Recognition

https://arxiv.org/abs/1904.01569

Can GCNs Go as Deep as CNNs?

https://arxiv.org/abs/1904.03751

Deep SR-ITM: Joint Learning of Super-resolution and Inverse Tone-Mapping for 4K UHD HDR Applications

https://arxiv.org/abs/1904.11176

Meta-Sim Learning to Generate Synthetic Datasets

https://arxiv.org/abs/1904.11621

Deep HoughVoting for 3D Object Detection in Point Clouds

https://arxiv.org/abs/1904.09664

Variational Adversarial Active Learning

https://arxiv.org/abs/1904.00370

Towards Unconstrained End-to-End Text Spotting

https://arxiv.org/abs/1908.09231

Non-local Recurrent Neural Memory for Supervised Sequence Modeling

https://arxiv.org/abs/1908.09535

Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels

https://arxiv.org/abs/1908.09597

增加116篇ICCV2019文章

Similarity-Preserving Knowledge Distillation

https://arxiv.org/abs/1907.09682

GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition

https://arxiv.org/abs/1907.09653

Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction

https://arxiv.org/pdf/1811.09845.pdf

Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers

https://arxiv.org/abs/1904.08489

nocaps: novel object captioning at scale

https://arxiv.org/abs/1812.08658

ThunderNet: Towards Real-time Generic Object Detection

https://arxiv.org/abs/1903.11752

Scene GraphPrediction with Limited Labels

https://arxiv.org/abs/1904.11622

Ego-Pose Estimation and Forecasting as Real-Time PD Control

https://arxiv.org/abs/1906.03173

The Trajectron: Probabilistic Multi-Agent Trajectory Modeling withDynamic Spatiotemporal Graphs

https://arxiv.org/abs/1810.05993

End-to-End Learning of Representations for Asynchronous Event-BasedData

https://arxiv.org/abs/1904.08245

Efficient Learning on Point Clouds with Basis Point Sets

https://arxiv.org/abs/1908.09186

Dynamic Kernel Distillation for Efficient Pose Estimation in Videos

https://arxiv.org/abs/1908.09216

Single-Stage Multi-Person Pose Machines

https://arxiv.org/abs/1908.09220

Towards Unsupervised Image Captioning with Shared Multimodal Embeddings

https://arxiv.org/abs/1908.09317

advPattern: Physical-World Attacks on Deep Person Re-Identification via Adversarially Transformable Patterns

https://arxiv.org/abs/1908.09327

Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images

https://arxiv.org/abs/1908.09464

Relation Distillation Networks for Video Object Detection

https://arxiv.org/abs/1908.09511

Object-Driven Multi-Layer Scene Decomposition From a Single Image

https://arxiv.org/abs/1908.09521

Embarrassingly Simple Binary Representation Learning

https://arxiv.org/abs/1908.09573

Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images

https://arxiv.org/abs/1908.00439

Learning the Model Update for Siamese Trackers

https://arxiv.org/abs/1908.00855

Distilling Knowledge From a Deep Pose Regressor Network

https://arxiv.org/abs/1908.00858

Permutation-invariant Feature Restructuring for Correlation-aware Image Set-based Recognition

https://arxiv.org/abs/1908.01174

ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal

https://arxiv.org/abs/1908.01323

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation

https://arxiv.org/abs/1908.01491

View N-gram Network for 3D Object Retrieval

https://arxiv.org/abs/1908.01958

Semi-supervised Skin Detection by Network with Mutual Guidance

https://arxiv.org/abs/1908.01977

Deep Self-Learning From Noisy Labels

https://arxiv.org/abs/1908.02160

Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking

https://arxiv.org/abs/1908.02231

Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning

https://arxiv.org/abs/1908.02441

Expert Sample Consensus Applied to Camera Re-Localization

https://arxiv.org/abs/1908.02484

SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition

https://arxiv.org/abs/1908.02660

GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild

https://arxiv.org/abs/1908.02809

SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences

https://arxiv.org/abs/1904.01416

Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds from Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction

https://arxiv.org/abs/1907.12704

Orientation-aware Semantic Segmentation on Icosahedron Spheres

https://arxiv.org/abs/1907.12849

EMPNet: Neural Localisation and Mapping using Embedded Memory Points

https://arxiv.org/abs/1907.13268

SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation

https://arxiv.org/abs/1907.11308

On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method

https://arxiv.org/abs/1907.11684

Goal-Driven Sequential Data Abstraction

https://arxiv.org/abs/1907.12336

Recursive Cascaded Networks for Unsupervised Medical Image Registration

https://arxiv.org/abs/1907.12353

Learn to Scale: Generating Multipolar Normalized Density Map for Crowd Counting

https://arxiv.org/abs/1907.12428

HoloGAN: Unsupervised learning of 3D representations from natural images

https://arxiv.org/abs/1904.01326

MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

https://arxiv.org/abs/1903.10258

FrameNet: Learning Local Canonical Frames of 3D Surfaces from a Single RGB Image

https://arxiv.org/pdf/1903.12305.pdf

Face De-occlusion using 3D Morphable Model and Generative Adversarialhttp://image.inha.ac.kr/paper/ICCV2019_Xaiowei.pdf
Deep Meta Learning for Real-Time Target-Aware Visual Tracking

https://arxiv.org/pdf/1712.09153.pdf

Switchable Whitening for Deep Representation Learning

https://arxiv.org/abs/1904.09739

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

https://arxiv.org/abs/1904.05049

Multi-layer Depth and Epipolar Feature Transformers for 3D Scene Reconstruction

https://arxiv.org/abs/1902.06729

Task2Vec: Task Embedding for Meta-Learning

https://arxiv.org/abs/1902.03545

ACE: Adapting to Changing Environments for Semantic Segmentation

https://arxiv.org/pdf/1904.06268.pdf

Few-shot Object Detection via Feature Reweighting

https://arxiv.org/pdf/1812.01866.pdf

Disentangling Propagation and Generation for Video Prediction

https://arxiv.org/pdf/1812.00452.pdf

An Empirical Study of Spatial Attention Mechanisms in Deep Networks

https://arxiv.org/pdf/1904.05873.pdf

Fashion++: Minimal Edits for Outfit Improvement

https://arxiv.org/pdf/1904.09261.pdf

Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment

https://arxiv.org/pdf/1903.11649.pdf

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded

https://arxiv.org/pdf/1902.03751.pdf

SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation

https://arxiv.org/pdf/1905.07512.pdf

EM-Fusion: Dynamic Object-Level SLAM with Probabilistic Data Association

https://arxiv.org/abs/1904.11781

Texture Fields: Learning Texture Representations in Function Space

https://arxiv.org/abs/1905.07259

AMASS: Archive of Motion Capture as Surface Shapes

https://arxiv.org/abs/1904.03278

End-to-end Learning for Graph Decomposition

https://arxiv.org/pdf/1812.09737.pdf

Towards Multi-pose Guided Virtual Try-on Network

https://arxiv.org/abs/1902.11026

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

https://arxiv.org/abs/1905.07482

Coherent Semantic Attention for Image Inpainting

https://arxiv.org/abs/1905.12384

LayoutVAE: Stochastic Scene Layout Generation from a Label Set

https://arxiv.org/abs/1907.10719

Co-Evolutionary Compression for Unpaired Image Translation

https://arxiv.org/abs/1907.10804

Enhancing Adversarial Example Transferability with an Intermediate Level Attack

https://arxiv.org/abs/1907.10823

Simultaneous multi-view instance detection with learned geometric soft-constraints

https://arxiv.org/abs/1907.10892

Gated2Depth: Real-time Dense Lidar from Gated Images

https://www.cs.princeton.edu/~fheide/papers/Gated2Depth_preprint.pdf

Moment Matching for Multi-Source Domain Adaptation

https://arxiv.org/abs/1812.01754

Learning Compositional Representations for Few-Shot Recognition

https://sites.google.com/view/comprepr/home

Digging Into Self-Supervised Monocular Depth Estimation

https://arxiv.org/pdf/1806.01260.pdf

Deep Interpretable Non-Rigid Structure from Motion

https://arxiv.org/pdf/1902.10840.pdf

PRECOG: PREdiction Conditioned On Goals in Visual Multi-Agent Settings

https://arxiv.org/pdf/1905.01296.pdf

Lifelong GAN: Continual Learning for Conditional Image Generation

https://arxiv.org/abs/1907.10107

Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

https://arxiv.org/abs/1907.10164

Towards Adversarially Robust Object Detection

https://arxiv.org/abs/1907.10310

6-DOF GraspNet: Variational Grasp Generation for Object Manipulation

https://arxiv.org/abs/1905.10520

Analyzing the Variety Loss in the Context of Probabilistic Trajectory Prediction

https://arxiv.org/abs/1907.10178

DAFL: Data-Free Learning of Student Networks

https://arxiv.org/abs/1904.01186

Multi-adversarial Faster-RCNN for Unrestricted Object Detection

https://arxiv.org/abs/1907.10343

Boosting Few-Shot Visual Learning with Self-Supervision

https://arxiv.org/abs/1906.05186

A Quaternion-based Certifiably Optimal Solution to the Wahba Problem with Outliers

https://arxiv.org/abs/1905.12536

Embodied Visual Recognition
Rethinking ImageNet Pre-training

https://arxiv.org/abs/1811.08883

TensorMask: A Foundation for Dense Object Segmentation

https://arxiv.org/abs/1903.12174

3D Point Cloud Learning for Large-scale Environment Analysis and Place Recognition

https://arxiv.org/abs/1812.07050

Selectivity or Invariance: Boundary-aware Salient Object Detection

https://arxiv.org/pdf/1812.10066.pdf

Creativity Inspired Zero-Shot Learning

https://arxiv.org/abs/1904.01109

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

https://arxiv.org/abs/1906.03327

Correlation Congruence for Knowledge Distillation

https://arxiv.org/abs/1904.018029

VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research

https://arxiv.org/abs/1904.03493

Episodic Training for Domain Generalization

https://arxiv.org/abs/1902.00113

GarNet: A Two-stream Network for Fast and Accurate 3D Cloth Draping

https://arxiv.org/abs/1811.10983v2

Semi-supervised Domain Adaptation via Minimax Entropy

https://arxiv.org/abs/1904.06487

xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera

https://arxiv.org/abs/1907.10045

Canonical Surface Mapping via Geometric Cycle Consistency

https://arxiv.org/abs/1907.10043

Incremental Class Discovery for Semantic Segmentation with RGBD Sensing

https://arxiv.org/abs/1907.10008

U4D: Unsupervised 4D Dynamic Scene Understanding

https://arxiv.org/abs/1907.09905

BMN: Boundary-Matching Network for Temporal Action Proposal Generation

https://arxiv.org/abs/1907.09702

SPGNet: Semantic Prediction Guidance for Scene Parsing

https://arxiv.org/abs/1908.09798

Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation

https://arxiv.org/abs/1811.07456

DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense

https://arxiv.org/abs/1812.11017

Closed-Form Optimal Two-View Triangulation Based on Angular Errors

https://arxiv.org/abs/1903.09115

Learning Combinatorial Embedding Networks for Deep Graph Matching

https://arxiv.org/abs/1904.00597

A Novel Unsupervised Camera-aware Domain Adaptation Framework for Person Re-identification

https://arxiv.org/abs/1904.03425

Remote Heart Rate Measurement from Highly Compressed Facial Videos: an End-to-end Deep Learning Solution with Video Enhancement

https://arxiv.org/abs/1907.11921

Symmetry-constrained Rectification Network for Scene Text Recognition

https://arxiv.org/abs/1908.01957

STM: SpatioTemporal and Motion Encoding for Action Recognition

https://arxiv.org/abs/1908.02486

Explicit Shape Encoding for Real-Time Instance Segmentation

https://arxiv.org/abs/1908.04067

Few-Shot Learning with Global Class Representations

https://arxiv.org/abs/1908.05257

Symmetric Cross Entropy for Robust Learning with Noisy Labels

https://arxiv.org/abs/1908.06112

Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

https://arxiv.org/abs/1908.07172

DADA: Depth-Aware Domain Adaptation in Semantic Segmentation

https://arxiv.org/abs/1904.01886

增加35篇包含开源代码的ICCV2019

Bidirectional One-Shot Unsupervised Domain Mapping

https://github.com/tomercohen11/BiOST

Joint Monocular 3D Detection and Tracking

https://arxiv.org/abs/1811.10742

https://github.com/ucbdrive/3d-vehicle-tracking

MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

https://arxiv.org/abs/1906.06059

https://github.com/vita-epfl/monoloco

Mask-ShadowGAN: Learning to Remove Shadows from Unpaired Data

https://github.com/xw-hu/Mask-ShadowGAN

Towards High-Resolution Salient Object Detection

https://arxiv.org/abs/1908.07274

https://github.com/yi94code/HRSOD

Confidence Regularized Self-Training

https://arxiv.org/abs/1908.09822

https://github.com/yzou2/CRST

Optimizing the F-measure for Threshold-free Salient Object Detectionhttp://data.kaizhao.net/publications/iccv2019fmeasure.pdf

https://github.com/zeakey/iccv2019-fmeasure

Perspective-Guided Convolution Networks for Crowd Counting

https://github.com/Zhaoyi-Yan/PGCNet

End-to-End Wireframe Parsing

https://arxiv.org/abs/1905.03246

https://github.com/zhou13/lcnn

Temporal Attentive Alignment for Large-Scale Video Domain Adaptation

https://arxiv.org/abs/1907.12743http://github.com/cmhungsteve/TA3N

From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer

https://arxiv.org/abs/1908.06473

https://github. com/xhp-hust-2018-2011/S-DCNet

Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN

https://arxiv.org/abs/1904.10247

https://github.com/amjltc295/Free-Form-Video-Inpainting

What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention

https://arxiv.org/pdf/1905.09035.pdf

https://github.com/antoninofurnari/rulstm

CompenNet++: End-to-end Full Projector Compensation

https://github.com/BingyaoHuang/CompenNet-plusplus

Pose-aware Dynamic Attention for Human Object Interaction Detection

https://github.com/bobwan1995/Pose-aware-Dynamic-Attention-for-Human-Object-Interaction-Detection

Temporally-Aggregating Spatial Encoder-Decoder for Video Saliency Detection

https://github.com/kylemin/TASED-Net

PU-GAN: a Point Cloud Upsampling Adversarial Network

https://arxiv.org/abs/1907.10844

https://github.com/liruihui/PU-GAN

A Closed-form Solution to Universal Style Transfer

https://arxiv.org/abs/1906.00668

https://github.com/lu-m13/OptimalStyleTransfer

Video Face Clustering with Unknown Number of Clusters

https://github.com/makarandtapaswi/BallClustering_ICCV2019

TSM: Temporal Shift Module for Efficient Video Understanding

https://arxiv.org/abs/1811.08383

https://github.com/mit-han-lab/temporal-shift-module

Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image

https://arxiv.org/abs/1907.11346

https://github.com/mks0601/3DMPPE_ROOTNET_RELEASE

3D-RelNet: Joint Object and Relational Network for 3D Prediction

https://arxiv.org/pdf/1906.02729.pdf

https://github.com/nileshkulkarni/relative3d

Few-shot Unsupervised Image-to-Image Translation

https://arxiv.org/abs/1905.01723

https://github.com/nvlabs/FUNIT/

Metric Learning with HORDE: High-Order Regularizer for Deep Embeddings

https://arxiv.org/abs/1908.02735

https://github.com/pierre-jacob/ICCV2019-Horde

Model Vulnerability to Distributional Shifts over Image Transformation Sets

https://arxiv.org/abs/1903.11900

https://github.com/ricvolpi/domain-shift-robustness

Language-Conditioned Graph Networks for Relational Reasoning

https://arxiv.org/abs/1905.04405

https://github.com/ronghanghu/lcgn

Domain Intersection and Domain Difference

https://github.com/sagiebenaim/DomainIntersectionDifference

Probabilistic Face Embeddings

https://arxiv.org/abs/1904.09658

https://github.com/seasonSH/Probabilistic-Face-Embeddings

Counting with Focus for Free

https://arxiv.org/abs/1903.12206

https://github.com/shizenglin/Counting-with-Focus-for-Free

CCNet: Criss-Cross Attention for Semantic Segmentation

https://arxiv.org/abs/1811.11721

https://github.com/speedinghzl/CCNet

ABD-Net: Attentive but Diverse Person Re-Identification

https://arxiv.org/abs/1908.01114

https://github.com/TAMU-VITA/ABD-Net

AutoGAN: Neural Architecture Search for Generative Adversarial Networks

https://github.com/TAMU-VITA/AutoGAN

SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning

https://github.com/TerenceCYJ/SO-HandNet

Tex2Shape: Detailed Full Human Body Geometry from a Single Image

https://arxiv.org/abs/1904.08645

https://github.com/thmoa/tex2shape

FCOS: Fully Convolutional One-Stage Object Detectio

https://arxiv.org/abs/1904.01355

https://github.com/tianzhi0549/FCOS/

ICCV 1998-2017最佳论文

2017Mask R-CNNKaiming He, Facebook AI Research; et al.

Georgia Gkioxari, Facebook AI Research
Piotr Dollar, Facebook AI Research
Ross Girshick, Facebook AI Research

2015Deep Neural Decision ForestsPeter Kontschieder, Microsoft Research; et al.

Madalina Fiterau, Carnegie Mellon University
Antonio Criminisi, Microsoft Research
Samuel Rota Bulò, Microsoft Research

2013From Large Scale Image Categorization to Entry-Level CategoriesVicente Ordonez, University of North Carolina at Chapel Hill; et al.

Jia Deng, Stanford University
Yejin Choi, Stony Brook University
Alexander Berg, University of North Carolina at Chapel Hill
Tamara Berg, University of North Carolina at Chapel Hill

2011Relative AttributesDevi Parikh, Toyota Technological Institute at Chicago
Kristen Grauman, University of Texas at Austin2009Discriminative models for multi-class object layoutChaitanya Desai, University of California Irvine; et al.

Deva Ramanan, University of California Irvine
Charless Fowlkes, University of California Irvine

2007Population Shape Regression From Random Design DataBradley Davis, University of North Carolina at Chapel Hill; et al.

P. Thomas Fletcher, University of Utah
Elizabeth Bullitt, University of North Carolina at Chapel Hill
Sarang Joshi, University of Utah

2005Globally Optimal Estimates for Geometric Reconstruction ProblemsFredrik Kahl, Lund University
Didier Henrion, LAAS-CNRS2003Detecting Pedestrians using Patterns of Motion and AppearancePaul Viola, Microsoft Research; et al.

Michael J. Jones, Mitsubishi Electric Research Laboratories
Daniel Snow, Mitsubishi Electric Research Laboratories

Image Parsing: Unifying Segmentation, Detection and RecognitionZhuowen Tu, University of California Los Angeles; et al.

Xiangrong Chen, University of California Los Angeles
Alan L. Yuille, University of California Los Angeles
Song-Chun Zhu, University of California Los Angeles

Image-based Rendering using Image-based PriorsAndrew Fitzgibbon, University of Oxford; et al.

Yonatan Wexler, Weizmann Institute of Science
Andrew Zisserman, University of Oxford

2001Probabilistic Tracking with Exemplars in a Metric SpaceKentaro Toyama & Andrew Blake, Microsoft ResearchThe Space of All Stereo ImagesSteven Seitz, University of Washington1999Euclidean Reconstruction and Reprojection up to SubgroupsYi Ma, University of California Berkeley; et al.

Stefano Soatto, Washington University in St. Louis
Jana Kosecka, University of California Berkeley
Shankar Sastry, University of California Berkeley

A Theory of Shape by Space CarvingKiriakos Kutulakos, University of Rochester
Steven Seitz, Carnegie Mellon University1998Self-Calibration and Metric Reconstruction in spite of Varying and Unknown Internal Camera Paramet...Marc Pollefeys, Katholieke Universiteit Leuven; et al.

Reinhard Koch, Katholieke Universiteit Leuven
Luc Van Gool, Katholieke Universiteit Leuven

The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image SequencesPhil Torr, Microsoft Research; et al.

Andrew Fitzgibbon, University of Oxford
Andrew Zisserman, University of Oxford

对应表格版


2017	Mask R-CNN	Kaiming He, Facebook AI Research; et al.
2015	Deep Neural Decision Forests	Peter Kontschieder, Microsoft Research; et al.
2013	From Large Scale Image Categorization to Entry-Level Categories	Vicente Ordonez, University of North Carolina at Chapel Hill; et al.
2011	Relative Attributes	Devi Parikh, Toyota Technological Institute at Chicago
Kristen Grauman, University of Texas at Austin
2009	Discriminative models for multi-class object layout	Chaitanya Desai, University of California Irvine; et al.
2007	Population Shape Regression From Random Design Data	Bradley Davis, University of North Carolina at Chapel Hill; et al.
2005	Globally Optimal Estimates for Geometric Reconstruction Problems	Fredrik Kahl, Lund University

2003	Detecting Pedestrians using Patterns of Motion and Appearance	Paul Viola, Microsoft Research; et al.
Image Parsing: Unifying Segmentation, Detection and Recognition	Zhuowen Tu, University of California Los Angeles; et al.
Image-based Rendering using Image-based Priors	Andrew Fitzgibbon, University of Oxford; et al.
2001	Probabilistic Tracking with Exemplars in a Metric Space	Kentaro Toyama & Andrew Blake, Microsoft Research
The Space of All Stereo Images	Steven Seitz, University of Washington
1999	Euclidean Reconstruction and Reprojection up to Subgroups	Yi Ma, University of California Berkeley; et al.
A Theory of Shape by Space Carving	Kiriakos Kutulakos, University of Rochester
Steven Seitz, Carnegie Mellon University
1998	Self-Calibration and Metric Reconstruction in spite of Varying and Unknown Internal Camera Paramet...	Marc Pollefeys, Katholieke Universiteit Leuven; et al.
The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences	Phil Torr, Microsoft Research; et al.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
image		image
README.md		README.md

Sophia-11/Awesome-ICCV

Folders and files

Latest commit

History

Repository files navigation