Pamitc



ICCV15 Main Conference Program

Sunday
(13 Dec.)
Monday
(14 Dec.)
Tuesday
(15 Dec.)
Wednesday
(16 Dec.)

Opening Ceremony
(8:30-8:45)

Oral Session 1A
(8:45-9:45)
Oral Session 2A
(8:30-10:00)
Oral Session 3A
(8:30-9:45)
Oral Session 4A
(8:30-9:45)
Poster Session 1A
(9:45-12:15)
Poster Session 3A
(9:45-12:15)
Poster Session 4A
(9:45-12:15)
 
Oral Session 2B
(10:30-12:00)
 
Plenary Session
Speaker: Stephen Boyd
(12:15-13:15)
Oral Session 3B
(12:15-13:15)
Oral Session 4B
(12:15-13:15)
     
Oral Session 2C
(13:30-15:00)
Poster Session 1B
(14:45-17:15)
Poster Session 3B
(14:45-17:15)
Poster Session 4B
(14:45-17:15)
Poster Session 2A
(15:00-17:30)
Oral Session 1B
(17:15-18:15)
Oral Session 3C
(17:15-18:45)
Oral Session 4C
(17:15-18:45)
Awards
(17:30-18:45)
Opening Reception
(18:45-20:30)
Closing Reception
(18:45-20:30)
PAMI TC Meeting
(20:30-TBD)
 
 
[8:45-09:45] Oral Session 1A - Vision and Language

  Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images
  Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
  Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
  Learning Query and Image Similarities With Ranking Canonical Correlation Analysis
 
 
[09:45-12:15] Poster Session 1A - Recognition, Low-Level Vision, and Biomedical Image Analysis

 1Learning to See by Moving
 2Object Detection Using Generalization and Efficiency Balanced Co-Occurrence Features
 3Mining And-Or Graphs for Graph Matching and Object Discovery
 4Pose Induction for Novel Object Categories
 5Dynamic Texture Recognition via Orthogonal Tensor Dictionary Learning
 6Convolutional Channel Features
 7Local Convolutional Features With Unsupervised Training for Image Retrieval
 8RIDE: Reversal Invariant Descriptor Enhancement
 9Discrete Tabu Search for Graph Matching
 10Discriminative Learning of Deep Convolutional Feature Point Descriptors
 11Amodal Completion and Size Constancy in Natural Scenes
 12Learning Where to Position Parts in 3D
 13Query Adaptive Similarity Measure for RGB-D Object Recognition
 14Listening With Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines
 15Cluster-Based Point Set Saliency
 16A Comprehensive Multi-Illuminant Dataset for Benchmarking of the Intrinsic Image Algorithms
 17PatchMatch-Based Automatic Lattice Detection for Near-Regular Textures
 18A Data-Driven Metric for Comprehensive Evaluation of Saliency Models
 19A Matrix Decomposition Perspective to Multiple Graph Matching
 20Fast and Effective L0 Gradient Minimization by Region Fusion
 21Generic Promotion of Diffusion-Based Salient Object Detection
 22Nighttime Haze Removal With Glow and Multiple Light Colors
 23Conformal and Low-Rank Sparse Representation for Image Restoration
 24Patch Group Based Nonlocal Self-Similarity Prior Learning for Image Denoising
 25Automatic Thumbnail Generation Based on Visual Representativeness and Foreground Recognizability
 26SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks
 27A Novel Sparsity Measure for Tensor Recovery
 28Oriented Object Proposals
 29Learning Nonlinear Spectral Filters for Color Image Reconstruction
 30Beyond White: Ground Truth Colors for Color Constancy Correction
 31RGB-Guided Hyperspectral Image Upsampling
 32Projection Onto the Manifold of Elongated Structures for Accurate Extraction
 33Naive Bayes Super-Resolution Forest
 34POP Image Fusion - Derivative Domain Image Fusion Without Reintegration
 35Adaptive Spatial-Spectral Dictionary Learning for Hyperspectral Image Denoising
 36Fully Connected Guided Image Filtering
 37Segment Graph Based Image Filtering: Fast Structure-Preserving Smoothing
 38Deep Networks for Image Super-Resolution With Sparse Prior
 39Convolutional Color Constancy
 40Learning Ordinal Relationships for Mid-Level Vision
 41Thin Structure Estimation With Curvature Regularization
 42HARF: Hierarchy-Associated Rich Features for Salient Object Detection
 43Deep Colorization
 44Image Matting With KL-Divergence Based Sparse Sampling
 45Intrinsic Decomposition of Image Sequences From Local Temporal Variations
 46Low-Rank Tensor Approximation With Laplacian Scale Mixture Modeling for Multiframe Image Denoising
 47Learning Parametric Distributions for Image Super-Resolution: Where Patch Matching Meets Sparse Coding
 48Improving Image Restoration With Soft-Rounding
 49See the Difference: Direct Pre-Image Reconstruction and Pose Estimation by Differentiating HOG
 50An Efficient Statistical Method for Image Noise Level Estimation
 51Contour Detection and Characterization for Asynchronous Event Sensors
 52Class-Specific Image Deblurring
 53High-for-Low and Low-for-High: Efficient Boundary Detection From Deep Object Features and its Applications to High-Level Vision
 54Variational Depth Superresolution Using Example-Based Edge Representations
 55Conditioned Regression Models for Non-Blind Single Image Super-Resolution
 56Video Super-Resolution via Deep Draft-Ensemble Learning
 57Pan-Sharpening With a Hyper-Laplacian Penalty
 58Video Restoration Against Yin-Yang Phasing
 59Rolling Shutter Super-Resolution
 60Learning Large-Scale Automatic Image Colorization
 61Compression Artifacts Reduction by a Deep Convolutional Network
 62Multiple-Hypothesis Affine Region Estimation With Anisotropic LoG Filters
 63A Self-Paced Multiple-Instance Learning Framework for Co-Saliency Detection
 64External Patch Prior Guided Internal Clustering for Image Denoising
 66Illumination Robust Color Naming via Label Propagation
 67Unsupervised Cross-Modal Synthesis of Subject-Specific Scans
 68Learning to Boost Filamentary Structure Segmentation
 69Weakly-Supervised Structured Output Learning With Flexible and Latent Graphs Using High-Order Loss Functions
 70Efficient Classifier Training to Minimize False Merges in Electron Microscopy Segmentation
 71On Statistical Analysis of Neuroimages With Imperfect Registration
 72[From Oral 1A] Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images
 73[From Oral 1A] Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
 74[From Oral 1A] Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
 75[From Oral 1A] Learning Query and Image Similarities With Ranking Canonical Correlation Analysis
 
 
[12:15-13:15] Special Session 1A - Plenary Session

  Convex Optimization With Abstract Linear Operators
 
 
[14:45-17:15] Poster Session 1B - Recognition and 3D Computer Vision I

 1Building Dynamic Cloud Maps From the Ground Up
 2A Versatile Learning-Based 3D Temporal Tracker: Scalable, Robust, Online
 3Realtime Edge-Based Visual Odometry for a Monocular Camera
 4Fill and Transfer: A Simple Physics-Based Approach for Containability Reasoning
 5On Linear Structure From Motion for Light Field Cameras
 63D Object Reconstruction From Hand-Object Interactions
 7Minimal Solvers for 3D Geometry From Satellite Imagery
 8An Efficient Minimal Solution for Multi-Camera Motion
 9Learning Shape, Motion and Elastic Models in Force Space
 10A Versatile Scene Model With Differentiable Visibility Applied to Generative Pose Estimation
 11Semantic Pose Using Deep Networks Trained on Synthetic RGB-D
 12Exploiting High Level Scene Cues in Stereo Reconstruction
 13Point Triangulation Through Polyhedron Collapse Using the l∞ Norm
 14Optimizing the Viewing Graph for Structure-From-Motion
 15Intrinsic Scene Decomposition From RGB-D images
 163D Hand Pose Estimation Using Randomized Decision Forest With Segmentation Index Points
 17Accurate Camera Calibration Robust to Defocus Using a Smartphone
 18High Quality Structure From Small Motion for Rolling Shutter Cameras
 19Photogeometric Scene Flow for High-Detail Dynamic 3D Reconstruction
 20Blur-Aware Disparity Estimation From Defocus Stereo Images
 21Global Structure-From-Motion by Similarity Averaging
 22Massively Parallel Multiview Stereopsis by Surface Normal Diffusion
 23Variational PatchMatch MultiView Reconstruction and Refinement
 24As-Rigid-As-Possible Volumetric Shape-From-Template
 25General Dynamic Scene Reconstruction From Multiple View Video
 26The Joint Image Handbook
 27Direct, Dense, and Deformable: Template-Based Non-Rigid 3D Reconstruction From RGB Video
 28Single Image Pop-Up From Discriminatively Learned Parts
 29Learning Informative Edge Maps for Indoor Scene Layout Prediction
 30Multi-View Convolutional Neural Networks for 3D Shape Recognition
 31Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images
 323D Surface Profilometry Using Phase Shifting of De Bruijn Pattern
 33A Deep Visual Correspondence Embedding Model for Stereo Matching Costs
 34Learning Concept Embeddings With Combined Human-Machine Expertise
 35Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation
 36Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection
 37Improving Image Classification With Location Context
 38HICO: A Benchmark for Recognizing Human-Object Interactions in Images
 39Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
 40Continuous Pose Estimation With a Spatial Ensemble of Fisher Regressors
 41Adaptive Hashing for Fast Similarity Search
 42Single Image 3D Without a Single 3D Image
 43Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network
 44Attribute-Graph: A Graph Based Approach to Image Ranking
 45Contextual Action Recognition With R*CNN
 46What Makes an Object Memorable?
 47kNN Hashing With Factorized Neighborhood Representation
 48Multi-View Complementary Hash Tables for Nearest Neighbor Search
 49Scalable Person Re-Identification: A Benchmark
 50MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition
 51Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN Model
 52Neural Activation Constellations: Unsupervised Part Model Discovery With Convolutional Networks
 53Cascaded Sparse Spatial Bins for Efficient and Effective Generic Object Detection
 54Probabilistic Label Relation Graphs With Ising Models
 55Predicting Good Features for Image Geo-Localization Using Per-Bundle VLAD
 56Task-Driven Feature Pooling for Image Classification
 57Cutting Edge: Soft Correspondences in Multimodal Scene Parsing
 58One Shot Learning via Compositions of Meaningful Patches
 59FASText: Efficient Unconstrained Scene Text Detector
 60Multi-Scale Recognition With DAG-CNNs
 61Relaxed Multiple-Instance SVM With Application to Object Discovery
 62Im2Calories: Towards an Automated Mobile Vision Food Diary
 63LEWIS: Latent Embeddings for Word Images and their Semantics
 64Per-Sample Kernel Adaptation for Visual Recognition and Grouping
 65Fine-Grained Change Detection of Misaligned Scenes With Varied Illuminations
 66Aggregating Local Deep Features for Image Retrieval
 67Learning Deep Object Detectors From 3D Models
 68Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification
 69Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval
 70Person Re-Identification Ranking Optimisation by Discriminant Context Information Analysis
 71Unsupervised Generation of a Viewpoint Annotated Car Dataset From Videos
 72[From Oral 1B] Structured Indoor Modeling
 73[From Oral 1B] 3D Time-Lapse Reconstruction From Internet Photos
 74[From Oral 1B] Global, Dense Multiscale Reconstruction for a Billion Points
 75[From Oral 1B] On the Visibility of Point Clouds
 
 
[17:15-18:15] Oral Session 1B - 3D Vision

  Structured Indoor Modeling
  3D Time-Lapse Reconstruction From Internet Photos
  Global, Dense Multiscale Reconstruction for a Billion Points
  On the Visibility of Point Clouds
 
 
[8:30-10:00] Oral Session 2A - Segmentation, Edges and Saliency

  Weakly Supervised Graph Based Semantic Segmentation by Learning Communities of Image-Parts
  Piecewise Flat Embedding for Image Segmentation
  Semantic Image Segmentation via Deep Parsing Network
  Human Parsing With Contextualized Convolutional Neural Network
  Holistically-Nested Edge Detection
  Minimum Barrier Salient Object Detection at 80 FPS
 
 
[10:30-12:00] Oral Session 2B - Learning Representations and Attributes

  Learning Image Representations Tied to Ego-Motion
  Unsupervised Visual Representation Learning by Context Prediction
  Webly Supervised Learning of Convolutional Networks
  Fast R-CNN
  Bilinear CNN Models for Fine-Grained Visual Recognition
  Discovering the Spatial Extent of Relative Attributes
 
 
[13:30-15:00] Oral Session 2C - Statistical Methods and Learning

  Deep Neural Decision Forests
  Deep Fried Convnets
  Semantic Component Analysis
  Low-Rank Matrix Factorization Under General Mixture Noise Distributions
  Web-Scale Image Clustering Revisited
  Learning Discriminative Reconstructions for Unsupervised Outlier Removal
 
 
[15:00-17:30] Poster Session 2A - Optimization, Segmentation, and Recognition

 1Learning Deconvolution Network for Semantic Segmentation
 2Conditional Random Fields as Recurrent Neural Networks
 3The One Triangle Three Parallelograms Sampling Strategy and Its Application in Shape Regression
 4Boosting Object Proposals: From Pascal to COCO
 5Secrets of GrabCut and Kernel K-Means
 6Video Matting via Sparse and Low-Rank Representation
 7Joint Object and Part Segmentation Using Deep Learned Potentials
 8Low-Rank Tensor Constrained Multiview Subspace Clustering
 9BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies
 10The Middle Child Problem: Revisiting Parametric Min-Cut and Seeds for Object Proposals
 11Contour Guided Hierarchical Model for Shape Matching
 12Robust Image Segmentation Using Contour-Guided Color Palettes
 13Joint Optimization of Segmentation and Color Clustering
 14BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation
 15Detection and Segmentation of 2D Curved Reflection Symmetric Structures
 16Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories
 17Compositional Hierarchical Representation of Shape Manifolds for Classification of Non-Manifold Shapes
 18Shell PCA: Statistical Shape Modelling in Shell Space
 19Learning to Combine Mid-Level Cues for Object Proposal Generation
 20Enhancing Road Maps by Parsing Aerial Images Around the World
 21Probabilistic Appearance Models for Segmentation and Classification
 22A Randomized Ensemble Approach to Industrial CT Segmentation
 23Semi-Supervised Normalized Cuts for Image Segmentation
 24StereoSnakes: Contour Based Consistent Object Extraction For Stereo Images
 25Semantic Segmentation of RGBD Images With Mutex Constraints
 26Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation
 27Efficient Decomposition of Image and Mesh Graphs by Lifted Multicuts
 28Parsimonious Labeling
 29Volumetric Bias in Segmentation and Reconstruction: Secrets and Solutions
 30Entropy Minimization for Convex Relaxation Approaches
 31Adaptively Unified Semi-Supervised Dictionary Learning With Active Points
 32Constrained Convolutional Neural Networks for Weakly Supervised Segmentation
 33A Multiscale Variable-Grouping Framework for MRF Energy Minimization
 34Inferring M-Best Diverse Labelings in a Single One
 35Convolutional Sparse Coding for Image Super-Resolution
 36A Wavefront Marching Method for Solving the Eikonal Equation on Cartesian Grids
 37A Projection Free Method for Generalized Eigenvalue Problem With a Nonsmooth Regularizer
 38Optimizing Expected Intersection-Over-Union With Candidate-Constrained CRFs
 39Higher-Order Inference for Multi-Class Log-Supermodular Models
 40Depth-Based Hand Pose Estimation: Data, Methods, and Challenges
 41Adaptive Dither Voting for Robust Spatial Verification
 42Alternating Co-Quantization for Cross-Modal Hashing
 43Learning Deep Representation With Large-Scale Attributes
 44Deep Learning Strong Parts for Pedestrian Detection
 45Flowing ConvNets for Human Pose Estimation in Videos
 46Top Rank Supervised Binary Coding for Visual Search
 47BubbLeNet: Foveated Imaging for Visual Discovery
 48PQTable: Fast Exact Asymmetric Distance Neighbor Search for Product Quantization Using Hash Tables
 49Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions
 50Fast and Accurate Head Pose Estimation via Random Projection Forests
 51An MRF-Poselets Model for Detecting Highly Articulated Humans
 52Beyond Tree Structure Models: A New Occlusion Aware Graphical Model for Human Pose Estimation
 53Relaxing From Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging
 54Visual Phrases for Exemplar Face Detection
 55Spatial Semantic Regularisation for Large Scale Object Detection
 56Human Pose Estimation in Videos
 57Contour Box: Rejecting Object Proposals Without Explicit Closed Contours
 58[From Oral 2A] Weakly Supervised Graph Based Semantic Segmentation by Learning Communities of Image-Parts
 59[From Oral 2A] Piecewise Flat Embedding for Image Segmentation
 60[From Oral 2A] Semantic Image Segmentation via Deep Parsing Network
 61[From Oral 2A] Human Parsing With Contextualized Convolutional Neural Network
 62[From Oral 2A] Holistically-Nested Edge Detection
 63[From Oral 2A] Minimum Barrier Salient Object Detection at 80 FPS
 64[From Oral 2B] Learning Image Representations Tied to Ego-Motion
 65[From Oral 2B] Unsupervised Visual Representation Learning by Context Prediction
 66[From Oral 2B] Webly Supervised Learning of Convolutional Networks
 67[From Oral 2B] Fast R-CNN
 68[From Oral 2B] Bilinear CNN Models for Fine-Grained Visual Recognition
 69[From Oral 2B] Discovering the Spatial Extent of Relative Attributes
 70[From Oral 2C] Deep Neural Decision Forests
 71[From Oral 2C] Deep Fried Convnets
 72[From Oral 2C] Semantic Component Analysis
 73[From Oral 2C] Low-Rank Matrix Factorization Under General Mixture Noise Distributions
 74[From Oral 2C] Web-Scale Image Clustering Revisited
 75[From Oral 2C] Learning Discriminative Reconstructions for Unsupervised Outlier Removal
 
 
[8:30-09:45] Oral Session 3A - Registration, Alignment and Stereo

  Registering Images to Untextured Geometry Using Average Shading Gradients
  Robust Nonrigid Registration by Convex Optimization
  Robust and Optimal Sum-of-Squares-Based Point-to-Plane Registration of Image Sets and Structured Scenes
  MeshStereo: A Global Stereo Model With Mesh Alignment Regularization for View Interpolation
  CV-HAZOP: Introducing Test Data Validation for Computer Vision
 
 
[09:45-12:15] Poster Session 3A - Recognition and 3D Computer Vision II

 1Structure From Motion Using Structure-Less Resection
 2Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo
 3Higher-Order CRF Structural Segmentation of 3D Reconstructed Surfaces
 4Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition
 5Globally Optimal 2D-3D Registration From Points or Lines Without Correspondences
 6The HCI Stereo Metrics: Geometry-Aware Performance Analysis of Stereo Algorithms
 7Merging the Unmatchable: Stitching Visually Disconnected SfM Models
 83D Fragment Reassembly Using Integrated Template Guidance and Fracture-Region Matching
 9Procedural Editing of 3D Building Point Clouds
 10Semantically-Aware Aerial Reconstruction From Multi-Modal Data
 11Guaranteed Outlier Removal for Rotation Search
 12Peeking Template Matching for Depth Extension
 13Deformable 3D Fusion: From Partial Dynamic 3D Observations to Complete 4D Models
 14Non-Parametric Structure-Based Calibration of Radially Symmetric Cameras
 15Exploiting Object Similarity in 3D Reconstruction
 16You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans
 17MAP Disparity Estimation Using Hidden Markov Trees
 18Wide Baseline Stereo Matching With Convex Bounded Distortion Constraints
 19Interactive Visual Hull Refinement for Specular and Transparent Object Surface Reconstruction
 20Hierarchical Higher-Order Regression Forest Fields: An Application to 3D Indoor Scene Labelling
 21Classical Scaling Revisited
 22Dense Continuous-Time Tracking and Mapping With Rolling Shutter RGB-D Cameras
 23Dense Image Registration and Deformable Surface Reconstruction in Presence of Occlusions and Minimal Texture
 25Reflection Modeling for Passive Stereo
 26Detailed Full-Body Reconstructions of Moving People From Monocular RGB-D Sequences
 27Efficient Solution to the Epipolar Geometry for Radially Distorted Cameras
 28Learning a Descriptor-Specific 3D Keypoint Detector
 29Component-Wise Modeling of Articulated Objects
 30A Collaborative Filtering Approach to Real-Time Hand Pose Estimation
 31On the Equivalence of Moving Entrance Pupil and Radial Distortion for Camera Calibration
 32A Linear Generalized Camera Calibration From Three Intersecting Reference Planes
 33Towards Pointless Structure From Motion: 3D Reconstruction and Camera Parameters From General 3D Curves
 34Attributed Grammars for Joint Estimation of Human Attributes, Part and Pose
 35Real-Time Pose Estimation Piggybacked on Object Detection
 36Understanding and Predicting Image Memorability at a Large Scale
 37Multiple Granularity Descriptors for Fine-Grained Categorization
 38Guiding the Long-Short Term Memory Model for Image Caption Generation
 39Just Noticeable Differences in Visual Attributes
 40VQA: Visual Question Answering
 41Localize Me Anywhere, Anytime: A Multi-Task Point-Retrieval Approach
 42Dense Optical Flow Prediction From a Static Image
 43Unsupervised Domain Adaptation for Zero-Shot Learning
 44Visual Madlibs: Fill in the Blank Description Generation and Question Answering
 45Actions and Attributes From Wholes and Parts
 46DeepBox: Learning Objectness With Convolutional Networks
 47Active Object Localization With Deep Reinforcement Learning
 48Scene-Domain Active Part Models for Object Representation
 49A Unified Multiplicative Framework for Attribute Learning
 50Contractive Rectifier Networks for Nonlinear Maximum Margin Classification
 51Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization
 52Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images
 53Learning Common Sense Through Visual Abstraction
 54Domain Generalization for Object Recognition With Multi-Task Autoencoders
 55Square Localization for Efficient and Accurate Object Detection
 56Box Aggregation for Proposal Decimation: Last Mile of Object Detection
 57DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers
 58Semantic Segmentation With Object Clique Potential
 59Automatic Concept Discovery From Parallel Text and Visual Corpora
 60Simpler Non-Parametric Methods Provide as Good or Better Results to Multiple-Instance Learning
 61Monocular Object Instance Segmentation and Depth Ordering With CNNs
 62Multimodal Convolutional Neural Networks for Matching Image and Sentence
 63Structural Kernel Learning for Large Scale Multiclass Object Co-Detection
 64Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
 65Predicting Depth, Surface Normals and Semantic Labels With a Common Multi-Scale Convolutional Architecture
 66AttentionNet: Aggregating Weak Directions for Accurate Object Detection
 67Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images
 68[From Oral 3A] Registering Images to Untextured Geometry Using Average Shading Gradients
 69[From Oral 3A] Robust Nonrigid Registration by Convex Optimization
 70[From Oral 3A] Robust and Optimal Sum-of-Squares-Based Point-to-Plane Registration of Image Sets and Structured Scenes
 71[From Oral 3A] MeshStereo: A Global Stereo Model With Mesh Alignment Regularization for View Interpolation
 72[From Oral 3A] CV-HAZOP: Introducing Test Data Validation for Computer Vision
 73[From Oral 3B] 3D-Assisted Feature Synthesis for Novel Views of an Object
 74[From Oral 3B] Lost Shopping! Monocular Localization in Large Indoor Spaces
 75[From Oral 3B] Camera Pose Voting for Large-Scale Image-Based Localization
 
 
[12:15:13:15] Oral Session 3B - 3D Representations for Recognition and Localization

  3D-Assisted Feature Synthesis for Novel Views of an Object
  Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views
  Lost Shopping! Monocular Localization in Large Indoor Spaces
  Camera Pose Voting for Large-Scale Image-Based Localization
 
 
[14:45-17:15] Poster Session 3B - Statistical Methods and Learning, Motion and Tracking, and Video Analysis I

 1MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking
 2DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving
 3Active Transfer Learning With Zero-Shot Priors: Reusing Past Datasets for Future Tasks
 4HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition
 5Learning The Structure of Deep Convolutional Networks
 6FlowNet: Learning Optical Flow With Convolutional Networks
 7Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
 8Context-Guided Diffusion for Label Propagation on Graphs
 9Learning to Rank Based on Subsequences
 10Unsupervised Learning of Visual Representations Using Videos
 11A Nonparametric Bayesian Approach Toward Stacked Convolutional Independent Component Analysis
 12Robust Principal Component Analysis on Graphs
 13Projection Bank: From High-Dimensional Data to Medium-Length Binary Codes
 14Robust Optimization for Deep Regression
 15Multi-Class Multi-Annotator Active Learning With Robust Gaussian Process for Visual Recognition
 16Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation
 17An Exploration of Parameter Redundancy in Deep Networks With Circulant Projections
 18Additive Nearest Neighbor Feature Maps
 19Understanding Deep Features With Computer-Generated Imagery
 20Interpolation on the Manifold of K Component GMMs
 21Context-Aware CNNs for Person Head Detection
 22Mode-Seeking on Hypergraphs for Robust Geometric Model Fitting
 23Highly-Expressive Spaces of Well-Behaved Transformations: Keeping It Simple
 24Entropy-Based Latent Structured Output Prediction
 25Fast Orthogonal Projection Based on Kronecker Product
 26PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization
 27Predicting Multiple Structured Visual Interpretations
 28Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks
 29Matrix Backpropagation for Deep Networks With Structured Layers
 30Introducing Geometry in Active Learning for Image Segmentation
 31Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition
 32Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression
 33Face Flow
 34Discriminative Low-Rank Tracking
 35SOWP: Spatially Ordered and Weighted Patch Descriptor for Visual Tracking
 36Live Repetition Counting
 37Near-Online Multi-Target Tracking With Aggregated Local Flow Descriptor
 38Multi-Kernel Correlation Filter for Visual Tracking
 39Joint Probabilistic Data Association Revisited
 40Tracking-by-Segmentation With Online Gradient Boosting Decision Tree
 41Exploring Causal Relationships in Visual Object Tracking
 42Hierarchical Convolutional Features for Visual Tracking
 43Robust Non-Rigid Motion Tracking and Surface Reconstruction Using L0 Regularization
 44Online Object Tracking With Proposal Selection
 45Understanding and Diagnosing Visual Tracking Systems
 46Integrating Dashcam Views Through Inter-Video Mapping
 47Visual Tracking With Fully Convolutional Networks
 48Multiple Feature Fusion via Weighted Entropy for Visual Tracking
 49Pedestrian Travel Time Estimation in Crowded Scenes
 50Unsupervised Synchrony Discovery in Human Interaction
 51Efficient Video Segmentation Using Parametric Graph Partitioning
 52Learning to Track for Spatio-Temporal Action Localization
 53Unsupervised Object Discovery and Tracking in Video Collections
 54Car That Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models
 55Activity Auto-Completion: Predicting Human Activities From Partial Videos
 56Person Re-Identification With Correspondence Structure Learning
 57Adaptive Exponential Smoothing for Online Filtering of Pixel Prediction Maps
 58P-CNN: Pose-Based CNN Features for Action Recognition
 59Fully Connected Object Proposals for Video Segmentation
 60Video Segmentation With Just a Few Strokes
 61Actionness-Assisted Recognition of Actions
 62COUNT Forest: CO-Voting Uncertain Number of Targets Using Random Forest for Crowd Density Estimation
 63Multi-Cue Structure Preserving MRF for Unconstrained Video Segmentation
 64Motion Trajectory Segmentation via Minimum Cost Multicuts
 65Action Localization in Videos Through Context Walk
 66RGB-W: When Vision Meets Wireless
 67Action Detection by Implicit Intentional Motion Clustering
 68Simultaneous Foreground Detection and Classification With Hybrid Features
 69SpeDo: 6 DOF Ego-Motion Sensor Using Speckle Defocus Imaging
 70The Likelihood-Ratio Test and Efficient Robust Estimation
 71[From Oral 3B] Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views
 72[From Oral 3C] Training a Feedback Loop for Hand Pose Estimation
 73[From Oral 3C] Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose
 74[From Oral 3C] Panoptic Studio: A Massively Multiview System for Social Motion Capture
 75[From Oral 3C] Where to Buy It: Matching Street Clothing Photos in Online Shops
 76[From Oral 3C] Multi-Task Recurrent Neural Network for Immediacy Prediction
 77[From Oral 3C] Learning Complexity-Aware Cascades for Deep Pedestrian Detection
 
 
[17:15-18:45] Oral Session 3C - Vision and People

  Training a Feedback Loop for Hand Pose Estimation
  Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose
  Panoptic Studio: A Massively Multiview System for Social Motion Capture
  Where to Buy It: Matching Street Clothing Photos in Online Shops
  Multi-Task Recurrent Neural Network for Immediacy Prediction
  Learning Complexity-Aware Cascades for Deep Pedestrian Detection
 
 
[8:30-09:45] Oral Session 4A - Computational Photography and Image Enhancement

  Polarized 3D: High-Quality Depth Sensing With Polarization Cues
  Airborne Three-Dimensional Cloud Tomography
  Leave-One-Out Kernel Optimization for Shadow Detection
  Removing Rain From a Single Image via Discriminative Sparse Coding
  Mutual-Structure for Joint Filtering
 
 
[09:45-12:15] Poster Session 4A - Computational Photography, Face and Gesture, and Vision for X

 1Photometric Stereo in a Scattering Medium
 2Resolving Scale Ambiguity Via XSlit Aspect Ratio Analysis
 3Single-Shot Specular Surface Reconstruction With Gonio-Plenoptic Imaging
 4TransCut: Transparent Object Segmentation From a Light-Field Image
 5Depth Recovery From Light Field Using Focal Stack Symmetry
 6Depth Map Estimation and Colorization of Anaglyph Images Using Local Color Prior and Reverse Intensity Distribution
 7Learning Data-Driven Reflectance Priors for Intrinsic Image Decomposition
 8Photometric Stereo With Small Angular Variations
 9Occlusion-Aware Depth Estimation Using Light-Field Cameras
 10Oriented Light-Field Windows for Scene Flow
 11Extended Depth of Field Catadioptric Imaging Using Focal Sweep
 12Intrinsic Depth: Improving Depth Transfer With Intrinsic Images
 13Separating Fluorescent and Reflective Components by Using a Single Hyperspectral Image
 14Frequency-Based Environment Matting by Compressive Sensing
 15Complementary Sets of Shutter Sequences for Motion Deblurring
 16Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior
 17A Gaussian Process Latent Variable Model for BRDF Inference
 18Active One-Shot Scan for Wide Depth Range Using a Light Field Projector Based on Coded Aperture
 19Model-Based Tracking at 300Hz Using Raw Time-of-Flight Observations
 20Hyperspectral Super-Resolution by Coupled Spectral Unmixing
 21Depth Selective Camera: A Direct, On-Chip, Programmable Technique for Depth Selectivity in Photography
 22A Groupwise Multilinear Correspondence Optimization for 3D Faces
 23Selective Encoding for Recognizing Unreliably Localized Faces
 24Confidence Preserving Machine for Facial Action Unit Detection
 25Learning Social Relation Traits From Face Images
 26Robust Heart Rate Measurement From Video Using Select Random Patches
 27Robust Model-Based 3D Head Pose Estimation
 28Robust Facial Landmark Detection Under Significant Head Poses and Occlusion
 29Conditional Convolutional Neural Network for Modality-Aware Face Recognition
 30From Facial Parts Responses to Face Detection: A Deep Learning Approach
 31Efficient PSD Constrained Asymmetric Metric Learning for Person Re-Identification
 32Pose-Invariant 3D Face Alignment
 33From Emotions to Action Units With Hidden and Semi-Hidden-Task Learning
 34Automated Facial Trait Judgment and Election Outcome Prediction: Social Dimensions of Face
 35Simultaneous Local Binary Feature Learning and Encoding for Face Recognition
 36Deep Learning Face Attributes in the Wild
 37Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification
 38Regressing a 3D Face Shape From a Single Image
 39Rendering of Eyes for Eye-Shape Registration and Gaze Estimation
 40Multi-Scale Learning for Low-Resolution Person Re-Identification
 41Learning to Transfer: Transferring Latent Task Structures and Its Application to Person-Specific Facial Action Unit Detection
 42Pairwise Conditional Random Forests for Facial Expression Recognition
 43Multi-Conditional Latent Variable Model for Joint Facial Action Unit Detection
 44Leveraging Datasets With Varying Annotations for Face Alignment via Deep Regression Network
 45A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification
 46Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction
 47An Accurate Iris Segmentation Framework Under Relaxed Imaging Constraints Using Total Variation Model
 48Discriminative Pose-Free Descriptors for Face and Object Matching
 49Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation
 50Regressive Tree Structured Model for Facial Landmark Localization
 51Person Recognition in Personal Photo Collections
 52Robust Statistical Face Frontalization
 53PIEFA: Personalized Incremental and Ensemble Face Alignment
 54Understanding Everyday Hands in Action From RGB-D Images
 55Example-Based Modeling of Facial Texture From Deficient Data
 56Learning to Predict Saliency on Face Images
 57Group Membership Prediction
 58Extraction of Virtual Baselines From Distorted Document Images Using Curvilinear Projection
 59Robust RGB-D Odometry Using Point and Line Features
 60Learning a Discriminative Model for the Perception of Realism in Composite Images
 61What Makes Tom Hanks Look Like Tom Hanks
 62Wide-Area Image Geolocalization With Aerial Reference Imagery
 63Personalized Age Progression With Aging Dictionary
 64FaceDirector: Continuous Control of Facial Performance in Video
 65Synthesizing Illumination Mosaics From Internet Photo-Collections
 66Hot or Not: Exploring Correlations Between Appearance and Temperature
 67Self-Calibration of Optical Lenses
 68[From Oral 4A] Polarized 3D: High-Quality Depth Sensing With Polarization Cues
 69[From Oral 4A] Airborne Three-Dimensional Cloud Tomography
 70[From Oral 4A] Leave-One-Out Kernel Optimization for Shadow Detection
 71[From Oral 4A] Removing Rain From a Single Image via Discriminative Sparse Coding
 72[From Oral 4A] Mutual-Structure for Joint Filtering
 73[From Oral 4B] SPM-BP: Sped-up PatchMatch Belief Propagation for Continuous MRFs
 74[From Oral 4B] Flow Fields: Dense Correspondence Fields for Highly Accurate Large Displacement Optical Flow Estimation
 75[From Oral 4B] Dense Semantic Correspondence Where Every Pixel is a Classifier
 76[From Oral 4B] Multi-Image Matching via Fast Alternating Minimization
 
 
[12:15-13:15] Oral Session 4B - Motion and Correspondence

  SPM-BP: Sped-up PatchMatch Belief Propagation for Continuous MRFs
  Flow Fields: Dense Correspondence Fields for Highly Accurate Large Displacement Optical Flow Estimation
  Dense Semantic Correspondence Where Every Pixel is a Classifier
  Multi-Image Matching via Fast Alternating Minimization
 
 
[14:45-17:15] Poster Session 4B - Statistical Methods and Learning, Motion and Tracking, and Video Analysis II

 1Differential Recurrent Neural Networks for Action Recognition
 2Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis
 3Learning Ensembles of Potential Functions for Structured Prediction With Latent Variables
 4Simultaneous Deep Transfer Across Domains and Tasks
 5Low Dimensional Explicit Feature Maps
 6Unsupervised Learning of Spatiotemporally Coherent Metrics
 7Multi-Label Cross-Modal Retrieval
 8Improving Ferns Ensembles by Sparsifying and Quantising Posterior Probabilities
 9Beyond Gauss: Image-Set Matching on the Riemannian Manifold of PDFs
 10Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data
 11Secrets of Matrix Factorization: Approximations, Numerics, Manifold Optimization and Random Restarts
 12Geometry-Aware Deep Transform
 13Learning Binary Codes for Maximum Inner Product Search
 14ML-MG: Multi-Label Learning With Missing Labels Using a Mixed Graph
 15Zero-Shot Learning via Semantic Similarity Embedding
 16Bayesian Model Adaptation for Crowd Counts
 17An NMF Perspective on Binary Hashing
 18Multi-View Domain Generalization for Visual Recognition
 19Infinite Feature Selection
 20Semi-Supervised Zero-Shot Classification With Label Representation Learning
 21A Supervised Low-Rank Method for Learning Invariant Subspaces
 22Recursive Fréchet Mean Computation on the Grassmannian and its Applications to Computer Vision
 23Multi-View Subspace Clustering
 24Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions
 25Structured Feature Selection
 26Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning
 27Learning Image and User Features for Recommendation in Social Networks
 28Dual-Feature Warping-Based Motion Model Estimation
 29An Adaptive Data Representation for Robust Point-Set Registration and Merging
 30Local Subspace Collaborative Tracking
 31Learning Spatially Regularized Correlation Filters for Visual Tracking
 33Unsupervised Trajectory Clustering via Adaptive Multi-Kernel-Based Shrinkage
 34TRIC-track: Tracking by Regression With Incrementally Learned Cascades
 35Recurrent Network Models for Human Dynamics
 36Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment
 37FollowMe: Efficient Online Min-Cost Flow Tracking With Bounded Memory and Computation
 38Learning to Divide and Conquer for Online Multi-Target Tracking
 39Minimizing Human Effort in Interactive Tracking by Incremental Learning of Model Parameters
 40A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images
 41Linearization to Nonlinear Learning for Visual Tracking
 42Self-Occlusions and Disocclusions in Causal Video Object Segmentation
 43Large Displacement 3D Scene Flow With Occlusion Reasoning
 44Co-Interest Person Detection From Multiple Wearable Camera Videos
 45Sparse Dynamic 3D Reconstruction From Unsynchronized Videos
 46Category-Blind Human Action Recognition: A Practical Recognition System
 47Temporal Subspace Clustering for Human Motion Segmentation
 48Weakly-Supervised Alignment of Video With Text
 49Learning Temporal Embeddings for Complex Video Analysis
 50Unsupervised Semantic Parsing of Video Collections
 51Learning Spatiotemporal Features With 3D Convolutional Networks
 52Temporal Perception and Prediction in Ego-Centric Video
 53Describing Videos by Exploiting Temporal Structure
 54Person Re-Identification With Discriminatively Trained Viewpoint Invariant Dictionaries
 55Storyline Representation of Egocentric Videos With an Applications to Story-Based Search
 56Sequence to Sequence – Video to Text
 57Context Aware Active Learning of Activity Recognition Models
 58Action Recognition by Hierarchical Mid-Level Action Elements
 59Selecting Relevant Web Trained Concepts for Automated Event Retrieval
 60Beyond Covariance: Feature Representation With Nonlinear Kernel Matrices
 61Multiresolution Hierarchy Co-Clustering for Semantic Segmentation in Sequences With Small Variations
 62Objects2action: Classifying and Localizing Actions Without Any Video Example
 63Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks
 64Bayesian Non-Parametric Inference for Manifold Based MoCap Representation
 65Semantic Video Entity Linking Based on Visual Content and Metadata
 66Love Thy Neighbors: Image Annotation by Exploiting Image Metadata
 67Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders
 68Learning Visual Clothing Style With Heterogeneous Dyadic Co-Occurrences
 69Text Flow: A Unified Text Detection System in Natural Scene Images
 70[From Oral 4C] Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos
 71[From Oral 4C] Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!
 72[From Oral 4C] Partial Person Re-Identification
 73[From Oral 4C] Shape Interaction Matrix Revisited and Robustified: Efficient Subspace Clustering With Corrupted and Incomplete Data
 74[From Oral 4C] Multiple Hypothesis Tracking Revisited
 75[From Oral 4C] Learning to Track: Online Multi-Object Tracking by Decision Making
 
 
[17:15-18:45] Oral Session 4C - Video: Actions, Surveillance and Tracking

  Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos
  Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!
  Partial Person Re-Identification
  Shape Interaction Matrix Revisited and Robustified: Efficient Subspace Clustering With Corrupted and Incomplete Data
  Multiple Hypothesis Tracking Revisited
  Learning to Track: Online Multi-Object Tracking by Decision Making