Visual Question Answering

Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

Published: 09 Oct 2015

Visualizing and Interpreting Convolutional Neural Network

Papers

Deconvolutional Networks

Visualizing and Understanding Convolutional Network

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

Understanding Deep Image Representations by Inverting Them

deepViz: Visualizing Convolutional Neural Networks for Image Classification

Inverting Convolutional Networks with Convolutional Networks

Understanding Neural Networks Through Deep Visualization

Visualizing Higher-Layer Features of a Deep Network

Generative Modeling of Convolutional Neural Networks

Understanding Intra-Class Knowledge Inside CNN

Learning FRAME Models Using CNN Filters for Knowledge Visualization

Convergent Learning: Do different neural networks learn the same representations?

Visualizing and Understanding Deep Texture Representations

Visualizing Deep Convolutional Neural Networks Using Natural Pre-Images

An Interactive Node-Link Visualization of Convolutional Neural Networks

Learning Deep Features for Discriminative Localization

Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

A New Method to Visualize Deep Neural Networks

A Taxonomy and Library for Visualizing Learned Features in Convolutional Neural Networks

VisualBackProp: visualizing CNNs for autonomous driving

VisualBackProp: efficient visualization of CNNs

Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization

Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization

Grad-CAM: Why did you say that?

Visualizing Residual Networks

Visualizing Deep Neural Network Decisions: Prediction Difference Analysis

ActiVis: Visual Exploration of Industry-Scale Deep Neural Network Models

Picasso: A Neural Network Visualizer

CNN Fixations: An unraveling approach to visualize the discriminative image regions

A Forward-Backward Approach for Visualizing Information Flow in Deep Networks

Using KL-divergence to focus Deep Visual Explanation

https://arxiv.org/abs/1711.06431

An Introduction to Deep Visual Explanation

Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks

https://arxiv.org/abs/1712.06302

Visualizing the Loss Landscape of Neural Nets

Visualizing Deep Similarity Networks

https://arxiv.org/abs/1901.00536

Interpreting Convolutional Neural Networks

Network Dissection: Quantifying Interpretability of Deep Visual Representations

Interpreting Deep Visual Representations via Network Dissection

https://arxiv.org/abs/1711.05611

Methods for Interpreting and Understanding Deep Neural Networks

SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability

Towards Interpretable Deep Neural Networks by Leveraging Adversarial Examples

Interpretable Convolutional Neural Networks

https://arxiv.org/abs/1710.00935

Interpreting Convolutional Neural Networks Through Compression

Interpreting Deep Neural Networks

Interpreting CNNs via Decision Trees

https://arxiv.org/abs/1802.00121

Visual Interpretability for Deep Learning: a Survey

https://arxiv.org/abs/1802.00614

Interpreting Deep Classifier by Visual Distillation of Dark Knowledge

How convolutional neural network see the world - A survey of convolutional neural network visualization methods

Understanding Regularization to Visualize Convolutional Neural Networks

Deeper Interpretability of Deep Networks

Interpretable CNNs

https://arxiv.org/abs/1901.02413

Explaining AlphaGo: Interpreting Contextual Effects in Neural Networks

https://arxiv.org/abs/1901.02184

Interpretable BoW Networks for Adversarial Example Detection

https://arxiv.org/abs/1901.02229

Deep Features Analysis with Attention Networks

Understanding Neural Networks via Feature Visualization: A survey

Explaining Neural Networks via Perturbing Important Learned Features

https://arxiv.org/abs/1911.11081

Interpreting Adversarially Trained Convolutional Neural Networks

Projects

Interactive Deep Neural Net Hallucinations

torch-visbox

draw_convnet: Python script for illustrating Convolutional Neural Network (ConvNet)

Caffe prototxt visualization

Keras Visualization Toolkit

mNeuron: A Matlab Plugin to Visualize Neurons from Deep Models

cnnvis-pytorch

VisualDL

Blogs

“Visualizing GoogLeNet Classes”

http://auduno.com/post/125362849838/visualizing-googlenet-classes

Visualizing CNN architectures side by side with mxnet

How convolutional neural networks see the world: An exploration of convnet filters with Keras

Visualizing Deep Learning with t-SNE (Tutorial and Video)

Peeking inside Convnets

Visualizing Features from a Convolutional Neural Network

Visualizing Deep Neural Networks Classes and Features

http://ankivil.com/visualizing-deep-neural-networks-classes-and-features/

Visualizing parts of Convolutional Neural Networks using Keras and Cats

Visualizing convolutional neural networks

Tools

Topological Visualisation of a Convolutional Neural Network

http://terencebroad.com/convnetvis/vis.html

Visualization of Places-CNN and ImageNet CNN

Visualization of a feed forward Neural Network using MNIST dataset

CNNVis: Towards Better Analysis of Deep Convolutional Neural Networks.

http://shixialiu.com/publications/cnnvis/demo/

Quiver: Interactive convnet features visualization for Keras

Netron

Published: 09 Oct 2015

Video Applications

Papers

Published: 09 Oct 2015

Unsupervised Learning

Restricted Boltzmann Machine (RBM)

Published: 09 Oct 2015

Transfer Learning

Papers

Published: 09 Oct 2015

Training Deep Neural Networks

Tutorials

Published: 09 Oct 2015

Tracking

Learning A Deep Compact Image Representation for Visual Tracking

Hierarchical Convolutional Features for Visual Tracking

Robust Visual Tracking via Convolutional Networks

Transferring Rich Feature Hierarchies for Robust Visual Tracking

Learning Multi-Domain Convolutional Neural Networks for Visual Tracking

RATM: Recurrent Attentive Tracking Model

Understanding and Diagnosing Visual Tracking Systems

Recurrently Target-Attending Tracking

Visual Tracking with Fully Convolutional Networks

Deep Tracking: Seeing Beyond Seeing Using Recurrent Neural Networks

Learning to Track at 100 FPS with Deep Regression Networks

Learning by tracking: Siamese CNN for robust target association

Fully-Convolutional Siamese Networks for Object Tracking

Hedged Deep Tracking

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

Visual Tracking via Shallow and Deep Collaborative Model

Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking

Unsupervised Learning from Continuous Video in a Scalable Predictive Recurrent Network

Modeling and Propagating CNNs in a Tree Structure for Visual Tracking

Robust Scale Adaptive Kernel Correlation Filter Tracker With Hierarchical Convolutional Features

Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks

OTB Results: visual tracker benchmark results

Convolutional Regression for Visual Tracking

Semantic tracking: Single-target tracking with inter-supervised convolutional networks

SANet: Structure-Aware Network for Visual Tracking

ECO: Efficient Convolution Operators for Tracking

Dual Deep Network for Visual Tracking

Deep Motion Features for Visual Tracking

Globally Optimal Object Tracking with Fully Convolutional Networks

Robust and Real-time Deep Tracking Via Multi-Scale Domain Adaptation

Tracking The Untrackable: Learning To Track Multiple Cues with Long-Term Dependencies

Large Margin Object Tracking with Circulant Feature Maps

DCFNet: Discriminant Correlation Filters Network for Visual Tracking

End-to-end representation learning for Correlation Filter based tracking

Context-Aware Correlation Filter Tracking

Robust Multi-view Pedestrian Tracking Using Neural Networks

https://arxiv.org/abs/1704.06370

Re3 : Real-Time Recurrent Regression Networks for Object Tracking

Robust Tracking Using Region Proposal Networks

https://arxiv.org/abs/1705.10447

Hierarchical Attentive Recurrent Tracking

Siamese Learning Visual Tracking: A Survey

https://arxiv.org/abs/1707.00569

Robust Visual Tracking via Hierarchical Convolutional Features

CREST: Convolutional Residual Learning for Visual Tracking

Learning Policies for Adaptive Tracking with Deep Feature Cascades

Recurrent Filter Learning for Visual Tracking

Correlation Filters with Weighted Convolution Responses

Semantic Texture for Robust Dense Tracking

https://arxiv.org/abs/1708.08844

Learning Multi-frame Visual Representation for Joint Detection and Tracking of Small Objects

Differentiating Objects by Motion: Joint Detection and Tracking of Small Flying Objects

https://arxiv.org/abs/1709.04666

Tracking Persons-of-Interest via Unsupervised Representation Adaptation

End-to-end Flow Correlation Tracking with Spatial-temporal Attention

https://arxiv.org/abs/1711.01124

UCT: Learning Unified Convolutional Networks for Real-time Visual Tracking

Pixel-wise object tracking

https://arxiv.org/abs/1711.07377

MAVOT: Memory-Augmented Video Object Tracking

https://arxiv.org/abs/1711.09414

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

https://arxiv.org/abs/1801.02021

Parallel Tracking and Verifying

https://arxiv.org/abs/1801.10496

Saliency-Enhanced Robust Visual Tracking

https://arxiv.org/abs/1802.02783

A Twofold Siamese Network for Real-Time Object Tracking

Learning Dynamic Memory Networks for Object Tracking

https://arxiv.org/abs/1803.07268

Context-aware Deep Feature Compression for High-speed Visual Tracking

VITAL: VIsual Tracking via Adversarial Learning

Unveiling the Power of Deep Tracking

https://arxiv.org/abs/1804.06833

A Novel Low-cost FPGA-based Real-time Object Tracking System

MV-YOLO: Motion Vector-aided Tracking by Semantic Object Detection

https://arxiv.org/abs/1805.00107

Information-Maximizing Sampling to Promote Tracking-by-Detection

https://arxiv.org/abs/1806.02523

Instance Segmentation and Tracking with Cosine Embeddings and Recurrent Hourglass Networks

Stochastic Channel Decorrelation Network and Its Application to Visual Tracking

https://arxiv.org/abs/1807.01103

Fast Dynamic Convolutional Neural Networks for Visual Tracking

https://arxiv.org/abs/1807.03132

DeepTAM: Deep Tracking and Mapping

https://arxiv.org/abs/1808.01900

Distractor-aware Siamese Networks for Visual Object Tracking

Multi-Branch Siamese Networks with Online Selection for Object Tracking

Real-Time MDNet

Towards a Better Match in Siamese Network Based Visual Object Tracker

DensSiam: End-to-End Densely-Siamese Network with Self-Attention Model for Object Tracking

Deformable Object Tracking with Gated Fusion

https://arxiv.org/abs/1809.10417

Deep Attentive Tracking via Reciprocative Learning

Online Visual Robot Tracking and Identification using Deep LSTM Networks

  • intro: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, Canada, 2017. IROS RoboCup Best Paper Award
  • arxiv: https://arxiv.org/abs/1810.04941

Detect or Track: Towards Cost-Effective Video Object Detection/Tracking

Deep Siamese Networks with Bayesian non-Parametrics for Video Object Tracking

https://arxiv.org/abs/1811.07386

Fast Online Object Tracking and Segmentation: A Unifying Approach

Siamese Cascaded Region Proposal Networks for Real-Time Visual Tracking

Handcrafted and Deep Trackers: A Review of Recent Object Tracking Approaches

https://arxiv.org/abs/1812.07368

SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks

https://arxiv.org/abs/1812.11703

Deeper and Wider Siamese Networks for Real-Time Visual Tracking

https://arxiv.org/abs/1901.01660

SiamVGG: Visual Tracking using Deeper Siamese Networks

https://arxiv.org/abs/1902.02804

TrackNet: Simultaneous Object Detection and Tracking and Its Application in Traffic Video Analysis

https://arxiv.org/abs/1902.01466

Target-Aware Deep Tracking

  • intro: CVPR 2019
  • intro: 1Harbin Institute of Technology & Shanghai Jiao Tong University & Tencent AI Lab & University of California & Google Cloud AI
  • arxiv: https://arxiv.org/abs/1904.01772

Unsupervised Deep Tracking

Generic Multiview Visual Tracking

https://arxiv.org/abs/1904.02553

SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking

A Strong Feature Representation for Siamese Network Tracker

https://arxiv.org/abs/1907.07880

Visual Tracking via Dynamic Memory Networks

Multi-Adapter RGBT Tracking

Teacher-Students Knowledge Distillation for Siamese Trackers

https://arxiv.org/abs/1907.10586

Tell Me What to Track

Learning to Track Any Object

ROI Pooled Correlation Filters for Visual Tracking

D3S – A Discriminative Single Shot Segmentation Tracker

Visual Tracking by TridentAlign and Context Embedding

Transformer Tracking

Face Tracking

Mobile Face Tracking: A Survey and Benchmark

https://arxiv.org/abs/1805.09749

Multi-Object Tracking (MOT)

Simple Online and Realtime Tracking

Simple Online and Realtime Tracking with a Deep Association Metric

StrongSORT: Make DeepSORT Great Again

Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Virtual Worlds as Proxy for Multi-Object Tracking Analysis

Multi-Class Multi-Object Tracking using Changing Point Detection

POI: Multiple Object Tracking with High Performance Detection and Appearance Feature

Multiple Object Tracking: A Literature Review

Deep Network Flow for Multi-Object Tracking

Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism

https://arxiv.org/abs/1708.02843

Recurrent Autoregressive Networks for Online Multi-Object Tracking

https://arxiv.org/abs/1711.02741

SOT for MOT

Multi-Target, Multi-Camera Tracking by Hierarchical Clustering: Recent Progress on DukeMTMC Project

https://arxiv.org/abs/1712.09531

Multiple Target Tracking by Learning Feature Representation and Distance Metric Jointly

https://arxiv.org/abs/1802.03252

Tracking Noisy Targets: A Review of Recent Object Tracking Approaches

https://arxiv.org/abs/1802.03098

Machine Learning Methods for Solving Assignment Problems in Multi-Target Tracking

Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World

Features for Multi-Target Multi-Camera Tracking and Re-Identification

High Performance Visual Tracking with Siamese Region Proposal Network

Trajectory Factory: Tracklet Cleaving and Re-connection by Deep Siamese Bi-GRU for Multiple Object Tracking

Automatic Adaptation of Person Association for Multiview Tracking in Group Activities

Improving Online Multiple Object tracking with Deep Metric Learning

https://arxiv.org/abs/1806.07592

Tracklet Association Tracker: An End-to-End Learning-based Association Approach for Multi-Object Tracking

Multiple Object Tracking in Urban Traffic Scenes with a Multiclass Object Detector

Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers

https://arxiv.org/abs/1809.03137

Deep Affinity Network for Multiple Object Tracking

Exploit the Connectivity: Multi-Object Tracking with TrackletNet

https://arxiv.org/abs/1811.07258

Multi-Object Tracking with Multiple Cues and Switcher-Aware Classification

Online Multi-Object Tracking with Dual Matching Attention Networks

Online Multi-Object Tracking with Instance-Aware Tracker and Dynamic Model Refreshment

https://arxiv.org/abs/1902.08231

Tracking without bells and whistles

Spatial-Temporal Relation Networks for Multi-Object Tracking

Fooling Detection Alone is Not Enough: First Adversarial Attack against Multiple Object Tracking

State-aware Re-identification Feature for Multi-target Multi-camera Tracking

DeepMOT: A Differentiable Framework for Training Multiple Object Trackers

Graph Neural Based End-to-end Data Association Framework for Online Multiple-Object Tracking

End-to-End Learning Deep CRF models for Multi-Object Tracking

https://arxiv.org/abs/1907.12176

End-to-end Recurrent Multi-Object Tracking and Trajectory Prediction with Relational Reasoning

Robust Multi-Modality Multi-Object Tracking

Learning Multi-Object Tracking and Segmentation from Automatic Annotations

https://arxiv.org/abs/1912.02096

Learning a Neural Solver for Multiple Object Tracking

Multi-object Tracking via End-to-end Tracklet Searching and Ranking

Refinements in Motion and Appearance for Online Multi-Object Tracking

https://arxiv.org/abs/2003.07177

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

A Simple Baseline for Multi-Object Tracking

MOPT: Multi-Object Panoptic Tracking

SQE: a Self Quality Evaluation Metric for Parameters Optimization in Multi-Object Tracking

Multi-Object Tracking with Siamese Track-RCNN

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model

Quasi-Dense Similarity Learning for Multiple Object Tracking

imultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking

MAT: Motion-Aware Multi-Object Tracking

https://arxiv.org/abs/2009.04794

SAMOT: Switcher-Aware Multi-Object Tracking and Still Another MOT Measure

https://arxiv.org/abs/2009.10338

GCNNMatch: Graph Convolutional Neural Networks for Multi-Object Tracking via Sinkhorn Normalization

Rethinking the competition between detection and ReID in Multi-Object Tracking

GMOT-40: A Benchmark for Generic Multiple Object Tracking

Multi-object Tracking with a Hierarchical Single-branch Network

https://arxiv.org/abs/2101.01984

Discriminative Appearance Modeling with Multi-track Pooling for Real-time Multi-object Tracking

Learning a Proposal Classifier for Multiple Object Tracking

Track to Detect and Segment: An Online Multi-Object Tracker

Learnable Graph Matching: Incorporating Graph Partitioning with Deep Feature Learning for Multiple Object Tracking

Multiple Object Tracking with Correlation Learning

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

SiamMOT: Siamese Multi-Object Tracking

Synthetic Data Are as Good as the Real for Association Knowledge Learning in Multi-object Tracking

Track to Detect and Segment: An Online Multi-Object Tracker

Learning of Global Objective for Network Flow in Multi-Object Tracking

MeMOT: Multi-Object Tracking with Memory

TR-MOT: Multi-Object Tracking by Reference

Towards Grand Unification of Object Tracking

Tracking Every Thing in the Wild

Transformer

TransTrack: Multiple-Object Tracking with Transformer

TrackFormer: Multi-Object Tracking with Transformers

TransCenter: Transformers with Dense Queries for Multiple-Object Tracking

Looking Beyond Two Frames: End-to-End Multi-Object Tracking UsingSpatial and Temporal Transformers

TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

MOTR: End-to-End Multiple-Object Tracking with TRansformer

Global Tracking Transformers

Multiple People Tracking

Multi-Person Tracking by Multicut and Deep Matching

Joint Flow: Temporal Flow Fields for Multi Person Tracking

Multiple People Tracking by Lifted Multicut and Person Re-identification

Tracking by Prediction: A Deep Generative Model for Mutli-Person localisation and Tracking

Real-time Multiple People Tracking with Deeply Learned Candidate Selection and Person Re-Identification

Deep Person Re-identification for Probabilistic Data Association in Multiple Pedestrian Tracking

https://arxiv.org/abs/1810.08565

Multiple People Tracking Using Hierarchical Deep Tracklet Re-identification

https://arxiv.org/abs/1811.04091

Multi-person Articulated Tracking with Spatial and Temporal Embeddings

Instance-Aware Representation Learning and Association for Online Multi-Person Tracking

  • intro: Pattern Recognition
  • intro: Sun Yat-sen University & Guangdong University of Foreign Studies & Carnegie Mellon University & University of California & Guilin University of Electronic Technology & WINNER Technology
  • arxiv: https://arxiv.org/abs/1905.12409

Online Multiple Pedestrian Tracking using Deep Temporal Appearance Matching Association

Detecting Invisible People

MOTS

MOTS: Multi-Object Tracking and Segmentation

Segment as Points for Efficient Online Multi-Object Tracking and Segmentation

PointTrack++ for Effective Online Multi-Object Tracking and Segmentation

Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation

Multi-Object Tracking and Segmentation with a Space-Time Memory Network

Multi-target multi-camera tracking (MTMCT)

Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

3D MOT

A Baseline for 3D Multi-Object Tracking

Probabilistic 3D Multi-Object Tracking for Autonomous Driving

JRMOT: A Real-Time 3D Multi-Object Tracker and a New Large-Scale Dataset

Real-time 3D Deep Multi-Camera Tracking

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

PnPNet: End-to-End Perception and Prediction with Tracking in the Loop

GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with Multi-Feature Learning

1st Place Solutions for Waymo Open Dataset Challenges – 2D and 3D Tracking

Graph Neural Networks for 3D Multi-Object Tracking

Learnable Online Graph Representations for 3D Multi-Object Tracking

https://arxiv.org/abs/2104.11747

SimpleTrack: Understanding and Rethinking 3D Multi-object Tracking

Immortal Tracker: Tracklet Never Dies

Single Stage Joint Detection and Tracking

Bridging the Gap Between Detection and Tracking: A Unified Approach

Towards Real-Time Multi-Object Tracking

RetinaTrack: Online Single Stage Joint Detection and Tracking

Tracking Objects as Points

Fully Convolutional Online Tracking

Accurate Anchor Free Tracking

Ocean: Object-aware Anchor-free Tracking

Joint Detection and Multi-Object Tracking with Graph Neural Networks

Joint Multiple-Object Detection and Tracking

Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking

SMOT: Single-Shot Multi Object Tracking

https://arxiv.org/abs/2010.16031

DEFT: Detection Embeddings for Tracking

Global Correlation Network: End-to-End Joint Multi-Object Detection and Tracking

Tracking with Reinforcement Learning

Deep Reinforcement Learning for Visual Object Tracking in Videos

Visual Tracking by Reinforced Decision Making

End-to-end Active Object Tracking via Reinforcement Learning

https://arxiv.org/abs/1705.10561

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

https://arxiv.org/abs/1707.04991

Detect to Track and Track to Detect

Projects

MMTracking

  • intro: OpenMMLab Video Perception Toolbox. It supports Single Object Tracking (SOT), Multiple Object Tracking (MOT), Video Object Detection (VID) with a unified framework.
  • github: https://github.com/open-mmlab/mmtracking

Tensorflow_Object_Tracking_Video

Resources

Multi-Object-Tracking-Paper-List

Published: 09 Oct 2015

Super-Resolution

Papers

Published: 09 Oct 2015