Deep Learning Resources

Published: 09 Oct 2015 Category: deep_learning


Single-model on 224x224

Method top1 top5 Model Size Speed
ResNet-101 78.0% 94.0%    
ResNet-200 78.3% 94.2%    
ResNet-50 77.8%      
ResNet-101 79.6% 94.7%    

Single-model on 320×320 / 299×299

Method top1 top5 Model Size Speed
ResNet-200 79.9% 95.2%    
Inception-v3 78.8% 94.4%    
Inception-v4 80.0% 95.0%    
Inception-ResNet-v2 80.1% 95.1%    
ResNet-101 80.9% 95.6%    


ImageNet Classification with Deep Convolutional Neural Networks

Network In Network

Network In Network

Batch-normalized Maxout Network in Network

GoogLeNet (Inception V1)

Going Deeper with Convolutions

Building a deeper understanding of images


Very Deep Convolutional Networks for Large-Scale Image Recognition

Tensorflow VGG16 and VGG19


Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

ImageNet pre-trained models with batch normalization


Inception-V3 = Inception-V2 + BN-auxiliary (fully connected layer of the auxiliary classifier is also batch-normalized, not just the convolutions)

Rethinking the Inception Architecture for Computer Vision

Inception in TensorFlow

Train your own image classifier with Inception in TensorFlow

Notes on the TensorFlow Implementation of Inception v3

Training an InceptionV3-based image classifier with your own dataset

Inception-BN full for Caffe: Inception-BN ImageNet (21K classes) model for Caffe


Deep Residual Learning for Image Recognition

Third-party re-implementations

Training and investigating Residual Nets

resnet.torch: an updated version of fb.resnet.torch with many changes.

Highway Networks and Deep Residual Networks

Interpretating Deep Residual Learning Blocks as Locally Recurrent Connections

Lab41 Reading Group: Deep Residual Learning for Image Recognition

50-layer ResNet, trained on ImageNet, classifying webcam

Reproduced ResNet on CIFAR-10 and CIFAR-100 dataset.


Identity Mappings in Deep Residual Networks

Deep Residual Networks for Image Classification with Python + NumPy

Inception-V4 / Inception-ResNet-V2

Inception-V4, Inception-Resnet And The Impact Of Residual Connections On Learning

The inception-resnet-v2 models trained from scratch via torch

Inception v4 in Keras


Aggregated Residual Transformations for Deep Neural Networks

Residual Networks Variants

Resnet in Resnet: Generalizing Residual Architectures

Residual Networks are Exponential Ensembles of Relatively Shallow Networks

Wide Residual Networks

Residual Networks of Residual Networks: Multilevel Residual Networks

Multi-Residual Networks

Deep Pyramidal Residual Networks

Learning Identity Mappings with Residual Gates

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Deep Pyramidal Residual Networks with Separated Stochastic Depth

Spatially Adaptive Computation Time for Residual Networks

ShaResNet: reducing residual network parameter number by sharing weights

Sharing Residual Units Through Collective Tensor Factorization in Deep Neural Networks

Residual Attention Network for Image Classification

Dilated Residual Networks

Dynamic Steerable Blocks in Deep Residual Networks

Learning Deep ResNet Blocks Sequentially using Boosting Theory

ImageNet Projects

Training an Object Classifier in Torch-7 on multiple GPUs over ImageNet

Deep Learning And Bayesian

Scalable Bayesian Optimization Using Deep Neural Networks

Bayesian Dark Knowledge

Memory-based Bayesian Reasoning with Deep Learning

Towards Bayesian Deep Learning: A Survey

Towards Bayesian Deep Learning: A Framework and Some Existing Methods

Bayesian Deep Learning: Neural Networks in PyMC3 estimated with Variational Inference

Bayesian Deep Learning Part II: Bridging PyMC3 and Lasagne to build a Hierarchical Neural Network

Deep Learning: A Bayesian Perspective

Semi-Supervised Learning

Semi-Supervised Learning with Graphs

Semi-Supervised Learning with Ladder Networks

Semi-supervised Feature Transfer: The Practical Benefit of Deep Learning Today?

Temporal Ensembling for Semi-Supervised Learning

Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

Infinite Variational Autoencoder for Semi-Supervised Learning

Transfer Learning

Discriminative Transfer Learning with Tree-based Priors

How transferable are features in deep neural networks?

Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks

Transferring Knowledge from a RNN to a DNN

Simultaneous Deep Transfer Across Domains and Tasks

Net2Net: Accelerating Learning via Knowledge Transfer

Transfer Learning from Deep Features for Remote Sensing and Poverty Mapping

A theoretical framework for deep transfer learning

Transfer learning using neon

Hyperparameter Transfer Learning through Surrogate Alignment for Efficient Deep Neural Network Training

What makes ImageNet good for transfer learning?

Fine-tuning a Keras model using Theano trained Neural Network & Introduction to Transfer Learning

Multi-source Transfer Learning with Convolutional Neural Networks for Lung Pattern Analysis

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-tuning

Multi-label Learning

CNN: Single-label to Multi-label

Deep Learning for Multi-label Classification

Predicting Unseen Labels using Label Hierarchies in Large-Scale Multi-label Learning

Learning with a Wasserstein Loss

From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification

CNN-RNN: A Unified Framework for Multi-label Image Classification

Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations

Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications

Multi-Label Image Classification with Regional Latent Semantic Dependencies

Privileged Multi-label Learning

Multi-task Learning

Multitask Learning / Domain Adaptation

multi-task learning

Learning and Transferring Multi-task Deep Representation for Face Alignment

Multi-task learning of facial landmarks and expression

Multi-Task Deep Visual-Semantic Embedding for Video Thumbnail Selection

Learning Multiple Tasks with Deep Relationship Networks

Learning deep representation of multityped objects and tasks

Cross-stitch Networks for Multi-task Learning

Multi-Task Learning in Tensorflow (Part 1)

Deep Multi-Task Learning with Shared Memory

Learning to Push by Grasping: Using multiple tasks for effective learning

Identifying beneficial task relations for multi-task learning in deep neural networks

Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics

One Model To Learn Them All

MultiModel: Multi-Task Machine Learning Across Domains

An Overview of Multi-Task Learning in Deep Neural Networks

Multi-modal Learning

Multimodal Deep Learning

Multimodal Convolutional Neural Networks for Matching Image and Sentence

A C++ library for Multimodal Deep Learning

Multimodal Learning for Image Captioning and Visual Question Answering

Multi modal retrieval and generation with deep distributed models

Learning Aligned Cross-Modal Representations from Weakly Aligned Data

Variational methods for Conditional Multimodal Deep Learning

Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Deep Multi-Modal Image Correspondence Learning

Multimodal Deep Learning (D4L4 Deep Learning for Speech and Language UPC 2017)

Debugging Deep Learning

Some tips for debugging deep learning

Introduction to debugging neural networks

How to Visualize, Monitor and Debug Neural Network Learning

Learning from learning curves

Understanding CNN

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks

Adversarial Examples of Deep Learning

Intriguing properties of neural networks

Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images

Explaining and Harnessing Adversarial Examples

Distributional Smoothing with Virtual Adversarial Training

Confusing Deep Convolution Networks by Relabelling

Exploring the Space of Adversarial Images

Learning with a Strong Adversary

Adversarial examples in the physical world

DeepFool: a simple and accurate method to fool deep neural networks

Adversarial Autoencoders

Understanding Adversarial Training: Increasing Local Stability of Neural Nets through Robust Optimization

(Deep Learning’s Deep Flaws)’s Deep Flaws (By Zachary Chase Lipton)

Deep Learning Adversarial Examples – Clarifying Misconceptions

Adversarial Machines: Fooling A.Is (and turn everyone into a Manga)

How to trick a neural network into thinking a panda is a vulture

Assessing Threat of Adversarial Examples on Deep Neural Networks

Safety Verification of Deep Neural Networks

Adversarial Machine Learning at Scale

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Parseval Networks: Improving Robustness to Adversarial Examples

Towards Deep Learning Models Resistant to Adversarial Attacks

NO Need to Worry about Adversarial Examples in Object Detection in Autonomous Vehicles

Deep Learning Networks

PCANet: A Simple Deep Learning Baseline for Image Classification?

Convolutional Kernel Networks

Deeply-supervised Nets

FitNets: Hints for Thin Deep Nets

Striving for Simplicity: The All Convolutional Net

How these researchers tried something unconventional to come out with a smaller yet better Image Recognition.

Pointer Networks

Pointer Networks in TensorFlow (with sample code)

Rectified Factor Networks

Correlational Neural Networks

Diversity Networks

Competitive Multi-scale Convolution

A Unified Approach for Learning the Parameters of Sum-Product Networks (SPN)

Awesome Sum-Product Networks

Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation

Dynamic Capacity Networks

Bitwise Neural Networks

Learning Discriminative Features via Label Consistent Neural Network

A Theory of Generative ConvNet

How to Train Deep Variational Autoencoders and Probabilistic Ladder Networks

Group Equivariant Convolutional Networks (G-CNNs)

Deep Spiking Networks

Low-rank passthrough neural networks

Single Image 3D Interpreter Network

Deeply-Fused Nets

SNN: Stacked Neural Networks

Universal Correspondence Network

Progressive Neural Networks

Holistic SparseCNN: Forging the Trident of Accuracy, Speed, and Size

Mollifying Networks

Domain Separation Networks

Local Binary Convolutional Neural Networks

Densely Connected Convolutional Networks

CliqueCNN: Deep Unsupervised Exemplar Learning

Convexified Convolutional Neural Networks

Multi-scale brain networks

Warped Convolutions: Efficient Invariance to Spatial Transformations

Input Convex Neural Networks



X-CNN: Cross-modal Convolutional Neural Networks for Sparse Datasets

Tensor Switching Networks

BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

Spectral Convolution Networks

DelugeNets: Deep Networks with Massive and Flexible Cross-layer Information Inflows

PolyNet: A Pursuit of Structural Diversity in Very Deep Networks

Weakly Supervised Cascaded Convolutional Networks

DeepSetNet: Predicting Sets with Deep Neural Networks

Steerable CNNs

Feedback Networks

Oriented Response Networks

OptNet: Differentiable Optimization as a Layer in Neural Networks

A fast and differentiable QP solver for PyTorch

Meta Networks

Deformable Convolutional Networks

Second-order Convolutional Neural Networks

Gabor Convolutional Networks

Deep Rotation Equivariant Network

Dense Transformer Networks

Deep Complex Networks

DiracNets: Training Very Deep Neural Networks Without Skip-Connections

Dual Path Networks

Primal-Dual Group Convolutions for Deep Neural Networks

Highway Networks

Highway Networks

Highway Networks with TensorFlow

Very Deep Learning with Highway Networks

Training Very Deep Networks

Spatial Transformer Networks

Spatial Transformer Networks

The power of Spatial Transformer Networks

Recurrent Spatial Transformer Networks

Deep Learning Paper Implementations: Spatial Transformer Networks - Part I


FractalNet: Ultra-Deep Neural Networks without Residuals

Graph Convolutional Networks

Learning Convolutional Neural Networks for Graphs

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Semi-Supervised Classification with Graph Convolutional Networks

Graph Based Convolutional Neural Network

How powerful are Graph Convolutions? (review of Kipf & Welling, 2016)

Graph Convolutional Networks

DeepGraph: Graph Structure Predicts Network Growth

Deep Learning with Sets and Point Clouds

Deep Learning on Graphs

Robust Spatial Filtering with Graph Convolutional Neural Networks

Modeling Relational Data with Graph Convolutional Networks

Distance Metric Learning using Graph Convolutional Networks: Application to Functional Brain Networks

Deep Learning on Graphs with Graph Convolutional Networks

Deep Learning on Graphs with Keras

Deep Learning with Traditional Machine Learning Methods

Bag of Words (BoW)

Deep Learning Transcends the Bag of Words


Deep Boosting


Training Deep Neural Networks on Noisy Labels with Bootstrapping

Conditional Random Fields

DeepCRF: Neural Networks and CRFs for Sequence Labeling

Decision Tree

Deep Neural Decision Forests

Neural Network and Decision Tree

Decision Forests, Convolutional Networks and the Models in-Between

Dictionary Learning

Greedy Deep Dictionary Learning

Sparse Factorization Layers for Neural Networks with Limited Supervision

Fisher Vectors

Backpropagation Training for Fisher Vectors within Neural Networks

Gaussian Processes

Questions on Deep Gaussian Processes

Qs – Deep Gaussian Processes

Practical Learning of Deep Gaussian Processes via Random Fourier Features

Deep Learning with Gaussian Process

Doubly Stochastic Variational Inference for Deep Gaussian Processes


Unsupervised Neural Hidden Markov Models

Kernel Methods

Kernel Methods for Deep Learning

Deep Kernel Learning

Stochastic Variational Deep Kernel Learning

A Deep Learning Approach To Multiple Kernel Fusion


Large-scale Learning with SVM and Convolutional for Generic Object Categorization

Convolutional Neural Support Vector Machines:Hybrid Visual Pattern Classifiers for Multi-robot Systems

Deep Learning using Linear Support Vector Machines

Deep Support Vector Machines

Random Forest

Towards the effectiveness of Deep Convolutional Neural Network based Fast Random Forest Classifier

Deep Forest: Towards An Alternative to Deep Neural Networks

Forward Thinking: Building Deep Random Forests


Deep Markov Random Field for Image Modeling

Deep, Dense, and Low-Rank Gaussian Conditional Random Fields

Deep Probabilistic Programming with Edward

Deep Bayesian Active Learning with Image Data

Deep Robust Kalman Filter

Deep Learning and Robots

Robot Learning Manipulation Action Plans by “Watching” Unconstrained Videos from the World Wide Web

End-to-End Training of Deep Visuomotor Policies

Comment on Open AI’s Efforts to Robot Learning

The Curious Robot: Learning Visual Representations via Physical Interactions

How to build a robot that “sees” with $100 and TensorFlow

Deep Visual Foresight for Planning Robot Motion

Sim-to-Real Robot Learning from Pixels with Progressive Nets

Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics

A Differentiable Physics Engine for Deep Learning in Robotics

Deep-learning in Mobile Robotics - from Perception to Control Systems: A Survey on Why and Why not

Deep Robotic Learning

Deep Learning on Mobile Devices

Convolutional neural networks on the iPhone with VGGNet

TensorFlow for Mobile Poets

The Convolutional Neural Network(CNN) for Android

TensorFlow on Android

Experimenting with TensorFlow on Android frees AI from the prison of the supercomputer

Embedded Deep Learning

Embedded Deep Learning with NVIDIA Jetson

Deep Learning in Finance

Deep Learning in Finance

A Survey of Deep Learning Techniques Applied to Trading

Deep Learning and Long-Term Investing

Deep Learning in Trading

Research to Products: Machine & Human Intelligence in Finance

eep Neural Networks for Real-time Market Predictions

Deep Learning the Stock Market


Neural networks for algorithmic trading. Multivariate time series

Deep-Trading: Algorithmic trading with deep learning experiments

Neural networks for algorithmic trading. Multimodal and multitask deep learning

Deep Learning with Python in Finance - Singapore Python User Group

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

Deep Learning in Speech

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

End-to-end speech recognition with neon


WaveNet: A Generative Model for Raw Audio

A TensorFlow implementation of DeepMind’s WaveNet paper for text generation.

Fast Wavenet Generation Algorithm

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind’s WaveNet and tensorflow

Wav2Letter: an End-to-End ConvNet-based Speech Recognition System

TristouNet: Triplet Loss for Speaker Turn Embedding

Speech Recognion and Deep Learning

Robust end-to-end deep audiovisual speech recognition

An Experimental Comparison of Deep Neural Networks for End-to-end Speech Recognition

Recurrent Deep Stacking Networks for Speech Recognition

Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks

Deep Learning for Sound / Music


Suggesting Sounds for Images from Video Collections

Disney AI System Associates Images with Sounds

Convolutional Recurrent Neural Networks for Bird Audio Detection


Learning Features of Music from Scratch

DeepBach: a Steerable Model for Bach chorales generation

Deep Learning for Music

First International Workshop on Deep Learning and Music

Deep Learning on Games

TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games

BlizzCon 2016 DeepMind and StarCraft II Deep Learning Panel Transcript

DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

Gym StarCraft: StarCraft environment for OpenAI Gym, based on Facebook’s TorchCraft

  • intro: Gym StarCraft is an environment bundle for OpenAI Gym. It is based on Facebook’s TorchCraft, which is a bridge between Torch and StarCraft for AI research.
  • github:

Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games

Learning Macromanagement in StarCraft from Replays using Deep Learning

Deep Learning in Medicine and Biology

Low Data Drug Discovery with One-shot Learning

Democratizing Drug Discovery with DeepChem

Introduction to Deep Learning in Medicine and Biology

Deep Learning for Alzheimer Diagnostics and Decision Support

DeepCancer: Detecting Cancer through Gene Expressions via Deep Generative Learning

Towards biologically plausible deep learning

Deep Learning and Its Applications to Machine Health Monitoring: A Survey

Generating Focussed Molecule Libraries for Drug Discovery with Recurrent Neural Networks

Deep Learning Applications in Medical Imaging

Dermatologist-level classification of skin cancer with deep neural networks

Deep Learning for Health Informatics

Deep Learning for Fashion

Convolutional Neural Networks for Fashion Classification and Object Detection

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations

Deep Learning for Fast and Accurate Fashion Item Detection

Deep Learning at GILT

Working with Fashion Models

Fashion Forward: Forecasting Visual Style in Fashion

StreetStyle: Exploring world-wide clothing styles from millions of photos


Deep Learning’s Accuracy

Benchmarks for popular CNN models

Deep Learning Benchmarks



Reweighted Wake-Sleep

Probabilistic Backpropagation for Scalable Learning of Bayesian Neural Networks

Deeply-Supervised Nets

Deep learning

On the Expressive Power of Deep Learning: A Tensor Analysis

Understanding and Predicting Image Memorability at a Large Scale

A Survey: Time Travel in Deep Learning Space: An Introduction to Deep Learning Models and How Deep Learning Models Evolved from the Initial Ideas

Towards Open Set Deep Networks

Structured Prediction Energy Networks

A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition

A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

Recent Advances in Convolutional Neural Networks

Understanding Deep Convolutional Networks

DeepCare: A Deep Dynamic Memory Model for Predictive Medicine

Exploiting Cyclic Symmetry in Convolutional Neural Networks

Cross-dimensional Weighting for Aggregated Deep Convolutional Features

Understanding Visual Concepts with Continuation Learning

Learning Efficient Algorithms with Hierarchical Attentive Memory

DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks

Do Deep Convolutional Nets Really Need to be Deep (Or Even Convolutional)?

Harnessing Deep Neural Networks with Logic Rules

Degrees of Freedom in Deep Neural Networks

Deep Networks with Stochastic Depth

LIFT: Learned Invariant Feature Transform

Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex

Understanding How Image Quality Affects Deep Neural Networks

Deep Embedding for Spatial Role Labeling

Unreasonable Effectiveness of Learning Neural Nets: Accessible States and Robust Ensembles

Learning Deep Representation for Imbalanced Classification

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images

DeepMath - Deep Sequence Models for Premise Selection

Convolutional Neural Networks Analyzed via Convolutional Sparse Coding

Systematic evaluation of CNN advances on the ImageNet

Why does deep and cheap learning work so well?

A scalable convolutional neural network for task-specified scenarios via knowledge distillation

Alternating Back-Propagation for Generator Network

A Novel Representation of Neural Networks

Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm

Uncertainty in Deep Learning

Deep Convolutional Neural Network Design Patterns

Extensions and Limitations of the Neural GPU

Neural Functional Programming

Deep Information Propagation

Compressed Learning: A Deep Neural Network Approach

A backward pass through a CNN using a generative model of its activations

Understanding deep learning requires rethinking generalization

Local minima in training of deep networks

Learning the Number of Neurons in Deep Networks

Survey of Expressivity in Deep Neural Networks

  • intro: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems
  • intro: Google Brain & Cornell University & Stanford University
  • arxiv:

Designing Neural Network Architectures using Reinforcement Learning

Towards Robust Deep Neural Networks with BANG

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

A Probabilistic Theory of Deep Learning

A Probabilistic Framework for Deep Learning

Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer

Risk versus Uncertainty in Deep Learning: Bayes, Bootstrap and the Dangers of Dropout

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Deep Network Guided Proof Search

Neural Architecture Search with Reinforcement Learning

PathNet: Evolution Channels Gradient Descent in Super Neural Networks

Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks

The Power of Sparsity in Convolutional Neural Networks

Learning across scales - A multiscale method for Convolution Neural Networks

Stacking-based Deep Neural Network: Deep Analytic Network on Convolutional Spectral Histogram Features

A Compositional Object-Based Approach to Learning Physical Dynamics

Genetic CNN

Deep Sets

  • intro: Amazon Web Services & CMU
  • keywords: statistic estimation, point cloud classification, set expansion, and image tagging
  • arxiv:

Multiscale Hierarchical Convolutional Networks

Deep Neural Networks Do Not Recognize Negative Images

Failures of Deep Learning

Multi-Scale Dense Convolutional Networks for Efficient Prediction

Scaling the Scattering Transform: Deep Hybrid Networks

Coordinating Filters for Faster Deep Neural Networks

Deep Learning is Robust to Massive Label Noise

Input Fast-Forwarding for Better Deep Learning

Deep Mutual Learning

Methods for Interpreting and Understanding Deep Neural Networks

Automated Problem Identification: Regression vs Classification via Evolutionary Deep Networks

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Tutorials and Surveys

On the Origin of Deep Learning

Efficient Processing of Deep Neural Networks: A Tutorial and Survey

Dive Into CNN

Structured Receptive Fields in CNNs

How ConvNets model Non-linear Transformations

Separable Convolutions / Grouped Convolutions

Factorized Convolutional Neural Networks

Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial “Bottleneck” Structure

Deep Learning with Separable Convolutions

Xception: Deep Learning with Depthwise Separable Convolutions

Towards a New Interpretation of Separable Convolutions


MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

MobileNets: Open-Source Models for Efficient On-Device Vision

Google’s MobileNets on the iPhone

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices


A biological gradient descent for prediction through a combination of STDP and homeostatic plasticity

An objective function for STDP

Towards a Biologically Plausible Backprop

Target Propagation

How Auto-Encoders Could Provide Credit Assignment in Deep Networks via Target Propagation

Difference Target Propagation

Zero Shot Learning

Learning a Deep Embedding Model for Zero-Shot Learning

Zero-Shot (Deep) Learning

Zero-shot learning experiments by deep learning.

Semantic Autoencoder for Zero-Shot Learning

One Shot Learning

One-shot Learning with Memory-Augmented Neural Networks

Matching Networks for One Shot Learning

Learning feed-forward one-shot learners [NIPS 2016] [VALSE seminar]

Generative Adversarial Residual Pairwise Networks for One Shot Learning

Few-Shot Learning

Optimization as a Model for Few-Shot Learning

Incremental Learning

iCaRL: Incremental Classifier and Representation Learning

Ensemble Deep Learning

Convolutional Neural Fabrics

Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles

Snapshot Ensembles: Train 1, Get M for Free

Ensemble Deep Learning

Computer Vision

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

On the usability of deep networks for object-based image analysis

Learning Recursive Filters for Low-Level Vision via a Hybrid Neural Network

DSAC - Differentiable RANSAC for Camera Localization

Optical Flow

FlowNet: Learning Optical Flow with Convolutional Networks

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Optical Flow Estimation using a Spatial Pyramid Network

Guided Optical Flow Learning

All-In-One Network

HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition

UberNet: Training a `Universal’ Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory

An All-In-One Convolutional Neural Network for Face Analysis

  • intro: simultaneous face detection, face alignment, pose estimation, gender recognition, smile detection, age estimation and face recognition
  • arxiv:

MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving


Top Deep Learning Projects

deepnet: Implementation of some deep learning algorithms

DeepNeuralClassifier(Julia): Deep neural network using rectified linear units to classify hand written digits from the MNIST dataset

Clarifai Node.js Demo

Deep Learning in Rust

Implementation of state-of-art models in Torch

Deep Learning (Python, C, C++, Java, Scala, Go)

deepmark: THE Deep Learning Benchmarks

Siamese Net

  • intro: “This package shows how to train a siamese network using Lasagne and Theano and includes network definitions for state-of-the-art networks including: DeepID, DeepID2, Chopra et. al, and Hani et. al. We also include one pre-trained model using a custom convolutional network.”
  • github:


Deep Learning algorithms with TensorFlow: Ready to use implementations of various Deep Learning algorithms using TensorFlow

Fast Multi-threaded VGG 19 Feature Extractor

Live demo of neural network classifying images

mojo cnn: c++ convolutional neural network

DeepHeart: Neural networks for monitoring cardiac data

Deep Water: Deep Learning in H2O using Native GPU Backends

Greentea LibDNN: Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL

Dracula: A spookily good Part of Speech Tagger optimized for Twitter

Trained image classification models for Keras

PyCNN: Cellular Neural Networks Image Processing Python Library

regl-cnn: Digit recognition with Convolutional Neural Networks in WebGL

dagstudio: Directed Acyclic Graph Studio with Javascript D3

NEUGO: Neural Networks in Go

gvnn: Neural Network Library for Geometric Computer Vision

DeepForge: A development environment for deep learning

Implementation of recent Deep Learning papers

GPU-accelerated Theano & Keras on Windows 10 native

Head Pose and Gaze Direction Estimation Using Convolutional Neural Networks

Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN)

Deep CNN and RNN - Deep convolution/recurrent neural network project with TensorFlow

Experimental implementation of novel neural network structures

WaterNet: A convolutional neural network that identifies water in satellite images

Kur: Descriptive Deep Learning

Development of JavaScript-based deep learning platform and application to distributed training


Readings and Questions

What you wanted to know about AI

Epoch vs iteration when training neural networks

Questions to Ask When Applying Deep Learning

How can I know if Deep Learning works better for a specific problem than SVM or random forest?

What is the difference between deep learning and usual machine learning?


Awesome Deep Learning

Awesome-deep-vision: A curated list of deep learning resources for computer vision

Applied Deep Learning Resources: A collection of research articles, blog posts, slides and code snippets about deep learning in applied settings.

Deep Learning Libraries by Language

Deep Learning Resources

Deep Learning Resources

Turing Machine: musings on theory & code(DEEP LEARNING REVOLUTION, summer 2015, state of the art & topnotch links)

BICV Group: Biologically Inspired Computer Vision research group

Learning Deep Learning

Summaries and notes on Deep Learning research papers

Deep Learning Glossary

The Deep Learning Playbook

Deep Learning Study: Study of HeXA@UNIST in Preparation for Submission

Deep Learning Books

awesome-very-deep-learning: A curated list of papers and code about very deep neural networks (50+ layers)

Deep Learning Resources and Tutorials using Keras and Lasagne

Deep Learning: Definition, Resources, Comparison with Machine Learning

Awesome - Most Cited Deep Learning Papers

The most cited papers in computer vision and deep learning

deep learning papers: A place to collect papers that are related to deep learning and computational biology




DeepLearningBibliography: Bibliography for Publications about Deep Learning using GPU

Deep Learning Papers Reading Roadmap


Deep Learning and applications in Startups, CV, Text Mining, NLP

ml4a-guides - a collection of practical resources for working with machine learning software, including code and tutorials


21 Deep Learning Videos, Tutorials & Courses on Youtube from 2016

Awesome Deep learning papers and other resources


Summaries of machine learning papers

Awesome Deep Learning Resources

Arxiv Pages

Neural and Evolutionary Computing


Computer Vision and Pattern Recognition

Arxiv Sanity Preserver

Today’s Deep Learning


DNNGraph - A deep neural network model generation DSL in Haskell

Deep playground: an interactive visualization of neural networks, written in typescript using d3.js

Neural Network Package

  • intro: This package provides an easy and modular way to build and train simple or complex neural networks using Torch
  • github:

deepdish: Deep learning and data science tools from the University of Chicago deepdish: Serving Up Chicago-Style Deep Learning

AETROS CLI: Console application to manage deep neural network training in AETROS Trainer

Deep Learning Studio: Cloud platform for designing Deep Learning AI without programming

cuda-on-cl: Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices


Deep Learning

Fundamentals of Deep Learning: Designing Next-Generation Artificial Intelligence Algorithms

FIRST CONTACT WITH TENSORFLOW: Get started with with Deep Learning programming

Make Your Own Neural Network: IPython Neural Networks on a Raspberry Pi Zero


Neural Networks and Deep Learning

Deep Learning Reading List


Andrej Karpathy blog

Rodrigob’s github page

colah’s blog

What My Deep Model Doesn’t Know…

Christoph Feichtenhofer

Image recognition is not enough: As with language, photos need contextual intelligence

ResNets, HighwayNets, and DenseNets, Oh My!

The Frontiers of Memory and Attention in Deep Learning

Design Patterns for Deep Learning Architectures

Building a Deep Learning Powered GIF Search Engine

850k Images in 24 hours: Automating Deep Learning Dataset Creation

How six lines of code + SQL Server can bring Deep Learning to ANY App

Neural Network Architectures