Deep Learning Applications


Published: 09 Oct 2015

Optical Flow


Published: 09 Oct 2015



Published: 09 Oct 2015

Object Detection

Method backbone test size VOC2007 VOC2010 VOC2012 ILSVRC 2013 MSCOCO 2015 Speed
OverFeat           24.3%    
R-CNN AlexNet   58.5% 53.7% 53.3% 31.4%    
R-CNN VGG16   66.0%          
SPP_net ZF-5   54.2%     31.84%    
DeepID-Net     64.1%     50.3%    
NoC 73.3%   68.8%          
Fast-RCNN VGG16   70.0% 68.8% 68.4%   19.7%(@[0.5-0.95]), 35.9%(@0.5)  
MR-CNN 78.2%   73.9%          
Faster-RCNN VGG16   78.8%   75.9%   21.9%(@[0.5-0.95]), 42.7%(@0.5) 198ms
Faster-RCNN ResNet101   85.6%   83.8%   37.4%(@[0.5-0.95]), 59.0%(@0.5)  
YOLO     63.4%   57.9%     45 fps
YOLO VGG-16     66.4%         21 fps
YOLOv2   448x448 78.6%   73.4%   21.6%(@[0.5-0.95]), 44.0%(@0.5) 40 fps
SSD VGG16 300x300 77.2%   75.8%   25.1%(@[0.5-0.95]), 43.1%(@0.5) 46 fps
SSD VGG16 512x512 79.8%   78.5%   28.8%(@[0.5-0.95]), 48.5%(@0.5) 19 fps
SSD ResNet101 300x300         28.0%(@[0.5-0.95]) 16 fps
SSD ResNet101 512x512         31.2%(@[0.5-0.95]) 8 fps
DSSD ResNet101 300x300         28.0%(@[0.5-0.95]) 8 fps
DSSD ResNet101 500x500         33.2%(@[0.5-0.95]) 6 fps
ION     79.2%   76.4%      
CRAFT     75.7%   71.3% 48.5%    
OHEM     78.9%   76.3%   25.5%(@[0.5-0.95]), 45.9%(@0.5)  
R-FCN ResNet50   77.4%         0.12sec(K40), 0.09sec(TitianX)
R-FCN ResNet101   79.5%         0.17sec(K40), 0.12sec(TitianX)
R-FCN(ms train) ResNet101   83.6%   82.0%   31.5%(@[0.5-0.95]), 53.2%(@0.5)  
PVANet 9.0     84.9%   84.2%     750ms(CPU), 46ms(TitianX)
RetinaNet ResNet101-FPN              
Light-Head R-CNN Xception* 800/1200         31.5%@[0.5:0.95] 95 fps
Light-Head R-CNN Xception* 700/1100         30.7%@[0.5:0.95] 102 fps

Published: 09 Oct 2015

Object Counting

Object Counting

Published: 09 Oct 2015

Natural Language Processing


Practical Neural Networks for NLP

Structured Neural Networks for NLP: From Idea to Code

Understanding Deep Learning Models in NLP

Deep learning for natural language processing, Part 1

Neural Models

Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Visualizing and Understanding Neural Models in NLP

Character-Aware Neural Language Models

Skip-Thought Vectors

A Primer on Neural Network Models for Natural Language Processing

Character-aware Neural Language Models

Neural Variational Inference for Text Processing

Sequence to Sequence Learning

Generating Text with Deep Reinforcement Learning

MUSIO: A Deep Learning based Chatbot Getting Smarter


Learning phrase representations using rnn encoder-decoder for statistical machine translation

Neural Machine Translation by Jointly Learning to Align and Translate

Multi-Source Neural Translation

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

Modeling Coverage for Neural Machine Translation

A Character-level Decoder without Explicit Segmentation for Neural Machine Translation

NEMATUS: Attention-based encoder-decoder model for neural machine translation

Variational Neural Machine Translation

Neural Network Translation Models for Grammatical Error Correction

Linguistic Input Features Improve Neural Machine Translation

Sequence-Level Knowledge Distillation

Neural Machine Translation: Breaking the Performance Plateau

Tips on Building Neural Machine Translation Systems

Semi-Supervised Learning for Neural Machine Translation

EUREKA-MangoNMT: A C++ toolkit for neural machine translation for CPU

Deep Character-Level Neural Machine Translation

Neural Machine Translation Implementations

Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Learning to Translate in Real-time with Neural Machine Translation

Is Neural Machine Translation Ready for Deployment? A Case Study on 30 Translation Directions

Fully Character-Level Neural Machine Translation without Explicit Segmentation

Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation

Neural Machine Translation in Linear Time

Neural Machine Translation with Reconstruction

A Convolutional Encoder Model for Neural Machine Translation

Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder

MXNMT: MXNet based Neural Machine Translation

Doubly-Attentive Decoder for Multi-modal Neural Machine Translation

Massive Exploration of Neural Machine Translation Architectures

Depthwise Separable Convolutions for Neural Machine Translation

Deep Architectures for Neural Machine Translation

Marian: Fast Neural Machine Translation in C++



Extraction of Salient Sentences from Labelled Documents

A Neural Attention Model for Abstractive Sentence Summarization

A Convolutional Attention Network for Extreme Summarization of Source Code

Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond

textsum: Text summarization with TensorFlow

How to Run Text Summarization with TensorFlow

Reading Comprehension

Text Comprehension with the Attention Sum Reader Network

Text Understanding with the Attention Sum Reader Network

A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task

Consensus Attention-based Neural Networks for Chinese Reading Comprehension

Separating Answers from Queries for Neural Reading Comprehension

Attention-over-Attention Neural Networks for Reading Comprehension

Teaching Machines to Read and Comprehend CNN News and Children Books using Torch

Reasoning with Memory Augmented Neural Networks for Language Comprehension

Bidirectional Attention Flow: Bidirectional Attention Flow for Machine Comprehension

NewsQA: A Machine Comprehension Dataset

Gated-Attention Readers for Text Comprehension

Get To The Point: Summarization with Pointer-Generator Networks

Language Understanding

Recurrent Neural Networks with External Memory for Language Understanding

Neural Semantic Encoders

Neural Tree Indexers for Text Understanding

Better Text Understanding Through Image-To-Text Transfer

Text Classification

Convolutional Neural Networks for Sentence Classification

Recurrent Convolutional Neural Networks for Text Classification

Character-level Convolutional Networks for Text Classification

A C-LSTM Neural Network for Text Classification

Rationale-Augmented Convolutional Neural Networks for Text Classification

Text classification using DIGITS and Torch7

Recurrent Neural Network for Text Classification with Multi-Task Learning

Deep Multi-Task Learning with Shared Memory

Virtual Adversarial Training for Semi-Supervised Text Classification

Adversarial Training Methods for Semi-Supervised Text Classification

Sentence Convolution Code in Torch: Text classification using a convolutional neural network

Bag of Tricks for Efficient Text Classification

Actionable and Political Text Classification using Word Embeddings and LSTM

Implementing a CNN for Text Classification in TensorFlow

fancy-cnn: Multiparadigm Sequential Convolutional Neural Networks for text classification

Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level

Tweet Classification using RNN and CNN

Hierarchical Attention Networks for Document Classification

AC-BLSTM: Asymmetric Convolutional Bidirectional LSTM Networks for Text Classification

Generative and Discriminative Text Classification with Recurrent Neural Networks

Adversarial Multi-task Learning for Text Classification

Deep Text Classification Can be Fooled

Deep neural network framework for multi-label text classification

Multi-Task Label Embedding for Text Classification

Text Clustering

Self-Taught Convolutional Neural Networks for Short Text Clustering


Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books


Visual Dialog

Papers, code and data from FAIR for various memory-augmented nets with application to text understanding and dialogue.

Neural Emoji Recommendation in Dialogue Systems

Memory Networks

Neural Turing Machines

Memory Networks

End-To-End Memory Networks

Reinforcement Learning Neural Turing Machines - Revised

Learning to Transduce with Unbounded Memory

How to Code and Understand DeepMind’s Neural Stack Machine

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Ask Me Even More: Dynamic Memory Tensor Networks (Extended Model)

Structured Memory for Neural Turing Machines

Dynamic Memory Networks for Visual and Textual Question Answering

Neural GPUs Learn Algorithms

Hierarchical Memory Networks

Convolutional Residual Memory Networks

NTM-Lasagne: A Library for Neural Turing Machines in Lasagne

Evolving Neural Turing Machines for Reward-based Learning

Hierarchical Memory Networks for Answer Selection on Unknown Words

Gated End-to-End Memory Networks

Can Active Memory Replace Attention?

A Taxonomy for Neural Memory Networks


Globally Normalized Transition-Based Neural Networks

A Decomposable Attention Model for Natural Language Inference

Improving Recurrent Neural Networks For Sequence Labelling

Recurrent Memory Networks for Language Modeling

Tweet2Vec: Learning Tweet Embeddings Using Character-level CNN-LSTM Encoder-Decoder

Learning text representation using recurrent convolutional neural network with highway layers

Ask the GRU: Multi-task Learning for Deep Text Recommendations

From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning

Visualizing Linguistic Shift

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Deep Learning applied to NLP

Attention Is All You Need

Recent Trends in Deep Learning Based Natural Language Processing

HotFlip: White-Box Adversarial Examples for NLP

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

Interesting Applications

Data-driven HR - Résumé Analysis Based on Natural Language Processing and Machine Learning

sk_p: a neural program corrector for MOOCs

Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge

emoji2vec: Learning Emoji Representations from their Description

Inside-Outside and Forward-Backward Algorithms Are Just Backprop (Tutorial Paper)

Cruciform: Solving Crosswords with Natural Language Processing

Smart Reply: Automated Response Suggestion for Email

Deep Learning for RegEx

Learning Python Code Suggestion with a Sparse Pointer Network

End-to-End Prediction of Buffer Overruns from Raw Source Code via Neural Memory Networks

Convolutional Sequence to Sequence Learning

DeepFix: Fixing Common C Language Errors by Deep Learning

Hierarchically-Attentive RNN for Album Summarization and Storytelling


TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

NLP-Caffe: natural language processing with Caffe

DL4NLP: Deep Learning for Natural Language Processing

Combining CNN and RNN for spoken language identification

Character-Aware Neural Language Models: LSTM language model with CNN over characters in TensorFlow

Neural Relation Extraction with Selective Attention over Instances

deep-simplification: Text simplification using RNNs

lamtram: A toolkit for language and translation modeling using neural networks

Lango: Language Lego

Sequence-to-Sequence Learning with Attentional Neural Networks

harvardnlp code

Seq2seq: Sequence to Sequence Learning with Keras

debug seq2seq

Recurrent & convolutional neural network modules


Datasets for Natural Language Processing


How to read: Character level deep learning

Heavy Metal and Natural Language Processing

Sequence To Sequence Attention Models In PyCNN

Source Code Classification Using Deep Learning

My Process for Learning Natural Language Processing with Deep Learning

Convolutional Methods for Text


Word2Vec Tutorial - The Skip-Gram Model

Word2Vec Tutorial Part 2 - Negative Sampling

Word2Vec Resources

Demos - Deep Learning for Answering Questions about Images

Talks / Videos

Navigating Natural Language Using Reinforcement Learning


So, you need to understand language data? Open-source NLP software can help!

Curated list of resources on building bots

Notes for deep learning on NLP

Published: 09 Oct 2015

Neural Architecture Search


Published: 09 Oct 2015

LiDAR 3D Object Detection


Published: 09 Oct 2015