RNN and LSTM

intro: University of Electronic Science and Technology of China & Brown University & University of Utah & XJERA LABS PTE.LTD
arxiv: https://arxiv.org/abs/1712.05134

LSTMVis

Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks

homepage: http://lstm.seas.harvard.edu/
demo: http://lstm.seas.harvard.edu/client/index.html
arxiv: https://arxiv.org/abs/1606.07461
github: https://github.com/HendrikStrobelt/LSTMVis

Recurrent Memory Array Structures

arxiv: https://arxiv.org/abs/1607.03085
github: https://github.com/krocki/ArrayLSTM

Recurrent Highway Networks

author: Julian Georg Zilly, Rupesh Kumar Srivastava, Jan Koutník, Jürgen Schmidhuber
arxiv: http://arxiv.org/abs/1607.03474
github(Tensorflow+Torch): https://github.com/julian121266/RecurrentHighwayNetworks/

DeepSoft: A vision for a deep model of software

arxiv: http://arxiv.org/abs/1608.00092

Recurrent Neural Networks With Limited Numerical Precision

arxiv: http://arxiv.org/abs/1608.06902

Hierarchical Multiscale Recurrent Neural Networks

LightRNN

LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

intro: NIPS 2016
arxiv: https://arxiv.org/abs/1610.09893

Full-Capacity Unitary Recurrent Neural Networks

intro: NIPS 2016
arxiv: https://arxiv.org/abs/1611.00035
github: https://github.com/stwisdom/urnn

DeepCoder: Learning to Write Programs

arxiv: https://arxiv.org/abs/1611.01989

shuttleNet: A biologically-inspired RNN with loop connection and parameter sharing

arxiv: https://arxiv.org/abs/1611.05216

Tracking the World State with Recurrent Entity Networks

intro: Facebook AI Research
arxiv: https://arxiv.org/abs/1612.03969
github(Official): https://github.com/facebook/MemNN/tree/master/EntNet-babi

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild

intro: National University of Singapore & Peking University
arxiv: https://arxiv.org/abs/1612.08534

Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks

arxiv: https://arxiv.org/abs/1701.03441
github: https://github.com/jingweimo/Modified-LSTM

The Statistical Recurrent Unit

intro: CMU
arxiv: https://arxiv.org/abs/1703.00381

Factorization tricks for LSTM networks

intro: ICLR 2017 Workshop
arxiv: https://arxiv.org/abs/1703.10722
github: https://github.com/okuchaiev/f-lm

Bayesian Recurrent Neural Networks

intro: UC Berkeley
arxiv: https://arxiv.org/abs/1704.02798
github: https://github.com/mirceamironenco/BayesianRecurrentNN

Fast-Slow Recurrent Neural Networks

arxiv: https://arxiv.org/abs/1705.08639
github: https://github.com/amujika/Fast-Slow-LSTM

Visualizing LSTM decisions

https://arxiv.org/abs/1705.08153

Recurrent Additive Networks

intro: [University of Washington & Allen Institute for Artificial Intelligence
arxiv: https://arxiv.org/abs/1705.07393
paper: http://www.kentonl.com/pub/llz.2017.pdf
github(PyTorch): https://github.com/bheinzerling/ran

Recent Advances in Recurrent Neural Networks

intro: University of Toronto & University of Waterloo
arxiv: https://arxiv.org/abs/1801.01078

Grow and Prune Compact, Fast, and Accurate LSTMs

https://arxiv.org/abs/1805.11797

Projects

NeuralTalk (Deprecated): a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences

github: https://github.com/karpathy/neuraltalk

NeuralTalk2: Efficient Image Captioning code in Torch, runs on GPU

github: https://github.com/karpathy/neuraltalk2

char-rnn in Blocks

github: https://github.com/johnarevalo/blocks-char-rnn

Project: pycaffe-recurrent

code: https://github.com/kuprel/pycaffe-recurrent/

Using neural networks for password cracking

torch-rnn: Efficient, reusable RNNs and LSTMs for torch

github: https://github.com/jcjohnson/torch-rnn

Deploying a model trained with GPU in Torch into JavaScript, for everyone to use

blog: http://testuggine.ninja/blog/torch-conversion
demo: http://testuggine.ninja/DRUMPF-9000/
github: https://github.com/Darktex/char-rnn

LSTM implementation on Caffe

github: https://github.com/junhyukoh/caffe-lstm

JNN: Java Neural Network Library

intro: C2W model, LSTM-based Language Model, LSTM-based Part-Of-Speech-Tagger Model
github: https://github.com/wlin12/JNN

LSTM-Autoencoder: Seq2Seq LSTM Autoencoder

github: https://github.com/cheng6076/LSTM-Autoencoder

RNN Language Model Variations

intro: Standard LSTM, Gated Feedback LSTM, 1D-Grid LSTM
github: https://github.com/cheng6076/mlm

keras-extra: Extra Layers for Keras to connect CNN with RNN

github: https://github.com/anayebi/keras-extra

Dynamic Vanilla RNN, GRU, LSTM,2layer Stacked LSTM with Tensorflow Higher Order Ops

github: https://github.com/KnHuq/Dynamic_RNN_Tensorflow

PRNN: A fast implementation of recurrent neural network layers in CUDA

intro: Baidu Research
blog: https://svail.github.io/persistent_rnns/
github: https://github.com/baidu-research/persistent-rnn

min-char-rnn: Minimal character-level language model with a Vanilla Recurrent Neural Network, in Python/numpy

github: https://github.com/weixsong/min-char-rnn

rnn: Recurrent Neural Network library for Torch7’s nn

github: https://github.com/Element-Research/rnn

word-rnn-tensorflow: Multi-layer Recurrent Neural Networks (LSTM, RNN) for word-level language models in Python using TensorFlow

github: https://github.com/hunkim/word-rnn-tensorflow

tf-char-rnn: Tensorflow implementation of char-rnn

github: https://github.com/shagunsodhani/tf-char-rnn

translit-rnn: Automatic transliteration with LSTM

tf_lstm.py: Simple implementation of LSTM in Tensorflow in 50 lines (+ 130 lines of data generation and comments)

gist: https://gist.github.com/nivwusquorum/b18ce332bde37e156034e5d3f60f8a23

Handwriting generating with RNN

github: https://github.com/Arn-O/kadenze-deep-creative-apps/blob/master/final-project/glyphs-rnn.ipynb

RecNet - Recurrent Neural Network Framework

github: https://github.com/joergfranke/recnet

Blogs

Survey on Attention-based Models Applied in NLP

http://yanran.li/peppypapers/2015/10/07/survey-attention-model-1.html

Survey on Advanced Attention-based Models

http://yanran.li/peppypapers/2015/10/07/survey-attention-model-2.html

Online Representation Learning in Recurrent Neural Language Models

http://www.marekrei.com/blog/online-representation-learning-in-recurrent-neural-language-models/

Fun with Recurrent Neural Nets: One More Dive into CNTK and TensorFlow

http://esciencegroup.com/2016/03/04/fun-with-recurrent-neural-nets-one-more-dive-into-cntk-and-tensorflow/

Materials to understand LSTM

https://medium.com/@shiyan/materials-to-understand-lstm-34387d6454c1#.4mt3bzoau

Understanding LSTM and its diagrams

:star::star::star::star::star:

Persistent RNNs: 30 times faster RNN layers at small mini-batch sizes

Persistent RNNs: Stashing Recurrent Weights On-Chip

intro: Greg Diamos, Baidu Silicon Valley AI Lab
paper: http://jmlr.org/proceedings/papers/v48/diamos16.pdf
blog: http://svail.github.io/persistent_rnns/
slides: http://on-demand.gputechconf.com/gtc/2016/presentation/s6673-greg-diamos-persisten-rnns.pdf

All of Recurrent Neural Networks

https://medium.com/@jianqiangma/all-about-recurrent-neural-networks-9e5ae2936f6e#.q4s02elqg

Rolling and Unrolling RNNs

https://shapeofdata.wordpress.com/2016/04/27/rolling-and-unrolling-rnns/

Sequence prediction using recurrent neural networks(LSTM) with TensorFlow: LSTM regression using TensorFlow

LSTMs

blog: https://shapeofdata.wordpress.com/2016/06/04/lstms/

Machines and Magic: Teaching Computers to Write Harry Potter

Crash Course in Recurrent Neural Networks for Deep Learning

http://machinelearningmastery.com/crash-course-recurrent-neural-networks-deep-learning/

Understanding Stateful LSTM Recurrent Neural Networks in Python with Keras

http://machinelearningmastery.com/understanding-stateful-lstm-recurrent-neural-networks-python-keras/

Recurrent Neural Networks in Tensorflow

Written Memories: Understanding, Deriving and Extending the LSTM

http://r2rt.com/written-memories-understanding-deriving-and-extending-the-lstm.html

Attention and Augmented Recurrent Neural Networks

blog: http://distill.pub/2016/augmented-rnns/
github: https://github.com/distillpub/post–augmented-rnns

Interpreting and Visualizing Neural Networks for Text Processing

https://civisanalytics.com/blog/data-science/2016/09/22/neural-network-visualization/

A simple design pattern for recurrent deep learning in TensorFlow

RNN Spelling Correction: To crack a nut with a sledgehammer

blog: https://medium.com/@yaoyaowd/rnn-spelling-correction-to-crack-a-nut-with-a-sledgehammer-7f5aa442c08c#.mc2ycyfda

Recurrent Neural Network Gradients, and Lessons Learned Therein

blog: http://willwolf.io/en/2016/10/13/recurrent-neural-network-gradients-and-lessons-learned-therein/

A noob’s guide to implementing RNN-LSTM using Tensorflow

http://monik.in/a-noobs-guide-to-implementing-rnn-lstm-using-tensorflow/

Non-Zero Initial States for Recurrent Neural Networks

blog: http://r2rt.com/non-zero-initial-states-for-recurrent-neural-networks.html

Interpreting neurons in an LSTM network

http://yerevann.github.io/2017/06/27/interpreting-neurons-in-an-LSTM-network/

Optimizing RNN (Baidu Silicon Valley AI Lab)

Optimizing RNN performance

blog: http://svail.github.io/rnn_perf/

Optimizing RNNs with Differentiable Graphs

Resources

Awesome Recurrent Neural Networks - A curated list of resources dedicated to RNN

homepage: http://jiwonkim.org/awesome-rnn/
github: https://github.com/kjw0612/awesome-rnn

Jürgen Schmidhuber’s page on Recurrent Neural Networks

http://people.idsia.ch/~juergen/rnn.html

Reading and Questions

Are there any Recurrent convolutional neural network network implementations out there ?

reddit: https://www.reddit.com/r/MachineLearning/comments/4chu3y/are_there_any_recurrent_convolutional_neural/

RNN and LSTM

Types of RNN

Tutorials

How to build a Recurrent Neural Network in TensorFlow

Unfolding RNNs

Train RNN

Learn To Execute Programs

Attention Models

Papers