It doesn't realize two layers Lstm. Because the code only shows the bias = 2. It doesn't work. And The code only realize one layer Lstm or gru.