过好每一秒

2020-06-12 09:23 已编辑微博_广告算法_算法工程师

关注

Tensorflow实现的深度NLP模型集锦

本文转自：

https://www.toutiao.com/a6685688607191073294/

本文约2000字，建议阅读5分钟。

本文收集整理了一批基于Tensorflow实现的深度学习/机器学习的深度NLP模型。

收集整理了一批基于Tensorflow实现的深度学习/机器学习的深度NLP模型。

基于Tensorflow的自然语言处理模型，为自然语言处理问题收集机器学习和Tensorflow深度学习模型，100%Jupeyter NoteBooks且内部代码极为简洁。

资源整理自网络，源地址：

https://github.com/huseinzol05

Text classification
Chatbot
Neural Machine Translation
Embedded
Entity-Tagging
POS-Tagging
Dependency-Parser
Question-Answers
Supervised Summarization
Unsupervised Summarization
Stemming
Generator
Language detection
OCR (optical character recognition)
Speech to Text
Text to Speech
Text Similarity
Miscellaneous
Attention

目标

原始的实现稍微有点复杂，对于初学者来说有点难。所以我尝试将其中大部分内容简化，同时，还有很多论文的内容亟待实现，一步一步来。

内容

文本分类：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/text-classification

1. Basic cell RNN

2. Bidirectional RNN

3. LSTM cell RNN

4. GRU cell RNN

5. LSTM RNN + Conv2D

6. K-max Conv1d

7. LSTM RNN + Conv1D + Highway

8. LSTM RNN with Attention

9. Neural Turing Machine

10. Seq2Seq

11. Bidirectional Transformers

12. Dynamic Memory Network

13. Residual Network using Atrous CNN + Bahdanau Attention

14. Transformer-XL

完整列表包含(66 notebooks)

聊天机器人：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/chatbot

1. Seq2Seq-manual

2. Seq2Seq-API Greedy

3. Bidirectional Seq2Seq-manual

4. Bidirectional Seq2Seq-API Greedy

5. Bidirectional Seq2Seq-manual + backward Bahdanau + forward Luong

6. Bidirectional Seq2Seq-API + backward Bahdanau + forward Luong + Stack Bahdanau Luong Attention + Beam Decoder

7. Bytenet

8. Capsule layers + LSTM Seq2Seq-API + Luong Attention + Beam Decoder

9. End-to-End Memory Network

10. Attention is All you need

11. Transformer-XL + LSTM

12. GPT-2 + LSTM

完整列表包含(51 notebooks)

机器翻译（英语到越南语）：

链接：

https://github.com/huseinzol05/NLP-ModelsTensorflow/tree/master/neural-machine-translation

1. Seq2Seq-manual

2. Seq2Seq-API Greedy

3. Bidirectional Seq2Seq-manual

4. Bidirectional Seq2Seq-API Greedy

5. Bidirectional Seq2Seq-manual + backward Bahdanau + forward Luong

6. Bidirectional Seq2Seq-API + backward Bahdanau + forward Luong + Stack Bahdanau Luong Attention + Beam Decoder

7. Bytenet

8. Capsule layers + LSTM Seq2Seq-API + Luong Attention + Beam Decoder

9. End-to-End Memory Network

10. Attention is All you need

完整列表包含(49 notebooks)

词向量：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/embedded

1. Word Vector using CBOW sample softmax

2. Word Vector using CBOW noise contrastive estimation

3. Word Vector using skipgram sample softmax

4. Word Vector using skipgram noise contrastive estimation

5. Lda2Vec Tensorflow

6. Supervised Embedded

7. Triplet-loss + LSTM

8. LSTM Auto-Encoder

9. Batch-All Triplet-loss LSTM

10. Fast-text

11. ELMO (biLM)

词性标注：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/pos-tagging

1. Bidirectional RNN + Bahdanau Attention + CRF

2. Bidirectional RNN + Luong Attention + CRF

3. Bidirectional RNN + CRF

实体识别：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/entity-tagging

1. Bidirectional RNN + Bahdanau Attention + CRF

2. Bidirectional RNN + Luong Attention + CRF

3. Bidirectional RNN + CRF

4. Char Ngrams + Bidirectional RNN + Bahdanau Attention + CRF

5. Char Ngrams + Residual Network + Bahdanau Attention + CRF

依存分析：

链接：

https://github.com/huseinzol05/NLP-ModelsTensorflow/tree/master/dependency-parser

1. Bidirectional RNN + Bahdanau Attention + CRF

2. Bidirectional RNN + Luong Attention + CRF

3. Residual Network + Bahdanau Attention + CRF

4. Residual Network + Bahdanau Attention + Char Embedded + CRF

问答：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/question-answer

1. End-to-End Memory Network + Basic cell

2. End-to-End Memory Network + GRU cell

3. End-to-End Memory Network + LSTM cell

词干抽取：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/stemming

1. LSTM + Seq2Seq + Beam

2. GRU + Seq2Seq + Beam

3. LSTM + BiRNN + Seq2Seq + Beam

4. GRU + BiRNN + Seq2Seq + Beam

5. DNC + Seq2Seq + Greedy

有监督摘要抽取：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/summarization

1. LSTM Seq2Seq using topic modelling

2. LSTM Seq2Seq + Luong Attention using topic modelling

3. LSTM Seq2Seq + Beam Decoder using topic modelling

4. LSTM Bidirectional + Luong Attention + Beam Decoder using topic modelling

5. LSTM Seq2Seq + Luong Attention + Pointer Generator

6. Bytenet

无监督摘要抽取:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/unsupervised-summarization

1. Skip-thought Vector (unsupervised)

2. Residual Network using Atrous CNN (unsupervised)

3. Residual Network using Atrous CNN + Bahdanau Attention (unsupervised)

OCR (字符识别):

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/ocr

1. CNN + LSTM RNN

语音识别:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/speech-to-text

1. Tacotron

2. Bidirectional RNN + Greedy CTC

3. Bidirectional RNN + Beam CTC

4. Seq2Seq + Bahdanau Attention + Beam CTC

5. Seq2Seq + Luong Attention + Beam CTC

6. Bidirectional RNN + Attention + Beam CTC

7. Wavenet

语音合成:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/text-to-speech

1. Tacotron

2. Wavenet

3. Seq2Seq + Luong Attention

4. Seq2Seq + Bahdanau Attention

生成器:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/generator

1. Character-wise RNN + LSTM

2. Character-wise RNN + Beam search

3. Character-wise RNN + LSTM + Embedding

4. Word-wise RNN + LSTM

5. Word-wise RNN + LSTM + Embedding

6. Character-wise + Seq2Seq + GRU

7. Word-wise + Seq2Seq + GRU

8. Character-wise RNN + LSTM + Bahdanau Attention

9. Character-wise RNN + LSTM + Luong Attention

语言检测:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/language-detection

1. Fast-text Char N-Grams

文本相似性:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/text-similarity

1. Character wise similarity + LSTM + Bidirectional

2. Word wise similarity + LSTM + Bidirectional

3. Character wise similarity Triplet loss + LSTM

4. Word wise similarity Triplet loss + LSTM

注意力机制:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/attention

1. Bahdanau

2. Luong

3. Hierarchical

4. Additive

5. Soft

6. Attention-over-Attention

7. Bahdanau API

8. Luong API

其他:

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/misc

1. Attention heatmap on Bahdanau Attention

2. Attention heatmap on Luong Attention

非深度学习：

链接：

https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/not-deep-learning

1. Markov chatbot

2. Decomposition summarization (3 notebooks)

编辑：王菁

校对：林亦霖

— 完 —

全部评论

推荐最新楼层

11-27 12:43

已编辑

门头沟学院 C++

大家目前秋招的情况怎么样了

截止到目前为止，大家是否已经结束掉秋招，拿到了什么offer，或者还在继续秋招，欢迎在评论区留言

阿杰阿杰：拿了offer，患上了间断刷牛客的瘾，比刷抖音还严重

点赞评论收藏

11-29 18:33

CVTE_运营管理_HR

拿完offer再挑

目前池子里已经有很多同学泡着了，各个学校的三方也已经陆续开放，但是并不代表着签完三方就结束了，很多同学还会有违约考虑国企央企，所以10月到11月之间对于大部分同学来讲，还是没办法完全放松（已经拿到满意的offer的另说） ，还没有投的同学最后都投上吧，春招hc都不一定有秋招中期的剩下的hc多，直入主题，大家都关注的问题cvte加不加班？加还是加的，但是看部门，软硬件基本都是8点钟走其他的一些文职部门我看下班还是六点准时的而且不是强制，你事情做完可以走的，全天不打卡工资待遇怎么样？待遇还是挺不错，校招普遍都是10k以上，目前我见过最高的是18k的。福利待遇有哪些？五险一金啥的不用说了。住房公积...

CVTE公司福利 270人发布

点赞评论收藏

10-31 19:24

门头沟学院硬件开发

大疆寄

太丑陋了。

耀孝女：就是你排序挂了

点赞评论收藏

10-25 14:31

长沙民政职业技术学院前端工程师

一个offer都没有麻了啊

求大佬们给点建议，孩子听劝😭

Bug压路：老哥看得出来你是想多展示一些项目，但好像一般最多两个就够了😂页数一般一页，多的也就2页；这些项目应该是比较同质化的，和评论区其他大佬一样，我也觉得应该展示一些最拿手的（质量>数量）😁😁😁专业技能部分也可以稍微精简一些

点赞评论收藏

不愿透露姓名的神秘牛友

11-29 12:19

pdd offer沟通会

要开奖了，期待ing

拼多多求职进展汇总

点赞评论收藏

全站热榜

正在热议

# 如果你有一天可以担任公司的CEO，你会做哪三件事？ #

# 如果公司给你放一天假，你会怎么度过？ #

# 参加完秋招的机械人，还参加春招吗？ #

20165次浏览 240人参与

# 我的上岸简历长这样 #

452111次浏览 8090人参与