natural language processing
https://mp.weixin.qq.com/s/0oc0OLPCpO4io-alxWwfRw 自然语言处理常见数据集、论文最全整理分享 https://liuhuanyong.github.io/ 资源及语料
https://github.com/jikexueyuanwiki/tensorflow-zh/blob/master/SOURCE/tutorials/word2vec.md 在本教程我们来看一下Mikolov et al中提到的word2vec模型。
The Natural Language Decathlon: A Multitask Challenge for NLP https://github.com/salesforce/decaNLP Organized Resources for Deep Learning in Natural Language Processing https://github.com/astorfi/Deep-Learning-NLP
https://github.com/NTMC-Community/awaresome-neural-models-for-semantic-match A curated list of papers dedicated to neural text (semantic) matching. https://github.com/NTMC-Community/MatchZoo MatchZoo is a toolkit for text matching. It was developed to facilitate the designing, comparing, and sharing of deep text matching models.
https://github.com/Kyubyong/nlp_tasks Natural Language Processing Tasks and References
http://tangra.cs.yale.edu/newaan/ 耶鲁大学发布自然语言处理资源引擎TutorialBank: 让NLP学习不再困难 https://github.com/howie6879/mlhub123 机器学习&深度学习网站资源汇总(Machine Learning Resources) https://www.mlhub123.com/
Tracking Progress in Natural Language Processing https://github.com/sebastianruder/NLP-progress
Notes on deep learning for NLP https://github.com/Tixierae/deep_learning_NLP
benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. https://github.com/shawnwun/RNNLG
CoNaLa: The Code/Natural Language Challenge https://conala-corpus.github.io/
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类,word2vec训练词向量、文档摘要、情感识别、关系抽取。 https://github.com/Roshanson/TextInfoExp
Statistical NLG for spoken dialogue systems https://github.com/UFAL-DSG/tgen
ava API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version. https://github.com/simplenlg/simplenlg
#summarization A curated list of resources dedicated to text summarization https://github.com/mathsyouth/awesome-text-summarization
Tutorial on Abstractive Text Summarization https://nlgsummer.github.io/slides/Advaith_Siddharthan-Introduction_to_Summarisation.pdf
Automatic Web Article Summarizer https://github.com/jjangsangy/ExplainToMe
Automatic Keyword Extraction for Text Summarization: A Survey https://arxiv.org/abs/1704.03242
NLP for Microblog Summarization https://www.aminer.cn/conf/airs2016
Text Summarization Techniques: A Brief Survey https://arxiv.org/abs/1707.02268
A Survey on Neural Network-Based Summarization Methods https://arxiv.org/abs/1804.04589
Official version of TextTeaser. https://github.com/IndigoResearch/textteaser
A Survey on Automatic Text Summarization http://www.cs.cmu.edu/~nasmith/LS2/das-martins.07.pdf
中文近义词工具包 https://github.com/huyingxi/Synonyms
DeepDive is a system to extract value from dark data. Like dark matter, dark data is the great mass of data buried in text, tables, figures, and images, which lacks structure and so is essentially unprocessable by existing software. http://deepdive.stanford.edu/index.html https://github.com/HazyResearch/deepdive https://hazyresearch.github.io/snorkel/
Well tested & Multi-language evaluation framework for text summarization. https://github.com/chakki-works/sumeval
The software used to extract structured data from Wikipedia https://github.com/dbpedia/extraction-framework
A Lightweight Chinese Natural Language Processing Toolkit,提供中文分词, 中文词性标注, 文本纠错,文本转拼音,情感分析... https://github.com/SeanLee97/xmnlp
结巴中文分词 https://github.com/fxsjy/jieba
Python library for processing Chinese text https://github.com/isnowfy/snownlp
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. https://textblob.readthedocs.io/ https://github.com/sloria/TextBlob
Simple Solution for Multi-Criteria Chinese Word Segmentation http://www.hankcs.com/nlp/segment/mul… https://github.com/hankcs/multi-criteria-cws https://arxiv.org/abs/1712.02856
A Chinese Nature Language Toolkit https://github.com/rockyzhengwu/FoolNLTK
同义词表,反义词表,否定词表 https://github.com/guotong1988/chinese_dictionary
Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015 https://github.com/tticoin/AntonymDetection
Python Module to get Meanings, Synonyms and what not for a given word https://vocabulary.readthedocs.io/en/… https://github.com/tasdikrahman/vocabulary#wordnet-comparison
Unsupervised Morphology Induction Using Word Embeddings http://aclweb.org/anthology/N/N15/N15-1186.pdf
13-车万翔-句法语义分析及其应用
15-万小军-文本自动摘要技术
NLPCC2017示例代码以及数据描述 https://github.com/FudanNLP/nlpcc2017_news_headline_categorization
Text Summarization http://summarization.com/
NLP中自动生产文摘(auto text summarization) https://weibo.com/ttarticle/p/show?id=2309404162011079564441
收集2017年文本摘要相关的paper。 http://www.paperweekly.site/collections/347/papers
LCSTS: A Large Scale Chinese Short Text Summarization Dataset http://cn.arxiv.org/pdf/1506.05865
Deep Learning and applications in Startups, CV, Text Mining, NLP https://github.com/lipiji/app-dl
Code for training a Neural Open IE model (NAACL2018) https://github.com/gabrielStanovsky/supervised-oie
Topic model reading list http://bigml.cs.tsinghua.edu.cn/~jianfei/lda-reading.html
lecture notes for probabilistic topic models using ipython notebook https://github.com/dongwookim-ml/topic-model-lecture-note
Topic Modeling Bibliography http://qpleple.com/bib/
Explore text classification methods in NLP with deep learning
Refer
https://github.com/dennybritz/cnn-text-classification-tf (http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/)
https://github.com/yoonkim/CNN_sentence
https://github.com/chenyuntc/PyTorchText :1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案) https://biendata.com/competition/zhihu/
https://www.zhihu.com/question/58863937
https://github.com/richliao/textClassifier :Text classifier for Hierarchical Attention Networks for Document Classification
https://github.com/yudake/porn_fiction_classify 利用 文本卷积神经网络 (TextCNN)训练的文章分类模型,检测是否为色情文章。
http://nadbordrozd.github.io/blog/2017/08/12/looking-for-the-text-top-model/ Looking for the Text Top Model
Paper
https://github.com/ying-wen/nn_text_representation Neural networks for text representation
https://github.com/dennybritz/deeplearning-papernotes Summaries and notes on Deep Learning research papers
http://www.aclweb.org/anthology/D14-1181
https://arxiv.org/abs/1510.03820
https://arxiv.org/pdf/1708.04729.pdf
https://arxiv.org/pdf/1804.02063.pdf (Few-Shot Text Classification with Pre-Trained Word Embeddings and a Human in the Loop)
https://arxiv.org/pdf/1801.06287.pdf What Does a TextCNN Learn?
https://arxiv.org/pdf/1708.04729.pdf Deconvolutional Paragraph Representation Learning
https://arxiv.org/pdf/1707.02919.pdf A Brief Survey of Text Mining: Classification, Clustering and extrachtion techniques
https://arxiv.org/pdf/1709.08716.pdf DOC: Deep Open Classification of Text Documents
https://arxiv.org/pdf/1709.08267.pdf HDLTex: Hierarchical Deep Learning for Text Classification Tool
https://github.com/dennybritz/bella bella is a tool that helps managing, labeling and evaluating natural language datasets.
https://arxiv.org/pdf/1801.06146.pdf Fine-tuned Language Models for Text Classification