Skip to content

greysun/NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 

Repository files navigation

NLP

natural language processing

https://mp.weixin.qq.com/s/0oc0OLPCpO4io-alxWwfRw 自然语言处理常见数据集、论文最全整理分享 https://liuhuanyong.github.io/ 资源及语料

https://github.com/jikexueyuanwiki/tensorflow-zh/blob/master/SOURCE/tutorials/word2vec.md 在本教程我们来看一下Mikolov et al中提到的word2vec模型。

The Natural Language Decathlon: A Multitask Challenge for NLP https://github.com/salesforce/decaNLP Organized Resources for Deep Learning in Natural Language Processing https://github.com/astorfi/Deep-Learning-NLP

https://github.com/NTMC-Community/awaresome-neural-models-for-semantic-match A curated list of papers dedicated to neural text (semantic) matching. https://github.com/NTMC-Community/MatchZoo MatchZoo is a toolkit for text matching. It was developed to facilitate the designing, comparing, and sharing of deep text matching models.

https://github.com/Kyubyong/nlp_tasks Natural Language Processing Tasks and References

http://tangra.cs.yale.edu/newaan/ 耶鲁大学发布自然语言处理资源引擎TutorialBank: 让NLP学习不再困难 https://github.com/howie6879/mlhub123 机器学习&深度学习网站资源汇总(Machine Learning Resources) https://www.mlhub123.com/

Tracking Progress in Natural Language Processing https://github.com/sebastianruder/NLP-progress

Notes on deep learning for NLP https://github.com/Tixierae/deep_learning_NLP

benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. https://github.com/shawnwun/RNNLG

CoNaLa: The Code/Natural Language Challenge https://conala-corpus.github.io/

自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类,word2vec训练词向量、文档摘要、情感识别、关系抽取。 https://github.com/Roshanson/TextInfoExp

Statistical NLG for spoken dialogue systems https://github.com/UFAL-DSG/tgen

ava API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version. https://github.com/simplenlg/simplenlg

#summarization A curated list of resources dedicated to text summarization https://github.com/mathsyouth/awesome-text-summarization

Tutorial on Abstractive Text Summarization https://nlgsummer.github.io/slides/Advaith_Siddharthan-Introduction_to_Summarisation.pdf

Automatic Web Article Summarizer https://github.com/jjangsangy/ExplainToMe

Automatic Keyword Extraction for Text Summarization: A Survey https://arxiv.org/abs/1704.03242

NLP for Microblog Summarization https://www.aminer.cn/conf/airs2016

Text Summarization Techniques: A Brief Survey https://arxiv.org/abs/1707.02268

A Survey on Neural Network-Based Summarization Methods https://arxiv.org/abs/1804.04589

Official version of TextTeaser. https://github.com/IndigoResearch/textteaser

A Survey on Automatic Text Summarization http://www.cs.cmu.edu/~nasmith/LS2/das-martins.07.pdf

中文近义词工具包 https://github.com/huyingxi/Synonyms

DeepDive is a system to extract value from dark data. Like dark matter, dark data is the great mass of data buried in text, tables, figures, and images, which lacks structure and so is essentially unprocessable by existing software. http://deepdive.stanford.edu/index.html https://github.com/HazyResearch/deepdive https://hazyresearch.github.io/snorkel/

Well tested & Multi-language evaluation framework for text summarization. https://github.com/chakki-works/sumeval

The software used to extract structured data from Wikipedia https://github.com/dbpedia/extraction-framework

A Lightweight Chinese Natural Language Processing Toolkit,提供中文分词, 中文词性标注, 文本纠错,文本转拼音,情感分析... https://github.com/SeanLee97/xmnlp

结巴中文分词 https://github.com/fxsjy/jieba

Python library for processing Chinese text https://github.com/isnowfy/snownlp

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. https://textblob.readthedocs.io/ https://github.com/sloria/TextBlob

Simple Solution for Multi-Criteria Chinese Word Segmentation http://www.hankcs.com/nlp/segment/mul… https://github.com/hankcs/multi-criteria-cws https://arxiv.org/abs/1712.02856

A Chinese Nature Language Toolkit https://github.com/rockyzhengwu/FoolNLTK

同义词表,反义词表,否定词表 https://github.com/guotong1988/chinese_dictionary

Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015 https://github.com/tticoin/AntonymDetection

Python Module to get Meanings, Synonyms and what not for a given word https://vocabulary.readthedocs.io/en/… https://github.com/tasdikrahman/vocabulary#wordnet-comparison

Unsupervised Morphology Induction Using Word Embeddings http://aclweb.org/anthology/N/N15/N15-1186.pdf

13-车万翔-句法语义分析及其应用
15-万小军-文本自动摘要技术

NLPCC2017示例代码以及数据描述 https://github.com/FudanNLP/nlpcc2017_news_headline_categorization

Text Summarization http://summarization.com/

NLP中自动生产文摘(auto text summarization) https://weibo.com/ttarticle/p/show?id=2309404162011079564441

收集2017年文本摘要相关的paper。 http://www.paperweekly.site/collections/347/papers

LCSTS: A Large Scale Chinese Short Text Summarization Dataset http://cn.arxiv.org/pdf/1506.05865

Deep Learning and applications in Startups, CV, Text Mining, NLP https://github.com/lipiji/app-dl

Code for training a Neural Open IE model (NAACL2018) https://github.com/gabrielStanovsky/supervised-oie

Topic model reading list http://bigml.cs.tsinghua.edu.cn/~jianfei/lda-reading.html

lecture notes for probabilistic topic models using ipython notebook https://github.com/dongwookim-ml/topic-model-lecture-note

Topic Modeling Bibliography http://qpleple.com/bib/

Text-Classification

Explore text classification methods in NLP with deep learning

Refer

https://github.com/brightmart/text_classification

https://github.com/dennybritz/cnn-text-classification-tf  (http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/)

https://github.com/yoonkim/CNN_sentence

https://github.com/chenyuntc/PyTorchText :1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案) https://biendata.com/competition/zhihu/

https://github.com/gaussic/text-classification-cnn-rnn

https://www.zhihu.com/question/58863937

https://github.com/richliao/textClassifier :Text classifier for Hierarchical Attention Networks for Document Classification

https://github.com/yudake/porn_fiction_classify 利用 文本卷积神经网络 (TextCNN)训练的文章分类模型,检测是否为色情文章。

http://nadbordrozd.github.io/blog/2017/08/12/looking-for-the-text-top-model/ Looking for the Text Top Model

Paper

https://github.com/ying-wen/nn_text_representation Neural networks for text representation

https://github.com/dennybritz/deeplearning-papernotes Summaries and notes on Deep Learning research papers

http://www.aclweb.org/anthology/D14-1181

https://arxiv.org/abs/1510.03820

https://arxiv.org/pdf/1708.04729.pdf

https://arxiv.org/pdf/1804.02063.pdf  (Few-Shot Text Classification with Pre-Trained Word Embeddings and a Human in the Loop)

https://arxiv.org/pdf/1801.06287.pdf What Does a TextCNN Learn?

https://arxiv.org/pdf/1708.04729.pdf Deconvolutional Paragraph Representation Learning

https://arxiv.org/pdf/1707.02919.pdf A Brief Survey of Text Mining: Classification, Clustering and extrachtion techniques

https://arxiv.org/pdf/1709.08716.pdf DOC: Deep Open Classification of Text Documents

https://arxiv.org/pdf/1709.08267.pdf HDLTex: Hierarchical Deep Learning for Text Classification Tool

https://github.com/dennybritz/bella bella is a tool that helps managing, labeling and evaluating natural language datasets.

https://arxiv.org/pdf/1801.06146.pdf Fine-tuned Language Models for Text Classification

About

natural language processing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published