Stars
OCR Annotations from Amazon Textract for Industry Documents Library
A curated list of resources for Document Understanding (DU) topic
My best practice of training large dataset using PyTorch.
Reformer, the efficient Transformer, in Pytorch
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
code for EMNLP 2019 paper Text Summarization with Pretrained Encoders
hopefully I can continuously develop the project.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
h324yang / aut
Forked from archivesunleashed/autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Anserini is a Lucene toolkit for reproducible information retrieval research
A curated list of network embedding techniques.
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Python implementation of Local Outlier Factor algorithm.
A Python implementation of the Hoeffding Tree algorithm.