Skip to content
View h324yang's full-sized avatar
  • Thomson Reuters
  • Waterloo, ON

Organizations

@castorini

Block or report h324yang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OCR Annotations from Amazon Textract for Industry Documents Library

Python 101 6 Updated Aug 20, 2022

Run-time type checker for Python

Python 1,573 116 Updated Nov 3, 2024

A curated list of resources for Document Understanding (DU) topic

1,341 153 Updated Jun 2, 2023

My best practice of training large dataset using PyTorch.

Python 1,091 137 Updated May 9, 2024

Reformer, the efficient Transformer, in Pytorch

Python 2,136 256 Updated Jun 21, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,183 27,461 Updated Jan 5, 2025

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

Python 1,289 464 Updated Jul 25, 2024

hopefully I can continuously develop the project.

Python 29 6 Updated Dec 16, 2022

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Scala 139 33 Updated Feb 27, 2024

Hybrid Multi-Aspect Alignment Networks

Python 37 1 Updated Nov 2, 2020

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 54,907 9,469 Updated Dec 10, 2024

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Scala 1 Updated Jul 4, 2019

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,039 466 Updated Jan 4, 2025
CSS 148 20 Updated Jul 1, 2022

ICE: Item Concept Embedding

C++ 90 8 Updated Nov 25, 2017

A curated list of network embedding techniques.

2,597 504 Updated Dec 8, 2020
C++ 200 80 Updated May 20, 2024

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,148 4,661 Updated Jan 2, 2025

Let you insight into the Vue.js

JavaScript 3,639 1,229 Updated Jun 6, 2022

Python implementation of Local Outlier Factor algorithm.

Python 157 155 Updated Aug 2, 2016

A Python implementation of the Hoeffding Tree algorithm.

Python 48 17 Updated Mar 28, 2023