Stars
A lightweight WebAssembly runtime that is fast, secure, and standards-compliant
Benchmarking Legal Knowledge of Large Language Models
Scalable Meta-Evaluation of LLMs as Evaluators
Tools for merging pretrained large language models.
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
Open standard for machine learning interoperability
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
👨💻 An awesome and curated list of best code-LLM for research.
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
Self-evaluating interview for AI coders