Stars
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
浙大软院研究生毕业论文 Latex 模版(非官方)2021夏季
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Data creation, training and eval scripts for the IRCoder paper
Data and Code for Program of Thoughts (TMLR 2023)
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations"
[EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
some bravo or inspiring research works on the topic of curriculum learning
CodeSage: Code Representation Learning At Scale (ICLR 2024)
IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models
Artifact for ASE 2023 paper "On the Evaluation of Neural Code Translation: Taxonomy and Benchmark"
Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In Proceedings of The 46th IEEE/ACM International Conference on …
[EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
OpenChat: Advancing Open-source Language Models with Imperfect Data
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
DeepSeek Coder: Let the Code Write Itself
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. …
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
This code is for cross-domain segmentation tasks, which can plot T-sne of domains and classes