jinnaiyuu

Follow

Yuu David Jinnai jinnaiyuu

Follow

Artificial Intelligence, Planning, Reinforcement Learning, and Text Generation

50 followers · 9 following

Achievements

Achievements

Pinned Loading

CyberAgentAILab/annotation-efficient-po CyberAgentAILab/annotation-efficient-po Public

Code of "Annotation-Efficient Preference Optimization for Language Model Alignment"

Python 4 1
CyberAgentAILab/regularized-bon CyberAgentAILab/regularized-bon Public

Code of "Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment" (2024).

Python 8
CyberAgentAILab/model-based-mbr CyberAgentAILab/model-based-mbr Public

Code of "Model-Based Minimum Bayes Risk Decoding for Text Generation" 2024

Jupyter Notebook 5
CyberAgentAILab/diverse-mbr CyberAgentAILab/diverse-mbr Public

Code of "Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding" 2024

Python 2
CyberAgentAILab/adaptive-mbr CyberAgentAILab/adaptive-mbr Public

Code of "Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding" 2024

Python 2 1
search-ja search-ja Public

ヒューリスティック探索入門

TeX 18 4