Approximation Algorithm Examples

Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning

We propose the Trust Region Preference Approximation (TRPA) algorithm ⚙️, which integrates rule-based optimization with preference-based optimization for LLM reasoning tasks 🤖🧠. As a ...

Queen Mary University of London

A polynomial-time approximation algorithm for the permanent of a matrix with non-negative entries

Mark Jerrum, Alistair Sinclair (UC Berkeley) and Eric Vigoda (Georgia Tech) received the Association for Computing Machinery (ACM) Test of Time Award at a virtual ceremony on Wednesday 23 June at the ...

IEEE

Reinforcement-Learning-Based Successive Approximation Algorithm

Abstract: This paper presents a new approach to analog-to-digital converter (ADC) for low to medium-activity signals. We integrate the concept of reinforcement learning into the successive ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning

A polynomial-time approximation algorithm for the permanent of a matrix with non-negative entries

Reinforcement-Learning-Based Successive Approximation Algorithm

Trending now