Reinforcement Learning Tutorial Python

Reinforcement Learning-powered Effectiveness and Efficiency Few-shot Jailbreaking Attack LLMs

Abstract: The widespread use of large language models (LLMs) has brought about security risks, including biases, discrimination, and ethical concerns. Reinforcement Learning from Human Feedback (RLHF) ...

IEEE

Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation

Safety is one of the critical challenges in the autonomous driving task. Recent works address the safety by implementing a safe reinforcement learning (safe RL) mechanism. However, most approaches ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Reinforcement Learning-powered Effectiveness and Efficiency Few-shot Jailbreaking Attack LLMs

Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation

Trending now