arxiv:2501.00192
Shiyu Zhao
xiaofeng-94
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning
authored
a paper
11 months ago
MLLM-as-a-Judge for Image Safety without Human Labeling
upvoted
a
paper
about 1 year ago
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Organizations
None yet