Eureka Beaker - Search News

News

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

We present MM-Eureka-Qwen, a multimodal reasoning model that successfully extends large-scale rule-based reinforcement learning (RL) to multimodal reasoning. Compared to the previous version of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

News

Trending now