OpenMOSS-Team/DiRL-8B-Instruct
8B
ā¢
Updated
ā¢
32
ā¢
9
LLM
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models