arXiv論文紹介
トップ
アーカイブ
RSS
アーカイブ
過去に紹介したarXiv論文の一覧
過去記事一覧
全271件中 181-190件を表示
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains
2507.17746v1
•
2025-07-25
•
cs.LG, cs.AI
Yume: An Interactive World Generation Model
2507.17744v1
•
2025-07-25
•
cs.AI, cs.CV, cs.LG
Yume: An Interactive World Generation Model
2507.17744v1
•
2025-07-25
•
cs.AI, cs.CV, cs.LG
Megrez2 Technical Report
2507.17728v1
•
2025-07-25
•
cs.CL, cs.AI
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
2507.16815v1
•
2025-07-24
•
cs.CV, cs.AI, cs.LG
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
2507.16812v1
•
2025-07-24
•
cs.AI, cs.LG
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
2507.16812v1
•
2025-07-24
•
cs.AI, cs.LG
Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
2507.16806v1
•
2025-07-24
•
cs.LG, cs.AI
Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
2507.16806v1
•
2025-07-24
•
cs.LG, cs.AI
Diffusion Beats Autoregressive in Data-Constrained Settings
2507.15857v1
•
2025-07-23
•
cs.LG, cs.AI
← 前へ
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
次へ →