arXiv論文紹介
トップ
アーカイブ
RSS
アーカイブ
過去に紹介したarXiv論文の一覧
過去記事一覧
全267件中 21-30件を表示
Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation
2509.08825
•
2025-09-12
•
cs.CL, cs.AI
Merge-of-Thought Distillation
2509.08814
•
2025-09-12
•
cs.LG, cs.AI
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
2509.08809
•
2025-09-12
•
cs.CL, cs.LG
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
2509.07980v1
•
2025-09-11
•
cs.LG, cs.AI
Visual Representation Alignment for Multimodal Large Language Models
2509.07979v1
•
2025-09-11
•
cs.CV, cs.AI
Customizing the Inductive Biases of Softmax Attention using Structured Matrices
2509.07963v1
•
2025-09-11
•
cs.LG, cs.AI
CAViAR: Critic-Augmented Video Agentic Reasoning
2509.07680v1
•
2025-09-11
•
cs.CV, cs.AI
Deep Reactive Policy: Learning Reactive Manipulator Motion Planning for Dynamic Environments
2509.06953v1
•
2025-09-10
•
cs.RO, cs.AI
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
2509.06951v1
•
2025-09-10
•
cs.CV, cs.AI, cs.LG
Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning
2509.06948v1
•
2025-09-10
•
cs.CL, cs.LG, cs.AI
← 前へ
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
次へ →