SERIESFUSION.AI Science Discovery for Humans | Curated by AI & Humans

AI & ML

2305 papers · Page 24 of 24

Enables VideoLLMs to perform complex logical reasoning simultaneously with video playback without incurring the latency of standard test-time scaling.

New Capability arxiv | Mar 13

An open foundation model for humanoid robots that achieves high performance using only 30 hours of real-world robot data by pre-training on egocentric human videos.

Open Release arxiv | Mar 13

A unified streaming visual backbone that performs perception, 3D reconstruction, and robotic action simultaneously from a continuous video stream.

New Capability arxiv | Mar 13

Introduces adaptive video tokenization that allocates tokens based on scene complexity, reducing token usage by 24% while improving reconstruction quality.

Efficiency Breakthrough arxiv | Mar 13

Demonstrates that the stochasticity in standard regularized model training (like cross-validation) can serve as a 'free' and effective exploration strategy for contextual bandits.

Paradigm Shift arxiv | Mar 13