Open Release Open Release
57 papers
OpenSanctions Pairs releases a massive benchmark for entity matching, proving that local LLMs can now match production rule-based systems in high-stakes compliance tasks.
AI & ML arxiv | Mar 13
Tiny Aya is a 3.35B parameter multilingual model that achieves state-of-the-art results across 70 languages, challenging the need for massive scale in global AI.
AI & ML arxiv | Mar 13
Introduces the first billion-scale SAR vision foundation model and a massive unified benchmark for all-weather geospatial semantic segmentation.
AI & ML arxiv | Mar 13
An open foundation model for humanoid robots that achieves high performance using only 30 hours of real-world robot data by pre-training on egocentric human videos.
AI & ML arxiv | Mar 13
Surg-R1 is a specialized surgical reasoning model released alongside the largest surgical Chain-of-Thought dataset (320,000 pairs).
AI & ML arxiv | Mar 16
Releases Feynman, an agentic pipeline and 100k-sample dataset for generating high-quality, knowledge-rich diagrams with grounded captions.
AI & ML arxiv | Mar 16
Introduces the largest-ever multi-modal CAD dataset with 10 million annotations for 1 million models to enable geometric deep learning on BRep data.
AI & ML arxiv | Mar 16
Introduces a unified evaluation harness for Vision-Language-Action (VLA) models that standardizes disparate protocols and exposes hidden flaws in published SOTA models.
AI & ML arxiv | Mar 17
Releases an 11-billion example dataset and model (RealVLG-R1) for unified real-world visual-language grounding and robotic manipulation.
AI & ML arxiv | Mar 17
Releases a million-scale human preference dataset (29M pairs) specifically for text-to-image editing tasks.
AI & ML arxiv | Mar 17
Tagarela releases 8,972 hours of high-quality Portuguese podcast audio, rivaling the scale of GigaSpeech for English.
AI & ML arxiv | Mar 17
Democratizes the development of 'Deep Search' agents by open-sourcing the specialized training data and trajectory synthesis methods.
AI & ML arxiv | Mar 17
Kamino is a massively parallel GPU physics solver that natively supports complex kinematic loops and multi-body systems.
AI & ML arxiv | Mar 18
IQuest-Coder-V1 introduces a series of high-performance code models including a unique 'Loop' variant with a recurrent mechanism for efficiency.
AI & ML arxiv | Mar 18
SurgΣ is a massive open-source release of 5.98M multimodal conversations and foundation models for surgical intelligence.
AI & ML arxiv | Mar 18
The first dedicated foundation model for electrodermal activity (EDA) data, released alongside the largest public dataset for physiological signal modeling.
AI & ML arxiv | Mar 19
Democratizes dexterous robot data collection by enabling high-fidelity 21-DoF teleoperation using only a standard smartphone.
AI & ML arxiv | Mar 19
Introduces FineViT and a 450M local caption dataset to solve the 'coarse perception' bottleneck in current CLIP-based encoders.
AI & ML arxiv | Mar 19
SpecForge provides an open-source framework and high-quality draft models (SpecBundle) to make speculative decoding production-ready.
AI & ML arxiv | Mar 20
OpenT2M is a massive open-source motion dataset (2,800+ hours) that addresses the data starvation in text-to-motion generation.
AI & ML arxiv | Mar 20
An open release of a multilingual embedding family (80M to 14B) covering 200+ languages and ranking first on 11 MTEB benchmarks.
AI & ML arxiv | Mar 20
Releases an offline search-and-browse pipeline with 97K long-horizon trajectories for training 'Deep Research' agents.
AI & ML arxiv | Mar 24
AgentComm-Bench is the first benchmark to stress-test cooperative embodied AI under realistic wireless impairments like packet loss and bandwidth collapse.
AI & ML arxiv | Mar 24
ScaleEdit-12M is the largest open-source image editing dataset, democratizing high-quality, instruction-based editing data previously limited to proprietary models.
AI & ML arxiv | Mar 24
An open-source family of language models for Kazakh that outperforms much larger multilingual models by using a language-specific tokenizer.
AI & ML arxiv | Mar 24
CLT-Forge democratizes mechanistic interpretability by providing an end-to-end library for training Cross-Layer Transcoders and generating feature attribution graphs.
AI & ML arxiv | Mar 24
LongCat-Flash-Prover is a 560B MoE model that sets a new SOTA for open-weights formal reasoning, achieving a 97.1% pass rate on MiniF2F-Test.
AI & ML arxiv | Mar 24
Open-sources a high-fidelity foundation model that jointly generates synchronized video and audio using a unified single-stream Transformer.
AI & ML arxiv | Mar 24
Releases the first large-scale family of learned sparse retrieval (LSR) models specialized for code (up to 8B parameters).
AI & ML arxiv | Mar 24
Releases the hardware design and training environment for MEVIUS2, an open-source, Spot-scale quadruped robot.
AI & ML arxiv | Mar 24
An open foundation suite for universal dexterous robot control trained on over 50k trajectories across eight different robotic hand architectures.
AI & ML arxiv | Mar 24
Introduces the first high-performing open-source metric for per-sample AI music quality evaluation.
AI & ML arxiv | Mar 25
Provides a massive 2.5M image-to-TikZ dataset and the first instruction-augmented dataset for geometric visual reasoning.
AI & ML arxiv | Mar 25
Berta is an open-source, production-proven AI clinical scribe that reduces operating costs by up to 95% compared to commercial alternatives.
AI & ML arxiv | Mar 26
BioVITA releases a massive multimodal biological dataset of 3.6M image-audio-text samples covering 14,000 species.
AI & ML arxiv | Mar 26
Releases a high-quality, 92K-sentence parallel dataset for Hindi-Sanskrit translation focusing on contemporary and spoken language.
AI & ML arxiv | Mar 26
Releases 55 hours of continuous 30fps expert human computer-use videos to address the 'missing ingredient' for desktop automation agents.
AI & ML arxiv | Mar 26
VFIG enables high-fidelity conversion of rasterized technical figures into editable, scalable SVGs using a new 66K-pair dataset.
AI & ML arxiv | Mar 26
Releases weights for LEMON, a foundation model for single-cell nuclear morphology trained on millions of pathology images.
AI & ML arxiv | Mar 30
The first large-scale benchmark for LLM agents based on years of authentic, cross-domain user behavioral data rather than synthetic personas.
AI & ML arxiv | Mar 30
Releases DataFlex, a unified open-source framework for data-centric dynamic training (selection, mixture, and reweighting) for LLMs.
AI & ML arxiv | Mar 30
Releases Ruka-v2, a fully open-source, 13-DOF tendon-driven humanoid hand with wrist and finger abduction buildable for under $1,300.
AI & ML arxiv | Mar 30
Releases a massive 117k-instruction dataset and a language-conditioned world model framework for visual navigation.
AI & ML arxiv | Mar 31
Releases ROSClaw, a model-agnostic executive layer that allows any foundation model to control any ROS 2 robot through standardized capability discovery and safety envelopes.
AI & ML arxiv | Mar 31
Releases ChartNet, a million-scale, high-quality multimodal dataset for chart understanding spanning 24 chart types and 1.5 million samples.
AI & ML arxiv | Mar 31
Introduces MeteoCap-3B, a billion-scale meteorological dataset with expert captions and a spectral-aware diffusion model for weather time-series generation.
AI & ML arxiv | Mar 31
A fully open industrial-scale pretraining project releasing 8T tokens of processed data, a 3B model, and 200+ controlled pretraining ablations.
AI & ML arxiv | Mar 31
The first self-supervised, domain-agnostic model for LiDAR ground segmentation, eliminating the need for per-sensor manual labeling.
AI & ML arxiv | Mar 31
A modular, JAX-based framework and taxonomy for Reinforcement Learning with Diffusion and Flow policies.
AI & ML arxiv | Mar 31
Kuaishou releases KAT-Coder-V2, an agentic coding model achieving state-of-the-art results on SWE-bench Verified through a 'Specialize-then-Unify' paradigm.
AI & ML arxiv | Mar 31
A unified, open-source framework that converts complex post-training quantization workflows into a single-line, hardware-aware pipeline.
AI & ML arxiv | Apr 1
A massive multimodal release for 10 low-resource African languages, reducing SOTA Word Error Rates (WER) by up to 61% relative.
AI & ML arxiv | Apr 1
A massive 270K-sample multi-view video corpus specifically for embodied AI agents in complex retail environments.
AI & ML arxiv | Apr 1
Independently reproduces OpenAI's gpt-oss-20b scores by reverse-engineering undisclosed tool-calling formats and agent harnesses.
AI & ML arxiv | Apr 2
OmniVoice is an open-source TTS model scaling to over 600 languages using a novel diffusion language model architecture.
AI & ML arxiv | Apr 2
Releases the GPT-NL Public Corpus, the largest permissively licensed (CC-BY) Dutch-first dataset for LLM pre-training.
AI & ML arxiv | Apr 2
Delivers a state-of-the-art universal phone recognition model across 100+ languages with full open-source release.
AI & ML arxiv | Apr 2