Open Release

57 papers

OpenSanctions Pairs releases a massive benchmark for entity matching, proving that local LLMs can now match production rule-based systems in high-stakes compliance tasks.

AI & ML arxiv | Mar 13

Tiny Aya is a 3.35B parameter multilingual model that achieves state-of-the-art results across 70 languages, challenging the need for massive scale in global AI.

AI & ML arxiv | Mar 13

Introduces the first billion-scale SAR vision foundation model and a massive unified benchmark for all-weather geospatial semantic segmentation.

AI & ML arxiv | Mar 13

An open foundation model for humanoid robots that achieves high performance using only 30 hours of real-world robot data by pre-training on egocentric human videos.

AI & ML arxiv | Mar 13

Surg-R1 is a specialized surgical reasoning model released alongside the largest surgical Chain-of-Thought dataset (320,000 pairs).

AI & ML arxiv | Mar 16

Releases Feynman, an agentic pipeline and 100k-sample dataset for generating high-quality, knowledge-rich diagrams with grounded captions.

AI & ML arxiv | Mar 16

Introduces the largest-ever multi-modal CAD dataset with 10 million annotations for 1 million models to enable geometric deep learning on BRep data.

AI & ML arxiv | Mar 16

Introduces a unified evaluation harness for Vision-Language-Action (VLA) models that standardizes disparate protocols and exposes hidden flaws in published SOTA models.

AI & ML arxiv | Mar 17

Releases an 11-billion example dataset and model (RealVLG-R1) for unified real-world visual-language grounding and robotic manipulation.

AI & ML arxiv | Mar 17

Releases a million-scale human preference dataset (29M pairs) specifically for text-to-image editing tasks.

AI & ML arxiv | Mar 17

Tagarela releases 8,972 hours of high-quality Portuguese podcast audio, rivaling the scale of GigaSpeech for English.

AI & ML arxiv | Mar 17

Democratizes the development of 'Deep Search' agents by open-sourcing the specialized training data and trajectory synthesis methods.

AI & ML arxiv | Mar 17

Kamino is a massively parallel GPU physics solver that natively supports complex kinematic loops and multi-body systems.

AI & ML arxiv | Mar 18

IQuest-Coder-V1 introduces a series of high-performance code models including a unique 'Loop' variant with a recurrent mechanism for efficiency.

AI & ML arxiv | Mar 18

SurgΣ is a massive open-source release of 5.98M multimodal conversations and foundation models for surgical intelligence.

AI & ML arxiv | Mar 18

The first dedicated foundation model for electrodermal activity (EDA) data, released alongside the largest public dataset for physiological signal modeling.

AI & ML arxiv | Mar 19

Democratizes dexterous robot data collection by enabling high-fidelity 21-DoF teleoperation using only a standard smartphone.

AI & ML arxiv | Mar 19

Introduces FineViT and a 450M local caption dataset to solve the 'coarse perception' bottleneck in current CLIP-based encoders.

AI & ML arxiv | Mar 19

SpecForge provides an open-source framework and high-quality draft models (SpecBundle) to make speculative decoding production-ready.

AI & ML arxiv | Mar 20

OpenT2M is a massive open-source motion dataset (2,800+ hours) that addresses the data starvation in text-to-motion generation.

AI & ML arxiv | Mar 20

An open release of a multilingual embedding family (80M to 14B) covering 200+ languages and ranking first on 11 MTEB benchmarks.

AI & ML arxiv | Mar 20

Releases an offline search-and-browse pipeline with 97K long-horizon trajectories for training 'Deep Research' agents.

AI & ML arxiv | Mar 24

AgentComm-Bench is the first benchmark to stress-test cooperative embodied AI under realistic wireless impairments like packet loss and bandwidth collapse.

AI & ML arxiv | Mar 24

ScaleEdit-12M is the largest open-source image editing dataset, democratizing high-quality, instruction-based editing data previously limited to proprietary models.

AI & ML arxiv | Mar 24

An open-source family of language models for Kazakh that outperforms much larger multilingual models by using a language-specific tokenizer.

AI & ML arxiv | Mar 24

CLT-Forge democratizes mechanistic interpretability by providing an end-to-end library for training Cross-Layer Transcoders and generating feature attribution graphs.

AI & ML arxiv | Mar 24

LongCat-Flash-Prover is a 560B MoE model that sets a new SOTA for open-weights formal reasoning, achieving a 97.1% pass rate on MiniF2F-Test.

AI & ML arxiv | Mar 24

Open-sources a high-fidelity foundation model that jointly generates synchronized video and audio using a unified single-stream Transformer.

AI & ML arxiv | Mar 24

Releases the first large-scale family of learned sparse retrieval (LSR) models specialized for code (up to 8B parameters).

AI & ML arxiv | Mar 24

Releases the hardware design and training environment for MEVIUS2, an open-source, Spot-scale quadruped robot.

AI & ML arxiv | Mar 24

An open foundation suite for universal dexterous robot control trained on over 50k trajectories across eight different robotic hand architectures.

AI & ML arxiv | Mar 24

Introduces the first high-performing open-source metric for per-sample AI music quality evaluation.

AI & ML arxiv | Mar 25

Provides a massive 2.5M image-to-TikZ dataset and the first instruction-augmented dataset for geometric visual reasoning.

AI & ML arxiv | Mar 25

Berta is an open-source, production-proven AI clinical scribe that reduces operating costs by up to 95% compared to commercial alternatives.

AI & ML arxiv | Mar 26

BioVITA releases a massive multimodal biological dataset of 3.6M image-audio-text samples covering 14,000 species.

AI & ML arxiv | Mar 26

Releases a high-quality, 92K-sentence parallel dataset for Hindi-Sanskrit translation focusing on contemporary and spoken language.

AI & ML arxiv | Mar 26

Releases 55 hours of continuous 30fps expert human computer-use videos to address the 'missing ingredient' for desktop automation agents.

AI & ML arxiv | Mar 26

VFIG enables high-fidelity conversion of rasterized technical figures into editable, scalable SVGs using a new 66K-pair dataset.

AI & ML arxiv | Mar 26

Releases weights for LEMON, a foundation model for single-cell nuclear morphology trained on millions of pathology images.

AI & ML arxiv | Mar 30

The first large-scale benchmark for LLM agents based on years of authentic, cross-domain user behavioral data rather than synthetic personas.

AI & ML arxiv | Mar 30

Releases DataFlex, a unified open-source framework for data-centric dynamic training (selection, mixture, and reweighting) for LLMs.

AI & ML arxiv | Mar 30

Releases Ruka-v2, a fully open-source, 13-DOF tendon-driven humanoid hand with wrist and finger abduction buildable for under $1,300.

AI & ML arxiv | Mar 30

Releases a massive 117k-instruction dataset and a language-conditioned world model framework for visual navigation.

AI & ML arxiv | Mar 31

Releases ROSClaw, a model-agnostic executive layer that allows any foundation model to control any ROS 2 robot through standardized capability discovery and safety envelopes.

AI & ML arxiv | Mar 31

Releases ChartNet, a million-scale, high-quality multimodal dataset for chart understanding spanning 24 chart types and 1.5 million samples.

AI & ML arxiv | Mar 31

Introduces MeteoCap-3B, a billion-scale meteorological dataset with expert captions and a spectral-aware diffusion model for weather time-series generation.

AI & ML arxiv | Mar 31

A fully open industrial-scale pretraining project releasing 8T tokens of processed data, a 3B model, and 200+ controlled pretraining ablations.

AI & ML arxiv | Mar 31

The first self-supervised, domain-agnostic model for LiDAR ground segmentation, eliminating the need for per-sensor manual labeling.

AI & ML arxiv | Mar 31

A modular, JAX-based framework and taxonomy for Reinforcement Learning with Diffusion and Flow policies.

AI & ML arxiv | Mar 31

Kuaishou releases KAT-Coder-V2, an agentic coding model achieving state-of-the-art results on SWE-bench Verified through a 'Specialize-then-Unify' paradigm.

AI & ML arxiv | Mar 31

A unified, open-source framework that converts complex post-training quantization workflows into a single-line, hardware-aware pipeline.

AI & ML arxiv | Apr 1

A massive multimodal release for 10 low-resource African languages, reducing SOTA Word Error Rates (WER) by up to 61% relative.

AI & ML arxiv | Apr 1

A massive 270K-sample multi-view video corpus specifically for embodied AI agents in complex retail environments.

AI & ML arxiv | Apr 1

Independently reproduces OpenAI's gpt-oss-20b scores by reverse-engineering undisclosed tool-calling formats and agent harnesses.

AI & ML arxiv | Apr 2

OmniVoice is an open-source TTS model scaling to over 600 languages using a novel diffusion language model architecture.

AI & ML arxiv | Apr 2

Releases the GPT-NL Public Corpus, the largest permissively licensed (CC-BY) Dutch-first dataset for LLM pre-training.

AI & ML arxiv | Apr 2

Delivers a state-of-the-art universal phone recognition model across 100+ languages with full open-source release.

AI & ML arxiv | Apr 2