SeriesFusion
Science, curated & edited by AI

AI & Machine Learning

2,557 papers  ·  Page 42 of 52

Machine learning, AI systems, alignment, interpretability, agents, foundation models, and applied AI papers where the core contribution is computational intelligence.

Breaks Assumption
Exposes 'hidden clones' in VLM ensembles, where models from the same family share correlated errors that naive voting mechanisms fail to detect.
Mar 19
New Capability
Proposes REAL, a Reinforcement Learning framework tailored for regression and ordinal scoring rather than simple binary accuracy.
Mar 19
New Capability
Introduces a framework for LLM agents to autonomously evolve their policies and skill libraries during system idle time without retraining downtime.
Mar 19
Efficiency Breakthrough
A backbone-agnostic denoising objective that allows small GNNs to outperform large models pretrained on much larger supervised datasets in physical sciences.
Mar 19
Paradigm Shift
Achieves high-performance online continual learning without the massive memory overhead of traditional experience replay buffers.
Mar 19
Breaks Assumption
Internal activation probing detects LLM 'rationalization' more reliably than monitoring the model's own Chain-of-Thought (CoT).
Mar 19
Efficiency Breakthrough
A dynamic data pruning framework that cuts dense retriever training time by 50% while actually improving retrieval accuracy.
Mar 19
New Capability
Automates the generation of synthetic machine learning challenges to train agents that can genuinely learn research skills from doing.
Mar 19
Breaks Assumption
Alignment processes induce a 'normative bias' that makes LLMs worse at predicting real human behavior in strategic scenarios.
Mar 19
New Capability
Enables reliable, training-free emotion steering in speech-generative audio models via direct manipulation of specific emotion-sensitive neurons.
Mar 19
Paradigm Shift
A formal, graph-native memory architecture that treats agent memory as a versioned asset, dramatically outperforming Gemini 2.5 Pro on complex recall.
Mar 19
New Capability
A framework to quantify and fix 'task steerability,' the common failure of robots to respond to new instructions while mid-task.
Mar 19
Efficiency Breakthrough
Achieves up to a 1,000x gain in RLHF data efficiency by using information-directed exploration and epistemic neural networks.
Mar 19
Efficiency Breakthrough
Introduces a reward framework that reduces LLM reasoning verbosity by optimizing for 'Information Density' via entropy reduction per step.
Mar 19
Paradigm Shift
Shifts retrieval from static contrastive vector alignment to dynamic reasoning trajectories using a generative model (T1) and GRPO.
Mar 19
Breaks Assumption
Identifies that reasoning-induced safety failures occur *during* Chain-of-Thought and proposes a shift to 'decide-then-reason' architectures.
Mar 19
Efficiency Breakthrough
Generates 9 million grid points of 3D spatiotemporal physical fields in seconds, a 10,000x speedup over traditional physics simulations.
Mar 19
New Capability
Proposes a world model that jointly generates appearance and binocular geometry using an epipolar-aware attention mechanism.
Mar 19
Open Release
Introduces FineViT and a 450M local caption dataset to solve the 'coarse perception' bottleneck in current CLIP-based encoders.
Mar 19
Paradigm Shift
Provides a sheaf-theoretic proof that local causal consistency in generative models does not guarantee global counterfactual coherence.
Mar 19
Efficiency Breakthrough
Replaces quadratic self-attention with $O(N \log N)$ phase-native coupling for time-series, enabling massive context windows.
Mar 19
New Capability
Introduces a paradigm for vision-language navigation that uses ubiquitously available semantic floor plans as global spatial priors.
Mar 19
New Capability
Embeds invisible, agent-specific 'watermarks' into token distributions to enable forensic attribution and topology reconstruction in multi-agent systems.
Mar 19
Efficiency Breakthrough
Achieves an 80% reduction in Chain-of-Thought (CoT) tokens while slightly increasing reasoning accuracy.
Mar 19
Efficiency Breakthrough
Extends LLM context from 32K to 128K by teaching models to selectively skip global attention for ~80% of tokens.
Mar 19
New Capability
Reduces hallucinations by teaching models 'epistemological humility'—the ability to admit they don't know something—using synthetic non-existent terms.
Mar 19
Breaks Assumption
Develops a zero-watermarking framework that survives AI editing by leveraging invariant relations between image patches.
Mar 19
Paradigm Shift
Unifies large-scale search, recommendation, and reasoning into a single self-contained LLM by treating item IDs as a distinct modality.
Mar 19
Scaling Insight
Video fine-tuning consistently degrades static image understanding in multimodal LLMs, revealing a zero-sum trade-off between spatial and temporal capabilities.
Mar 19
New Capability
Introduces a Prompt-Free Universal Region Proposal Network (PF-RPN) that identifies objects in any domain without needing text or image exemplars.
Mar 19
New Capability
FrescoDiffusion enables coherent, 4K image-to-video generation using a training-free, tiled diffusion method with precomputed latent priors.
Mar 19
Efficiency Breakthrough
Knowledge-Aware Active Learning (KA2L) uses latent space probing to identify what an LLM doesn't know and generates targeted synthetic questions.
Mar 19
Breaks Assumption
Dense retrieval architectures are fundamentally flawed at detecting negation and contradictions due to 'Semantic Collapse' in vector space.
Mar 19
Paradigm Shift
Edit-As-Act reframes 3D scene editing as a goal-regressive planning problem using symbolic action languages rather than purely generative pixel manipulation.
Mar 19
Breaks Assumption
ARES demonstrates high-fidelity data reconstruction from large Federated Learning batches without requiring any architectural modifications to the model.
Mar 19
Scaling Insight
Mechanistic probing reveals a directional asymmetry in how LLMs encode hierarchy: hypernymy is redundant and resilient, while hyponymy is fragile and compact.
Mar 19
Efficiency Breakthrough
S-VGGT introduces structure-aware subscene decomposition to break the quadratic scaling bottleneck of 3D foundation models.
Mar 19
New Capability
Introduces a framework to generate complex, non-linear environments with mathematically guaranteed ground-truth optimal policies for RL benchmarking.
Mar 19
Efficiency Breakthrough
DSS-GAN is the first generative adversarial network to use a Mamba (State Space Model) backbone for high-quality image synthesis.
Mar 19
New Capability
VectorWorld enables stable, real-time 1km+ closed-loop world model rollouts for autonomous driving using diffusion flow on vector graphs.
Mar 19
New Capability
REAL achieves extreme quadruped parkour agility that is robust even to a 1-meter visual blind zone.
Mar 19
Breaks Assumption
FINER discovers that MLLMs are highly prone to hallucination when images contain fine-grained mismatches co-occurring with real elements.
Mar 19
Efficiency Breakthrough
Synthetic videos of simple geometric shapes are more effective than massive real-world datasets for teaching video-language models fundamental temporal reasoning.
Mar 19
New Capability
Lifting 2D features into a volumetric representation for robot manipulation policies yields a 14.8% success rate improvement by solving the 2D-3D spatial reasoning mismatch.
Mar 19
Paradigm Shift
A new self-refining surrogate framework enables neural models to simulate complex dynamical systems over arbitrarily long horizons without the standard failure mode of compounding error.
Mar 19
Breaks Assumption
Massive activation outliers in Transformers are an adaptive response to 'gradient sinks' during training, rather than just an inference-time quirk.
Mar 19
Paradigm Shift
The 'consensus trap' in label-free RL—where models reinforce their own systematic errors—can be broken by co-evolving the model in alternating generator and verifier roles.
Mar 19
Breaks Assumption
In-context memory for LLMs is fundamentally unreliable due to compaction loss and goal drift, but structured 'Knowledge Objects' provide a 252x cheaper and 100% accurate alternative.
Mar 19
Efficiency Breakthrough
Anomaly detection can be performed directly using a primary model's internal neuron output ranges, eliminating the need for expensive external AD models.
Mar 19
Efficiency Breakthrough
Truncated backpropagation for video decoding reduces the memory cost of fine-tuning video diffusion models from linear to constant.
Mar 19