AI & Machine Learning

2,557 papers · Page 23 of 52

Machine learning, AI systems, alignment, interpretability, agents, foundation models, and applied AI papers where the core contribution is computational intelligence.

Filter by category: Paradigm Challenge Breaks Assumption First Ever Nature Is Weird Practical Magic Cosmic Scale Life Origin Open Release Efficiency Leap New Capability Scaling Insight

Researchers have designed a new internet protocol specifically for a 10-node colony network spanning Earth, the Moon, and Mars.

Practical Magic

Everyday 5G cell towers can be repurposed as a massive radar system capable of tracking drones hidden in urban noise.

Nature Is Weird

AI voice assistants can be tricked into 'hearing' voices and events that never actually happened with near-perfect accuracy.

Practical Magic

Future wireless signals could be boosted by walls that physically shift and morph their shape to bounce waves toward your phone.

Paradigm Challenge

Researchers have mapped out all 19.3 million chords the human hand can play on a piano to reveal why some sound 'clear' and others 'muddy.'

Interfaces LLMs with Wikidata-scale graphs for multi-hop reasoning without any retraining of the model or the query executor.

A unified, open-source framework that converts complex post-training quantization workflows into a single-line, hardware-aware pipeline.

Efficiency Breakthrough

Decouples data mixture ratio selection from continual pre-training by optimizing distribution vectors post-hoc with 15-35x lower compute cost.

Achieves an 80x improvement in stable generation length for occupancy world models, enabling 4km+ autonomous driving simulations from a single frame.

Replaces the heuristic constant momentum (0.9) with a parameter-free, physics-inspired schedule that speeds up convergence by nearly 2x.

Leverages model reprogramming as an 'active signal amplifier' to proactively audit privacy leakage in LLMs and Diffusion models.

Efficiency Breakthrough

Combines differentiable optimization with exact ILP solvers to achieve a 10x performance gain in solving NP-hard combinatorial scheduling problems.

Proposes a mathematical framework where 'spectral gaps' in parameter updates control phase transitions like grokking and loss plateaus.

Breaks Assumption

Large-scale experiments reveal that self-organizing LLM agents spontaneously outperform manually designed hierarchical structures by 14%.

Efficiency Breakthrough

A fabricated 16nm SoC that performs real-time 3D occupancy mapping under 6 mW, reducing query energy by over 80%.

Proposes a neuroscience-grounded memory architecture that makes interactions cheaper and more accurate with experience, rather than relying on expanding context windows.

Breaks Assumption

Reveals that parallel translated data is surprisingly unnecessary for creating aligned multilingual representations in LLMs.

Breaks Assumption

Discovers that pretraining Implicit Neural Representations (INRs) on structured $1/f^\alpha$ noise performs as well as data-driven initialization.

Introduces DASES, a framework that replaces passive validation with active 'falsification' to ensure scientific models learn actual mechanisms rather than just winning benchmarks.

Efficiency Breakthrough

Generates complete, simulatable analog circuits in milliseconds, outperforming search-based methods by over 600x.

Breaks Assumption

Demonstrates that integer multiplication is not a long-range dependency problem, and that current architectures like Transformers and Mamba are fundamentally using the wrong 'computational spacetime.'

Efficiency Breakthrough

Introduces PolarQuant, a quantization method that uses Hadamard rotation to make LLM weights near-lossless at 5-bit without calibration data.

Breaks Assumption

Demonstrates that the 'modality gap' in CLIP-style models is a feature that can be exploited to increase robustness without retraining.

Achieves a +48pp accuracy gain in agents using a non-parametric online learning framework that reuses procedural plans without updating model weights.

Efficiency Breakthrough

Scales curvature-aware bilevel optimization to BERT-sized models using KFAC, significantly outperforming standard gradient unrolling.

Switches the training objective from hard Next-Token Prediction to predicting 'concepts' (sets of semantically related tokens).

Breaks Assumption

Challenges the assumption that architecture and loss are the primary levers for neural simulators by proving the 'carried state' design is the dominant bottleneck.

Proves that LLM agent capability (pass@1) and reliability (consistency) diverge systematically, with frontier models often having the highest 'meltdown' rates.

Introduces a way for diffusion models to generate a single, sharp 'mental average' of a concept rather than blurry pixel-wise averages.

A massive multimodal release for 10 low-resource African languages, reducing SOTA Word Error Rates (WER) by up to 61% relative.

Efficiency Breakthrough

Enables infinite-length video understanding on a single consumer GPU (RTX 3090) through a training-free visual memory mechanism.

Learns stable, interpretable Koopman generators for nonlinear PDEs from trajectory data alone without any physics supervision.

A massive 270K-sample multi-view video corpus specifically for embodied AI agents in complex retail environments.

Introduces a scalable reinforcement learning framework that enables high-fidelity control of a whole-body human musculoskeletal system with over 700 muscles.

Proposes 'Nomad', an exploration-first agent architecture that autonomously discovers insights in data without being limited by human prompts or questions.

Breaks Assumption

Reveals that many massive LLM benchmarks provide highly redundant information, with major leaderboards often containing only ~2 independent axes of measurement.

Provides a robust solution for anti-aliasing in Feed-forward Gaussian Splatting, enabling high-fidelity rendering across varying sampling rates and resolutions.

Breaks Assumption

Uses token-level perplexity analysis to prove that LLMs rely on simple heuristics rather than the linguistic reasoning they appear to exhibit on standard benchmarks.

Breaks Assumption

Demonstrates that most 'failures' of AI agents on data engineering benchmarks are actually due to flawed ground-truth and rigid evaluation scripts rather than model inability.

Enables precise Camera-LiDAR extrinsic calibration even under massive initial misalignments that typically break automated calibration systems.

Shows that VLMs can overcome deep-seated perceptual biases and optical illusions by using image manipulation tools rather than more training data.

Efficiency Breakthrough

Obtain epistemic and aleatoric uncertainty from a single forward-backward pass of an unmodified pretrained LLM.

The first prior-fitted foundation model for survival analysis that enables zero-shot time-to-event predictions on tabular data.

Breaks Assumption

Mathematical proof that cosine similarity between label representations (unembeddings) in softmax classifiers is fundamentally uninformative.

Efficiency Breakthrough

A vector-wise sparse attention mechanism that accelerates long-context video inference by 2.6x with zero loss in accuracy.

A novel neural primitive based on metriplectic dynamics that outperforms Transformers in data efficiency and generalization.

Breaks Assumption

A debunking of the idea that single-vector embedding failures are primarily due to low dimensionality.

Efficiency Breakthrough

A unified quantization and runtime framework for deploying multiple LoRA-adapted generative models on edge devices simultaneously.

Breaks Assumption

A diagnostic revealing that over 50% of video understanding benchmark samples can be solved without any video or temporal context.

Efficiency Breakthrough

A 1D continuous image tokenizer that uses semantic masking to achieve a 64x reduction in token usage without sacrificing generation fidelity.