§ BEAT
Research
Piper Compiler Eliminates Hand-Coding for Distributed Training
Single Linear Layer Outperforms 1M-Parameter Gate in MTP Speedup Test
Real EHR Benchmark Exposes Limits of LLMs in Clinical Action
AHA-WAM achieves 4.59× faster robot control by decoupling Diffusion Transformers
FASE Cuts Hallucination Detection to 333x Speed
New DRPO Method Fixes Long-Tail Vocabulary Collapse in LLM RL
FASE Cuts Hallucination Detection Cost to 0.3% of Rivals
SIGA Speeds Coding Agents on Scientific Simulators by 36×
Echo-Memory Shows World Models Fail the Revisit Test
Waterloo researchers cut uncertainty quantification cost 99.7% with FASE
EvalCards Schema Exposes Systematic AI Benchmark Metadata Gaps
Perplexity Agentic AI Cuts Task Time 87 Percent in Production Study
64 Percent of Audio-Text Conflicts in AI Models Are Fixable
Router Matching 50 Retries with 10 Samples Cuts LLM Test-Time Compute
StreamMA Cuts Multi-Agent Reasoning Latency 26.9×
Alibaba Open-Sources Skill-RM for Unified LLM Reward Evaluation
Vendor-Diverse Judge Panels Eliminate Bias in Language Model Evaluations
LLMs Can Induce Hidden Rules, but Procedural Execution Remains Uncracked
AdaCodec cuts video-token load by 7× with predictive encoding
SafeSteer cuts alignment tax by targeting sparse safety tokens
Output Format Drives Faster Accuracy Loss Than Domain Shift in Multimodal LLMs
SubFit Maintains 84.6% Accuracy While Pruning LLM Layers at 25% Sparsity
Claude Code Spent 58% of Sessions Optimizing a Broken Architecture
Robot Manipulation Accuracy Jumps 22.5% With Motion-Aware Encoder
Linear Inverse Problems Don't Protect Against Diffusion Hallucination
HullFT Method Cuts Test-Time Finetuning Latency Versus SIFT
GPIC Open-Source Dataset Displaces ImageNet-1K as Standard Training Corpus
Vision-Language Models Show No Advantage in Text-Only Alignment
Omega-QVLA Cuts Robot Vision Model Memory by 71% Without Retraining