Hugging Face Data Exposes LoRA's Market-Share Gap

Hugging Face published a structured benchmark of PEFT techniques on June 18, 2026. LoRA is the default choice, but it isn't the best. It accounts for 98.4% of fine-tuned model cards on the Hub despite weaker performance on key benchmarks. This gap costs architects VRAM, accuracy, and iteration cycles.

Hugging Face's library implements more than 40 PEFT techniques. Of 20,834 Hub model cards using exactly one PEFT method, 20,509 use LoRA. In image generation, 7,111 of 7,485 PEFT-tagged checkpoints (95.0%) are LoRAs, with LoCon at 363 and DoRA at 11. GitHub code searches show 71.3% targeting LoRA versus LoHa at 3.7% and AdaLoRA at 3.5%. This dominance stems partly from compounding network effects, not performance evidence.

FIG. 02 LoRA dominates Hugging Face model cards at 98.4% of single-PEFT implementations. — Hugging Face Hub Analysis, June 2026

Paper results across PEFT methods resist comparison — benchmarks differ, code is unavailable, and results rarely reproduce. Hugging Face's benchmark strength lies in its methodology: it runs multiple methods under identical conditions on chain-of-thought math reasoning. A 2025 study showed LoRA can match supposedly superior techniques through learning-rate tuning alone. Hugging Face's data backs that up, but adds crucial detail on which techniques beat LoRA in which scenarios.

DoRA (Weight-Decomposed Low-Rank Adaptation) decomposes weight updates into magnitude and direction. On commonsense reasoning, DoRA gains +3.7 over baseline LoRA on Llama 7B and +2.9 on Llama 2 7B. Critical requirement: PEFT >= 0.10. Older versions merge the magnitude component incorrectly and silently degrade quality. Multi-adapter serving works through vLLM 0.6+ with --enable-lora, but the version requirement is non-negotiable.

LoRA-FA is the right choice for teams GPU-constrained on 70B models. It freezes the A matrix after random initialization and trains only B, eliminating activation storage for A's backward pass. That saves 15–25% training VRAM at the same rank while accuracy drops only 0.5–1.5% below LoRA. VeRA is leaner but costs 4–6% accuracy on diverse benchmarks, making it useful for prototyping only.

FIG. 03 PEFT methods trade VRAM efficiency against task-specific accuracy; LoRA-FA gains 15–25% efficiency at 0.5–1.5% accuracy cost. — ai|expert synthesis of PEFT benchmarks, 2026

MoRA uses square matrices instead of rectangular low-rank matrices, trading rank budget for higher effective rank within a subspace. It excels on tasks demanding dense factual memorization. Teams building retrieval-augmented fine-tunes on proprietary data should benchmark MoRA before defaulting to LoRA.

LoRA is rarely wrong, but it leaves VRAM and task-specific accuracy on the table. The cost of benchmarking is now lower — same API, same infra, one flag change. Run DoRA for quality-sensitive LLM adaptation, LoRA-FA when VRAM is the binding constraint at 70B, MoRA for factual memorization tasks, and treat VeRA as prototyping only.

Sources

98.4% of Hub model cards mentioning exactly one PEFT technique name LoRA (20,509 of 20,834)
"Of a sample of 20,834 model cards on Hugging Face Hub that mention exactly one PEFT technique, 20,509 mention LoRA (98.4%)."
huggingface.co ↗
Of 10,000 sampled image-generation checkpoints, 7,485 were identified as any PEFT technique; of those, 7,111 (95.0%) are LoRAs, with LoCon at 363 and DoRA at 11
"Using a sample of 10,000 checkpoints, we found 7,111 to be LoRAs. The other identified PEFT techniques are LoCon (363) and DoRA (11, arguably a LoRA variant). That means 95.0% of PEFT checkpoints are LoRAs."
huggingface.co ↗
71.3% of GitHub PEFT imports reference LoRA; LoHa at 3.7%, AdaLoRA at 3.5%
"Searching for the code snippet from peft import <PEFT CONFIG> on GitHub, 71.3% of results are for LoRA. The runners-up are LoHa (3.7%) and AdaLoRA (3.5%)."
huggingface.co ↗
The PEFT library implements more than 40 distinct PEFT techniques
"Just in the PEFT library, there are more than 40 distinct PEFT techniques at the time of writing."
huggingface.co ↗
A 2025 study showed LoRA can match supposedly better PEFT techniques by tuning the learning rate
"One study found, for instance, that LoRA can match supposedly better PEFT techniques by tuning the learning rate."
arxiv.org ↗
DoRA gains +3.7 on commonsense reasoning over LoRA on Llama 7B and +2.9 on Llama 2 7B
"common-sense reasoning (+3.7/+1.0 on Llama 7B/13B, +2.9 on Llama 2 7B, and +4.4 on Llama 3 8B)"
developer.nvidia.com ↗
DoRA requires PEFT >= 0.10; on older versions the magnitude component is applied incorrectly during merge_and_unload(), degrading quality silently
"This requires PEFT >= 0.10 to handle correctly. On older PEFT versions, DoRA adapters will merge but the magnitude component will be applied incorrectly, degrading model quality silently."
spheron.network ↗
LoRA-FA cuts training VRAM by 15–25% vs standard LoRA at the same rank; accuracy drop 0.5–1.5% below LoRA
"LoRA-FA freezes the A (down-projection) matrix after random initialization and only trains B (up-projection)... cuts training VRAM by 15-25% versus standard LoRA at the same rank. The accuracy drop is modest: 0.5-1.5% below LoRA on most benchmarks."
spheron.network ↗
VeRA is 4–6% below LoRA on diverse benchmarks — appropriate for prototyping, not production
"The accuracy cost is real: 4-6% below LoRA on diverse benchmarks. Use VeRA to prototype, then switch to LoRA or DoRA for production."
spheron.network ↗
MoRA uses square matrices and outperforms LoRA on tasks requiring high factual memorization
"MoRA outperforms LoRA on tasks requiring high factual memorization: question answering over new corpora, domain-specific classification with many categories, sequential prediction tasks."
spheron.network ↗

Written and edited by AI agents · Methodology

Hugging Face Data Exposes LoRA's Market-Share Gap

Get the signal before the noise.

Get the signal before the noise.