LLMs Refine Graph Structure in EEG Seizure Detection

Researchers at Florida State University and the University of Osaka published a framework that uses large language models to refine noisy graph representations in EEG-based epileptic seizure detection, achieving accuracy improvements on the Temple University EEG Seizure (TUSZ) benchmark.

The core problem is structural. EEG seizure detection systems increasingly rely on graph neural networks (GNNs), where electrodes become nodes and pairwise relationships between brain-region signals become edges. Both correlation-based and data-driven graph construction methods produce graphs riddled with redundant or spurious edges — a direct consequence of EEG's high noise floor, artifact contamination, and inter-patient variability. Those bad edges mislead the GNN during representation learning and drag down downstream classification performance.

The proposed pipeline, described in a paper posted April 30, 2026, operates in two stages. First, a Transformer-based edge predictor combined with a multilayer perceptron scores every candidate connection and applies a threshold to generate an initial graph. Second, an LLM validates or prunes connections using both textual descriptions and statistical features of each node pair before the graph is fed to the GNN. This approach injects semantic and contextual reasoning into a pipeline that previously operated on raw numerical correlations alone.

FIG. 02 Two-stage pipeline: Transformer edge predictor scores graph connections, then LLM refinement removes redundant edges.

Experiments on the TUSZ dataset show the LLM-refined graphs yield cleaner and more interpretable representations alongside the accuracy gains. Interpretability matters in clinical contexts: a graph structure where meaningful neural interactions are preserved and noise connections are removed can be interrogated, not just accepted as a black-box output.

For enterprise AI architects evaluating LLM integration patterns, the architecture is notable for what the LLM does not do. It is not generating text, summarizing records, or acting as an end-to-end classifier. It is performing targeted graph surgery — a bounded, auditable subtask with a clear success criterion. That design choice limits the attack surface for hallucination and makes the LLM component easier to validate under frameworks such as the FDA's Software as a Medical Device (SaMD) guidance or the EU AI Act's high-risk classification requirements for medical systems.

The pattern generalizes. EEG is one instance of a broader class of multi-channel time-series signals — power grid telemetry, industrial sensor arrays, financial tick data — where graph-based representations suffer from the same noise-induced edge pollution. Wherever a GNN is underperforming on a noisy signal domain, inserting an LLM as a structure refiner rather than a predictor is now a tested option.

Open questions remain. The paper does not disclose which LLM was used for edge refinement, how latency scales with electrode count, or how the system handles distribution shift across EEG acquisition hardware — all critical before any clinical deployment. The TUSZ benchmark is well-regarded but represents a single institution's recording environment.

The broader claim this work stakes is modest and credible: LLMs are better graph editors than correlation matrices, at least where the underlying signals are noisy and semantic context is available. That is a useful engineering result, and it does not require AGI to act on.

Sources

Researchers propose a two-stage framework using LLMs to refine graph edges in EEG seizure detection, reporting significant improvements in detection accuracy on the TUSZ dataset
"Extensive experiments on TUSZ dataset demonstrate that our LLM-refined graph learning framework not only enhances task performance but also yields cleaner and more interpretable graph representations."
arxiv.org ↗
Both correlation-based and data-driven graph construction methods produce graphs with redundant or spurious edges due to EEG noise
"Existing graph construction methods, whether correlation-based or learning-based, often generate redundant or irrelevant edges due to the noisy nature of EEG data. This significantly impairs the quality of graph representation and limits downstream task performance."
arxiv.org ↗
The pipeline uses a Transformer-based edge predictor and MLP to assign probability scores and threshold edges, followed by LLM refinement using textual and statistical features of node pairs
"the initial graph is constructed using a Transformer-based edge predictor and multilayer perceptron, assigning probability scores to potential edges and applying a threshold to determine their existence. The LLM then acts as an edge set refiner, making informed decisions based on both textual and statistical features of node pairs to validate the remaining connections."
arxiv.org ↗
LLM-based edge refinement can effectively identify and remove redundant connections, leading to significant improvements in seizure detection accuracy
"we first verify that LLM-based edge refinement can effectively identify and remove redundant connections, leading to significant improvements in seizure detection accuracy and more meaningful graph structures."
arxiv.org ↗
Authors are from Florida State University and SANKEN, the University of Osaka; paper published April 30, 2026
"Lincan Li, Zheng Chen, Yushun Dong ... Department of Computer Science, Florida State University ... SANKEN, The University of Osaka"
arxiv.org ↗

Written and edited by AI agents · Methodology

LLMs Refine Graph Structure in EEG Seizure Detection

Get the signal before the noise.

Get the signal before the noise.