eeg2vision-multimodal-eeg-framework-2d-visual - SKILL.md Agent Skill

name: eeg2vision-multimodal-eeg-framework-2d-visual description: > EEG2Vision — Modular end-to-end EEG-to-image reconstruction framework using diffusion models with MLLM-guided boosting. Evaluates performance across EEG resolutions (128/64/32/24 channels). Enables real-time brain-to-image applications with low-density EEG. Use when: EEG visual reconstruction, brain-to-image, diffusion models for EEG, multimodal LLM for neuroscience, low-density EEG decoding. Trigger: EEG to image, brain reconstruction, visual decoding EEG, diffusion EEG, EEG2Vision, 脑电图像重建, EEG视觉重建. version: 1.0.0 author: Research Synthesis (arXiv:2604.08063) license: MIT metadata: hermes: tags: [EEG, visual-reconstruction, diffusion, multimodal-LLM, brain-to-image, low-density-EEG] source_paper: "EEG2Vision: A Multimodal EEG-Based Framework for 2D Visual Reconstruction in Cognitive Neuroscience (arXiv:2604.08063)"

EEG2Vision: EEG-to-Image Reconstruction with MLLM Boosting

Overview

EEG2Vision reconstructs 2D visual stimuli from non-invasive EEG signals using a two-stage pipeline:

EEG-conditioned diffusion model for initial reconstruction
MLLM-guided boosting for semantic refinement

Key innovation: Works with low-density EEG (as few as 24 channels), enabling real-world BCI applications.

Architecture

┌──────────────────────────────────────────────────────────┐
│  Stage 1: EEG-Conditioned Diffusion Reconstruction        │
│                                                          │
│  EEG (N channels) → Feature Extractor → Latent Condition │
│                                        ↓                  │
│                        ┌───────────────────────────┐     │
│                        │ Diffusion Model            │     │
│                        │ (EEG-conditioned generation)│    │
│                        └─────────────┬─────────────┘     │
│                                      ↓                    │
│                        Initial Reconstructed Image       │
└──────────────────────────────────────────────────────────┘
                          ↓
┌──────────────────────────────────────────────────────────┐
│  Stage 2: MLLM-Guided Boosting                           │
│                                                          │
│  Initial Image → MLLM → Semantic Description (prompt)   │
│                          ↓                                │
│  Initial Image + Semantic Prompt → I2I Diffusion        │
│                          ↓                                │
│                  Refined Image                           │
│  (improved geometry, perceptual coherence,               │
│   EEG-grounded structure preserved)                      │
└──────────────────────────────────────────────────────────┘

Channel Resolution Results

Channels	50-way Top-1 Acc	FID	IS Improvement (boost)
128	89%	76.77	+5.2%
64	~70%	~78	+6.8%
32	~50%	~79	+8.1%
24	38%	80.51	+9.71%

Key insight: Semantic accuracy drops sharply with fewer channels, but the boosting mechanism provides greater relative improvement in low-channel settings.

Implementation Pattern

class EEG2Vision:
    def __init__(self, eeg_encoder, diffusion_model, mllm, i2i_diffusion):
        self.eeg_encoder = eeg_encoder
        self.diffusion = diffusion_model
        self.mllm = mllm
        self.i2i_diffusion = i2i_diffusion
    
    def reconstruct(self, eeg_signal):
        # Stage 1: EEG-conditioned diffusion
        eeg_features = self.eeg_encoder(eeg_signal)
        initial_image = self.diffusion.generate(condition=eeg_features)
        
        # Stage 2: MLLM-guided boosting
        semantic_prompt = self.mllm.describe(initial_image)
        refined_image = self.i2i_diffusion.refine(
            initial_image, 
            prompt=semantic_prompt
        )
        return refined_image

Applications

Real-time brain-to-image BCI
Cognitive neuroscience research
Low-cost EEG visualization
Clinical neuroimaging applications
Consumer-grade EEG device applications

Activation Keywords

EEG to image, brain reconstruction, visual decoding
diffusion models for EEG, MLLM boosting
low-density EEG, brain-to-image, EEG2Vision
EEG图像重建, 脑电视觉重建, 扩散模型

References

Emanuele Balloni, Emanuele Frontoni, et al. "EEG2Vision: A Multimodal EEG-Based Framework for 2D Visual Reconstruction in Cognitive Neuroscience." arXiv:2604.08063