fmri-gesture-reconstruction - SKILL.md Agent Skill

name: fmri-gesture-reconstruction description: "fMRI2GES: Dual brain decoding alignment framework for co-speech gesture reconstruction from fMRI signals. Maps brain responses to external stimuli and decodes gestural behavior through dual-alignment brain decoding. Activation: fMRI gesture reconstruction, brain-to-gesture decoding, fMRI2GES, co-speech gesture brain decoding, dual brain decoding alignment."

fMRI Gesture Reconstruction (fMRI2GES)

Dual brain decoding alignment framework for reconstructing co-speech gestures from fMRI signals, advancing brain-to-behavior decoding beyond traditional speech/motor paradigms.

Metadata

Source: arXiv:2512.01189
Authors: Chunzheng Zhu, Jialin Shao, Jianxin Lin, Yijun Wang, Jing Wang, Jinhui Tang
Published: 2025-11-30
Categories: Not specified in abstract

Core Methodology

Key Innovation

Extends brain decoding beyond speech/text reconstruction to gesture reconstruction — decoding the physical gestural behavior that accompanies speech from fMRI signals. Uses dual alignment strategy to improve decoding fidelity.

Technical Framework

Dual Brain Decoding Alignment
- Aligns fMRI signals to both gesture kinematics and speech features
- Two-stage alignment: brain-to-gesture + brain-to-speech cross-modal mapping
- Leverages shared neural representations between speech and gesture production
Gesture Reconstruction Pipeline
- Input: fMRI time series during speech-with-gesture tasks
- Intermediate: Aligned latent representations
- Output: Reconstructed gesture sequences (motion trajectories)
Cross-Modal Learning
- Joint training on gesture and speech decoding objectives
- Shared encoder captures multimodal neural representations
- Separate decoders for gesture kinematics and speech features

Applications

Brain-computer interfaces for gesture communication
Understanding neural basis of co-speech gesture production
Neurorehabilitation for speech-gesture coordination deficits
Multimodal brain decoding research

Implementation Guide

Prerequisites

fMRI data with gesture annotation
Motion capture or gesture tracking data
Deep learning framework for sequence modeling

Step-by-Step

Collect fMRI data during natural speech-with-gesture tasks
Annotate gesture sequences (kinematics, timing, type)
Build dual-alignment encoder for fMRI-to-gesture mapping
Train with joint gesture + speech decoding objectives
Evaluate gesture reconstruction quality against ground truth
Analyze neural regions contributing to gesture decoding

Pitfalls

fMRI temporal resolution limits gesture reconstruction precision
Requires synchronized fMRI and motion capture data
Gesture annotation is labor-intensive
Cross-modal alignment may be sensitive to individual variability
Limited generalizability across subjects without adaptation

Related Skills

brain-dit-fmri-foundation-model
brain-to-speech-prosody-feature-engineering
eeg2vision-multimodal-eeg-framework-2d-visual
visual-imagery-decoding-fmri