Key Takeaways
HyperSIGMA is the first billion-parameter foundation model specifically for Hyperspectral Image (HSI) interpretation.
It unifies HSI processing across diverse high-level (classification, detection) and low-level (unmixing, denoising, super-resolution) tasks.
A novel Sparse Sampling Attention (SSA) mechanism efficiently extracts diverse contextual features by addressing HSI's spectral and spatial redundancy.
The Spectral Enhancement Module (SEM) effectively fuses spatial and spectral features, improving overall representation.
HyperGlobal-450K, a new large-scale dataset of 450K global HSIs, enables robust pre-training using Masked Image Modeling (MAE).
HyperSIGMA demonstrates superior performance, scalability, robustness, and real-world applicability compared to state-of-the-art methods.