Links
Contact Info.
  • Address:天津市西青区宾水西道399号天津工业大学化学与化工学院化学工程与工艺系6D518
  • Zip:300387
  • Tel:022-83955663
  • Fax:022-83955663
  • Email:bianxihui@163.com
Current Location :> Home > Publications > Text
A review on representative sample subset selection methods for multivariate modelling
writer:Xihui Bian, Wenbo Yang, Zhang Kexin, Zhang Qiang, Liu Peng, Weilu Tian*, Geert van Kollenburg*
keywords:Representative sample subset, Sample partitioning, Multivariate modelling, Chemometrics
source:期刊
specific source:Chemometrics and Intelligent Laboratory Systems, 2025, 265, 105493
Issue time:2025年
Chemometric analysis of complex systems often involves large datasets. Efficiently managing these datasets requires careful sample subset selection, encompassing two key tasks: selecting a representative sample subset for initial analysis and partitioning data for model calibration and validation. This review provides a comprehensive overview of 28 sample subset selection methods developed within the chemometrics field. For the first time, we classify these methods into seven distinct categories based on their underlying principles: sampling-based, distance-based, clustering-inspired, experimental design-inspired, variable selection-inspired, outlier detection-inspired, and preprocessing-inspired approaches. We systematically discuss the principles, advantages, disadvantages, and typical applications of each method. This consolidation serves as a valuable resource for researchers, facilitating the informed selection of appropriate sample subset selection strategies prior to multivariate calibration or chemical pattern recognition tasks.