Publication: Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
No Thumbnail Available
Date
2024-08
Authors
Tian, Ying
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Pie chart is a widely used statistical chart to represent the proportions of
various components in a certain entity. The shares of data in a pie chart, also known
as compositional data, consist of non-negative values, containing only relative
information. However, in many real-life domains, a substantial amount of missing
values is often collected. The complexity of compositional data with missing values
renders traditional estimation methods inadequate. In this thesis, a compositional
data imputation method designed based on LASSO is proposed combining group
LASSO and adaptive LASSO analysis methods. The estimation effects of highdimensional
and low-dimensional compositional data with missing values are
compared through simulation studies and case analyses under different missing rates,
dimensions, and correlation coefficients. Considering the impact of outliers on the
accuracy of estimation, both simulation and case analysis are conducted to compare
the proposed algorithm against four existing methods. The experimental results
demonstrate that the proposed adaptive group LASSO method produces a better
imputation performance, MSE, MADE, RMSE and NRMSE increased by up to
26.6% at selected missing rates. Future work analyses the effect of imputation under
continuous missing rates, MAR missing mechanism and more model evaluation
criteria.
Description
Keywords
Lasso Imputation , Compositional Data