The Modified Group Method of Data Handling Adaptation for Constructing a Multivariate Regression Given by a Redundant Representation with a Significant Impact of a Random Factor

PDF (1340KB), PP.115-124

Views: 0 Downloads: 0

Author(s)

Alexander Pavlov 1,* Kateryna Lishchuk 1 Maxim Holovchenko 1 Mykyta Kyselov 1 Cennuo Hu 2

1. National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Kyiv 03056, Ukraine

2. Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA

* Corresponding author.

DOI: https://doi.org/10.5815/ijem.2026.03.08

Received: 30 Mar. 2026 / Revised: 22 Apr. 2026 / Accepted: 15 May 2026 / Published: 8 Jun. 2026

Index Terms

Group Method of Data Handling, Multivariate Regression, Synthetic Method, Decomposition Method, Regularity Criterion, Random Factor, Regression Model

Abstract

A Modified Group Method of Data Handling (MGMDH) is a component of a synthetic method of constructing multivariate polynomial regression given by a redundant representation. The MGMDH is used to construct multivariate linear regression given by a redundant representation in the case when a decomposition method, which is also a component of the synthetic method, allowed to estimate with a given accuracy the values of unknown coefficients for nonlinear terms of a multivariate polynomial regression. As statistical studies have shown, the MGMDH efficiently finds the correct structure of a multivariate linear regression when the realizations of a random factor in the tests are an order of magnitude smaller than the modules of the corresponding values of the regression to be determined. Only in this case, the use of the regularity criterion in the MGMDH almost always allows finding the correct structure of a multivariate linear regression given by a redundant representation. In this paper, the MGMDH is adapted for the case when during the tests the modules of the random factor implementations and the values of the regression to be determined take values of the same order, which significantly increases the efficiency of using the MGMDH for constructing multivariate linear regressions given by redundant representations. It is obvious that the adapted MGMDH for multivariate linear regressions given by redundant representations presented in this work is easily transformed using the standardization operation for the general problem of constructing multivariate regression given by a redundant representation in the case when the unknown coefficients are linear in the regression structure.

Cite This Paper

Alexander Pavlov, Kateryna Lishchuk, Maxim Holovchenko, Mykyta Kyselov, Cennuo Hu, "The Modified Group Method of Data Handling Adaptation for Constructing a Multivariate Regression Given by a Redundant Representation with a Significant Impact of a Random Factor", International Journal of Engineering and Manufacturing (IJEM), Vol.16, No.3, pp.115-124, 2026. DOI:10.5815/ijem.2026.03.08

Reference

[1]N. Roustaei, “Application and interpretation of linear-regression analysis,” Medical Hypothesis Discovery & Innovation in Ophthalmology, vol. 13, no. 3, pp. 151–159, Oct. 2024, doi: https://doi.org/10.51329/mehdiophthal1506.
[2]J. Liu et al., “A new time-varying coefficient regression approach for analyzing infectious disease data,” Scientific Reports, vol. 13, no. 1, Sep. 2023, doi: https://doi.org/10.1038/s41598-023-41551-1.
[3]C. Gao, “Robust regression via mutivariate regression depth,” Bernoulli, vol. 26, no. 2, pp. 1139–1170, May 2020, doi: https://doi.org/10.3150/19-bej1144.
[4]P. Haghighat, D. Gándara, L. Kang, and H. Anahideh, “Fair Multivariate Adaptive Regression Splines for Ensuring Equity and Transparency,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 20, pp. 22076–22086, Mar. 2024, doi: https://doi.org/10.1609/aaai.v38i20.30211.
[5]Z. Zhou, J. Ji, Y. Wang, Z. Zhu, and J. Chen, “Hybrid regression model via multivariate adaptive regression spline and online sequential extreme learning machine and its application in vision servo system,” International Journal of Advanced Robotic Systems, vol. 19, no. 3, May 2022, doi: https://doi.org/10.1177/17298806221108603.
[6]A. W. Omer, Asst. Prof. Dr. Bekhal S. Sedeeq, and Ali, “A proposed hybrid method for Multivariate Linear Regression Model and Multivariate Wavelets (Simulation study),” Mitanni Journal of Humanitarian Sciences, vol. 5, no. 1, pp. 112–124, 2024, doi: https://doi.org/10.25156/ptjhss.v5n1y2024.pp112-124.
[7]A. T. Abdulrahman and N. S. Alshammari, “Factor analysis and regression analysis to find out the influencing factors that led to the countries’ debt crisis,” Advances and Applications in Statistics, vol. 78, pp. 1–16, Jun. 2022, doi: https://doi.org/10.17654/0972361722047.
[8]P. Hünermund and B. Louw, “On the nuisance of control variables in causal regression analysis,” Organizational Research Methods, vol. 28, no. 1, Dec. 2023, doi: https://doi.org/10.1177/10944281231219274.
[9]S. Szentpéteri and B. C. Csáji, “Finite-Sample Identification of Linear Regression Models With Residual-Permuted Sums,” IEEE Control Systems Letters, vol. 8, pp. 1523–1528, 2024, doi: https://doi.org/10.1109/lcsys.2024.3412856.
[10]B. Nie et al., “A novel regression method: Partial least distance square regression methodology,” Chemometrics and Intelligent Laboratory Systems, vol. 237, p. 104827, Apr. 2023, doi: https://doi.org/10.1016/j.chemolab.2023.104827.
[11]A. Pavlov, M. Holovchenko, and V. Drozd, “An adaptive method for building a multivariate regression,” Bulletin of National Technical University “KhPI”. Series: System Analysis, Control and Information Technologies, no. 1 (11), pp. 3–8, Jul. 2024, doi: https://doi.org/10.20998/2079-0023.2024.01.01.
[12]A. Pavlov and A. Kushch, “Zadacha liniinoi rehresii z liniino zalezhnymy koefitsiientamy,” [Formulation of a linear regression problem with linearly dependent coefficients], Adaptive Systems of Automatic Control, vol. 2, no. 47, pp. 147–157, Sep. 2025, doi: https://doi.org/10.20535/1560-8956.47.2025.340201.
[13]R. Ragunath and R. Rathipriya, “Forecasting Agriculture Commodity Price Trend using Novel Competitive Ensemble Regression Model,” International Journal of Information Technology and Computer Science (IJITCS), vol. 17, no. 3, pp. 97–105, Jun. 2025, doi: https://doi.org/10.5815/ijitcs.2025.03.07.
[14]D. K. Singh and N. Rawat, “Decoding Optimization Algorithms for Convolutional Neural Networks in Time Series Regression Tasks,” International Journal of Information Technology and Computer Science (IJITCS), vol. 15, no. 6, pp. 37–49, Dec. 2023, doi: https://doi.org/10.5815/ijitcs.2023.06.04.
[15]A. Mondal, R. T. Goswami, and S. Sen, “A New Framework of Honeypots Network Security Using Linear Regression Decision Algorithm,” International Journal of Wireless and Microwave Technologies (IJWMT), vol. 13, no. 6, pp. 23–31, Nov. 2023, doi: https://doi.org/10.5815/ijwmt.2023.06.03.
[16]M. Ye, “Birth Rate Study of Henan Province Based on Ridge Regression Model,” International Journal of Mathematical Sciences and Computing (IJMSC), vol. 9, no. 4, pp. 10–19, Nov. 2023, doi: https://doi.org/10.5815/ijmsc.2023.04.02.
[17]A. W. Wambua and G. M. Wambugu, “A Comparative Analysis of Bat and Genetic Algorithms for Test Case Prioritization in Regression Testing,” International Journal of Intelligent Systems and Applications (IJISA), vol. 15, no. 1, pp. 13–21, Feb. 2023, doi: https://doi.org/10.5815/ijisa.2023.01.02.
[18]M.-H. Guo, X. Lü, and Y.-X. Jin, “Extraction and reconstruction of variable-coefficient governing equations using Res-KAN integrating sparse regression,” Physica D: Nonlinear Phenomena, vol. 481, p. 134689, Nov. 2025, doi: https://doi.org/10.1016/j.physd.2025.134689.
[19]Moritz Flaschel et al., “Automated discovery of interpretable hyperelastic material models for human brain tissue with EUCLID,” Journal of the Mechanics and Physics of Solids, vol. 180, p. 105404, Aug. 2023, doi: https://doi.org/10.1016/j.jmps.2023.105404.
[20]M. Mehrad, A. Ramezanzadeh, Mahdi Bajolvand, and Mohammad Reza Hajsaeedi, “Estimating shear wave velocity in carbonate reservoirs from petrophysical logs using intelligent algorithms,” Journal of Petroleum Science and Engineering, vol. 212, p. 110254, May 2022, doi: https://doi.org/10.1016/j.petrol.2022.110254.
[21]P. Su, G. Tarr, S. Muller, and S. Wang, “CR-Lasso: Robust cellwise regularized sparse regression,” Computational Statistics & Data Analysis, vol. 197, p. 107971, Sep. 2024, doi: https://doi.org/10.1016/j.csda.2024.107971.
[22]M. Huang, H. Chen, Y. Mi, C. Luo, S.-J. Horng, and T. Li, “Joint sparse latent representation learning and dual manifold regularization for unsupervised feature selection,” Knowledge-Based Systems, vol. 282, p. 111105, Dec. 2023, doi: https://doi.org/10.1016/j.knosys.2023.111105.
[23]A. Pavlov, M. Holovchenko, I. Mukha, K. Lishchuk, and V. Drozd, “A Modified Method and an Architecture of a Software for a Multivariate Polynomial Regression Building Based on the Results of a Conditional Active Experiment,” Lecture notes on data engineering and communications technologies, pp. 207–222, Jan. 2023, doi: https://doi.org/10.1007/978-3-031-36118-0_19.
[24]A. Pavlov, M. Holovchenko, and V. Drozd, “Efficiency substantiation for a synthetical method of constructing a multivariate polynomial regression given by a redundant representation,” Bulletin of National Technical University KhPI Series System Analysis Control and Information Technologies, no. 1 (9), pp. 3–9, Jul. 2023, doi: https://doi.org/10.20998/2079-0023.2023.01.01.
[25]A. Pavlov and A. Kushch, “Algorithms for constructing a regression linear with respect to unknown coefficients on a limited amount of experimental data,” Bulletin of National Technical University KhPI Series System Analysis Control and Information Technologies, no. 2 (14), pp. 8–15, Dec. 2025, doi: https://doi.org/10.20998/2079-0023.2025.02.02.