Doctoral Dissertations

Date of Award

8-1998

Degree Type

Dissertation

Degree Name

Doctor of Philosophy

Major

Management Science

Major Professor

Hamparsum Bozdogan

Committee Members

Mary Leitnaker, Kenneth Gilbert, John Neter

Abstract

A new approach to statistical modeling is studied which integrates the information-based statistical criteria into Ivakhnenko's (1966) polynomial theory, also known as the Group Method of Data Handling (GMDH). The GMDH algorithm was introduced as a hierarchical algorithm which constructs a mathematical model of a complex system by the composition of lower-order polynomials of the form: Z = A + Bx + Cy + Dx2 + Ey2 + Fxy. Under the assumptions that there exists a random error component additive with the above deterministic component, the polynomials generated at each level of the GMDH algorithm are statistically modeled as regression-type equations of the following form: y = β0 + β1x1 + β2x2 + β3x21 + β4x22 + β5x1x2 + ε.

In this research, the assumption of independence of error terms is relaxed by the derivation of information-based model selection criteria for the GMDH algorithm. These criteria inherently guard against multicollinearity by incorporating a measure of the complexity of each model being evaluated. In addition to deriving the information-based criteria for the GMDH, the consistency of the criteria when used within the GMDH algorithm is established. While consistency is a desirable asymptotic property for model evaluation, it does not, by itself, guarantee that the criteria are effective in terms of model selection. Thus, Monte Carlo experimentation is used to study the aptness (i.e., the ability of the model to describe the system of interest) of the model identified by the GMDH algorithm using information-based criteria.

At the end of the GMDH algorithm, the "best approximating" model is identified. In generations two and beyond, the models are written in terms of the previous generation's polynomials. Hence, the chosen model is typically not written in terms of the original set of predictor variables. Even with the original GMDH algorithm, existing computer programs do not provide a means of backtracing (i.e., back-substituting) to the original variables. The revised algorithm (using the information-based model evaluation criteria) is coded in MATLAB®. A backtracing algorithm is introduced and designed for the GMDH hierarchical tree-algorithm to determine the "best" fitting model in terms of the original variables using a symbolic mathematical language by borrowing on the rules of algebraic manipulation.

Files over 3MB may be slow to open. For best results, right-click and select "save as..."

Share

COinS