R：HTMCGLM软件包中多个响应回归模型的假设测试

论文标题

R：HTMCGLM软件包中多个响应回归模型的假设测试

Hypothesis tests for multiple responses regression models in R: The htmcglm Package

论文作者

de Freitas, Lineu Alberto Cavazani, Bonat, Wagner Hugo

论文摘要

本文介绍了用于执行有关多元协方差概括性线性模型（MCGLMS）的回归和分散参数的假设测试的R软件包HTMCGLM。 MCGLM为正常和非正常的多元数据分析以及广泛的相关结构提供了一般的统计建模框架。提出的软件包考虑了WALD统计数据以执行一般假设测试，并构建了量身定制的ANOVA，MANOVAS和多重比较测试。包装的目的是提供工具来改善回归和分散参数的解释。我们通过测试回归系数来评估协变量对响应变量的影响。同样，我们对分散系数进行测试，以评估研究单元之间的相关性。在数据观测值相互关联的情况下，例如纵向，时代序列，空间和重复测量研究。 HTMCGLM软件包提供了一个用户友好的接口，用于执行MANOVA，例如测试以及MCGLM类模型的多元假设测试。我们描述了包装实现，并通过分析两个数据集对其进行了说明。第一个涉及大豆产量的实验；该问题具有不同类型的三个响应变量（连续，计数和二项式）和三个解释性变量（水的量，受精和块）。第二个数据集解决了一个问题，其中响应是狩猎动物的纵向双变量计数。所使用的解释变量是动物的狩猎方法和性别。通过这些示例，我们能够说明几项测试，其中该提案被证明可用于评估回归和分散参数，包括依赖或独立观察的问题。

This article describes the R package htmcglm implemented for performing hypothesis tests on regression and dispersion parameters of multivariate covariance generalized linear models (McGLMs). McGLMs provide a general statistical modeling framework for normal and non-normal multivariate data analysis along with a wide range of correlation structures. The proposed package considers the Wald statistics to perform general hypothesis tests and build tailored ANOVAs, MANOVAs and multiple comparison tests. The goal of the package is to provide tools to improve the interpretation of regression and dispersion parameters. We assess the effects of the covariates on the response variables by testing the regression coefficients. Similarly, we perform tests on the dispersion coefficients in order to assess the correlation between study units. It could be of interest in situations where the data observations are correlated with each other, such as in longitudinal, times series, spatial and repeated measures studies. The htmcglm package provides a user friendly interface to perform MANOVA like tests as well as multivariate hypothesis tests for models of the mcglm class. We describe the package implementation and illustrate it through the analysis of two data sets. The first deals with an experiment on soybean yield; the problem has three response variables of different types (continuous, counting and binomial) and three explanatory variables (amount of water, fertilization and block). The second dataset addresses a problem where responses are longitudinal bivariate counts of hunting animals; the explanatory variables used are the hunting method and sex of the animal. With these examples we were able to illustrate several tests in which the proposal proves to be useful for the evaluation of regression and dispersion parameters both in problems with dependent or independent observations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题