论文标题
美国家庭食品支出的异质性:纵向分位数回归的证据
Heterogeneity in Food Expenditure amongst US families: Evidence from Longitudinal Quantile Regression
论文作者
论文摘要
关于食品支出的经验研究主要基于横截面数据,对于基于纵向(或面板)数据的一些研究,重点一直放在条件平均值上。虽然前者无法通过跨时间的观测值对依赖关系进行建模,但后者无法查看粮食支出与协变量(例如收入,教育等)之间在下层(或上层)分位数之间的关系,这是对决策者感兴趣的。本文使用纵向数据的平均回归和分位数回归模型分析了总食品(TF),家里的食物(FAH)和外国食物(FAFH)的支出,以研究经济衰退和各种人口统计学,社会经济和地理因素的影响。数据取自《收入动力学》小组研究(PSID),由2001年至2015年之间观察到的2174个家庭(美国)组成。结果表明,头部的年龄和教育,家庭收入,女性领导家庭,婚姻状况和经济衰退是所有三种粮食支出的重要决定因素。配偶教育,家庭规模和某些区域指标对于TF和FAH的支出很重要,但对于FAFH而言并不重要。分位数分析揭示了所有类型的食物支出的协变量效应中的相当异质性,而这些模型无法捕获有条件平均值的模型。该研究结束时表明,对同一家庭单位的跨时间进行观察之间的有条件依赖性对于减少/避免异质性偏差和更好的模型拟合至关重要。
Empirical studies on food expenditure are largely based on cross-section data and for a few studies based on longitudinal (or panel) data the focus has been on the conditional mean. While the former, by construction, cannot model the dependencies between observations across time, the latter cannot look at the relationship between food expenditure and covariates (such as income, education, etc.) at lower (or upper) quantiles, which are of interest to policymakers. This paper analyzes expenditures on total food (TF), food at home (FAH), and food away from home (FAFH) using mean regression and quantile regression models for longitudinal data to examine the impact of economic recession and various demographic, socioeconomic, and geographic factors. The data is taken from the Panel Study of Income Dynamics (PSID) and comprises of 2174 families in the United States (US) observed between 2001-2015. Results indicate that age and education of the head, family income, female headed family, marital status, and economic recession are important determinants for all three types of food expenditure. Spouse education, family size, and some regional indicators are important for expenditures on TF and FAH, but not for FAFH. Quantile analysis reveals considerable heterogeneity in the covariate effects for all types of food expenditure, which cannot be captured by models focused on conditional mean. The study ends by showing that modeling conditional dependence between observations across time for the same family unit is crucial to reducing/avoiding heterogeneity bias and better model fitting.