TY - JOUR
T1 - Spatial hedonic modelling adjusted for preferential sampling
AU - Paci, Lucia
AU - Gelfand, Alan E.
AU - Beamonte, María Asunción
AU - Gargallo, Pilar
AU - Salvador, Manuel
PY - 2020
Y1 - 2020
N2 - Hedonic models are widely used to predict selling prices of properties. Originally, they were proposed as simple spatial regressions, i.e. a spatially referenced response regressed on spatially referenced predictors. Subsequently, spatial random effects were introduced to serve as surrogates for unmeasured or unobservable predictors and were shown to provide better out-of-sample prediction. However, what has been ignored in the literature is the fact that the locations (and times) of the sales are random and, in fact, are an observation of a random point pattern. Here, we first consider whether there is stochastic dependence between the point pattern of locations and the set of responses. If so, a second question is whether incorporating a log-intensity for the point pattern of locations in the hedonic modelling enables improvement in the prediction of selling price. We connect this problem to what is referred to as preferential sampling. Through model comparison we illuminate the role of the point pattern data in the prediction of selling price. Using two different years of property sales from Zaragoza, Spain, we employ both the full database as well as an intentionally biased subset to elaborate this story.
AB - Hedonic models are widely used to predict selling prices of properties. Originally, they were proposed as simple spatial regressions, i.e. a spatially referenced response regressed on spatially referenced predictors. Subsequently, spatial random effects were introduced to serve as surrogates for unmeasured or unobservable predictors and were shown to provide better out-of-sample prediction. However, what has been ignored in the literature is the fact that the locations (and times) of the sales are random and, in fact, are an observation of a random point pattern. Here, we first consider whether there is stochastic dependence between the point pattern of locations and the set of responses. If so, a second question is whether incorporating a log-intensity for the point pattern of locations in the hedonic modelling enables improvement in the prediction of selling price. We connect this problem to what is referred to as preferential sampling. Through model comparison we illuminate the role of the point pattern data in the prediction of selling price. Using two different years of property sales from Zaragoza, Spain, we employ both the full database as well as an intentionally biased subset to elaborate this story.
KW - Bayesian inference
KW - Log-Gaussian Cox process
KW - Markov chain Monte Carlo sampling
KW - Nearest neighbour Gaussian process
KW - Real estate transactions
KW - Shared process models
KW - Bayesian inference
KW - Log-Gaussian Cox process
KW - Markov chain Monte Carlo sampling
KW - Nearest neighbour Gaussian process
KW - Real estate transactions
KW - Shared process models
UR - http://hdl.handle.net/10807/146654
UR - http://onlinelibrary.wiley.com/journal/10.1111/(issn)1467-985x
U2 - 10.1111/rssa.12489
DO - 10.1111/rssa.12489
M3 - Article
SN - 0964-1998
VL - 183
SP - 169
EP - 192
JO - Journal of the Royal Statistical Society. Series A: Statistics in Society
JF - Journal of the Royal Statistical Society. Series A: Statistics in Society
ER -