首页> 外文期刊>Independent Journal of Management & Production >A roadmap to determine the important factors of the house value: a case study by using actual price registration data of Taipei housing transactions
【24h】

A roadmap to determine the important factors of the house value: a case study by using actual price registration data of Taipei housing transactions

机译:确定房屋价值重要因素的路线图:以台北房屋交易的实际价格注册数据为例的案例研究

获取原文
           

摘要

While many studies have applied data mining techniques to judge housing prices, few have decoded the important attributes or prioritized them simultaneously. This paper aims to utilize five data mining techniques to discover the important attributes for three major types of real estate in Taipei city. The datasets, involving a total of 22,480 transactions, were publicly available from the Taiwan Actual Price Registration from July 2013 to August 2015. The five models are decision trees, random forests, model trees, artificial neural networks and multiple regression. The criteria used to measure the forecasting accuracy are MAPE, R2, RMSE, MAE and COR. The model with the best performance for all houses is the Model Tree with a MAPE value of 27.59. As for apartments, the best is Random Forests. Artificial Neural Networks perform best for suites and buildings with elevators. Different housing types need different models. Furthermore, the attributes importance helps us to conclude the really critical attributes, which include the floor area, administrative districts, parking area and land area, and their rankings. This variable ranking and selection procedure proposed by this research can also be adopted to improve the prediction efficiency for most big data applications other than the housing transactions.
机译:尽管许多研究已使用数据挖掘技术来判断房价,但很少有人对重要属性进行解码或同时对它们进行优先排序。本文旨在利用五种数据挖掘技术来发现台北市三种主要房地产类型的重要属性。这些数据集共计22,480笔交易,可从2013年7月至2015年8月从台湾实际价格登记处公开获得。这五个模型是决策树,随机森林,模型树,人工神经网络和多元回归。用于衡量预测准确性的标准是MAPE,R2,RMSE,MAE和COR。所有房屋中性能最佳的模型是MAPE值为27.59的模型树。至于公寓,最好的是随机森林。人工神经网络最适合带电梯的套房和建筑物。不同的房屋类型需要不同的型号。此外,重要性属性有助于我们得出真正关键的属性,包括建筑面积,行政区,停车区和土地面积及其排名。这项研究提出的变量排序和选择程序也可以用来提高房屋交易以外的大多数大数据应用的预测效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号