线性回归模型的多离群点检测方法及节能应用

Multiple Outlier Detection Method for Linear Regression Model and Its Energy Conservation Application

  • 摘要: 办公设备能耗可使用线性回归模型进行描述,为检测该类能耗数据的异常点,提出了基于最小截平方和估计的单链接层次聚类多离群点检测算法.首先将该方法在不同类型的典型数据集中进行验证,证明算法具有优良的性能.而后将算法应用于办公设备能耗数据集,通过实验表明算法处理淹没问题及掩盖问题的能力优于其它算法.其不但能够正确确定异常能耗数据,还能够直观地给出各个异常能耗数据的离群程度,管理者可据此制定正确合理的能源管理方案,最终达到节能的目的.

     

    Abstract: The energy consumption of the office equipment can be described by linear regression model. For the model, a multiple outlier detection algorithm based on single-link hierarchical clustering and LTS (least trimmed squares) estimator is proposed. This method is validated in different types of typical data sets. And the results prove that it has excellent performance. Then it is applied to office equipment energy consumption data sets. The experiments show that it has better ability to deal with the masking and swamping problems than other algorithms. Also, the method can not only correctly identify the outliers but also provide abnormal degree of outliers. The managers can develop the reasonable energy management solutions and achieve the purpose of energy saving.

     

/

返回文章
返回