CHEN Gang, GUO Xiaomei. Oversampling Algorithm for Imbalanced Datasets Based on Time Series Model[J]. INFORMATION AND CONTROL, 2021, 50(5): 522-530. DOI: 10.13976/j.cnki.xk.2021.0551
Citation: CHEN Gang, GUO Xiaomei. Oversampling Algorithm for Imbalanced Datasets Based on Time Series Model[J]. INFORMATION AND CONTROL, 2021, 50(5): 522-530. DOI: 10.13976/j.cnki.xk.2021.0551

Oversampling Algorithm for Imbalanced Datasets Based on Time Series Model

  • This study proposes an oversampling algorithm based on a time series model to address the rebalancing problem of imbalanced data. First, a method of converting deterministic data into random data is proposed through which minority data are converted into time series. Second, a stationarity test is performed on time series transformed from the minority class, and stationary processing is carried out. Third, the stationary series is fitted to obtain a suitable time series model and forecast the minority class. In this way, the datasets are balanced. Lastly, six datasets are selected from UCI and KEEL repositories, and the proposed algorithm is compared with other common oversampling algorithms. A decision tree classifier is utilized to perform classification experiments. Evaluation indicators are used to examine the results of classification experiments. The results show the effectiveness of the proposed algorithm.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return