Prediction of distillation column temperature using machine learning and data preprocessing

Yechan Lee, Yeongryeol Choi, Hyungtae Cho, Junghwan Kim

Research output: Contribution to journalArticlepeer-review


A distillation column, which is a main facility of the chemical process, separates the desired product from a mixture by using the difference of boiling points. The distillation process requires the optimization and the prediction of operation because it consumes much energy. The target process of this study is difficult to operate efficiently because the composition of feed flow is not steady according to the supplier. To deal with this problem, we could develop a data-driven model to predict operating conditions. However, data preprocessing is essential to improve the predictive performance of the model because the raw data contains outlier and noise. In this study, after optimizing the predictive model based long-short term memory (LSTM) and Random forest (RF), we used a low-pass filter and one-class support vector machine for data preprocessing and compared predictive performance according to the method and range of the preprocessing. The performance of the predictive model and the effect of the preprocessing is compared by using R2and RMSE. In the case of LSTM, R2increased from 0.791 to 0.977 by 23.5%, and RMSE decreased from 0.132 to 0.029 by 78.0%. In the case of RF, R2increased from 0.767 to 0.938 by 22.3%, and RMSE decreased from 0.140 to 0.050 by 64.3%.

Original languageEnglish
Pages (from-to)191-199
Number of pages9
JournalKorean Chemical Engineering Research
Issue number2
Publication statusPublished - 2021 May

Bibliographical note

Publisher Copyright:
© 2021 Korean Institute of Chemical Engineers. All rights reserved.

All Science Journal Classification (ASJC) codes

  • Chemical Engineering(all)


Dive into the research topics of 'Prediction of distillation column temperature using machine learning and data preprocessing'. Together they form a unique fingerprint.

Cite this