A Data-based Predictive Model for Distillation Column of Bio-based 2,3-Butanediol

Yeongryeol Choi, Hyundo Park, Jiwon Roh, Jongkoo Lim, In su Han, Duk Ki Kim, Sangjun Jeon, Hee Geun Nam, Il Moon, Hyungtae Cho, Junghwan Kim

Research output: Chapter in Book/Report/Conference proceedingChapter

1 Citation (Scopus)


In bio-based chemical processes, stable and efficient process control is challenging because the feedstock composition made by microbial fermentation is uncontrollable. Many researchers have developed data-driven models that predict the operating condition using a correlation between the process data. Since the performance of a data-driven model depends on the quality of the data, data preprocessing techniques such as noise removal and outlier detection are generally considered to enhance the model performance. Thus, research to establish an appropriate data preprocessing method and range is required to increase the performance of the model. In this study, we developed a data-based prediction model for the bio-based 2,3-butanediol distillation column. We prepared sixty train datasets by applying noise removal and outlier detection with different methods and ranges on the raw data and confirmed the correlation between the data using Pearson's correlation coefficient. We considered two kinds of feature selection depending on data preprocessing and compared the performance (R2 and normalized RMSE (NRMSE)) and reproducibility. The case in which the features were selected by only noise removal showed a similar coefficient value to that of the raw data; therefore, it had a higher performance than the case which applied outlier detection. The best performance provided R2 and NRMSE values of 0.962 and 0.045, respectively, and it could be enhanced by hyperparameter tuning. Based on this result, we plan to develop a robust predictive model that features adaptable real-time prediction and control throughout the entire process time and period.

Original languageEnglish
Title of host publicationComputer Aided Chemical Engineering
PublisherElsevier B.V.
Number of pages7
Publication statusPublished - 2021 Jan

Publication series

NameComputer Aided Chemical Engineering
ISSN (Print)1570-7946

Bibliographical note

Funding Information:
This study was conducted with the support of the Korea Institute of Industrial Technology as “Development of hybrid model and software to optimization of ash removal system in recovery boiler for power generation (EE-20-0014)” and “Development of AI Platform Technology for Smart Chemical Process (JH-21-0005)”.

Publisher Copyright:
© 2021 Elsevier B.V.

All Science Journal Classification (ASJC) codes

  • Chemical Engineering(all)
  • Computer Science Applications


Dive into the research topics of 'A Data-based Predictive Model for Distillation Column of Bio-based 2,3-Butanediol'. Together they form a unique fingerprint.

Cite this