Skip to main navigation Skip to search Skip to main content

기계학습을 이용한 한지의 특성 예측 모델링 (Part 2) - 랜덤 포레스트를 이용한 카보닐기 함량 예측과 변수 중요도 분석 -

Translated title of the contribution: Predictive Modeling of Korean Traditional Paper Characteristics Using Machine Learning Approaches (Part 2): Prediction of Carbonyl Content and Analysis of Variable Importance Using Random Forest
  • Kang Jae Kim*
  • , Jin Ho Kim
  • , Geunyong Park
  • , Myung Joon Jeong
  • *Corresponding author for this work
  • Kyungpook National University

Research output: Contribution to journalJournal articlepeer-review

Abstract

This paper introduces a random forest regression model trained with infrared spectral data to predict the carbonyl content of Hanji, a traditional Korean paper. The random forest model demonstrated excellent performance in carbonyl content prediction, surpassing the results obtained from the partial least squares model. To optimize the infrared spectral range for prediction, the spectral range was restricted from the entire range of 4000-400 cm-1to the narrower range of 1800-1200 cm-1, known for its suitability in characterizing paper properties. This limitation enhanced the coefficients of determination of the model, increasing it from 0.921 to 0.937. A permutation variable importance measure was then applied to identify the key spectral regions contributing to carbonyl content prediction. The analysis pinpointed the 1650-1350 cm-1range as a crucial region for accurate predictions. Subsequently, a new prediction model was built using data exclusively from this important region, yielding remarkably improved coefficients of determination of 0.960 and 0.965 for the raw and second derivative spectra, respectively. These findings affirm the validity and significance of the critical region identified by the permutation variable importance measure. The predictive performance of the established models is valid within the range of 7.2 to 29.4 μmol/g of carbonyl content in the training set.

Translated title of the contributionPredictive Modeling of Korean Traditional Paper Characteristics Using Machine Learning Approaches (Part 2): Prediction of Carbonyl Content and Analysis of Variable Importance Using Random Forest
Original languageKorean
Pages (from-to)13-23
Number of pages11
JournalPalpu Chongi Gisul/Journal of Korea Technical Association of the Pulp and Paper Industry
Volume55
Issue number5
DOIs
StatePublished - 2023

Keywords

  • Carbonyl content
  • Hanji
  • infrared spectroscopy
  • machine learning
  • permutation importance
  • random forest
  • regression

Quacquarelli Symonds(QS) Subject Topics

  • Materials Science
  • Engineering - Electrical & Electronic
  • Chemistry

Fingerprint

Dive into the research topics of 'Predictive Modeling of Korean Traditional Paper Characteristics Using Machine Learning Approaches (Part 2): Prediction of Carbonyl Content and Analysis of Variable Importance Using Random Forest'. Together they form a unique fingerprint.

Cite this