A Basic Oxygen Furnace (BOF) for steel making is a complex industrial process that is difficult to monitor due to the harsh environment, so the collected production data is very limited given the process complexity. Also, such production data has a low degree of variability. An accurate machine learning (ML) model for predicting production outcome requires both large and varied data, so utilising data from multiple BOFs will allow for more capable ML models, since both the amount and variability of data increases. Data collection setups for different BOFs are different, such that data sets are not compatible to directly join for ML training. Our approach is to let a neural network benefit from these collection differences in a joint training model. We present a neural network-based approach that simultaneously and jointly co-trains on several data sets. Our novelty is that the first network layer finds an internal representation of each individual BOF, while the other layers use this representation to concurrently learn a common BOF model. Our evaluation shows that the prediction accuracy of the common model increases compared to separate models trained on individual furnaces’ data sets. It is clear that multiple data sets can be utilised this way to increase model accuracy for better production prediction performance. For the industry, this means that the amount of available data for model training increases and thereby more capable ML models can be trained when having access to multiple data sets describing the same or similar manufacturing processes.
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.