Kim, J., Obregon, J., Park, H., & Jung, J. Y. (2024). Multi-step photovoltaic power forecasting using transformer and recurrent neural networks. Renewable and Sustainable Energy Reviews, 200.
Affordable and clean energy is an important UN sustainable development goal. Solar energy is more difficult to control than fossil fuels, highlighting the need for accurate solar power forecasts. This study develops three variants of the transformer networks, called PVTransNet, for a multi-step day-ahead photovoltaic power forecasting. The transformer networks use historical solar power generation, weather observation, weather forecast and solar geometry data as input to effectively predict next-day hourly power generation. In particular, the third variant model combines long short-term memory (LSTM) to transformer networks to supplement weather forecasts from the weather station. The combined model, PVTransNet-EDR, outperformed individual LSTM and other transformer models in the experiments conducted on data from two photovoltaic power plants. The model performed 48.3 % better, in mean absolute error, than simple LSTM in the power forecasting. Accurate solar power forecasting model is expected to be utilized for efficient energy storage and microgrid management, effective energy supply policy, and optimal plant location selection.
Obregon, J., & Jung, J. Y. (2024). Rule-based visualization of faulty process conditions in the die-casting manufacturing. Journal of Intelligent Manufacturing, 35(2), 521–537.
Die-casting is a popular manufacturing process that produces precise metal parts with excellent dimensional accuracy and smooth cast surfaces. Recently die-casting process condition data can be acquired to be used as input for machine learning techniques for fault detection. The rapid development of complex and accurate machine learning algorithms, such as tree ensembles and deep learning, allows the accurate detection of faulty products. However, interpreting and explaining black-box models is crucial in the die-casting industry because the predictions provided by the machine learning solution can be adopted in practice only after understanding the internal decision mechanism of the model. To solve this problem, rule extraction methods generate simple rule-based predictive models from complex tree ensembles. Nevertheless, rulesets may contain numerous complex rules with redundant conditions, and the standard structure of rulesets does not clearly show the hierarchical relationships and frequent interactions among their elements. For this reason, in this study, a visualization tool based on formal concept analysis, called RuleLat (Rule Lattice), is proposed, which generates simple visual representations of rule-based classifiers. The generated models depict the hierarchical relationships of interactions among conditions, rules, and predicted classes in a modified concept lattice that is easy to analyze and understand. To demonstrate the applicability of the proposed method, a case study using real-world manufacturing data collected from a die-casting company in Korea is presented. RuleLat is adopted as a tool for interpretable machine learning, and the process conditions of three types of defects (porosity, material, and imprint) are analyzed and discussed.
Obregon, J., Han, Y. R., Ho, C. W., Mouraliraman, D., Lee, C. W., & Jung, J. Y. (2023). Convolutional autoencoder-based SOH estimation of lithium-ion batteries using electrochemical impedance spectroscopy. Journal of Energy Storage, 60.
The advancement of consumer electronics and electric vehicles requires heavy use of energy sources, particularly in the form of rechargeable batteries. Although lithium-ion batteries (LiBs) enable the use of such technologies owing to their high energy and power densities, estimating the state-of-health (SOH) of such batteries remains a challenge because of the various environmental operational conditions that affect the charging and discharging cycles of LiBs. In this study, we explore an approach that uses a convolutional autoencoder (CAE) for overcomplete feature extraction from electrochemical impedance spectroscopy (EIS) data. Subsequently, the extracted latent data representation is fed into a deep neural network (DNN) for battery capacity retention and SOH estimation. The proposed end-to-end deep learning-based architecture is called CAE-DNN. To prove the effectiveness of the proposed architecture, we conducted a series of experiments using a public dataset involving EIS spectra collected from fully charged LiBs cycled at different temperatures. The experimental results were compared with those of existing state-of-the-art methods, and with other classic machine learning methods. The results demonstrate that the proposed architecture extracts useful features in an unsupervised manner and estimates the SOH of LiBs more accurately than other baseline estimation methods.
Obregon, J., & Jung, J. Y. (2023). RuleCOSI+: Rule extraction for interpreting classification tree ensembles. Information Fusion, 89, 355–381.
Despite the advent of novel neural network architectures, tree-based ensemble algorithms such as random forests and gradient boosting machines still prevail in many practical machine learning problems in manufacturing, financial, and medical domains. However, tree ensembles have the limitation that the internal decision mechanisms of complex models are difficult to understand. Therefore, we present a post-hoc interpretation approach for classification tree ensembles. The proposed method, RuleCOSI+, extracts simple rules from tree ensembles by greedily combining and simplifying their base trees. Compared with its previous version, RuleCOSI, this new version can be applied to both bagging (e.g., random forest, RF) and boosting ensembles (e.g., gradient boosting machines, GBM) and run much faster for ensembles with hundreds of trees. To assess the performance and applicability of the method, empirical experiments were conducted using two bagging algorithms and four gradient boosting algorithms over 33 datasets. RuleCOSI+ could generate the best classification rulesets in terms of F-measure together with RuleFit for RF and GBM models of the datasets among five ensemble simplification algorithms, but the rulesets of RuleCOSI+ had, on average, less than half the size of those of RuleFit. Moreover, RuleCOSI+ had the best antecedent uniqueness rate (“UNIQ”) among the five algorithms, and had also ranked high in the number of rules (“NRULES”) and the rule reduction rate (“REDU”). In addition, the proposed method could reduce generalization errors in the simplified rulesets to obtain, on average, slightly better classification errors than original models of two bagging and three gradient boosting algorithms except CATBoost.
Obregon, J., Hong, J., & Jung, J.-Y. (2021). Rule-based explanations based on ensemble machine learning for detecting sink mark defects in the injection moulding process. Journal of Manufacturing Systems, 60, 392–405.
Manufacturing quality control (QC) in plastic injection moulding is of the upmost importance since almost one third of plastic products are manufactured via the injection moulding process. Moreover, smart manufacturing technologies are enabling the generation of huge amounts of data in production lines. This data can be used for predicting the quality of manufactured plastic products using machine learning methods, allowing companies to save costs and improve their production efficiency. However, high-performance machine learning models are usually too complicated to be understood by human intuition. Therefore, we have introduced a rule-based explanations (RBE) framework that combines several machine learning interpretation methods to help to understand the decision mechanisms of accurate and complex predictive models – specifically tree ensemble models. These generated rules can be used to visually and easily understand the main factors that affect the quality in the manufacturing process. To demonstrate the applicability of RBE, we present two experiments with real industrial data gathered from a plastic injection moulding machine in a Singapore model factory. The collected datasets contain condition data for several manufacturing processes as well as the QC results for sink mark defects in the production of small plastic products. The experiments revealed that it is possible to extract meaningful explanations in the form of simple decision rules that are enhanced with partial dependence plots and feature importance rankings for a better understanding of the underlying mechanisms and data relationships of accurate tree ensembles.
Obregon, J., Kim, A., & Jung, J.-Y. (2019). RuleCOSI: Combination and simplification of production rules from boosted decision trees for imbalanced classification. Expert Systems with Applications, 126.
Obregon, J., Song, M., & Jung, J. Y. (2019). InfoFlow: Mining Information Flow Based on User Community in Social Networking Services. IEEE Access, 7, 48024–48036.
Online social networking services (SNSs) have emerged rapidly and have become huge data sources for social network analysis. The spread of the content generated by users is crucial in SNS, but there is only a handful of research works on information diffusion and, more precisely, information diffusion flow. In this paper, we propose a novel method to discover information diffusion processes from SNS data. The method starts preprocessing the SNS data using a user-centric algorithm of community detection based on modularity maximization with the purpose of reducing the complexity of the noisy data. After that, the InfoFlow miner generates information diffusion flow models among the user communities discovered from the data. The algorithm is an extension of a traditional process discovery technique called the Flexible Heuristics miner, but the visualization ability of the generated process model is improved with a new measure called response weight, which effectively captures and represents the interactions among communities. An experiment with Facebook data was conducted, and information flow among user communities was visualized. Additionally, a quality assessment of the models was carried out to demonstrate the effectiveness of the method. The final constructed models allowed us to identify useful information such as how the information flows between communities and information disseminators and receptors within communities.
Kim, K., Obregon, J., & Jung, J.-Y. (2014). Analyzing information flow and context for Facebook fan pages. IEICE Transactions on Information and Systems, E97-D(4).
Kim, M., Han, Y.-S., Obregon, J., & Jung, J.-Y. (2024). Discovering Dispatching Rules in a Semiconductor Fab Using Interpretable Machine Learning. International Conference on Flexible Automation and Intelligent Manufacturing, 91–97. https://link.springer.com/chapter/10.1007/978-3-031-74482-2_11
Recent studies have been conducted in the application of machine learning (ML)-based dispatching methods. Unfortunately, the internal dispatching behavior of such ML-based models is difficult to interpret. Therefore, this study transforms the ML-based model to a...
Smedt, J. D., Broucke, S. K. L. M. V., Obregon, J., Kim, A., Jung, J.-Y., & Vanthienen, J. (2017). Decision mining in a broader context: An overview of the current landscape and future directions. Business Process Management Workshops, 281.
Kim, A., Obregon, J., & Jung, J.-Y. (2014). Constructing decision trees from process logs for performer recommendation. Business Process Management Workshops, 171 171 LN.
Obregon, J., Kim, A., & Jung, J.-Y. (2013). DTminer: A tool for decision making based on historical process data. Asia Pacific Business Process Management, 159.
Obregon, J., & Jung, J. Y. (2022). Explanation of ensemble models. In Human-Centered Artificial Intelligence: Research and Applications (pp. 51–72). Academic Press; .
Ensemble learning is a type of machine learning, typically supervised learning, that combines the decisions of multiple individual models to improve the classification or regression accuracy. Since their introduction 2 decades ago, ensemble models have been widely used not only in academia but also in practical applications, particularly in data science competitions such as Kaggle because they excel with tabular and structured data. However, understanding the decision mechanisms of such large models is a challenge. In the explainable artificial intelligence (XAI) literature, several studies have attempted to make ensemble models more transparent. Although deep learning has recently attracted significant attention in XAI research, it is still important to introduce methods that help explain ensemble models. This chapter introduces the problem of explaining the ensemble models in detail. Two of the most popular ensemble approaches, bagging and boosting, are first introduced, and then the main factors that make ensemble models difficult to explain are discussed. Later, a taxonomy of ensemble interpretation methods and some representative techniques for each category is also presented.