Keerthi Sree Marrapu - AI Product Engineer

1. The Business Challenge: The High Cost of Impurity

In steel manufacturing, controlling the silicon (Si) impurity in hot metal is critical. High silicon content leads to significant operational inefficiencies, including increased consumption of expensive resources like oxygen and lime, and higher production costs.

The goal for the IISCo plant was to move from reactive adjustments to proactive control. I was tasked with developing a predictive model to forecast silicon content, enabling the plant to optimize processes and reduce costs.

A diagram of a blast furnace used in steel making. — The complex environment of a blast furnace.

2. Understanding the Domain: The Metallurgy Behind the Model

Before diving into the data, the first step was to understand the complex chemical and physical processes inside a blast furnace. My research focused on answering a critical question: why is low silicon a priority? I found that it directly increases operational costs by requiring more oxygen and lime and creating excess slag. This domain knowledge was essential for identifying the most influential variables in the dataset.

A diagram showing why low silicon is preferred in hot metal.

3. The Process: From Raw Data to a Predictive Engine

Data Matching, Cleaning & Feature Engineering

The first major hurdle was handling the raw plant data. Input variables had a significant time lag of 330-380 minutes before impacting the final output. I developed Python scripts to create uniform timestamps, accurately match inputs with their corresponding outputs, and systematically clean the dataset. This crucial process ensured data integrity and produced a high-quality dataset for modeling.

Diagram showing the time lag between input and output variables

Python code snippet for data cleaning and timestamp matching

Table showing data entries before and after cleaning

Model Benchmarking

With a clean dataset, I benchmarked several machine learning algorithms to find the most effective model for this regression task. My evaluation included Partial Least Squares (PLS) Regression, Random Forest, XGBoost, and an Artificial Neural Network (ANN). This methodical approach ensured that the final model choice was backed by comparative performance data.

4. The Outcome: An 89% Accurate Prediction

The final Artificial Neural Network model, built with Keras, consistently delivered the best performance, significantly outperforming other benchmarked models as shown in the comparison.

Trained on 49 distinct process parameters, the model achieved an **accuracy of 89.67%** (based on Mean Absolute Error). This provides a powerful tool for the plant's process control team to proactively optimize raw material mix, reduce resource consumption, and enhance blast furnace efficiency.

A table comparing the accuracy of Random Forest, XGBoost, and Artificial Neural Networks. — The ANN model showed superior performance.

A scatter plot showing the high correlation between the model's predicted and actual silicon values. — Predicted vs. Actual values for the final ANN model.

AI-driven Silicon Prediction for Steel Manufacturing

1. The Business Challenge: The High Cost of Impurity

2. Understanding the Domain: The Metallurgy Behind the Model

3. The Process: From Raw Data to a Predictive Engine

Data Matching, Cleaning & Feature Engineering

Model Benchmarking

4. The Outcome: An 89% Accurate Prediction

Skills & Technologies

Core ML & Data Science

Algorithms & Technologies