A Reinforcement Learning Motivated Algorithm for Process Optimization
Research output: Contribution to journal › Article › Research › peer-review
Standard
In: Periodica Polytechnica Civil Engineering, Vol. 63.2019, No. 4, 18.12.2019, p. 961-970.
Research output: Contribution to journal › Article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex - Download
}
RIS (suitable for import to EndNote) - Download
TY - JOUR
T1 - A Reinforcement Learning Motivated Algorithm for Process Optimization
AU - Ábrahám, Ábrahám
AU - Auer, Peter
AU - Dósa, György
AU - Dulai, Tibor
AU - Werner-Stark, Ágnes
PY - 2019/12/18
Y1 - 2019/12/18
N2 - In process scheduling problems there are several processes and resources. Any process consists of several tasks, and there may be precedence constraints among them. In our paper we consider a special case, where the precedence constraints form short disjoint (directed) paths. This model occurs frequently in practice, but as far as we know it is considered very rarely in the literature. The goal is to find a good resource allocation (schedule) to minimize the makespan. The problem is known to be strongly NP-hard, and such hard problems are often solved by heuristic methods. We found only one paper which is closely related to our topic, this paper proposes the heuristic method HH. We propose a new heuristic called QLM which is inspired by reinforcement learning methods from the area of machine learning. As we did not find appropriate benchmark problems for the investigated model. We have created such inputs and we have made exhaustive comparisons, comparing the results of HH and QLM, and an exact solver using CPLEX. We note that a heuristic method can give a "near optimal" solution very fast while an exact solver provides the optimal solution, but it may need a huge amount of time to find it. In our computational evaluation we experienced that our heuristic is more effective than HH and finds the optimal solution in many cases and very fast.
AB - In process scheduling problems there are several processes and resources. Any process consists of several tasks, and there may be precedence constraints among them. In our paper we consider a special case, where the precedence constraints form short disjoint (directed) paths. This model occurs frequently in practice, but as far as we know it is considered very rarely in the literature. The goal is to find a good resource allocation (schedule) to minimize the makespan. The problem is known to be strongly NP-hard, and such hard problems are often solved by heuristic methods. We found only one paper which is closely related to our topic, this paper proposes the heuristic method HH. We propose a new heuristic called QLM which is inspired by reinforcement learning methods from the area of machine learning. As we did not find appropriate benchmark problems for the investigated model. We have created such inputs and we have made exhaustive comparisons, comparing the results of HH and QLM, and an exact solver using CPLEX. We note that a heuristic method can give a "near optimal" solution very fast while an exact solver provides the optimal solution, but it may need a huge amount of time to find it. In our computational evaluation we experienced that our heuristic is more effective than HH and finds the optimal solution in many cases and very fast.
UR - http://www.scopus.com/inward/record.url?scp=85078578441&partnerID=8YFLogxK
U2 - 10.3311/PPci.14295
DO - 10.3311/PPci.14295
M3 - Article
VL - 63.2019
SP - 961
EP - 970
JO - Periodica Polytechnica Civil Engineering
JF - Periodica Polytechnica Civil Engineering
IS - 4
ER -