Normalized Cut with Reinforcement Learning in Constrained Action Space
Authors:
Qize Jiang,
Linsey Pang,
Alice Gatti,
Mahima Aggarwal,
Giovanna Vantini,
Xiaosong Ma,
Weiwei Sun,
Sanjay Chawla
Abstract:
Reinforcement Learning (RL) has emerged as an important paradigm to solve combinatorial optimization problems primarily due to its ability to learn heuristics that can generalize across problem instances. However, integrating external knowledge that will steer combinatorial optimization problem solutions towards domain appropriate outcomes remains an extremely challenging task. In this paper, we p…
▽ More
Reinforcement Learning (RL) has emerged as an important paradigm to solve combinatorial optimization problems primarily due to its ability to learn heuristics that can generalize across problem instances. However, integrating external knowledge that will steer combinatorial optimization problem solutions towards domain appropriate outcomes remains an extremely challenging task. In this paper, we propose the first RL solution that uses constrained action spaces to guide the normalized cut problem towards pre-defined template instances. Using transportation networks as an example domain, we create a Wedge and Ring Transformer that results in graph partitions that are shaped in form of Wedges and Rings and which are likely to be closer to natural optimal partitions. However, our approach is general as it is based on principles that can be generalized to other domains.
△ Less
Submitted 23 May, 2025; v1 submitted 20 May, 2025;
originally announced May 2025.
Collective wind farm operation based on a predictive model increases utility-scale energy production
Authors:
Michael F. Howland,
Jesus Bas Quesada,
Juan Jose Pena Martinez,
Felipe Palou Larranaga,
Neeraj Yadav,
Jasvipul S. Chawla,
Varun Sivaram,
John O. Dabiri
Abstract:
Wind turbines located in wind farms are operated to maximize only their own power production. Individual operation results in wake losses that reduce farm energy. In this study, we operate a wind turbine array collectively to maximize total array production through wake steering. The selection of the farm control strategy relies on the optimization of computationally efficient flow models. We deve…
▽ More
Wind turbines located in wind farms are operated to maximize only their own power production. Individual operation results in wake losses that reduce farm energy. In this study, we operate a wind turbine array collectively to maximize total array production through wake steering. The selection of the farm control strategy relies on the optimization of computationally efficient flow models. We develop a physics-based, data-assisted flow control model to predict the optimal control strategy. In contrast to previous studies, we first design and implement a multi-month field experiment at a utility-scale wind farm to validate the model over a range of control strategies, most of which are suboptimal. The flow control model is able to predict the optimal yaw misalignment angles for the array within +/- 5 degrees for most wind directions (11-32% power gains). Using the validated model, we design a control protocol which increases the energy production of the farm in a second multi-month experiment by 2.7% and 1.0%, for the wind directions of interest and for wind speeds between 6 and 8 m/s and all wind speeds, respectively. The developed and validated predictive model can enable a wider adoption of collective wind farm operation.
△ Less
Submitted 26 January, 2022;
originally announced February 2022.