A parallel rollout algorithm for wildfire suppression

Mauro Montenegro; Roberto López; Rolando Menchaca-Méndez; Emanuel Becerra; Ricardo Menchaca-Méndez

doi:10.1007/978-3-030-62554-2_18

A parallel rollout algorithm for wildfire suppression

Mauro Montenegro, Roberto López, Rolando Menchaca-Méndez, Emanuel Becerra, Ricardo Menchaca-Méndez

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

Resumen

In this paper, we formulate the problem of optimal Wildfire Suppression as an infinite horizon Decision Process (DP) problem where an agent (e.g., a robotic firefighter) decides which areas to intervene to extinguish the fire. The dynamics of the wildfire is modeled by a cellular automaton whose state at time k is defined as a bi-dimensional grid $$x:k$$ where each cell in this grid describes the state of a rectangular geographic region of the wildland. The proposed algorithm, which is based on a non-parametric reinforcement learning (RL) methodology, computes optimized control policies that determine the agent’s actions that minimize a cost function that aims to preserve most of the cells with trees. From a given state $$x:k$$, the proposed algorithms employs rollout to take advantage of heuristic solutions to approximate, in polynomial-time, the future cost function. Two different heuristics approaches were applied: A corrective-based model that only takes into account surrounding burning cells, and a predictive strategy that calculates a coefficient-based metric over nearby trees and empty cells. We implemented a parallel sampler using CUDA to simulate the trajectories that rollout generates. This parallel implementation allows us to increase the number of lookahead steps without incurring in large computing times. Our experimental results show that the rollout strategy outperforms the base heuristics and that effectively suppresses wildfires.

Idioma original	Inglés
Título de la publicación alojada	Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings
Editores	Miguel Félix Mata-Rivera, Roberto Zagal-Flores, Cristian Barria-Huidobro
Editorial	Springer Science and Business Media Deutschland GmbH
Páginas	244-255
Número de páginas	12
ISBN (versión impresa)	9783030625535
DOI	https://doi.org/10.1007/978-3-030-62554-2_18
Estado	Publicada - 2020
Publicado de forma externa	Sí
Evento	9th International Congress on Telematics and Computing, WITCOM 2020 - Puerto Vallarta, México Duración: 2 nov. 2020 → 6 nov. 2020

Serie de la publicación

Nombre	Communications in Computer and Information Science
Volumen	1280
ISSN (versión impresa)	1865-0929
ISSN (versión digital)	1865-0937

Conferencia

Conferencia	9th International Congress on Telematics and Computing, WITCOM 2020
País/Territorio	México
Ciudad	Puerto Vallarta
Período	2/11/20 → 6/11/20

Acceder al documento

10.1007/978-3-030-62554-2_18

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Montenegro, M., López, R., Menchaca-Méndez, R., Becerra, E., & Menchaca-Méndez, R. (2020). A parallel rollout algorithm for wildfire suppression. En M. F. Mata-Rivera, R. Zagal-Flores, & C. Barria-Huidobro (Eds.), Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings (pp. 244-255). (Communications in Computer and Information Science; Vol. 1280). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-62554-2_18

Montenegro, Mauro ; López, Roberto ; Menchaca-Méndez, Rolando et al. / A parallel rollout algorithm for wildfire suppression. Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings. editor / Miguel Félix Mata-Rivera ; Roberto Zagal-Flores ; Cristian Barria-Huidobro. Springer Science and Business Media Deutschland GmbH, 2020. pp. 244-255 (Communications in Computer and Information Science).

@inproceedings{a9f022edec4f49b18960e674c7780293,

title = "A parallel rollout algorithm for wildfire suppression",

abstract = "In this paper, we formulate the problem of optimal Wildfire Suppression as an infinite horizon Decision Process (DP) problem where an agent (e.g., a robotic firefighter) decides which areas to intervene to extinguish the fire. The dynamics of the wildfire is modeled by a cellular automaton whose state at time k is defined as a bi-dimensional grid $$x:k$$ where each cell in this grid describes the state of a rectangular geographic region of the wildland. The proposed algorithm, which is based on a non-parametric reinforcement learning (RL) methodology, computes optimized control policies that determine the agent{\textquoteright}s actions that minimize a cost function that aims to preserve most of the cells with trees. From a given state $$x:k$$, the proposed algorithms employs rollout to take advantage of heuristic solutions to approximate, in polynomial-time, the future cost function. Two different heuristics approaches were applied: A corrective-based model that only takes into account surrounding burning cells, and a predictive strategy that calculates a coefficient-based metric over nearby trees and empty cells. We implemented a parallel sampler using CUDA to simulate the trajectories that rollout generates. This parallel implementation allows us to increase the number of lookahead steps without incurring in large computing times. Our experimental results show that the rollout strategy outperforms the base heuristics and that effectively suppresses wildfires.",

author = "Mauro Montenegro and Roberto L{\'o}pez and Rolando Menchaca-M{\'e}ndez and Emanuel Becerra and Ricardo Menchaca-M{\'e}ndez",

note = "Publisher Copyright: {\textcopyright} 2020, Springer Nature Switzerland AG.; 9th International Congress on Telematics and Computing, WITCOM 2020 ; Conference date: 02-11-2020 Through 06-11-2020",

year = "2020",

doi = "10.1007/978-3-030-62554-2_18",

language = "Ingl{\'e}s",

isbn = "9783030625535",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "244--255",

editor = "Mata-Rivera, {Miguel F{\'e}lix} and Roberto Zagal-Flores and Cristian Barria-Huidobro",

booktitle = "Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings",

address = "Alemania",

}

Montenegro, M, López, R, Menchaca-Méndez, R, Becerra, E & Menchaca-Méndez, R 2020, A parallel rollout algorithm for wildfire suppression. En MF Mata-Rivera, R Zagal-Flores & C Barria-Huidobro (eds.), Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings. Communications in Computer and Information Science, vol. 1280, Springer Science and Business Media Deutschland GmbH, pp. 244-255, 9th International Congress on Telematics and Computing, WITCOM 2020, Puerto Vallarta, México, 2/11/20. https://doi.org/10.1007/978-3-030-62554-2_18

A parallel rollout algorithm for wildfire suppression. / Montenegro, Mauro; López, Roberto; Menchaca-Méndez, Rolando et al.
Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings. ed. / Miguel Félix Mata-Rivera; Roberto Zagal-Flores; Cristian Barria-Huidobro. Springer Science and Business Media Deutschland GmbH, 2020. p. 244-255 (Communications in Computer and Information Science; Vol. 1280).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - A parallel rollout algorithm for wildfire suppression

AU - Montenegro, Mauro

AU - López, Roberto

AU - Menchaca-Méndez, Rolando

AU - Becerra, Emanuel

AU - Menchaca-Méndez, Ricardo

PY - 2020

Y1 - 2020

N2 - In this paper, we formulate the problem of optimal Wildfire Suppression as an infinite horizon Decision Process (DP) problem where an agent (e.g., a robotic firefighter) decides which areas to intervene to extinguish the fire. The dynamics of the wildfire is modeled by a cellular automaton whose state at time k is defined as a bi-dimensional grid $$x:k$$ where each cell in this grid describes the state of a rectangular geographic region of the wildland. The proposed algorithm, which is based on a non-parametric reinforcement learning (RL) methodology, computes optimized control policies that determine the agent’s actions that minimize a cost function that aims to preserve most of the cells with trees. From a given state $$x:k$$, the proposed algorithms employs rollout to take advantage of heuristic solutions to approximate, in polynomial-time, the future cost function. Two different heuristics approaches were applied: A corrective-based model that only takes into account surrounding burning cells, and a predictive strategy that calculates a coefficient-based metric over nearby trees and empty cells. We implemented a parallel sampler using CUDA to simulate the trajectories that rollout generates. This parallel implementation allows us to increase the number of lookahead steps without incurring in large computing times. Our experimental results show that the rollout strategy outperforms the base heuristics and that effectively suppresses wildfires.

AB - In this paper, we formulate the problem of optimal Wildfire Suppression as an infinite horizon Decision Process (DP) problem where an agent (e.g., a robotic firefighter) decides which areas to intervene to extinguish the fire. The dynamics of the wildfire is modeled by a cellular automaton whose state at time k is defined as a bi-dimensional grid $$x:k$$ where each cell in this grid describes the state of a rectangular geographic region of the wildland. The proposed algorithm, which is based on a non-parametric reinforcement learning (RL) methodology, computes optimized control policies that determine the agent’s actions that minimize a cost function that aims to preserve most of the cells with trees. From a given state $$x:k$$, the proposed algorithms employs rollout to take advantage of heuristic solutions to approximate, in polynomial-time, the future cost function. Two different heuristics approaches were applied: A corrective-based model that only takes into account surrounding burning cells, and a predictive strategy that calculates a coefficient-based metric over nearby trees and empty cells. We implemented a parallel sampler using CUDA to simulate the trajectories that rollout generates. This parallel implementation allows us to increase the number of lookahead steps without incurring in large computing times. Our experimental results show that the rollout strategy outperforms the base heuristics and that effectively suppresses wildfires.

UR - http://www.scopus.com/inward/record.url?scp=85096615013&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-62554-2_18

DO - 10.1007/978-3-030-62554-2_18

M3 - Contribución a la conferencia

AN - SCOPUS:85096615013

SN - 9783030625535

T3 - Communications in Computer and Information Science

SP - 244

EP - 255

BT - Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings

A2 - Mata-Rivera, Miguel Félix

A2 - Zagal-Flores, Roberto

A2 - Barria-Huidobro, Cristian

PB - Springer Science and Business Media Deutschland GmbH

T2 - 9th International Congress on Telematics and Computing, WITCOM 2020

Y2 - 2 November 2020 through 6 November 2020

ER -

Montenegro M, López R, Menchaca-Méndez R, Becerra E, Menchaca-Méndez R. A parallel rollout algorithm for wildfire suppression. En Mata-Rivera MF, Zagal-Flores R, Barria-Huidobro C, editores, Telematics and Computing - 9th International Congress, WITCOM 2020, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. p. 244-255. (Communications in Computer and Information Science). doi: 10.1007/978-3-030-62554-2_18

A parallel rollout algorithm for wildfire suppression

Resumen

Serie de la publicación

Conferencia

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto