TY - JOUR
T1 - A Geo-Social Characterization of Health Impact from Air Pollution in Mexico Valley
AU - Zagal Flores, Roberto
AU - Claramunt, Christophe
AU - Mata Rivera, Miguel Felix
AU - Garay Jiménez, Laura Ivoone
AU - Jiménez Hernández, Hugo
AU - Herrera Navarro, Ana Marcela
AU - Argüelles Cruz, Amadeo José
N1 - Publisher Copyright:
© 2022 Roberto Zagal Flores et al.
PY - 2022
Y1 - 2022
N2 - The impact of the air pollution phenomenon has been long studied, but most often with a fragmented approach, without closely looking at the relationship between different components that characterize it, such as sensor-based data, health data from institutional databases, and data on how it is perceived by human beings in social media. The research developed in this study introduces an integrated methodological framework that analyses sensor data on air pollution distributed in space and time, combined with health data and social data narratives that reflect how different communities perceive this phenomenon in space and time; exploring how these different heterogeneous sources can be combined to better understand the impact of air pollution phenomena at the large-city level in the Valley of Mexico. We introduce a Spatio-temporal data integration and mining framework that aims to discover trends and insights regarding the distribution of the impact of an air pollution phenomenon in terms of human health and perception. The main peculiarity of our methodological framework is the integration of different large data sources by combining a series of methods: NLP (topic modeling), data mining (data cubes, unsupervised learning, and clustering), and GIS capabilities (spatial interpolation, choropleth maps) that together provide a better understanding of the quantitative and qualitative patterns emerging at a different spatial scale and temporal granularity. Overall, this shows how social data, when combined with quantitative data, can provide a better understanding of the impact of a given phenomenon, such as air pollution.
AB - The impact of the air pollution phenomenon has been long studied, but most often with a fragmented approach, without closely looking at the relationship between different components that characterize it, such as sensor-based data, health data from institutional databases, and data on how it is perceived by human beings in social media. The research developed in this study introduces an integrated methodological framework that analyses sensor data on air pollution distributed in space and time, combined with health data and social data narratives that reflect how different communities perceive this phenomenon in space and time; exploring how these different heterogeneous sources can be combined to better understand the impact of air pollution phenomena at the large-city level in the Valley of Mexico. We introduce a Spatio-temporal data integration and mining framework that aims to discover trends and insights regarding the distribution of the impact of an air pollution phenomenon in terms of human health and perception. The main peculiarity of our methodological framework is the integration of different large data sources by combining a series of methods: NLP (topic modeling), data mining (data cubes, unsupervised learning, and clustering), and GIS capabilities (spatial interpolation, choropleth maps) that together provide a better understanding of the quantitative and qualitative patterns emerging at a different spatial scale and temporal granularity. Overall, this shows how social data, when combined with quantitative data, can provide a better understanding of the impact of a given phenomenon, such as air pollution.
UR - http://www.scopus.com/inward/record.url?scp=85138092683&partnerID=8YFLogxK
U2 - 10.1155/2022/5562317
DO - 10.1155/2022/5562317
M3 - Artículo
AN - SCOPUS:85138092683
SN - 1574-017X
VL - 2022
JO - Mobile Information Systems
JF - Mobile Information Systems
M1 - 5562317
ER -