-
Using Vision Language Models for Safety Hazard Identification in Construction
Authors:
Muhammad Adil,
Gaang Lee,
Vicente A. Gonzalez,
Qipei Mei
Abstract:
Safety hazard identification and prevention are the key elements of proactive safety management. Previous research has extensively explored the applications of computer vision to automatically identify hazards from image clips collected from construction sites. However, these methods struggle to identify context-specific hazards, as they focus on detecting predefined individual entities without un…
▽ More
Safety hazard identification and prevention are the key elements of proactive safety management. Previous research has extensively explored the applications of computer vision to automatically identify hazards from image clips collected from construction sites. However, these methods struggle to identify context-specific hazards, as they focus on detecting predefined individual entities without understanding their spatial relationships and interactions. Furthermore, their limited adaptability to varying construction site guidelines and conditions hinders their generalization across different projects. These limitations reduce their ability to assess hazards in complex construction environments and adaptability to unseen risks, leading to potential safety gaps. To address these challenges, we proposed and experimentally validated a Vision Language Model (VLM)-based framework for the identification of construction hazards. The framework incorporates a prompt engineering module that structures safety guidelines into contextual queries, allowing VLM to process visual information and generate hazard assessments aligned with the regulation guide. Within this framework, we evaluated state-of-the-art VLMs, including GPT-4o, Gemini, Llama 3.2, and InternVL2, using a custom dataset of 1100 construction site images. Experimental results show that GPT-4o and Gemini 1.5 Pro outperformed alternatives and displayed promising BERTScore of 0.906 and 0.888 respectively, highlighting their ability to identify both general and context-specific hazards. However, processing times remain a significant challenge, impacting real-time feasibility. These findings offer insights into the practical deployment of VLMs for construction site hazard detection, thereby contributing to the enhancement of proactive safety management.
△ Less
Submitted 12 April, 2025;
originally announced April 2025.
-
Coarse-to-Fine Learning for Multi-Pipette Localisation in Robot-Assisted In Vivo Patch-Clamp
Authors:
Lan Wei,
Gema Vera Gonzalez,
Phatsimo Kgwarae,
Alexander Timms,
Denis Zahorovsky,
Simon Schultz,
Dandan Zhang
Abstract:
In vivo image-guided multi-pipette patch-clamp is essential for studying cellular interactions and network dynamics in neuroscience. However, current procedures mainly rely on manual expertise, which limits accessibility and scalability. Robotic automation presents a promising solution, but achieving precise real-time detection of multiple pipettes remains a challenge. Existing methods focus on ex…
▽ More
In vivo image-guided multi-pipette patch-clamp is essential for studying cellular interactions and network dynamics in neuroscience. However, current procedures mainly rely on manual expertise, which limits accessibility and scalability. Robotic automation presents a promising solution, but achieving precise real-time detection of multiple pipettes remains a challenge. Existing methods focus on ex vivo experiments or single pipette use, making them inadequate for in vivo multi-pipette scenarios. To address these challenges, we propose a heatmap-augmented coarse-to-fine learning technique to facilitate multi-pipette real-time localisation for robot-assisted in vivo patch-clamp. More specifically, we introduce a Generative Adversarial Network (GAN)-based module to remove background noise and enhance pipette visibility. We then introduce a two-stage Transformer model that starts with predicting the coarse heatmap of the pipette tips, followed by the fine-grained coordination regression module for precise tip localisation. To ensure robust training, we use the Hungarian algorithm for optimal matching between the predicted and actual locations of tips. Experimental results demonstrate that our method achieved > 98% accuracy within 10 μm, and > 89% accuracy within 5 μm for the localisation of multi-pipette tips. The average MSE is 2.52 μm.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
The $Z$-Curve as an $n$-Dimensional Hypersphere: Properties and Analysis
Authors:
Diego Vazquez Gonzalez,
Hsing-Kuo Pao
Abstract:
In this research, we introduce an algorithm that produces what appears to be a new mathematical object as a consequence of projecting the \( n \)-dimensional \( Z \)-curve onto an \( n \)-dimensional sphere. The first part presents the algorithm that enables this transformation, and the second part focuses on studying its properties.
In this research, we introduce an algorithm that produces what appears to be a new mathematical object as a consequence of projecting the \( n \)-dimensional \( Z \)-curve onto an \( n \)-dimensional sphere. The first part presents the algorithm that enables this transformation, and the second part focuses on studying its properties.
△ Less
Submitted 4 November, 2024; v1 submitted 6 October, 2024;
originally announced October 2024.
-
A Connector for Integrating NGSI-LD Data into Open Data Portals
Authors:
Laura Martín,
Jorge Lanza,
Víctor González,
Juan Ramón Santana,
Pablo Sotres,
Luis Sánchez
Abstract:
Nowadays, there are plenty of data sources generating massive amounts of information that, combined with novel data analytics frameworks, are meant to support optimisation in many application domains. Nonetheless, there are still shortcomings in terms of data discoverability, accessibility and interoperability. Open Data portals have emerged as a shift towards openness and discoverability. However…
▽ More
Nowadays, there are plenty of data sources generating massive amounts of information that, combined with novel data analytics frameworks, are meant to support optimisation in many application domains. Nonetheless, there are still shortcomings in terms of data discoverability, accessibility and interoperability. Open Data portals have emerged as a shift towards openness and discoverability. However, they do not impose any condition to the data itself, just stipulate how datasets have to be described. Alternatively, the NGSI-LD standard pursues harmonisation in terms of data modelling and accessibility. This paper presents a solution that bridges these two domains (i.e., Open Data portals and NGSI-LD-based data) in order to keep benefiting from the structured description of datasets offered by Open Data portals, while ensuring the interoperability provided by the NGSI-LD standard. Our solution aggregates the data into coherent datasets and generate high-quality descriptions, ensuring comprehensiveness, interoperability and accessibility. The proposed solution has been validated through a real-world implementation that exposes IoT data in NGSI-LD format through the European Data Portal (EDP). Moreover, the results from the Metadata Quality Assessment that the EDP implements, show that the datasets' descriptions generated achieve excellent ranking in terms of the Findability, Accessibility, Interoperability and Reusability (FAIR) data principles.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Revisiting Micro and Macro Expressions in Computer Graphics Characters
Authors:
Rubens Montanha,
Giovana Raupp,
Vitoria Gonzalez,
Yanny Partichelli,
André Bins,
Marcos Ferreira,
Victor Araujo,
Soraia Musse
Abstract:
This paper presents the reproduction of two studies focused on the perception of micro and macro expressions of Virtual Humans (VHs) generated by Computer Graphics (CG), first described in 2014 and replicated in 2021. The 2014 study referred to a VH realistic, whereas, in 2021, it referred to a VH cartoon. In our work, we replicate the study by using a realistic CG character. Our main goals are to…
▽ More
This paper presents the reproduction of two studies focused on the perception of micro and macro expressions of Virtual Humans (VHs) generated by Computer Graphics (CG), first described in 2014 and replicated in 2021. The 2014 study referred to a VH realistic, whereas, in 2021, it referred to a VH cartoon. In our work, we replicate the study by using a realistic CG character. Our main goals are to compare the perceptions of micro and macro expressions between levels of realism (2021 cartoon versus 2023 realistic) and between realistic characters in different periods (i.e., 2014 versus 2023). In one of our results, people more easily recognized micro expressions in realistic VHs than in a cartoon VH. In another result, we show that the participants' perception was similar for both micro and macro expressions in 2014 and 2023.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs
Authors:
Violeta Menéndez González,
Andrew Gilbert,
Graeme Phillipson,
Stephen Jolly,
Simon Hadfield
Abstract:
In the field of media production, video editing techniques play a pivotal role. Recent approaches have had great success at performing novel view image synthesis of static scenes. But adding temporal information adds an extra layer of complexity. Previous models have focused on implicitly representing static and dynamic scenes using NeRF. These models achieve impressive results but are costly at t…
▽ More
In the field of media production, video editing techniques play a pivotal role. Recent approaches have had great success at performing novel view image synthesis of static scenes. But adding temporal information adds an extra layer of complexity. Previous models have focused on implicitly representing static and dynamic scenes using NeRF. These models achieve impressive results but are costly at training and inference time. They overfit an MLP to describe the scene implicitly as a function of position. This paper proposes ZeST-NeRF, a new approach that can produce temporal NeRFs for new scenes without retraining. We can accurately reconstruct novel views using multi-view synthesis techniques and scene flow-field estimation, trained only with unrelated scenes. We demonstrate how existing state-of-the-art approaches from a range of fields cannot adequately solve this new task and demonstrate the efficacy of our solution. The resulting network improves quantitatively by 15% and produces significantly better visual results.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
SVS: Adversarial refinement for sparse novel view synthesis
Authors:
Violeta Menéndez González,
Andrew Gilbert,
Graeme Phillipson,
Stephen Jolly,
Simon Hadfield
Abstract:
This paper proposes Sparse View Synthesis. This is a view synthesis problem where the number of reference views is limited, and the baseline between target and reference view is significant. Under these conditions, current radiance field methods fail catastrophically due to inescapable artifacts such 3D floating blobs, blurring and structural duplication, whenever the number of reference views is…
▽ More
This paper proposes Sparse View Synthesis. This is a view synthesis problem where the number of reference views is limited, and the baseline between target and reference view is significant. Under these conditions, current radiance field methods fail catastrophically due to inescapable artifacts such 3D floating blobs, blurring and structural duplication, whenever the number of reference views is limited, or the target view diverges significantly from the reference views.
Advances in network architecture and loss regularisation are unable to satisfactorily remove these artifacts. The occlusions within the scene ensure that the true contents of these regions is simply not available to the model. In this work, we instead focus on hallucinating plausible scene contents within such regions. To this end we unify radiance field models with adversarial learning and perceptual losses. The resulting system provides up to 60% improvement in perceptual accuracy compared to current state-of-the-art radiance field models on this problem.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
SaiNet: Stereo aware inpainting behind objects with generative networks
Authors:
Violeta Menéndez González,
Andrew Gilbert,
Graeme Phillipson,
Stephen Jolly,
Simon Hadfield
Abstract:
In this work, we present an end-to-end network for stereo-consistent image inpainting with the objective of inpainting large missing regions behind objects. The proposed model consists of an edge-guided UNet-like network using Partial Convolutions. We enforce multi-view stereo consistency by introducing a disparity loss. More importantly, we develop a training scheme where the model is learned fro…
▽ More
In this work, we present an end-to-end network for stereo-consistent image inpainting with the objective of inpainting large missing regions behind objects. The proposed model consists of an edge-guided UNet-like network using Partial Convolutions. We enforce multi-view stereo consistency by introducing a disparity loss. More importantly, we develop a training scheme where the model is learned from realistic stereo masks representing object occlusions, instead of the more common random masks. The technique is trained in a supervised way. Our evaluation shows competitive results compared to previous state-of-the-art techniques.
△ Less
Submitted 14 May, 2022;
originally announced May 2022.
-
Understanding and Assessment of Mission-Centric Key Cyber Terrains for joint Military Operations
Authors:
Álvaro Luis Martínez,
Jorge Maestre Vidal,
Victor A. Villagrá González
Abstract:
Since the cyberspace consolidated as fifth warfare dimension, the different actors of the defense sector began an arms race toward achieving cyber superiority, on which research, academic and industrial stakeholders contribute from a dual vision, mostly linked to a large and heterogeneous heritage of developments and adoption of civilian cybersecurity capabilities. In this context, augmenting the…
▽ More
Since the cyberspace consolidated as fifth warfare dimension, the different actors of the defense sector began an arms race toward achieving cyber superiority, on which research, academic and industrial stakeholders contribute from a dual vision, mostly linked to a large and heterogeneous heritage of developments and adoption of civilian cybersecurity capabilities. In this context, augmenting the conscious of the context and warfare environment, risks and impacts of cyber threats on kinetic actuations became a critical rule-changer that military decision-makers are considering. A major challenge on acquiring mission-centric Cyber Situational Awareness (CSA) is the dynamic inference and assessment of the vertical propagations from situations that occurred at the mission supportive Information and Communications Technologies (ICT), up to their relevance at military tactical, operational and strategical views. In order to contribute on acquiring CSA, this paper addresses a major gap in the cyber defence state-of-the-art: the dynamic identification of Key Cyber Terrains (KCT) on a mission-centric context. Accordingly, the proposed KCT identification approach explores the dependency degrees among tasks and assets defined by commanders as part of the assessment criteria. These are correlated with the discoveries on the operational network and the asset vulnerabilities identified thorough the supported mission development. The proposal is presented as a reference model that reveals key aspects for mission-centric KCT analysis and supports its enforcement and further enforcement by including an illustrative application case.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
On the Interaction of Belief Bias and Explanations
Authors:
Ana Valeria Gonzalez,
Anna Rogers,
Anders Søgaard
Abstract:
A myriad of explainability methods have been proposed in recent years, but there is little consensus on how to evaluate them. While automatic metrics allow for quick benchmarking, it isn't clear how such metrics reflect human interaction with explanations. Human evaluation is of paramount importance, but previous protocols fail to account for belief biases affecting human performance, which may le…
▽ More
A myriad of explainability methods have been proposed in recent years, but there is little consensus on how to evaluate them. While automatic metrics allow for quick benchmarking, it isn't clear how such metrics reflect human interaction with explanations. Human evaluation is of paramount importance, but previous protocols fail to account for belief biases affecting human performance, which may lead to misleading conclusions. We provide an overview of belief bias, its role in human evaluation, and ideas for NLP practitioners on how to account for it. For two experimental paradigms, we present a case study of gradient-based explainability introducing simple ways to account for humans' prior beliefs: models of varying quality and adversarial examples. We show that conclusions about the highest performing methods change when introducing such controls, pointing to the importance of accounting for belief bias in evaluation.
△ Less
Submitted 29 June, 2021;
originally announced June 2021.
-
Does injecting linguistic structure into language models lead to better alignment with brain recordings?
Authors:
Mostafa Abdou,
Ana Valeria Gonzalez,
Mariya Toneva,
Daniel Hershcovich,
Anders Søgaard
Abstract:
Neuroscientists evaluate deep neural networks for natural language processing as possible candidate models for how language is processed in the brain. These models are often trained without explicit linguistic supervision, but have been shown to learn some linguistic structure in the absence of such supervision (Manning et al., 2020), potentially questioning the relevance of symbolic linguistic th…
▽ More
Neuroscientists evaluate deep neural networks for natural language processing as possible candidate models for how language is processed in the brain. These models are often trained without explicit linguistic supervision, but have been shown to learn some linguistic structure in the absence of such supervision (Manning et al., 2020), potentially questioning the relevance of symbolic linguistic theories in modeling such cognitive processes (Warstadt and Bowman, 2020). We evaluate across two fMRI datasets whether language models align better with brain recordings, if their attention is biased by annotations from syntactic or semantic formalisms. Using structure from dependency or minimal recursion semantic annotations, we find alignments improve significantly for one of the datasets. For another dataset, we see more mixed results. We present an extensive analysis of these results. Our proposed approach enables the evaluation of more targeted hypotheses about the composition of meaning in the brain, expanding the range of possible scientific inferences a neuroscientist could make, and opens up new opportunities for cross-pollination between computational neuroscience and linguistics.
△ Less
Submitted 29 January, 2021;
originally announced January 2021.
-
Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Authors:
Ana Valeria Gonzalez,
Gagan Bansal,
Angela Fan,
Robin Jia,
Yashar Mehdad,
Srinivasan Iyer
Abstract:
While research on explaining predictions of open-domain QA systems (ODQA) to users is gaining momentum, most works have failed to evaluate the extent to which explanations improve user trust. While few works evaluate explanations using user studies, they employ settings that may deviate from the end-user's usage in-the-wild: ODQA is most ubiquitous in voice-assistants, yet current research only ev…
▽ More
While research on explaining predictions of open-domain QA systems (ODQA) to users is gaining momentum, most works have failed to evaluate the extent to which explanations improve user trust. While few works evaluate explanations using user studies, they employ settings that may deviate from the end-user's usage in-the-wild: ODQA is most ubiquitous in voice-assistants, yet current research only evaluates explanations using a visual display, and may erroneously extrapolate conclusions about the most performant explanations to other modalities. To alleviate these issues, we conduct user studies that measure whether explanations help users correctly decide when to accept or reject an ODQA system's answer. Unlike prior work, we control for explanation modality, e.g., whether they are communicated to users through a spoken or visual interface, and contrast effectiveness across modalities. Our results show that explanations derived from retrieved evidence passages can outperform strong baselines (calibrated confidence) across modalities but the best explanation strategy in fact changes with the modality. We show common failure cases of current explanations, emphasize end-to-end evaluation of explanations, and caution against evaluating them in proxy modalities that are different from deployment.
△ Less
Submitted 30 December, 2020;
originally announced December 2020.
-
Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias
Authors:
Ana Valeria Gonzalez,
Maria Barrett,
Rasmus Hvingelby,
Kellie Webster,
Anders Søgaard
Abstract:
The one-sided focus on English in previous studies of gender bias in NLP misses out on opportunities in other languages: English challenge datasets such as GAP and WinoGender highlight model preferences that are "hallucinatory", e.g., disambiguating gender-ambiguous occurrences of 'doctor' as male doctors. We show that for languages with type B reflexivization, e.g., Swedish and Russian, we can co…
▽ More
The one-sided focus on English in previous studies of gender bias in NLP misses out on opportunities in other languages: English challenge datasets such as GAP and WinoGender highlight model preferences that are "hallucinatory", e.g., disambiguating gender-ambiguous occurrences of 'doctor' as male doctors. We show that for languages with type B reflexivization, e.g., Swedish and Russian, we can construct multi-task challenge datasets for detecting gender bias that lead to unambiguously wrong model predictions: In these languages, the direct translation of 'the doctor removed his mask' is not ambiguous between a coreferential reading and a disjoint reading. Instead, the coreferential reading requires a non-gendered pronoun, and the gendered, possessive pronouns are anti-reflexive. We present a multilingual, multi-task challenge dataset, which spans four languages and four NLP tasks and focuses only on this phenomenon. We find evidence for gender bias across all task-language combinations and correlate model bias with national labor market statistics.
△ Less
Submitted 28 September, 2020; v1 submitted 24 September, 2020;
originally announced September 2020.
-
Non-linearity identification for construction workers' personality-safety behaviour predictive relationship using neural network and linear regression modelling
Authors:
Yifan Gao,
Vicente A. Gonzalez,
Tak Wing Yiu,
Guillermo Cabrera-Guerrerod
Abstract:
The prediction of workers' safety behaviour can help identify vulnerable workers who intend to undertake unsafe behaviours and be useful in the design of management practices to minimise the occurrence of accidents. The latest literature has evidenced that there is within-population diversity that leads people's intended safety behaviours in the workplace, which are found to vary among individuals…
▽ More
The prediction of workers' safety behaviour can help identify vulnerable workers who intend to undertake unsafe behaviours and be useful in the design of management practices to minimise the occurrence of accidents. The latest literature has evidenced that there is within-population diversity that leads people's intended safety behaviours in the workplace, which are found to vary among individuals as a function of their personality traits. In this study, an innovative forecasting model, which employs neural network algorithms, is developed to numerically simulate the predictive relationship between construction workers' personality traits and their intended safety behaviour. The data-driven nature of neural network enabled a reliable estimate of the relationship, which allowed this research to find that a nonlinear effect exists in the relationship. This research has practical implications. The neural network developed is shown to have highly satisfactory prediction accuracy and is thereby potentially useful for assisting project decision-makers to assess how prone workers are to carry out unsafe behaviours in the workplace.
△ Less
Submitted 26 August, 2020; v1 submitted 11 December, 2019;
originally announced December 2019.
-
Retrieval-based Goal-Oriented Dialogue Generation
Authors:
Ana Valeria Gonzalez,
Isabelle Augenstein,
Anders Søgaard
Abstract:
Most research on dialogue has focused either on dialogue generation for openended chit chat or on state tracking for goal-directed dialogue. In this work, we explore a hybrid approach to goal-oriented dialogue generation that combines retrieval from past history with a hierarchical, neural encoder-decoder architecture. We evaluate this approach in the customer support domain using the Multiwoz dat…
▽ More
Most research on dialogue has focused either on dialogue generation for openended chit chat or on state tracking for goal-directed dialogue. In this work, we explore a hybrid approach to goal-oriented dialogue generation that combines retrieval from past history with a hierarchical, neural encoder-decoder architecture. We evaluate this approach in the customer support domain using the Multiwoz dataset (Budzianowski et al., 2018). We show that adding this retrieval step to a hierarchical, neural encoder-decoder architecture leads to significant improvements, including responses that are rated more appropriate and fluent by human evaluators. Finally, we compare our retrieval-based model to various semantically conditioned models explicitly using past dialog act information, and find that our proposed model is competitive with the current state of the art (Chen et al., 2019), while not requiring explicit labels about past machine acts.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Domain Transfer in Dialogue Systems without Turn-Level Supervision
Authors:
Joachim Bingel,
Victor Petrén Bach Hansen,
Ana Valeria Gonzalez,
Paweł Budzianowski,
Isabelle Augenstein,
Anders Søgaard
Abstract:
Task oriented dialogue systems rely heavily on specialized dialogue state tracking (DST) modules for dynamically predicting user intent throughout the conversation. State-of-the-art DST models are typically trained in a supervised manner from manual annotations at the turn level. However, these annotations are costly to obtain, which makes it difficult to create accurate dialogue systems for new d…
▽ More
Task oriented dialogue systems rely heavily on specialized dialogue state tracking (DST) modules for dynamically predicting user intent throughout the conversation. State-of-the-art DST models are typically trained in a supervised manner from manual annotations at the turn level. However, these annotations are costly to obtain, which makes it difficult to create accurate dialogue systems for new domains. To address these limitations, we propose a method, based on reinforcement learning, for transferring DST models to new domains without turn-level supervision. Across several domains, our experiments show that this method quickly adapts off-the-shelf models to new domains and performs on par with models trained with turn-level supervision. We also show our method can improve models trained using turn-level supervision by subsequent fine-tuning optimization toward dialog-level rewards.
△ Less
Submitted 16 September, 2019;
originally announced September 2019.
-
Rewarding Coreference Resolvers for Being Consistent with World Knowledge
Authors:
Rahul Aralikatte,
Heather Lent,
Ana Valeria Gonzalez,
Daniel Hershcovich,
Chen Qiu,
Anders Sandholm,
Michael Ringaard,
Anders Søgaard
Abstract:
Unresolved coreference is a bottleneck for relation extraction, and high-quality coreference resolvers may produce an output that makes it a lot easier to extract knowledge triples. We show how to improve coreference resolvers by forwarding their input to a relation extraction system and reward the resolvers for producing triples that are found in knowledge bases. Since relation extraction systems…
▽ More
Unresolved coreference is a bottleneck for relation extraction, and high-quality coreference resolvers may produce an output that makes it a lot easier to extract knowledge triples. We show how to improve coreference resolvers by forwarding their input to a relation extraction system and reward the resolvers for producing triples that are found in knowledge bases. Since relation extraction systems can rely on different forms of supervision and be biased in different ways, we obtain the best performance, improving over the state of the art, using multi-task reinforcement learning.
△ Less
Submitted 11 November, 2019; v1 submitted 5 September, 2019;
originally announced September 2019.
-
An Immersive Virtual Reality Serious Game to Enhance Earthquake Behavioral Responses and Post-earthquake Evacuation Preparedness in Buildings
Authors:
Zhenan Feng,
Vicente A. González,
Robert Amor,
Michael Spearpoint,
Jared Thomas,
Rafael Sacks,
Ruggiero Lovreglio,
Guillermo Cabrera-Guerrero
Abstract:
Enhancing the earthquake behavioral responses and post-earthquake evacuation preparedness of building occupants is beneficial to increasing their chances of survival and reducing casualties after the main shock of an earthquake. Traditionally, training approaches such as seminars, posters, videos or drills are applied to enhance preparedness. However, they are not highly engaging and have limited…
▽ More
Enhancing the earthquake behavioral responses and post-earthquake evacuation preparedness of building occupants is beneficial to increasing their chances of survival and reducing casualties after the main shock of an earthquake. Traditionally, training approaches such as seminars, posters, videos or drills are applied to enhance preparedness. However, they are not highly engaging and have limited sensory capabilities to mimic life-threatening scenarios for the purpose of training potential participants. Immersive Virtual Reality (IVR) and Serious Games (SG) as innovative digital technologies can be used to create training tools to overcome these limitations. In this study, we propose an IVR SG-based training system to improve earthquake behavioral responses and post-earthquake evacuation preparedness. Auckland City Hospital was chosen as a case study to test our IVR SG training system. A set of learning outcomes based on best evacuation practice has been identified and embedded into several training scenarios of the IVR SG. Hospital staff (healthcare and administrative professionals) and visitors were recruited as participants to be exposed to these training scenarios. Participants' preparedness has been measured along two dimensions: 1) Knowledge about best evacuation practice; 2) Self-efficacy in dealing with earthquake emergencies. Assessment results showed that there was a significant knowledge and self-efficacy increase after the training. And participants acknowledged that it was easy and engaging to learn best evacuation practice knowledge through the IVR SG training system.
△ Less
Submitted 27 May, 2019;
originally announced May 2019.
-
Rapid 3D Reconstruction of Indoor Environments to Generate Virtual Reality Serious Games Scenarios
Authors:
Zhenan Feng,
Vicente A. González,
Ling Ma,
Mustafa M. A. Al-Adhami,
Claudio Mourgues
Abstract:
Virtual Reality (VR) for Serious Games (SGs) is attracting increasing attention for training applications due to its potential to provide significantly enhanced learning to users. Some examples of the application of VR for SGs are complex training evacuation problems such as indoor earthquake evacuation or fire evacuation. The indoor 3D geometry of existing buildings can largely influence evacuees…
▽ More
Virtual Reality (VR) for Serious Games (SGs) is attracting increasing attention for training applications due to its potential to provide significantly enhanced learning to users. Some examples of the application of VR for SGs are complex training evacuation problems such as indoor earthquake evacuation or fire evacuation. The indoor 3D geometry of existing buildings can largely influence evacuees' behaviour, being instrumental in the design of VR SGs storylines and simulation scenarios. The VR scenarios of existing buildings can be generated from drawings and models. However, these data may not reflect the 'as-is' state of the indoor environment and may not be suitable to reflect dynamic changes of the system (e.g. Earthquakes), resulting in excessive development efforts to design credible and meaningful user experience. This paper explores several workflows for the rapid and effective reconstruction of 3D indoor environments of existing buildings that are suitable for earthquake simulations. These workflows start from Building Information Modelling (BIM), laser scanning and 360-degree panoramas. We evaluated the feasibility and efficiency of different approaches by using an earthquake-based case study developed for VR SGs.
△ Less
Submitted 4 December, 2018;
originally announced December 2018.
-
The Effectiveness of Traditional Tools and Computer-Aided Technologies for Health and Safety Training in the Construction Sector: A Systematic Review
Authors:
Yifan Gao,
Vicente Gonzalez,
Tak Wing Yiu
Abstract:
For workers, the exposure to on-site hazards can result in fatalities and serious injuries. To improve safety outcomes, different approaches have been implemented for health and safety training in the construction sector, such as traditional tools and computer-aided technologies (e.g., serious games and virtual reality). However, the effectiveness of these approaches has been barely explored. In o…
▽ More
For workers, the exposure to on-site hazards can result in fatalities and serious injuries. To improve safety outcomes, different approaches have been implemented for health and safety training in the construction sector, such as traditional tools and computer-aided technologies (e.g., serious games and virtual reality). However, the effectiveness of these approaches has been barely explored. In order to bridge this gap, a systematic review of existing studies was conducted. Unlike previous review studies in this field that focused on uncovering the technology characters and challenges, this study mainly evaluated the effectiveness of training using traditional tools and computer-aided technologies on the well-being of individuals. Measures of the effectiveness included knowledge acquisition, unsafe behaviour alteration, and injury rate reduction. Results indicated that: 1. the effectiveness of traditional tools is sufficiently supported by statistical evidence; and 2. the use of computer-aided technologies has evidence to support its effectiveness, but more solid evidence is required to support this statement. It was also found that the overall performance of computer-aided technologies is superior in several technical aspects compared to traditional tools, namely, representing actual workplace situations, providing text-free interfaces, having better user engagement, and being more cost-efficient. Finally, using the systematic review findings, a theoretical framework is proposed as a potential solution to help future research in this field systematically examine the effectiveness and usability of their approaches. This framework is theoretical in nature and requires further validation. A further study is therefore proposed to test and validate this framework.
△ Less
Submitted 5 August, 2018;
originally announced August 2018.
-
Immersive Virtual Reality Serious Games for Evacuation Training and Research: A Systematic Literature Review
Authors:
Zhenan Feng,
Vicente A. González,
Robert Amor,
Ruggiero Lovreglio,
Guillermo Cabrera
Abstract:
An appropriate and safe behavior for exiting a facility is key to reducing injuries and increasing survival when facing an emergency evacuation in a building. Knowledge on the best evacuation practice is commonly delivered by traditional training approaches such as videos, posters, or evacuation drills, but they may become ineffective in terms of knowledge acquisition and retention. Serious games…
▽ More
An appropriate and safe behavior for exiting a facility is key to reducing injuries and increasing survival when facing an emergency evacuation in a building. Knowledge on the best evacuation practice is commonly delivered by traditional training approaches such as videos, posters, or evacuation drills, but they may become ineffective in terms of knowledge acquisition and retention. Serious games (SGs) are an innovative approach devoted to training and educating people in a gaming environment. Recently, increasing attention has been paid to immersive virtual reality (IVR)-based SGs for evacuation knowledge delivery and behavior assessment because they are highly engaging and promote greater cognitive learning.
This paper aims to understand the development and implementation of IVR SGs in the context of building evacuation training and research, applied to various indoor emergencies such as fire and earthquake. Thus, a conceptual framework for effective design and implementation through the systematic literature review method was developed. As a result, this framework integrates critical aspects and provides connections between them, including pedagogical and behavioral impacts, gaming environment development, and outcome and participation experience measures.
△ Less
Submitted 13 September, 2018; v1 submitted 14 May, 2018;
originally announced May 2018.
-
Prototyping Virtual Reality Serious Games for Building Earthquake Preparedness: The Auckland City Hospital Case Study
Authors:
Ruggiero Lovreglio,
Vicente Gonzalez,
Zhenan Feng,
Robert Amor,
Michael Spearpoint,
Jared Thomas,
Margaret Trotter,
Rafael Sacks
Abstract:
Enhancing evacuee safety is a key factor in reducing the number of injuries and deaths that result from earthquakes. One way this can be achieved is by training occupants. Virtual Reality (VR) and Serious Games (SGs), represent novel techniques that may overcome the limitations of traditional training approaches. VR and SGs have been examined in the fire emergency context, however, their applicati…
▽ More
Enhancing evacuee safety is a key factor in reducing the number of injuries and deaths that result from earthquakes. One way this can be achieved is by training occupants. Virtual Reality (VR) and Serious Games (SGs), represent novel techniques that may overcome the limitations of traditional training approaches. VR and SGs have been examined in the fire emergency context, however, their application to earthquake preparedness has not yet been extensively examined. We provide a theoretical discussion of the advantages and limitations of using VR SGs to investigate how building occupants behave during earthquake evacuations and to train building occupants to cope with such emergencies. We explore key design components for developing a VR SG framework: (a) what features constitute an earthquake event, (b) which building types can be selected and represented within the VR environment, (c) how damage to the building can be determined and represented, (d) how non-player characters (NPC) can be designed, and (e) what level of interaction there can be between NPC and the human participants. We illustrate the above by presenting the Auckland City Hospital, New Zealand as a case study, and propose a possible VR SG training tool to enhance earthquake preparedness in public buildings.
△ Less
Submitted 25 February, 2018;
originally announced February 2018.
-
Knowledge management metrics for Public Organizations: A literature review-based proposal
Authors:
Pérez López-Portillo,
Héctor,
Vázquez González,
Edgar René,
Romero Hidalgo,
Jorge Alberto
Abstract:
Knowledge Management (KM) is a relatively new phenomenon that appears in the field of Public Sector Organizations (PSO) bringing new paradigms of organizational management, challenges, risks and opportunities for its implementation, development and evaluation. KM can be seen as a systematic and deliberate effort to coordinate people, technology, organizational structures and its environment throug…
▽ More
Knowledge Management (KM) is a relatively new phenomenon that appears in the field of Public Sector Organizations (PSO) bringing new paradigms of organizational management, challenges, risks and opportunities for its implementation, development and evaluation. KM can be seen as a systematic and deliberate effort to coordinate people, technology, organizational structures and its environment through knowledge reuse and innovation. This management approach has been established in parallel with the development and use of information and communications technologies (ICT). Nowadays more PSO are embodying KM practices in their core processes for support them, and as an advanced management strategy to create a new culture based on technology and resources efficiency. In this paper, we observed that KM can support organizational goals in PSO. The aim of this paper is to understand KM factors and its associated components, and propose KM metrics for measure KM programs in PSO. Through a critical literature review we analysed diverse studies related with KM performance indicators in PSO, then based on previous works we summarized the more convenient this purpose. We found that, in academic literature, studies about KM measurement in PSO are uncommon and emerging. As well, in the last section of this paper, we present a proposal of KM metrics for PSO, and some recommendations and practical implications for KM metrics development in PSO. This academic endeavour seeks to contribute to theoretical debate about KM measure development for KM initiatives in PSO.
△ Less
Submitted 29 September, 2016;
originally announced September 2016.
-
Tweeting Over The Border: An Empirical Study of Transnational Migration in San Diego and Tijuana
Authors:
Victor R. Martinez,
Antonio Mancilla,
Victor M. Gonzalez
Abstract:
Sociological studies on transnational migration are often based on surveys or interviews, an expensive and time consuming approach. On the other hand, the pervasiveness of mobile phones and location aware social networks has introduced new ways to understand human mobility patterns at a national or global scale. In this work, we leverage geo located information obtained from Twitter as to understa…
▽ More
Sociological studies on transnational migration are often based on surveys or interviews, an expensive and time consuming approach. On the other hand, the pervasiveness of mobile phones and location aware social networks has introduced new ways to understand human mobility patterns at a national or global scale. In this work, we leverage geo located information obtained from Twitter as to understand transnational migration patterns between two border cities (San Diego, USA and Tijuana, Mexico). We obtained 10.9 million geo located tweets from December 2013 to January 2015. Our method infers human mobility by inspecting tweet submissions and user's home locations. Our results depict a trans national community structure that exhibits the formation of a functional metropolitan area that physically transcends international borders. These results show the potential for re analysing sociology phenomena from a technology based empirical perspective.
△ Less
Submitted 21 July, 2015;
originally announced July 2015.