-
Can LLM Agents Maintain a Persona in Discourse?
Authors:
Pranav Bhandari,
Nicolas Fay,
Michael Wise,
Amitava Datta,
Stephanie Meek,
Usman Naseem,
Mehwish Nasim
Abstract:
Large Language Models (LLMs) are widely used as conversational agents, exploiting their capabilities in various sectors such as education, law, medicine, and more. However, LLMs are often subjected to context-shifting behaviour, resulting in a lack of consistent and interpretable personality-aligned interactions. Adherence to psychological traits lacks comprehensive analysis, especially in the cas…
▽ More
Large Language Models (LLMs) are widely used as conversational agents, exploiting their capabilities in various sectors such as education, law, medicine, and more. However, LLMs are often subjected to context-shifting behaviour, resulting in a lack of consistent and interpretable personality-aligned interactions. Adherence to psychological traits lacks comprehensive analysis, especially in the case of dyadic (pairwise) conversations. We examine this challenge from two viewpoints, initially using two conversation agents to generate a discourse on a certain topic with an assigned personality from the OCEAN framework (Openness, Conscientiousness, Extraversion, Agreeableness, and Neuroticism) as High/Low for each trait. This is followed by using multiple judge agents to infer the original traits assigned to explore prediction consistency, inter-model agreement, and alignment with the assigned personality. Our findings indicate that while LLMs can be guided toward personality-driven dialogue, their ability to maintain personality traits varies significantly depending on the combination of models and discourse settings. These inconsistencies emphasise the challenges in achieving stable and interpretable personality-aligned interactions in LLMs.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
Text Understanding in GPT-4 vs Humans
Authors:
Thomas R. Shultz,
Jamie M. Wise,
Ardavan Salehi Nobandegani
Abstract:
We examine whether a leading AI system GPT4 understands text as well as humans do, first using a well-established standardized test of discourse comprehension. On this test, GPT4 performs slightly, but not statistically significantly, better than humans given the very high level of human performance. Both GPT4 and humans make correct inferences about information that is not explicitly stated in th…
▽ More
We examine whether a leading AI system GPT4 understands text as well as humans do, first using a well-established standardized test of discourse comprehension. On this test, GPT4 performs slightly, but not statistically significantly, better than humans given the very high level of human performance. Both GPT4 and humans make correct inferences about information that is not explicitly stated in the text, a critical test of understanding. Next, we use more difficult passages to determine whether that could allow larger differences between GPT4 and humans. GPT4 does considerably better on this more difficult text than do the high school and university students for whom these the text passages are designed, as admission tests of student reading comprehension. Deeper exploration of GPT4 performance on material from one of these admission tests reveals generally accepted signatures of genuine understanding, namely generalization and inference.
△ Less
Submitted 17 January, 2025; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps
Authors:
Muhammad Ibrahim,
Naveed Akhtar,
Saeed Anwar,
Michael Wise,
Ajmal Mian
Abstract:
Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a $360^\circ$ LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systemat…
▽ More
Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a $360^\circ$ LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systematically processing the slices. To the best of our knowledge, this is the first instance of leveraging multi-head attention for outdoor point clouds. We additionally introduce the Perth-WA dataset, which provides a large-scale LiDAR map of Perth city in Western Australia, covering $\sim$4km$^2$ area. Localization annotations are provided for Perth-WA. The proposed localization method is thoroughly evaluated on Perth-WA and Appollo-SouthBay datasets. We also establish the efficacy of our self-supervised learning approach for the common downstream task of object classification using ModelNet40 and ScanNN datasets. The code and Perth-WA data will be publicly released.
△ Less
Submitted 13 August, 2023; v1 submitted 21 January, 2023;
originally announced January 2023.
-
Structure-preserving, energy stable numerical schemes for a liquid thin film coarsening model
Authors:
Juan Zhang,
Cheng Wang,
Steven M. Wise,
Zhengru Zhang
Abstract:
In this paper, two finite difference numerical schemes are proposed and analyzed for the droplet liquid film model, with a singular Leonard-Jones energy potential involved. Both first and second order accurate temporal algorithms are considered. In the first order scheme, the convex potential and the surface diffusion terms are implicitly, while the concave potential term is updated explicitly. Fu…
▽ More
In this paper, two finite difference numerical schemes are proposed and analyzed for the droplet liquid film model, with a singular Leonard-Jones energy potential involved. Both first and second order accurate temporal algorithms are considered. In the first order scheme, the convex potential and the surface diffusion terms are implicitly, while the concave potential term is updated explicitly. Furthermore, we provide a theoretical justification that this numerical algorithm has a unique solution, such that the positivity is always preserved for the phase variable at a point-wise level, so that a singularity is avoided in the scheme. In fact, the singular nature of the Leonard-Jones potential term around the value of 0 prevents the numerical solution reaching such singular value, so that the positivity structure is always preserved. Moreover, an unconditional energy stability of the numerical scheme is derived, without any restriction for the time step size. In the second order numerical scheme, the BDF temporal stencil is applied, and an alternate convex-concave decomposition is derived, so that the concave part corresponds to a quadratic energy. In turn, the combined Leonard-Jones potential term is treated implicitly, and the concave part the is approximated by a second order Adams-Bashforth explicit extrapolation, and an artificial Douglas-Dupont regularization term is added to ensure the energy stability. The unique solvability and the positivity-preserving property for the second order scheme could be similarly established. In addition, optimal rate convergence analysis is provided for both the first and second order accurate schemes. A few numerical simulation results are also presented, which demonstrate the robustness of the numerical schemes.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Astronomy and Computing: a New Journal for the Astronomical Computing Community
Authors:
Alberto Accomazzi,
Tamás Budavári,
Christopher Fluke,
Norman Gray,
Robert G Mann,
William O'Mullane,
Andreas Wicenec,
Michael Wise
Abstract:
We introduce \emph{Astronomy and Computing}, a new journal for the growing population of people working in the domain where astronomy overlaps with computer science and information technology. The journal aims to provide a new communication channel within that community, which is not well served by current journals, and to help secure recognition of its true importance within modern astronomy. In…
▽ More
We introduce \emph{Astronomy and Computing}, a new journal for the growing population of people working in the domain where astronomy overlaps with computer science and information technology. The journal aims to provide a new communication channel within that community, which is not well served by current journals, and to help secure recognition of its true importance within modern astronomy. In this inaugural editorial, we describe the rationale for creating the journal, outline its scope and ambitions, and seek input from the community in defining in detail how the journal should work towards its high-level goals.
△ Less
Submitted 30 October, 2012;
originally announced October 2012.