Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs > arXiv:2305.04258

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Science > Databases

arXiv:2305.04258 (cs)
[Submitted on 7 May 2023]

Title:From Unstructured to Structured: Transforming Chatbot Dialogues into Data Mart Schema for Visualization

Authors:Mark Edward M. Gonzales, Elyssia Barrie H. Ong, Charibeth K. Cheng, Ethel Chua Joy Ong, Judith J. Azcarraga
View a PDF of the paper titled From Unstructured to Structured: Transforming Chatbot Dialogues into Data Mart Schema for Visualization, by Mark Edward M. Gonzales and 4 other authors
View PDF
Abstract:Schools are among the primary avenues for public healthcare interventions. With resource limitations posing challenges to the routine conduct of health and wellness checks in Philippine public schools, the deployment of a chatbot-assisted health monitoring system may provide an alternative method. However, deriving insights from raw conversations is not straightforward due to the expressiveness of natural language that causes variances in the input. In this paper, we present a process for transforming unstructured dialogues into a structured schema. The process comprises four stages: (i) processing the dialogues through entity extraction and data aggregation, (ii) storing them as NoSQL documents on the cloud, (iii) transforming them into a star schema for online analytical processing and building an extract-transform-load workflow, and (iv) creating a web-based dashboard for visualizing summarized data and reports. Performance evaluation of this dashboard showed that increasing the number of stored dialogues by a factor of 100,000 increased the loading time for the display of roll-up, drill-down, and filter results by around only one second.
Comments: Accepted for paper presentation at the 23rd Philippine Computing Science Congress (PCSC 2023), held in Cebu, Philippines
Subjects: Databases (cs.DB); Human-Computer Interaction (cs.HC)
ACM classes: H.3; H.5.2
Cite as: arXiv:2305.04258 [cs.DB]
  (or arXiv:2305.04258v1 [cs.DB] for this version)
  https://doi.org/10.48550/arXiv.2305.04258
arXiv-issued DOI via DataCite

Submission history

From: Mark Edward Gonzales [view email]
[v1] Sun, 7 May 2023 12:18:05 UTC (1,366 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled From Unstructured to Structured: Transforming Chatbot Dialogues into Data Mart Schema for Visualization, by Mark Edward M. Gonzales and 4 other authors
  • View PDF
  • TeX Source
  • Other Formats
license icon view license
Current browse context:
cs
< prev   |   next >
new | recent | 2023-05
Change to browse by:
cs.DB
cs.HC

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack