Skip to main content

Showing 1–50 of 75 results for author: Pedersen, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01364  [pdf, ps, other

    cs.LG cs.AI

    Unraveling Spatio-Temporal Foundation Models via the Pipeline Lens: A Comprehensive Review

    Authors: Yuchen Fang, Hao Miao, Yuxuan Liang, Liwei Deng, Yue Cui, Ximu Zeng, Yuyang Xia, Yan Zhao, Torben Bach Pedersen, Christian S. Jensen, Xiaofang Zhou, Kai Zheng

    Abstract: Spatio-temporal deep learning models aims to utilize useful patterns in such data to support tasks like prediction. However, previous deep learning models designed for specific tasks typically require separate training for each use case, leading to increased computational and storage costs. To address this issue, spatio-temporal foundation models have emerged, offering a unified framework capable… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 21 pages, 10 figures

  2. arXiv:2505.12616  [pdf, ps, other

    cs.CL

    Duluth at SemEval-2025 Task 7: TF-IDF with Optimized Vector Dimensions for Multilingual Fact-Checked Claim Retrieval

    Authors: Shujauddin Syed, Ted Pedersen

    Abstract: This paper presents the Duluth approach to the SemEval-2025 Task 7 on Multilingual and Crosslingual Fact-Checked Claim Retrieval. We implemented a TF-IDF-based retrieval system with experimentation on vector dimensions and tokenization strategies. Our best-performing configuration used word-level tokenization with a vocabulary size of 15,000 features, achieving an average success@10 score of 0.78… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: SemEval-2025

    MSC Class: 68T50

  3. arXiv:2504.03595  [pdf, other

    cs.CL

    Extending the SAREF4ENER Ontology with Flexibility Based on FlexOffers

    Authors: Fabio Lilliu, Amir Laadhar, Christian Thomsen, Diego Reforgiato Recupero, Torben Bach Pedersen

    Abstract: A key element to support the increased amounts of renewable energy in the energy system is flexibility, i.e., the possibility of changing energy loads in time and amount. Many flexibility models have been designed; however, exact models fail to scale for long time horizons or many devices. Because of this, the FlexOffer (FOs) model has been designed, to provide device-independent approximations of… ▽ More

    Submitted 18 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    Comments: 13 pages, 5 figures, 4 tables. Submitted to SmartGridComm 2025

  4. arXiv:2501.14432  [pdf, other

    cs.DB cs.IR cs.IT

    CAMEO: Autocorrelation-Preserving Line Simplification for Lossy Time Series Compression

    Authors: Carlos Enrique Muñiz-Cuza, Matthias Boehm, Torben Bach Pedersen

    Abstract: Time series data from a variety of sensors and IoT devices need effective compression to reduce storage and I/O bandwidth requirements. While most time series databases and systems rely on lossless compression, lossy techniques offer even greater space-saving with a small loss in precision. However, the unknown impact on downstream analytics applications requires a semi-manual trial-and-error expl… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 14 pages, 13 figures

    ACM Class: E.2; H.3.2; H.2.8

  5. arXiv:2412.06940  [pdf, other

    cs.LG eess.SP

    Digital Twin-Empowered Voltage Control for Power Systems

    Authors: Jiachen Xu, Yushuai Li, Torben Bach Pedersen, Yuqiang He, Kim Guldstrand Larsen, Tianyi Li

    Abstract: Emerging digital twin technology has the potential to revolutionize voltage control in power systems. However, the state-of-the-art digital twin method suffers from low computational and sampling efficiency, which hinders its applications. To address this issue, we propose a Gumbel-Consistency Digital Twin (GC-DT) method that enhances voltage control with improved computational and sampling effici… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: 6 pages, 1 figure, conference paper

  6. arXiv:2412.00034  [pdf, other

    cs.DB

    Data-Driven Prescriptive Analytics Applications: A Comprehensive Survey

    Authors: Martin Moesmann, Torben Bach Pedersen

    Abstract: Prescriptive Analytics (PSA), an emerging business analytics field suggesting concrete options for solving business problems, has seen an increasing amount of interest after more than a decade of multidisciplinary research. This paper is a comprehensive survey of existing applications within PSA in terms of their use cases, methodologies, and possible future research directions. To ensure a manage… ▽ More

    Submitted 22 May, 2025; v1 submitted 21 November, 2024; originally announced December 2024.

    Comments: This work has been submitted to Elsevier for possible publication

  7. Modular assurance of an Autonomous Ferry using Contract-Based Design and Simulation-based Verification Principles

    Authors: Jon Arne Glomsrud, Stephanie Kemna, Chanjei Vasanthan, Luman Zhao, Dag McGeorge, Tom Arne Pedersen, Tobias Rye Torben, Børge Rokseth, Dong Trong Nguyen

    Abstract: With the introduction of autonomous technology into our society, e.g. autonomous shipping, it is important to assess and assure the safety of autonomous systems in a real-world context. Simulation-based testing is a common approach to attempt to verify performance of autonomous systems, but assurance also requires formal evidence. This paper introduces the Assurance of Digital Assets (ADA) framewo… ▽ More

    Submitted 30 October, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: 12 pages, 3 figures, final draft submitted to ICMASS/MTEC 2024 conference

  8. arXiv:2407.01544  [pdf, other

    cs.NI cs.AI

    Decentralized Multi-Party Multi-Network AI for Global Deployment of 6G Wireless Systems

    Authors: Merim Dzaferagic, Marco Ruffini, Nina Slamnik-Krijestorac, Joao F. Santos, Johann Marquez-Barja, Christos Tranoris, Spyros Denazis, Thomas Kyriakakis, Panagiotis Karafotis, Luiz DaSilva, Shashi Raj Pandey, Junya Shiraishi, Petar Popovski, Soren Kejser Jensen, Christian Thomsen, Torben Bach Pedersen, Holger Claussen, Jinfeng Du, Gil Zussman, Tingjun Chen, Yiran Chen, Seshu Tirupathi, Ivan Seskar, Daniel Kilper

    Abstract: Multiple visions of 6G networks elicit Artificial Intelligence (AI) as a central, native element. When 6G systems are deployed at a large scale, end-to-end AI-based solutions will necessarily have to encompass both the radio and the fiber-optical domain. This paper introduces the Decentralized Multi-Party, Multi-Network AI (DMMAI) framework for integrating AI into 6G networks deployed at scale. DM… ▽ More

    Submitted 15 April, 2024; originally announced July 2024.

  9. An Explainable and Conformal AI Model to Detect Temporomandibular Joint Involvement in Children Suffering from Juvenile Idiopathic Arthritis

    Authors: Lena Todnem Bach Christensen, Dikte Straadt, Stratos Vassis, Christian Marius Lillelund, Peter Bangsgaard Stoustrup, Ruben Pauwels, Thomas Klit Pedersen, Christian Fischer Pedersen

    Abstract: Juvenile idiopathic arthritis (JIA) is the most common rheumatic disease during childhood and adolescence. The temporomandibular joints (TMJ) are among the most frequently affected joints in patients with JIA, and mandibular growth is especially vulnerable to arthritic changes of the TMJ in children. A clinical examination is the most cost-effective method to diagnose TMJ involvement, but clinicia… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted at EMBC 2024

    Journal ref: Proceedings of the IEEE EMBC, 2024, Vol. 1, pp. 1-4, IEEE

  10. Is It Really You Who Forgot the Password? When Account Recovery Meets Risk-Based Authentication

    Authors: Andre Büttner, Andreas Thue Pedersen, Stephan Wiefling, Nils Gruschka, Luigi Lo Iacono

    Abstract: Risk-based authentication (RBA) is used in online services to protect user accounts from unauthorized takeover. RBA commonly uses contextual features that indicate a suspicious login attempt when the characteristic attributes of the login context deviate from known and thus expected values. Previous research on RBA and anomaly detection in authentication has mainly focused on the login process. Ho… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  11. A Comparative Study of Rapidly-exploring Random Tree Algorithms Applied to Ship Trajectory Planning and Behavior Generation

    Authors: Trym Tengesdal, Tom Arne Pedersen, Tor Arne Johansen

    Abstract: Rapidly Exploring Random Tree (RRT) algorithms, notably used for nonholonomic vehicle navigation in complex environments, are often not thoroughly evaluated for their specific challenges. This paper presents a first such comparison study of the variants Potential-Quick RRT* (PQ-RRT*), Informed RRT* (IRRT*), RRT*, and RRT, in maritime single-query nonholonomic motion planning. Additionally, the pra… ▽ More

    Submitted 17 April, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  12. arXiv:2401.07944  [pdf, ps, other

    cs.CL

    SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT

    Authors: Rupak Kumar Das, Ted Pedersen

    Abstract: This paper uses the BERT model, which is a transformer-based architecture, to solve task 4A, English Language, Sentiment Analysis in Twitter of SemEval2017. BERT is a very powerful large language model for classification tasks when the amount of training data is small. For this experiment, we have used the BERT(BASE) model, which has 12 hidden layers. This model provides better accuracy, precision… ▽ More

    Submitted 19 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  13. arXiv:2401.06524  [pdf, ps, other

    cs.LG

    Domain Adaptation for Time series Transformers using One-step fine-tuning

    Authors: Subina Khanal, Seshu Tirupathi, Giulio Zizzo, Ambrish Rawat, Torben Bach Pedersen

    Abstract: The recent breakthrough of Transformers in deep learning has drawn significant attention of the time series community due to their ability to capture long-range dependencies. However, like other deep learning models, Transformers face limitations in time series prediction, including insufficient temporal understanding, generalization challenges, and data shift issues for the domains with limited d… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted at the Fourth Workshop of Artificial Intelligence for Time Series Analysis (AI4TS): Theory, Algorithms, and Applications, AAAI 2024, Vancouver, Canada

  14. arXiv:2312.08557  [pdf, other

    cs.DB

    Creating and Querying Data Cubes in Python using pyCube

    Authors: Sigmundur Vang, Christian Thomsen, Torben Bach Pedersen

    Abstract: Data cubes are used for analyzing large data sets usually contained in data warehouses. The most popular data cube tools use graphical user interfaces (GUI) to do the data analysis. Traditionally this was fine since data analysts were not expected to be technical people. However, in the subsequent decades the data landscape changed dramatically requiring companies to employ large teams of highly t… ▽ More

    Submitted 28 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Extended version of DOLAP2024 submission

  15. arXiv:2309.12807  [pdf, other

    cs.RO

    Teacher-Student Reinforcement Learning for Mapless Navigation using a Planetary Space Rover

    Authors: Anton Bjørndahl Mortensen, Emil Tribler Pedersen, Laia Vives Benedicto, Lionel Burg, Mads Rossen Madsen, Simon Bøgh

    Abstract: We address the challenge of enhancing navigation autonomy for planetary space rovers using reinforcement learning (RL). The ambition of future space missions necessitates advanced autonomous navigation capabilities for rovers to meet mission objectives. RL's potential in robotic autonomy is evident, but its reliance on simulations poses a challenge. Transferring policies to real-world scenarios of… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  16. arXiv:2306.10994  [pdf, other

    cs.DB

    Efficient Generalized Temporal Pattern Mining in Big Time Series Using Mutual Information

    Authors: Van Long Ho, Nguyen Ho, Torben Bach Pedersen, Panagiotis Papapetrou

    Abstract: Big time series are increasingly available from an ever wider range of IoT-enabled sensors deployed in various environments. Significant insights can be gained by mining temporal patterns from these time series. Temporal pattern mining (TPM) extends traditional pattern mining by adding event time intervals into extracted patterns, making them more expressive at the expense of increased time and sp… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2010.03653

  17. Goal-Oriented Scheduling in Sensor Networks with Application Timing Awareness

    Authors: Josefine Holm, Federico Chiariotti, Anders E. Kalør, Beatriz Soret, Torben Bach Pedersen, Petar Popovski

    Abstract: Taking inspiration from linguistics, the communications theoretical community has recently shown a significant recent interest in pragmatic , or goal-oriented, communication. In this paper, we tackle the problem of pragmatic communication with multiple clients with different, and potentially conflicting, objectives. We capture the goal-oriented aspect through the metric of Value of Information (Vo… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Journal ref: IEEE Transactions on Communications, 2023

  18. arXiv:2209.04635  [pdf, other

    cs.LG cs.AI

    A Comparative Study on Unsupervised Anomaly Detection for Time Series: Experiments and Analysis

    Authors: Yan Zhao, Liwei Deng, Xuanhao Chen, Chenjuan Guo, Bin Yang, Tung Kieu, Feiteng Huang, Torben Bach Pedersen, Kai Zheng, Christian S. Jensen

    Abstract: The continued digitization of societal processes translates into a proliferation of time series data that cover applications such as fraud detection, intrusion detection, and energy management, where anomaly detection is often essential to enable reliability and safety. Many recent studies target anomaly detection for time series data. Indeed, area of time series anomaly detection is characterized… ▽ More

    Submitted 10 September, 2022; originally announced September 2022.

  19. arXiv:2206.14604  [pdf, other

    cs.DB

    Mining Seasonal Temporal Patterns in Time Series

    Authors: Van Long Ho, Nguyen Ho, Torben Bach Pedersen

    Abstract: Very large time series are increasingly available from an ever wider range of IoT-enabled sensors, from which significant insights can be obtained through mining temporal patterns from them. A useful type of patterns found in many real-world applications exhibits periodic occurrences, and is thus called seasonal temporal pattern (STP). Compared to regular patterns, mining seasonal temporal pattern… ▽ More

    Submitted 9 January, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

  20. arXiv:2204.09131  [pdf, other

    cs.DB

    A Unified Approach for Multi-Scale Synchronous Correlation Search in Big Time Series -- Full Version

    Authors: Nguyen Ho, Van Long Ho, Torben Bach Pedersen, Mai Vu, Christophe A. N. Biscio

    Abstract: The wide deployment of IoT sensors has enabled the collection of very big time series across different domains, from which advanced analytics can be performed to find unknown relationships, most importantly the correlations between them. However, current approaches for correlation search on time series are limited to only a single temporal scale and simple types of relations, and cannot handle noi… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 18 pages

  21. arXiv:2204.07039  [pdf, other

    cs.LO

    Methods for Efficient Unfolding of Colored Petri Nets

    Authors: Alexander Bilgram, Peter G. Jensen, Thomas Pedersen, Jiri Srba, Peter H. Taankvist

    Abstract: Colored Petri nets offer a compact and user friendly representation of the traditional P/T nets and colored nets with finite color ranges can be unfolded into the underlying P/T nets, however, at the expense of an exponential explosion in size. We present two novel techniques based on static analysis in order to reduce the size of unfolded colored nets. The first method identifies colors that beha… ▽ More

    Submitted 11 October, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

    Journal ref: Fundamenta Informaticae, Volume 189, Issues 3-4: Reachability Problems 2020 and 2021 (October 14, 2023) fi:9351

  22. arXiv:2202.08504  [pdf, other

    cs.SI

    Finding Representative Sampling Subsets in Sensor Graphs using Time Series Similarities

    Authors: Roshni Chakraborty, Josefine Holm, Torben Bach Pedersen, Petar Popovski

    Abstract: With the increasing use of IoT-enabled sensors, it is important to have effective methods for querying the sensors. For example, in a dense network of battery-driven temperature sensors, it is often possible to query (sample) just a subset of the sensors at any given time, since the values of the non-sampled sensors can be estimated from the sampled values. If we can divide the set of sensors into… ▽ More

    Submitted 18 February, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  23. arXiv:2110.09800  [pdf, other

    eess.SY cs.CE

    Optimal Scheduling of Flexible Power-to-X Technologies in the Day-ahead Electricity Market

    Authors: Neeraj Dhanraj Bokde, Tim T Pedersen, Gorm Bruun Andresen

    Abstract: The ambitious CO2 emission targets of the Paris agreements are achievable only with renewable energy, CO2-free power generation, new policies, and planning. The main motivation of this paper is that future green fuels from power-to-X assets should be produced from power with the lowest possible emissions while still keeping the cost of electricity low. To this end we propose a power-to-X schedulin… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  24. arXiv:2109.11609  [pdf, ps, other

    cs.DB

    Evolutionary Clustering of Streaming Trajectories

    Authors: Tianyi Li, Lu Chen, Christian S. Jensen, Torben Bach Pedersen, Jilin Hu

    Abstract: The widespread deployment of smartphones and location-enabled, networked in-vehicle devices renders it increasingly feasible to collect streaming trajectory data of moving objects. The continuous clustering of such data can enable a variety of real-time services, such as identifying representative paths or common moving trends among objects in real-time. However, little attention has so far been g… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  25. arXiv:2106.07385  [pdf, other

    cs.CL cs.AI cs.DL cs.IR cs.LG

    SemEval-2021 Task 11: NLPContributionGraph -- Structuring Scholarly NLP Contributions for a Research Knowledge Graph

    Authors: Jennifer D'Souza, Sören Auer, Ted Pedersen

    Abstract: There is currently a gap between the natural language expression of scholarly publications and their structured semantic content modeling to enable intelligent content search. With the volume of research growing exponentially every year, a search feature operating over semantically structured content is compelling. The SemEval-2021 Shared Task NLPContributionGraph (a.k.a. 'the NCG task') tasks par… ▽ More

    Submitted 15 October, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 13 pages, 5 figures, 8 tables

    Journal ref: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), (pp. 364-376), ACL

  26. arXiv:2105.06845  [pdf, other

    cs.NI

    Query Age of Information: Freshness in Pull-Based Communication

    Authors: Federico Chiariotti, Josefine Holm, Anders E. Kalør, Beatriz Soret, Søren K. Jensen, Torben B. Pedersen, Petar Popovski

    Abstract: Age of Information (AoI) has become an important concept in communications, as it allows system designers to measure the freshness of the information available to remote monitoring or control processes. However, its definition tacitly assumes that new information is used at any time, which is not always the case: the instants at which information is collected and used are dependent on a certain qu… ▽ More

    Submitted 12 January, 2022; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in IEEE Transactions on Communications (preprint version). Extended version of conference paper arXiv:2011.00917

  27. arXiv:2102.05874  [pdf, other

    cs.CV

    Explainability in CNN Models By Means of Z-Scores

    Authors: David Malmgren-Hansen, Allan Aasbjerg Nielsen, Leif Toudal Pedersen

    Abstract: This paper explores the similarities of output layers in Neural Networks (NNs) with logistic regression to explain importance of inputs by Z-scores. The network analyzed, a network for fusion of Synthetic Aperture Radar (SAR) and Microwave Radiometry (MWR) data, is applied to prediction of arctic sea ice. With the analysis the importance of MWR relative to SAR is found to favor MWR components. Fur… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Intended and accepted for the "Deep Learning Meets Earth Sciences: From Hybrid Modeling to Explainability" workshop at IGARSS 2020, but was redrawn due to authors being unable to participate when lockdown restrictions moved the conference days. The work was conducted 2019 under the Automated Sea Ice Products (ASIP) project funded by the Innovation Fund Denmark

  28. The Forgotten Document-Oriented Database Management Systems: An Overview and Benchmark of Native XML DODBMSes in Comparison with JSON DODBMSes

    Authors: Ciprian-Octavian Truică, Elena-Simona Apostol, Jérôme Darmont, Torben Bach Pedersen

    Abstract: In the current context of Big Data, a multitude of new NoSQL solutions for storing, managing, and extracting information and patterns from semi-structured data have been proposed and implemented. These solutions were developed to relieve the issue of rigid data structures present in relational databases, by introducing semi-structured and flexible schema design. As current data generated by differ… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 28 pages, 6 figures, 7 tables

    ACM Class: H.2

    Journal ref: Big Data Research, Vol. 25, July 2021

  29. A Multidisciplinary Definition of Privacy Labels: The Story of Princess Privacy and the Seven Helpers

    Authors: Johanna Johansen, Tore Pedersen, Simone Fischer-Hübner, Christian Johansen, Gerardo Schneider, Arnold Roosendaal, Harald Zwingelberg, Anders Jakob Sivesind, Josef Noll

    Abstract: Privacy is currently in distress and in need of rescue, much like princesses in the all-familiar fairytales. We employ storytelling and metaphors from fairytales to make reader-friendly and streamline our arguments about how a complex concept of Privacy Labeling (the 'knight in shining armor') can be a solution to the current state of Privacy (the 'princess in distress'). We give a precise definit… ▽ More

    Submitted 9 May, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: 29 pages, 6 figures

    Journal ref: Information and Computer Security, Vol. 30, No. 3, (2022) pp. 452-469

  30. arXiv:2011.00917  [pdf, ps, other

    cs.IT cs.NI

    Freshness on Demand: Optimizing Age of Information for the Query Process

    Authors: Josefine Holm, Anders E. Kalør, Federico Chiariotti, Beatriz Soret, Søren K. Jensen, Torben B. Pedersen, Petar Popovski

    Abstract: Age of Information (AoI) has become an important concept in communications, as it allows system designers to measure the freshness of the information available to remote monitoring or control processes. However, its definition tacitly assumed that new information is used at any time, which is not always the case and the instants at which information is collected and used are dependent on a certain… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Submitted for publication

  31. arXiv:2010.15404  [pdf, other

    cs.DB

    On Efficient and Scalable Time-Continuous Spatial Crowdsourcing -- Full Version

    Authors: Ting Wang, Xike Xie, Xin Cao, Torben Bach Pedersen, Yang Wang, Mingjun Xiao

    Abstract: The proliferation of advanced mobile terminals opened up a new crowdsourcing avenue, spatial crowdsourcing, to utilize the crowd potential to perform real-world tasks. In this work, we study a new type of spatial crowdsourcing, called time-continuous spatial crowdsourcing (TCSC in short). It supports broad applications for long-term continuous spatial data acquisition, ranging from environmental m… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

  32. arXiv:2010.03653  [pdf, other

    cs.DB cs.DS

    Efficient Temporal Pattern Mining in Big Time Series Using Mutual Information -- Full Version

    Authors: Van Long Ho, Nguyen Ho, Torben Bach Pedersen

    Abstract: Very large time series are increasingly available from an ever wider range of IoT-enabled sensors deployed in different environments. Significant insights can be gained by mining temporal patterns from these time series. Unlike traditional pattern mining, temporal pattern mining (TPM) adds event time intervals into extracted patterns, making them more expressive at the expense of increased mining… ▽ More

    Submitted 17 November, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

  33. Modeling all alternative solutions for highly renewable energy systems

    Authors: Tim T. Pedersen, Marta Victoria, Morten G. Rasmussen, Gorm B. Andresen

    Abstract: As the world is transitioning towards highly renewable energy systems, advanced tools are needed to analyze such complex networks. Energy system design is, however, challenged by real-world objective functions consisting of a blurry mix of technical and socioeconomic agendas, with limitations that cannot always be clearly stated. As a result, it is highly likely that solutions which are techno-eco… ▽ More

    Submitted 29 June, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 25 pages, 7 figures, also available as preprint at: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3682045

    Journal ref: Tim T. Pedersen, Marta Victoria, Morten G. Rasmussen, Gorm B. Andresen, Modeling all alternative solutions for highly renewable energy systems, Energy, Volume 234, 2021, 121294, ISSN 0360-5442,

  34. arXiv:2009.02795  [pdf, other

    cs.CL

    Duluth at SemEval-2020 Task 7: Using Surprise as a Key to Unlock Humorous Headlines

    Authors: Shuning Jin, Yue Yin, XianE Tang, Ted Pedersen

    Abstract: We use pretrained transformer-based language models in SemEval-2020 Task 7: Assessing the Funniness of Edited News Headlines. Inspired by the incongruity theory of humor, we use a contrastive approach to capture the surprise in the edited headlines. In the official evaluation, our system gets 0.531 RMSE in Subtask 1, 11th among 49 submissions. In Subtask 2, our system gets 0.632 accuracy, 9th amon… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: To appear in the Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), December 12-13, 2020, Barcelona

  35. arXiv:2007.12949  [pdf, ps, other

    cs.CL

    Duluth at SemEval-2019 Task 6: Lexical Approaches to Identify and Categorize Offensive Tweets

    Authors: Ted Pedersen

    Abstract: This paper describes the Duluth systems that participated in SemEval--2019 Task 6, Identifying and Categorizing Offensive Language in Social Media (OffensEval). For the most part these systems took traditional Machine Learning approaches that built classifiers from lexical features found in manually labeled training data. However, our most successful system for classifying a tweet as offensive (or… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: 7 pages, Appears in the Proceedings of the 13th International Workshop on Semantic Eva luation (SemEval 2019), June 2019, pp. 593-599, Minneapolis, MN (a NAACL-2019 workshop, aka OffenseEval--2019)

  36. arXiv:2007.12946  [pdf, ps, other

    cs.CL

    Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression

    Authors: Ted Pedersen

    Abstract: This paper describes the Duluth systems that participated in SemEval--2020 Task 12, Multilingual Offensive Language Identification in Social Media (OffensEval--2020). We participated in the three English language tasks. Our systems provide a simple Machine Learning baseline using logistic regression. We trained our models on the distantly supervised training data made available by the task organiz… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: 10 pages, To appear in the Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval--2020), December 12-13, 2020, Barcelona (a COLING-2020 workshop, aka OffensEval--2020)

  37. High-Level ETL for Semantic Data Warehouses -- Full Version

    Authors: Rudra Pratap Deb Nath, Oscar Romero, Torben Bach Pedersen, Katja Hose

    Abstract: The popularity of the Semantic Web (SW) encourages organizations to organize and publish semantic data using the RDF model. This growth poses new requirements to Business Intelligence (BI) technologies to enable On-Line Analytical Processing (OLAP)-like analysis over semantic data. The incorporation of semantic data into a Data Warehouse (DW) is not supported by the traditional Extract-Transform-L… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 44 pages including reference, 13 figures and 4 tables. This paper is submitted to Semantic Web Journal and now it is under review

    Journal ref: Semantic Web, vol. 13, no. 1, pp. 85-132, 2022

  38. Studying the Transfer of Biases from Programmers to Programs

    Authors: Johanna Johansen, Tore Pedersen, Christian Johansen

    Abstract: It is generally agreed that one origin of machine bias is resulting from characteristics within the dataset on which the algorithms are trained, i.e., the data does not warrant a generalized inference. We, however, hypothesize that a different `mechanism', hitherto not articulated in the literature, may also be responsible for machine's bias, namely that biases may originate from (i) the programme… ▽ More

    Submitted 13 December, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: 40 pages of which 7 pages of Appendix, 26 Figures, 2 Tables

    Journal ref: AI & SOCIETY: Knowledge, Culture and Communication, 2023, Vol.38, No.4, pp.1659-1683

  39. arXiv:2002.06608  [pdf, other

    cs.DB

    Multidimensional Enrichment of Spatial RDF Data for SOLAP -- Full Version

    Authors: Nurefsan Gür, Torben Bach Pedersen, Katja Hose, Mikael Midtgaard

    Abstract: Large volumes of spatial data and multidimensional data are being published on the Semantic Web, which has led to new opportunities for advanced analysis, such as Spatial Online Analytical Processing (SOLAP). The RDF Data Cube (QB) and QB4OLAP vocabularies have been widely used for annotating and publishing statistical and multidimensional RDF data. Although such statistical data sets might have s… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

    Comments: 33 pages, 8 figures, 7 tables, 10 listings, 7 algorithms, under review in Semantic Web Journal, available on http://www.semantic-web-journal.net/content/multidimensional-enrichment-spatial-rdf-data-solap

  40. Multi-Source Spatial Entity Linkage

    Authors: Suela Isaj, Torben Bach Pedersen, Esteban Zimányi

    Abstract: Besides the traditional cartographic data sources, spatial information can also be derived from location-based sources. However, even though different location-based sources refer to the same physical world, each one has only partial coverage of the spatial entities, describe them with different attributes, and sometimes provide contradicting information. Hence, we introduce the spatial entity lin… ▽ More

    Submitted 29 April, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

  41. AMIC: An Adaptive Information Theoretic Method to Identify Multi-Scale Temporal Correlations in Big Time Series Data -- Accepted Version

    Authors: Nguyen Ho, Huy Vo, Mai Vu, Torben Bach Pedersen

    Abstract: Recent development in computing, sensing and crowd-sourced data have resulted in an explosion in the availability of quantitative information. The possibilities of analyzing this so-called Big Data to inform research and the decision-making process are virtually endless. In general, analyses have to be done across multiple data sets in order to bring out the most value of Big Data. A first importa… ▽ More

    Submitted 7 July, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

  42. Scalable Model-Based Management of Correlated Dimensional Time Series in ModelarDB+

    Authors: Søren Kejser Jensen, Torben Bach Pedersen, Christian Thomsen

    Abstract: To monitor critical infrastructure, high quality sensors sampled at a high frequency are increasingly used. However, as they produce huge amounts of data, only simple aggregates are stored. This removes outliers and fluctuations that could indicate problems. As a remedy, we present a model-based approach for managing time series with dimensions that exploits correlation in and among time series. S… ▽ More

    Submitted 29 June, 2021; v1 submitted 25 March, 2019; originally announced March 2019.

    Comments: 12 Pages, 28 Figures, and 1 Table

  43. arXiv:1901.06712  [pdf, other

    cs.SI

    Seed-Driven Geo-Social Data Extraction -- Full Version

    Authors: Suela Isaj, Torben Bach Pedersen

    Abstract: Geo-social data has been an attractive source for a variety of problems such as mining mobility patterns, link prediction, location recommendation, and influence maximization. However, new geo-social data is increasingly unavailable and suffers several limitations. In this paper, we aim to remedy the problem of effective data extraction from geo-social data sources. We first identify and categoriz… ▽ More

    Submitted 23 June, 2019; v1 submitted 20 January, 2019; originally announced January 2019.

  44. arXiv:1805.10274  [pdf, other

    cs.CL

    UMDSub at SemEval-2018 Task 2: Multilingual Emoji Prediction Multi-channel Convolutional Neural Network on Subword Embedding

    Authors: Zhenduo Wang, Ted Pedersen

    Abstract: This paper describes the UMDSub system that participated in Task 2 of SemEval-2018. We developed a system that predicts an emoji given the raw text in a English tweet. The system is a Multi-channel Convolutional Neural Network based on subword embeddings for the representation of tweets. This model improves on character or word based methods by about 2\%. Our system placed 21st of 48 participating… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 5 pages, to Appear in the Proceedings of the 12th International Workshop on Semantic Evaluation (SemEval 2018), June 2018, New Orleans, LA

  45. arXiv:1805.10271  [pdf, other

    cs.CL

    UMDuluth-CS8761 at SemEval-2018 Task 9: Hypernym Discovery using Hearst Patterns, Co-occurrence frequencies and Word Embeddings

    Authors: Arshia Z. Hassan, Manikya S. Vallabhajosyula, Ted Pedersen

    Abstract: Hypernym Discovery is the task of identifying potential hypernyms for a given term. A hypernym is a more generalized word that is super-ordinate to more specific words. This paper explores several approaches that rely on co-occurrence frequencies of word pairs, Hearst Patterns based on regular expressions, and word embeddings created from the UMBC corpus. Our system Babbage participated in Subtask… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 5 pages, to Appear in the Proceedings of the 12th International Workshop on Semantic Evaluation (SemEval 2018), June 2018, New Orleans, LA

  46. arXiv:1805.10267  [pdf, other

    cs.CL

    Duluth UROP at SemEval-2018 Task 2: Multilingual Emoji Prediction with Ensemble Learning and Oversampling

    Authors: Shuning Jin, Ted Pedersen

    Abstract: This paper describes the Duluth UROP systems that participated in SemEval--2018 Task 2, Multilingual Emoji Prediction. We relied on a variety of ensembles made up of classifiers using Naive Bayes, Logistic Regression, and Random Forests. We used unigram and bigram features and tried to offset the skewness of the data through the use of oversampling. Our task evaluation results place us 19th of 48… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 4 pages, to Appear in the Proceedings of the 12th International Workshop on Semantic Evaluation (SemEval 2018), June 2018, New Orleans, LA

  47. Adaptive User-Oriented Direct Load-Control of Residential Flexible Devices

    Authors: Davide Frazzetto, Bijay Neupane, Torben Bach Pedersen, Thomas Dyhre Nielsen

    Abstract: Demand Response (DR) schemes are effective tools to maintain a dynamic balance in energy markets with higher integration of fluctuating renewable energy sources. DR schemes can be used to harness residential devices' flexibility and to utilize it to achieve social and financial objectives. However, existing DR schemes suffer from low user participation as they fail at taking into account the users… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: 10 pages plus 1 page references, 11 figures, conference: ACM e-Energy 2018

  48. arXiv:1805.02301  [pdf, other

    cs.CE

    Day-ahead Trading of Aggregated Energy Flexibility - Full Version

    Authors: Emmanouil Valsomatzis, Torben Bach Pedersen, Alberto Abello

    Abstract: Flexibility of small loads, in particular from Electric Vehicles (EVs), has recently attracted a lot of interest due to their possibility of participating in the energy market and the new commercial potentials. Different from existing work, the aggregation techniques proposed in this paper produce flexible aggregated loads from EVs taking into account technical market requirements. They can be fur… ▽ More

    Submitted 24 May, 2018; v1 submitted 6 May, 2018; originally announced May 2018.

    Comments: 9 pages, 7 figures, note paper of the full version to appear at ACM e-Energy 2018

  49. arXiv:1805.00702  [pdf, other

    cs.CE cs.LG

    Utilizing Device-level Demand Forecasting for Flexibility Markets - Full Version

    Authors: Bijay Neupane, Torben Bach Pedersen, Bo Thiesson

    Abstract: The uncertainty in the power supply due to fluctuating Renewable Energy Sources (RES) has severe (financial and other) implications for energy market players. In this paper, we present a device-level Demand Response (DR) scheme that captures the atomic (all available) flexibilities in energy demand and provides the largest possible solution space to generate demand/supply schedules that minimize m… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    Comments: 13 pages

  50. Time Series Management Systems: A Survey

    Authors: Søren Kejser Jensen, Torben Bach Pedersen, Christian Thomsen

    Abstract: The collection of time series data increases as more monitoring and automation are being deployed. These deployments range in scale from an Internet of things (IoT) device located in a household to enormous distributed Cyber-Physical Systems (CPSs) producing large volumes of data at high velocity. To store and analyze these vast amounts of data, specialized Time Series Management Systems (TSMSs) h… ▽ More

    Submitted 3 October, 2017; originally announced October 2017.

    Comments: 20 Pages, 15 Figures, 2 Tables, Accepted for publication in IEEE TKDE

    ACM Class: G.1.2; D.2.11; E.4; E.2; E.1; H.2; H.2.4; C.2.4; H.2.8; G.3

    Journal ref: TKDE, 29, 11, 2017, 2581-2600