Skip to main content

Showing 1–5 of 5 results for author: Liyanage, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.02178  [pdf

    cs.CL

    Subasa - Adapting Language Models for Low-resourced Offensive Language Detection in Sinhala

    Authors: Shanilka Haturusinghe, Tharindu Cyril Weerasooriya, Marcos Zampieri, Christopher M. Homan, S. R. Liyanage

    Abstract: Accurate detection of offensive language is essential for a number of applications related to social media safety. There is a sharp contrast in performance in this task between low and high-resource languages. In this paper, we adapt fine-tuning strategies that have not been previously explored for Sinhala in the downstream task of offensive language detection. Using this approach, we introduce fo… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Accepted to appear at NAACL SRW 2025

  2. arXiv:2406.14765  [pdf, other

    cs.AI cs.CL cs.CY cs.IR cs.LG

    ChatGPT as Research Scientist: Probing GPT's Capabilities as a Research Librarian, Research Ethicist, Data Generator and Data Predictor

    Authors: Steven A. Lehr, Aylin Caliskan, Suneragiri Liyanage, Mahzarin R. Banaji

    Abstract: How good a research scientist is ChatGPT? We systematically probed the capabilities of GPT-3.5 and GPT-4 across four central components of the scientific process: as a Research Librarian, Research Ethicist, Data Generator, and Novel Data Predictor, using psychological science as a testing field. In Study 1 (Research Librarian), unlike human researchers, GPT-3.5 and GPT-4 hallucinated, authoritativ… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Main article is 14 pages, 1 table. Includes SI Appendix: 26 pages, 12 tables, 2 figures. Total: 40 pages, 13 tables, 2 figures. Under revised review at PNAS

    ACM Class: I.2.7; K.4.0; K.4.1; K.4.2

  3. arXiv:2209.07202  [pdf, other

    cs.CR cs.SI

    Dizzy: Large-Scale Crawling and Analysis of Onion Services

    Authors: Yazan Boshmaf, Isuranga Perera, Udesh Kumarasinghe, Sajitha Liyanage, Husam Al Jawaheri

    Abstract: With nearly 2.5m users, onion services have become the prominent part of the darkweb. Over the last five years alone, the number of onion domains has increased 20x, reaching more than 700k unique domains in January 2022. As onion services host various types of illicit content, they have become a valuable resource for darkweb research and an integral part of e-crime investigation and threat intelli… ▽ More

    Submitted 4 May, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

  4. arXiv:2107.04172  [pdf, other

    cs.DC

    Experiences with Integrating Custos SecurityServices

    Authors: Isuru Ranawaka, Samitha Liyanage, Dannon Baker, Alexandru Mahmoud, Juleen Graham, Terry Fleury, Dimuthu Wannipurage, Yu Ma, Enis Afgan, Jim Basney, Suresh Marru, Marlon Pierce

    Abstract: Science gateways are user-facing cyberinfrastruc-ture that provide researchers and educators with Web-basedaccess to scientific software, computing, and data resources.Managing user identities, accounts, and permissions are essentialtasks for science gateways, and gateways likewise must man-age secure connections between their middleware and remoteresources. The Custos project is an effort to buil… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 9 pages, 12 figures

  5. arXiv:1708.02912  [pdf

    cs.CL cs.IR

    KeyXtract Twitter Model - An Essential Keywords Extraction Model for Twitter Designed using NLP Tools

    Authors: Tharindu Weerasooriya, Nandula Perera, S. R. Liyanage

    Abstract: Since a tweet is limited to 140 characters, it is ambiguous and difficult for traditional Natural Language Processing (NLP) tools to analyse. This research presents KeyXtract which enhances the machine learning based Stanford CoreNLP Part-of-Speech (POS) tagger with the Twitter model to extract essential keywords from a tweet. The system was developed using rule-based parsers and two corpora. The… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

    Comments: 7 Pages, 5 Figures, Proceedings of the 10th KDU International Research Conference