Skip to main content

Showing 1–32 of 32 results for author: Arslan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2508.16243  [pdf, ps, other

    cs.CL

    TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks

    Authors: İrem Demirtaş, Burak Payzun, Seçil Arslan

    Abstract: Thanks to the growing popularity of large language models over the years, there is great potential for their applications in finance. Despite the exceptional performance of larger proprietary models, which are presented as black-box solutions through APIs, smaller models that can be hosted on-premise present opportunities for adaptability and privacy. Especially in cases where the management of se… ▽ More

    Submitted 22 August, 2025; originally announced August 2025.

    Comments: IJCAI 2025 - FinLLM Workshop

  2. arXiv:2502.18871  [pdf, ps, other

    cs.CV cs.AI

    Inscanner: Dual-Phase Detection and Classification of Auxiliary Insulation Using YOLOv8 Models

    Authors: Youngtae Kim, Soonju Jeong, Sardar Arslan, Dhananjay Agnihotri, Yahya Ahmed, Ali Nawaz, Jinhee Song, Hyewon Kim

    Abstract: This study proposes a two-phase methodology for detecting and classifying auxiliary insulation in structural components. In the detection phase, a YOLOv8x model is trained on a dataset of complete structural blueprints, each annotated with bounding boxes indicating areas that should contain insulation. In the classification phase, these detected insulation patches are cropped and categorized into… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  3. arXiv:2409.16001  [pdf, other

    cs.AI q-bio.NC

    Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI

    Authors: Suayb S. Arslan

    Abstract: Human intelligence, the most evident and accessible form of source of reasoning, hosted by biological hardware, has evolved and been refined over thousands of years, positioning itself today to create new artificial forms and preparing to self--design their evolutionary path forward. Beginning with the advent of foundation models, the rate at which human and artificial intelligence interact with e… ▽ More

    Submitted 2 February, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 26 pages, 8 figures, submitted to IEEE Trans. on NNLS

  4. TALICS$^3$: Tape Library Cloud Storage System Simulator

    Authors: Suayb S. Arslan, James Peng, Turguy Goker

    Abstract: High performance computing data is surging fast into the exabyte-scale world, where tape libraries are the main platform for long-term durable data storage besides high-cost DNA. Tape libraries are extremely hard to model, but accurate modeling is critical for system administrators to obtain valid performance estimates for their designs. This research introduces a discrete--event tape simulation p… ▽ More

    Submitted 12 June, 2024; v1 submitted 18 January, 2024; originally announced May 2024.

    Comments: 15 pages, 13 figures

    Journal ref: Simulation Modelling Practice and Theory, Volume 134, 2024, 102947

  5. arXiv:2306.13176  [pdf

    cs.CV cs.LG eess.IV

    Key Frame Extraction with Attention Based Deep Neural Networks

    Authors: Samed Arslan, Senem Tanberk

    Abstract: Automatic keyframe detection from videos is an exercise in selecting scenes that can best summarize the content for long videos. Providing a summary of the video is an important task to facilitate quick browsing and content summarization. The resulting photos are used for automated works (e.g. summarizing security footage, detecting different scenes used in music clips) in different industries. In… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: in Turkish language

  6. arXiv:2305.00538  [pdf, other

    cs.NI

    SFC: Near-Source Congestion Signaling and Flow Control

    Authors: Yanfang Le, Jeongkeun Lee, Jeremias Blendin, Jiayi Chen, Georgios Nikolaidis, Rong Pan, Robert Soule, Aditya Akella, Pedro Yebenes Segura, Arjun singhvi, Yuliang Li, Qingkai Meng, Changhoon Kim, Serhat Arslan

    Abstract: State-of-the-art congestion control algorithms for data centers alone do not cope well with transient congestion and high traffic bursts. To help with these, we revisit the concept of direct \emph{backward} feedback from switches and propose Back-to-Sender (BTS) signaling to many concurrent incast senders. Combining it with our novel approach to in-network caching, we achieve near-source sub-RTT c… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  7. arXiv:2301.09057  [pdf, ps, other

    cs.PF cs.IR

    Durability and Availability of Erasure-Coded Storage Systems with Concurrent Maintenance

    Authors: Suayb S. Arslan

    Abstract: This initial version of this document was written back in 2014 for the sole purpose of providing fundamentals of reliability theory as well as to identify the theoretical types of machinery for the prediction of durability/availability of erasure-coded storage systems. Since the definition of a "system" is too broad, we specifically focus on warm/cold storage systems where the data is stored in a… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: 58 pages, 20 figures, 9 tables. arXiv admin note: substantial text overlap with arXiv:1911.00329

  8. arXiv:2301.00488  [pdf, ps, other

    cs.HC cs.AI cs.IT cs.PF eess.SP

    Information Transfer Rate in BCIs: Towards Tightly Integrated Symbiosis

    Authors: Suayb S. Arslan, Pawan Sinha

    Abstract: The information transmission rate (ITR), or effective bit rate, is a popular and widely used information measurement metric, particularly popularized for SSVEP-based Brain-Computer (BCI) interfaces. By combining speed and accuracy into a single-valued parameter, this metric aids in the evaluation and comparison of various target identification algorithms across different BCI communities. In order… ▽ More

    Submitted 10 June, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 27 pages, 10 figures, 2 tables. Accepted to Biomedical Signal Processing and Control, Elsevier, 2023

  9. arXiv:2212.06658  [pdf, other

    cs.NI

    Enabling the Reflex Plane with the nanoPU

    Authors: Stephen Ibanez, Alex Mallery, Serhat Arslan, Theo Jepsen, Muhammad Shahbaz, Changhoon Kim, Nick McKeown

    Abstract: Many recent papers have demonstrated fast in-network computation using programmable switches, running many orders of magnitude faster than CPUs. The main limitation of writing software for switches is the constrained programming model and limited state. In this paper we explore whether a new type of CPU, called the nanoPU, offers a useful middle ground, with a familiar C/C++ programming model, and… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  10. Fault-Tolerant Strassen-Like Matrix Multiplication

    Authors: Osman B. Guney, Suayb S. Arslan

    Abstract: In this study, we propose a simple method for fault-tolerant Strassen-like matrix multiplications. The proposed method is based on using two distinct Strassen-like algorithms instead of replicating a given one. We have realized that using two different algorithms, new check relations arise resulting in more local computations. These local computations are found using computer aided search. To impr… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 6 pages, 2 figures

  11. arXiv:2201.09356  [pdf, ps, other

    cs.DC cs.NI

    SToN: A New Fundamental Trade-off for Distributed Data Storage Systems

    Authors: Bastien Confais, Şuayb Ş. Arslan, Benoît Parrein

    Abstract: Locating data efficiently is a key process in every distributed data storage solution and particularly those deployed in multi-site environments, such as found in Cloud and Fog computing. Nevertheless, the existing protocols dedicated to this task are not compatible with the requirements of the infrastructures that underlie such computing paradigms. In this paper, we initially review three fundame… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: 19 pages, 9 figures

  12. arXiv:2109.11693  [pdf, other

    cs.NI cs.PF

    Updating the Theory of Buffer Sizing

    Authors: Bruce Spang, Serhat Arslan, Nick McKeown

    Abstract: Routers have packet buffers to reduce packet drops during times of congestion. It is important to correctly size the buffer: make it too small, and packets are dropped unnecessarily and the link may be underutilized; make it too big, and packets may wait for a long time, and the router itself may be more expensive to build. Despite its importance, there are few guidelines for picking the buffer si… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 21 pages, to be published in IFIP Performance '21. Code available at https://github.com/brucespang/ifip21-buffer-sizing

  13. arXiv:2105.02146  [pdf, other

    cs.IT cs.NI

    Cooperative Network Coding for Distributed Storage using Base Stations with Link Constraints

    Authors: Suayb S. Arslan, Massoud Pourmandi, Elif Haytaoglu

    Abstract: In this work, we consider a novel distributed data storage/caching scenario in a cellular setting where multiple nodes may fail/depart at the same time. In order to maintain the target reliability, we allow cooperative regeneration of lost nodes with the help of base stations allocated in a set of hierarchical layers. Due to this layered structure, a symbol download from each base station has a di… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 8 pages, 4 figures and 1 table

  14. arXiv:2010.14781  [pdf, ps, other

    cs.IT

    On the Fault Tolerant Distributed Data Caching using LDPC Codes in Cellular Networks

    Authors: Elif Haytaoglu, Erdi Kaya, Suayb S. Arslan

    Abstract: The base station-mobile device communication traffic has dramatically increased recently due to mobile data, which in turn heavily overloaded the underlying infrastructure. To decrease Base Station (BS) interaction, intra-cell communication between local devices, known as Device-to-Device, is utilized for distributed data caching. Nevertheless, due to the continuous departure of existing nodes and… ▽ More

    Submitted 1 October, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  15. arXiv:2010.12114  [pdf, other

    cs.AR cs.NI

    The nanoPU: Redesigning the CPU-Network Interface to Minimize RPC Tail Latency

    Authors: Stephen Ibanez, Alex Mallery, Serhat Arslan, Theo Jepsen, Muhammad Shahbaz, Nick McKeown, Changhoon Kim

    Abstract: The nanoPU is a new networking-optimized CPU designed to minimize tail latency for RPCs. By bypassing the cache and memory hierarchy, the nanoPU directly places arriving messages into the CPU register file. The wire-to-wire latency through the application is just 65ns, about 13x faster than the current state-of-the-art. The nanoPU moves key functions from software to hardware: reliable network tra… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 10 pages

    ACM Class: C.1.1; C.2.1

  16. Guessing Cost: Bounds and Applications to Data Repair in Distributed Storage

    Authors: Suayb S. Arslan, Elif Haytaoglu

    Abstract: The guesswork refers to the distribution of the minimum number of trials needed to guess a realization of a random variable accurately. In this study, a non-trivial generalization of the guesswork called guessing cost (also referred to as cost of guessing) is introduced, and an optimal strategy for finding the $ρ$-th moment of guessing cost is provided for a random variable defined on a finite set… ▽ More

    Submitted 8 December, 2023; v1 submitted 13 May, 2020; originally announced May 2020.

    Comments: 29 pages, 6 figures, 1 table. Accepted to IEEE Transactions on Information Theory, 2023

  17. arXiv:2004.12104  [pdf, other

    cs.CV

    Offline Signature Verification on Real-World Documents

    Authors: Deniz Engin, Alperen Kantarcı, Seçil Arslan, Hazım Kemal Ekenel

    Abstract: Research on offline signature verification has explored a large variety of methods on multiple signature datasets, which are collected under controlled conditions. However, these datasets may not fully reflect the characteristics of the signatures in some practical use cases. Real-world signatures extracted from the formal documents may contain different types of occlusions, for example, stamps, c… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 Biometrics Workshop

  18. A Data-Assisted Reliability Model for Carrier-Assisted Cold Data Storage Systems

    Authors: Suayb S. Arslan, James Peng, Turguy Goker

    Abstract: Cold data storage systems are used to allow long term digital preservation for institutions' archives. The common functionality among cold and warm/hot data storage is that the data is stored on some physical medium for read-back at a later time. However in cold storage, write and read operations are not necessarily done in the same exact geographical location. Hence, a third party assistance is t… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: 14 pages, 8 figures, accepted to Elsevier Reliability and Safety Journal, 2019 (unedited)

  19. arXiv:1905.10458  [pdf, other

    cs.DC

    Compress-Store on Blockchain: A Decentralized Data Processing and Immutable Storage for Multimedia Streaming

    Authors: Suayb S. Arslan, Turguy Goker

    Abstract: Decentralization for data storage is a challenging problem for blockchain-based solutions as the blocksize plays the key role for scalability. In addition, specific requirements of multimedia data calls for various changes in the blockchain technology internals. Considering one of the most popular applications of secure multimedia streaming, i.e., video surveillance, it is not clear how to judicio… ▽ More

    Submitted 12 March, 2022; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: 18 pages, 5 figures, 1 table, accepted to Cluster Computing

  20. Array BP-XOR Codes for Hierarchically Distributed Matrix Multiplication

    Authors: Suayb S. Arslan

    Abstract: A novel fault-tolerant computation technique based on array Belief Propagation (BP)-decodable XOR (BP-XOR) codes is proposed for distributed matrix-matrix multiplication. The proposed scheme is shown to be configurable and suited for modern hierarchical compute architectures such as Graphical Processing Units (GPUs) equipped with multiple nodes, whereby each has many small independent processing u… ▽ More

    Submitted 10 December, 2021; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: 22 pages, 5 figures, 4 tables. Accepted to IEEE Transactions on Information Theory, 2021. arXiv admin note: text overlap with arXiv:1709.07949

  21. arXiv:1810.10621  [pdf, ps, other

    cs.IT

    A Reliability Model for Dependent and Distributed MDS Disk Array Units

    Authors: Suayb S. Arslan

    Abstract: Archiving and systematic backup of large digital data generates a quick demand for multi-peta byte scale storage systems. As drive capacities continue to grow beyond the few terabytes range to address the demands of today's cloud, the likelihood of having multiple/simultaneous disk failures become a reality. Among the main factors causing catastrophic system failures, correlated disk failures and… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: This Paper Has Been Accepted For Publication In IEEE Transactions On Reliability, Oct. 2018. (unedited Version)

  22. arXiv:1807.11605  [pdf, other

    cs.CL

    Doubly Attentive Transformer Machine Translation

    Authors: Hasan Sait Arslan, Mark Fishel, Gholamreza Anbarjafari

    Abstract: In this paper a doubly attentive transformer machine translation model (DATNMT) is presented in which a doubly-attentive transformer decoder normally joins spatial visual features obtained via pretrained convolutional neural networks, conquering any gap between image captioning and translation. In this framework, the transformer decoder figures out how to take care of source-language words and par… ▽ More

    Submitted 30 July, 2018; originally announced July 2018.

  23. arXiv:1806.01764  [pdf, other

    cs.CV

    Graph Saliency Maps through Spectral Convolutional Networks: Application to Sex Classification with Brain Connectivity

    Authors: Salim Arslan, Sofia Ira Ktena, Ben Glocker, Daniel Rueckert

    Abstract: Graph convolutional networks (GCNs) allow to apply traditional convolution operations in non-Euclidean domains, where data are commonly modelled as irregular graphs. Medical imaging and, in particular, neuroscience studies often rely on such graph representations, with brain connectivity networks being a characteristic example, while ultimately seeking the locus of phenotypic or disease-related di… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

  24. arXiv:1802.06772  [pdf, other

    q-bio.NC cs.CV

    Connectivity-Driven Parcellation Methods for the Human Cerebral Cortex

    Authors: Salim Arslan

    Abstract: In this thesis, we present robust and fully-automated methods for the subdivision of the entire human cerebral cortex based on connectivity information. Our contributions are four-fold: First, we propose a clustering approach to delineate a cortical parcellation that provides a reliable abstraction of the brain's functional organisation. Second, we cast the parcellation problem as a feature reduct… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

    Comments: Abstract is summarised to satisfy the character limit imposed by Arxiv. Please refer to the pdf for the full text. Forked from https://spiral.imperial.ac.uk/handle/10044/1/54760

  25. arXiv:1801.08863  [pdf, other

    cs.CV cs.GR

    3D Scanning: A Comprehensive Survey

    Authors: Morteza Daneshmand, Ahmed Helmi, Egils Avots, Fatemeh Noroozi, Fatih Alisinanoglu, Hasan Sait Arslan, Jelena Gorbova, Rain Eric Haamer, Cagri Ozcinar, Gholamreza Anbarjafari

    Abstract: This paper provides an overview of 3D scanning methodologies and technologies proposed in the existing scientific and industrial literature. Throughout the paper, various types of the related techniques are reviewed, which consist, mainly, of close-range, aerial, structure-from-motion and terrestrial photogrammetry, and mobile, terrestrial and airborne laser scanning, as well as time-of-flight, st… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

    Comments: 18 pages, 3 figures

  26. arXiv:1709.07949  [pdf, other

    cs.IT

    Asymptotically MDS Array BP-XOR Codes

    Authors: Suayb S. Arslan

    Abstract: Belief propagation or message passing on binary erasure channels (BEC) is a low complexity decoding algorithm that allows the recovery of message symbols based on bipartite graph prunning process. Recently, array XOR codes have attracted attention for storage systems due to their burst error recovery performance and easy arithmetic based on Exclusive OR (XOR)-only logic operations. Array BP-XOR co… ▽ More

    Submitted 22 September, 2017; originally announced September 2017.

    Comments: 8 pages, 4 figures, to be submitted

  27. arXiv:1703.10062  [pdf, other

    q-bio.NC cs.NE

    Exploring Heritability of Functional Brain Networks with Inexact Graph Matching

    Authors: Sofia Ira Ktena, Salim Arslan, Sarah Parisot, Daniel Rueckert

    Abstract: Data-driven brain parcellations aim to provide a more accurate representation of an individual's functional connectivity, since they are able to capture individual variability that arises due to development or disease. This renders comparisons between the emerging brain connectivity networks more challenging, since correspondences between their elements are not preserved. Unveiling these correspon… ▽ More

    Submitted 29 March, 2017; originally announced March 2017.

    Comments: accepted at ISBI 2017: International Symposium on Biomedical Imaging, Apr 2017, Melbourne, Australia

  28. arXiv:1702.07409  [pdf, other

    cs.IT

    Founsure 1.0: An Erasure Code Library with Efficient Repair and Update Features

    Authors: Şuayb Ş. Arslan

    Abstract: Founsure is an open-source software library that implements a multi-dimensional graph-based erasure coding entirely based on fast exclusive OR (XOR) logic. Its implementation utilizes compiler optimizations and multi-threading to generate the right assembly code for the given multi-core CPU architecture with vector processing capabilities. Founsure possesses important features that shall find vari… ▽ More

    Submitted 13 January, 2021; v1 submitted 23 February, 2017; originally announced February 2017.

    Comments: Accepted to Elsevier SoftwareX, 2021

  29. arXiv:1402.6016  [pdf, ps, other

    cs.IT

    Incremental Redundancy, Fountain Codes and Advanced Topics

    Authors: Suayb S. Arslan

    Abstract: This document is written in order to establish a common base ground on which the majority of the relevant research about linear fountain codes can be analyzed and compared. As far as I am concerned, there is no unified approach that outlines and compares most of the published linear fountain codes in a single and self-contained framework. This written document has not only resulted in the review o… ▽ More

    Submitted 14 July, 2014; v1 submitted 24 February, 2014; originally announced February 2014.

    Comments: 57 pages, 22 figures, Version 0.2

  30. Redundancy and Aging of Efficient Multidimensional MDS-Parity Protected Distributed Storage Systems

    Authors: Suayb S. Arslan

    Abstract: The effect of redundancy on the aging of an efficient Maximum Distance Separable (MDS) parity--protected distributed storage system that consists of multidimensional arrays of storage units is explored. In light of the experimental evidences and survey data, this paper develops generalized expressions for the reliability of array storage systems based on more realistic time to failure distribution… ▽ More

    Submitted 3 December, 2013; originally announced December 2013.

    Comments: 11 pages, 6 figures, Accepted for publication in IEEE Transactions on Device and Materials Reliability (TDMR), Nov. 2013

  31. arXiv:1307.5906  [pdf, ps, other

    cs.IT

    Embedding Noise Prediction into List-Viterbi Decoding using Error Detection Codes for Magnetic Tape Systems

    Authors: Suayb. S. Arslan, Jaewook Lee, Turguy Goker

    Abstract: A List Viterbi detector produces a rank ordered list of the N globally best candidates in a trellis search. A List Viterbi detector structure is proposed that incorporates the noise prediction with periodic state-metric updates based on outer error detection codes (EDCs). More specifically, a periodic decision making process is utilized for a non-overlapping sliding windows of P bits based on the… ▽ More

    Submitted 22 July, 2013; originally announced July 2013.

    Comments: 4 pages, 3 figures, Proceedings of the ASME 2013 Conference on information storage and processing systems (ISPS 2013)

    Report number: ISPS2013-2835

  32. arXiv:1210.2815  [pdf, ps, other

    cs.MM

    Minimum Distortion Variance Concatenated Block Codes for Embedded Source Transmission

    Authors: Suayb S. Arslan

    Abstract: Some state-of-art multimedia source encoders produce embedded source bit streams that upon the reliable reception of only a fraction of the total bit stream, the decoder is able reconstruct the source up to a basic quality. Reliable reception of later source bits gradually improve the reconstruction quality. Examples include scalable extensions of H.264/AVC and progressive image coders such as JPE… ▽ More

    Submitted 1 March, 2014; v1 submitted 10 October, 2012; originally announced October 2012.

    Comments: 6 pages, 4 figures, In Proc. of International Conference on Computing, Networking and Communications, ICNC 2014, Hawaii, USA