Skip to main content

Showing 1–9 of 9 results for author: Dogar, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.08894  [pdf, ps, other

    cs.HC cs.AI cs.CY

    WaLLM -- Insights from an LLM-Powered Chatbot deployment via WhatsApp

    Authors: Hiba Eltigani, Rukhshan Haroon, Asli Kocak, Abdullah Bin Faisal, Noah Martin, Fahad Dogar

    Abstract: Recent advances in generative AI, such as ChatGPT, have transformed access to information in education, knowledge-seeking, and everyday decision-making. However, in many developing regions, access remains a challenge due to the persistent digital divide. To help bridge this gap, we developed WaLLM - a custom AI chatbot over WhatsApp, a widely used communication platform in developing regions. Beyo… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  2. arXiv:2410.11857  [pdf, other

    cs.DC cs.LG

    LLMProxy: Reducing Cost to Access Large Language Models

    Authors: Noah Martin, Abdullah Bin Faisal, Hiba Eltigani, Rukhshan Haroon, Swaminathan Lamelas, Fahad Dogar

    Abstract: In this paper, we make a case for a proxy for large language models which has explicit support for cost-saving optimizations. We design LLMProxy, which supports three key optimizations: model selection, context management, and caching. These optimizations present tradeoffs in terms of cost, inference time, and response quality, which applications can navigate through our high level, bidirectional… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  3. arXiv:2407.17760  [pdf, other

    cs.HC cs.AI

    TwIPS: A Large Language Model Powered Texting Application to Simplify Conversational Nuances for Autistic Users

    Authors: Rukhshan Haroon, Fahad Dogar

    Abstract: Autistic individuals often experience difficulties in conveying and interpreting emotional tone and non-literal nuances. Many also mask their communication style to avoid being misconstrued by others, spending considerable time and mental effort in the process. To address these challenges in text-based communication, we present TwIPS, a prototype texting application powered by a large language mod… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  4. arXiv:2401.10354  [pdf, other

    cs.DC cs.LG

    Towards providing reliable job completion time predictions using PCS

    Authors: Abdullah Bin Faisal, Noah Martin, Hafiz Mohsin Bashir, Swaminathan Lamelas, Fahad R. Dogar

    Abstract: In this paper we build a case for providing job completion time predictions to cloud users, similar to the delivery date of a package or arrival time of a booked ride. Our analysis reveals that providing predictability can come at the expense of performance and fairness. Existing cloud scheduling systems optimize for extreme points in the trade-off space, making them either extremely unpredictable… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  5. arXiv:2401.08890  [pdf, other

    cs.NI

    Characterizing TCP's Performance for Low-Priority Flows Inside a Cloud

    Authors: Hafiz Mohsin Bashir, Abdullah Bin Faisal, Fahad R. Dogar

    Abstract: Many cloud systems utilize low-priority flows to achieve various performance objectives (e.g., low latency, high utilization), relying on TCP as their preferred transport protocol. However, the suitability of TCP for such low-priority flows is relatively unexplored. Specifically, how prioritization-induced delays in packet transmission can cause spurious timeouts and low utilization. In this paper… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  6. arXiv:2304.05481  [pdf, other

    cs.NI

    Measuring Latency Reduction and the Digital Divide of Cloud Edge Datacenters

    Authors: Noah Martin, Fahad Dogar

    Abstract: Cloud providers are highly incentivized to reduce latency. One way they do this is by locating datacenters as close to users as possible. These "cloud edge" datacenters are placed in metropolitan areas and enable edge computing for residents of these cities. Therefore, which cities are selected to host edge datacenters determines who has the fastest access to applications requiring edge compute -… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  7. arXiv:1906.02562  [pdf, other

    cs.NI

    Judicious QoS using Cloud Overlays

    Authors: Osama Haq, Cody Doucette, John W. Byers, Fahad R. Dogar

    Abstract: We revisit the long-standing problem of providing network QoS to applications, and propose the concept of judicious QoS -- combining the cheaper, best effort IP service with the cloud, which offers a highly reliable infrastructure and the ability to add in-network services, albeit at higher cost. Our proposed J-QoS framework offers a range of reliability services with different cost vs. delay trad… ▽ More

    Submitted 26 September, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Compared to the previous version, we have made a number of changes, including new experiments on RIPE ATLAS testbed to evaluate the feasibility of our services, discussion on end-to-end working of the system, and several other changes in writing to clarify ambiguities in design or positioning of the work. arXiv admin note: substantial text overlap with arXiv:1812.10835

  8. arXiv:1905.13352  [pdf, other

    cs.NI cs.DC

    Reducing Tail Latency via Safe and Simple Duplication

    Authors: Hafiz Mohsin Bashir, Abdullah Bin Faisal, Muhammad Asim Jamshed, Peter Vondras, Ali Musa Iftikhar, Ihsan Ayyub Qazi, Fahad R. Dogar

    Abstract: Duplication can be a powerful strategy for overcoming stragglers in cloud services, but is often used conservatively because of the risk of overloading the system. We present duplicate-aware scheduling or DAS, which makes duplication safe and easy to use, by leveraging the two well-known primitives of prioritization and purging. To support DAS across diverse layers of a cloud system (e.g., network… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  9. arXiv:1812.10835  [pdf, other

    cs.NI

    CASPR: Judiciously Using the Cloud for Wide-Area Packet Recovery

    Authors: Osama Haq, Cody Doucette, John W. Byers, Fahad R. Dogar

    Abstract: We revisit a classic networking problem -- how to recover from lost packets in the best-effort Internet. We propose CASPR, a system that judiciously leverages the cloud to recover from lost or delayed packets. CASPR supplements and protects best-effort connections by sending a small number of coded packets along the highly reliable but expensive cloud paths. When receivers detect packet loss, they… ▽ More

    Submitted 1 January, 2019; v1 submitted 27 December, 2018; originally announced December 2018.