Skip to main content

Showing 1–43 of 43 results for author: Akbari, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.05936  [pdf, other

    cs.CV

    CASP: Compression of Large Multimodal Models Based on Attention Sparsity

    Authors: Mohsen Gholami, Mohammad Akbari, Kevin Cannons, Yong Zhang

    Abstract: In this work, we propose an extreme compression technique for Large Multimodal Models (LMMs). While previous studies have explored quantization as an efficient post-training compression method for Large Language Models (LLMs), low-bit compression for multimodal models remains under-explored. The redundant nature of inputs in multimodal models results in a highly sparse attention matrix. We theoret… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  2. arXiv:2503.02175  [pdf, other

    cs.CV cs.AI cs.LG

    DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models

    Authors: Saeed Ranjbar Alvar, Gursimran Singh, Mohammad Akbari, Yong Zhang

    Abstract: Large Multimodal Models (LMMs) have emerged as powerful models capable of understanding various data modalities, including text, images, and videos. LMMs encode both text and visual data into tokens that are then combined and processed by an integrated Large Language Model (LLM). Including visual tokens substantially increases the total token count, often by thousands. The increased input length f… ▽ More

    Submitted 1 April, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: Accepted to CVPR 2025

  3. arXiv:2502.15346  [pdf, other

    q-bio.QM cs.LG

    Drug-Target Interaction/Affinity Prediction: Deep Learning Models and Advances Review

    Authors: Ali Vefghi, Zahed Rahmati, Mohammad Akbari

    Abstract: Drug discovery remains a slow and expensive process that involves many steps, from detecting the target structure to obtaining approval from the Food and Drug Administration (FDA), and is often riddled with safety concerns. Accurate prediction of how drugs interact with their targets and the development of new drugs by using better methods and technologies have immense potential to speed up this p… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 64 pages, 7 figures, 10 tables

  4. arXiv:2412.12563  [pdf, other

    cs.CL

    Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers

    Authors: Vaden Masrani, Mohammad Akbari, David Ming Xuan Yue, Ahmad Rezaei, Yong Zhang

    Abstract: In the era of costly pre-training of large language models, ensuring the intellectual property rights of model owners, and insuring that said models are responsibly deployed, is becoming increasingly important. To this end, we propose model watermarking via passthrough layers, which are added to existing pre-trained networks and trained using a self-supervised loss such that the model produces hig… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted to AAAI2025

  5. arXiv:2412.05585  [pdf

    cs.CV cs.AI cs.CL

    UNet++ and LSTM combined approach for Breast Ultrasound Image Segmentation

    Authors: Saba Hesaraki, Morteza Akbari, Ramin Mousa

    Abstract: Breast cancer stands as a prevalent cause of fatality among females on a global scale, with prompt detection playing a pivotal role in diminishing mortality rates. The utilization of ultrasound scans in the BUSI dataset for medical imagery pertaining to breast cancer has exhibited commendable segmentation outcomes through the application of UNet and UNet++ networks. Nevertheless, a notable drawbac… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

  6. arXiv:2409.00314  [pdf, other

    cs.CV

    Towards Secure and Usable 3D Assets: A Novel Framework for Automatic Visible Watermarking

    Authors: Gursimran Singh, Tianxi Hu, Mohammad Akbari, Qiang Tang, Yong Zhang

    Abstract: 3D models, particularly AI-generated ones, have witnessed a recent surge across various industries such as entertainment. Hence, there is an alarming need to protect the intellectual property and avoid the misuse of these valuable assets. As a viable solution to address these concerns, we rigorously define the novel task of automated 3D visible watermarking in terms of two competing aspects: water… ▽ More

    Submitted 17 September, 2024; v1 submitted 30 August, 2024; originally announced September 2024.

    Comments: Accepted to WACV2025

  7. arXiv:2408.05868  [pdf, ps, other

    cs.CV

    LaWa: Using Latent Space for In-Generation Image Watermarking

    Authors: Ahmad Rezaei, Mohammad Akbari, Saeed Ranjbar Alvar, Arezou Fatemi, Yong Zhang

    Abstract: With generative models producing high quality images that are indistinguishable from real ones, there is growing concern regarding the malicious usage of AI-generated images. Imperceptible image watermarking is one viable solution towards such concerns. Prior watermarking methods map the image to a latent space for adding the watermark. Moreover, Latent Diffusion Models (LDM) generate the image in… ▽ More

    Submitted 30 May, 2025; v1 submitted 11 August, 2024; originally announced August 2024.

    Comments: Accepted to ECCV 2024

  8. arXiv:2403.19754  [pdf, other

    cs.CL

    GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation

    Authors: Mohsen Gholami, Mohammad Akbari, Cindy Hu, Vaden Masrani, Z. Jane Wang, Yong Zhang

    Abstract: Knowledge distillation from LLMs is essential for the efficient deployment of language models. Prior works have proposed data generation using LLMs for preparing distilled models. We argue that generating data with LLMs is prone to sampling mainly from the center of original content distribution. This limitation hinders the distilled model from learning the true underlying data distribution and to… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  9. arXiv:2403.05628  [pdf, other

    cs.MM cs.CR

    AMUSE: Adaptive Multi-Segment Encoding for Dataset Watermarking

    Authors: Saeed Ranjbar Alvar, Mohammad Akbari, David Ming Xuan Yue, Yong Zhang

    Abstract: Curating high quality datasets that play a key role in the emergence of new AI applications requires considerable time, money, and computational resources. So, effective ownership protection of datasets is becoming critical. Recently, to protect the ownership of an image dataset, imperceptible watermarking techniques are used to store ownership information (i.e., watermark) into the individual ima… ▽ More

    Submitted 18 July, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  10. arXiv:2310.17737  [pdf, other

    cs.CL

    ArchBERT: Bi-Modal Understanding of Neural Architectures and Natural Languages

    Authors: Mohammad Akbari, Saeed Ranjbar Alvar, Behnam Kamranian, Amin Banitalebi-Dehkordi, Yong Zhang

    Abstract: Building multi-modal language models has been a trend in the recent years, where additional modalities such as image, video, speech, etc. are jointly learned along with natural languages (i.e., textual information). Despite the success of these multi-modal language models with different modalities, there is no existing solution for neural network architectures and natural languages. Providing neur… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: CoNLL 2023

  11. arXiv:2308.02027  [pdf, other

    cs.CV

    ETran: Energy-Based Transferability Estimation

    Authors: Mohsen Gholami, Mohammad Akbari, Xinglu Wang, Behnam Kamranian, Yong Zhang

    Abstract: This paper addresses the problem of ranking pre-trained models for object detection and image classification. Selecting the best pre-trained model by fine-tuning is an expensive and time-consuming task. Previous works have proposed transferability estimation based on features extracted by the pre-trained models. We argue that quantifying whether the target dataset is in-distribution (IND) or out-o… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  12. EnrichEvent: Enriching Social Data with Contextual Information for Emerging Event Extraction

    Authors: Mohammadali Sefidi Esfahani, Mohammad Akbari

    Abstract: Social platforms have emerged as crucial platforms for distributing information and discussing social events, offering researchers an excellent opportunity to design and implement novel event detection frameworks. Identifying unspecified events and detecting events without prior knowledge enables governments, aid agencies, and experts to respond swiftly and effectively to unfolding situations, suc… ▽ More

    Submitted 11 June, 2025; v1 submitted 29 July, 2023; originally announced July 2023.

    Comments: Iran J Comput Sci (2025)

  13. arXiv:2303.04134  [pdf, other

    cs.CL cs.AI

    A Hybrid Architecture for Out of Domain Intent Detection and Intent Discovery

    Authors: Masoud Akbari, Ali Mohades, M. Hassan Shirali-Shahreza

    Abstract: Intent Detection is one of the tasks of the Natural Language Understanding (NLU) unit in task-oriented dialogue systems. Out of Scope (OOS) and Out of Domain (OOD) inputs may run these systems into a problem. On the other side, a labeled dataset is needed to train a model for Intent Detection in task-oriented dialogue systems. The creation of a labeled dataset is time-consuming and needs human res… ▽ More

    Submitted 30 July, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  14. arXiv:2303.00408  [pdf, other

    cs.CL

    A Persian Benchmark for Joint Intent Detection and Slot Filling

    Authors: Masoud Akbari, Amir Hossein Karimi, Tayyebeh Saeedi, Zeinab Saeidi, Kiana Ghezelbash, Fatemeh Shamsezat, Mohammad Akbari, Ali Mohades

    Abstract: Natural Language Understanding (NLU) is important in today's technology as it enables machines to comprehend and process human language, leading to improved human-computer interactions and advancements in fields such as virtual assistants, chatbots, and language-based AI systems. This paper highlights the significance of advancing the field of NLU for low-resource languages. With intent detection… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 8 pages, 5 figures

    Report number: 2303.00408

  15. arXiv:2301.00604  [pdf

    cs.CL

    Russia-Ukraine war: Modeling and Clustering the Sentiments Trends of Various Countries

    Authors: Hamed Vahdat-Nejad, Mohammad Ghasem Akbari, Fatemeh Salmani, Faezeh Azizi, Hamid-Reza Nili-Sani

    Abstract: With Twitter's growth and popularity, a huge number of views are shared by users on various topics, making this platform a valuable information source on various political, social, and economic issues. This paper investigates English tweets on the Russia-Ukraine war to analyze trends reflecting users' opinions and sentiments regarding the conflict. The tweets' positive and negative sentiments are… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  16. arXiv:2203.00748  [pdf, other

    cs.CL cs.LG

    E-LANG: Energy-Based Joint Inferencing of Super and Swift Language Models

    Authors: Mohammad Akbari, Amin Banitalebi-Dehkordi, Yong Zhang

    Abstract: Building huge and highly capable language models has been a trend in the past years. Despite their great performance, they incur high computational cost. A common solution is to apply model compression or choose light-weight architectures, which often need a separate fixed-size model for each desirable computational budget, and may lose performance in case of heavy compression. This paper proposes… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  17. arXiv:2112.14796  [pdf, ps, other

    cs.CV

    Deep Learning meets Liveness Detection: Recent Advancements and Challenges

    Authors: Arian Sabaghi, Marzieh Oghbaie, Kooshan Hashemifard, Mohammad Akbari

    Abstract: Facial biometrics has been recently received tremendous attention as a convenient replacement for traditional authentication systems. Consequently, detecting malicious attempts has found great significance, leading to extensive studies in face anti-spoofing~(FAS),i.e., face presentation attack detection. Deep feature learning and techniques, as opposed to hand-crafted features, have promised a dra… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

  18. arXiv:2110.10343  [pdf, other

    cs.CV cs.AI cs.LG

    EBJR: Energy-Based Joint Reasoning for Adaptive Inference

    Authors: Mohammad Akbari, Amin Banitalebi-Dehkordi, Yong Zhang

    Abstract: State-of-the-art deep learning models have achieved significant performance levels on various benchmarks. However, the excellent performance comes at a cost of inefficient computational cost. Light-weight architectures, on the other hand, achieve moderate accuracies, but at a much more desirable latency. This paper presents a new method of jointly using the large accurate models together with the… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: BMVC 2021

  19. arXiv:2110.07879  [pdf, other

    cs.CV

    Advances and Challenges in Deep Lip Reading

    Authors: Marzieh Oghbaie, Arian Sabaghi, Kooshan Hashemifard, Mohammad Akbari

    Abstract: Driven by deep learning techniques and large-scale datasets, recent years have witnessed a paradigm shift in automatic lip reading. While the main thrust of Visual Speech Recognition (VSR) was improving accuracy of Audio Speech Recognition systems, other potential applications, such as biometric identification, and the promised gains of VSR systems, have motivated extensive efforts on developing t… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  20. arXiv:2108.07800  [pdf, other

    cs.LG stat.ML

    Bagging Supervised Autoencoder Classifier for Credit Scoring

    Authors: Mahsan Abdoli, Mohammad Akbari, Jamal Shahrabi

    Abstract: Credit scoring models, which are among the most potent risk management tools that banks and financial institutes rely on, have been a popular subject for research in the past few decades. Accordingly, many approaches have been developed to address the challenges in classifying loan applicants and improve and facilitate decision-making. The imbalanced nature of credit scoring datasets, as well as t… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  21. arXiv:2107.06463  [pdf, other

    eess.IV cs.CV

    Learned Image Compression with Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

    Authors: Haisheng Fu, Feng Liang, Jianping Lin, Bing Li, Mohammad Akbari, Jie Liang, Guohe Zhang, Dong Liu, Chengjie Tu, Jingning Han

    Abstract: Recently deep learning-based image compression methods have achieved significant achievements and gradually outperformed traditional approaches including the latest standard Versatile Video Coding (VVC) in both PSNR and MS-SSIM metrics. Two key components of learned image compression are the entropy model of the latent representations and the encoding/decoding network architectures. Various models… ▽ More

    Submitted 9 February, 2024; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: IEEE Transactions On Image Processing

  22. arXiv:2105.04207  [pdf, other

    eess.SP cs.LG

    Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning

    Authors: Mohammad Akbari, Mohammad Reza Abedi, Roghayeh Joda, Mohsen Pourghasemian, Nader Mokari, Melike Erol-Kantarci

    Abstract: In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and schedul… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  23. arXiv:2101.04756  [pdf, other

    cs.CV cs.MM

    A Compact Deep Learning Model for Face Spoofing Detection

    Authors: Seyedkooshan Hashemifard, Mohammad Akbari

    Abstract: In recent years, face biometric security systems are rapidly increasing, therefore, the presentation attack detection (PAD) has received significant attention from research communities and has become a major field of research. Researchers have tackled the problem with various methods, from exploiting conventional texture feature extraction such as LBP, BSIF, and LPQ to using deep neural networks w… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

  24. arXiv:2012.15463  [pdf, other

    cs.CV cs.LG eess.IV

    Learned Multi-Resolution Variable-Rate Image Compression with Octave-based Residual Blocks

    Authors: Mohammad Akbari, Jie Liang, Jingning Han, Chengjie Tu

    Abstract: Recently deep learning-based image compression has shown the potential to outperform traditional codecs. However, most existing methods train multiple networks for multiple bit rates, which increase the implementation complexity. In this paper, we propose a new variable-rate image compression framework, which employs generalized octave convolutions (GoConv) and generalized octave transposed-convol… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 10 pages, 9 figures, 1 table; accepted to IEEE Transactions on Multimedia 2020. arXiv admin note: substantial text overlap with arXiv:1912.05688

  25. arXiv:2011.14754  [pdf

    cs.CR cs.CY cs.HC cs.LG

    Twitter Spam Detection: A Systematic Review

    Authors: Sepideh Bazzaz Abkenar, Mostafa Haghi Kashani, Mohammad Akbari, Ebrahim Mahdipour

    Abstract: Nowadays, with the rise of Internet access and mobile devices around the globe, more people are using social networks for collaboration and receiving real-time information. Twitter, the microblogging that is becoming a critical source of communication and news propagation, has grabbed the attention of spammers to distract users. So far, researchers have introduced various defense techniques to det… ▽ More

    Submitted 1 December, 2020; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: 18 pages, 12 figures, 14 tables, 91 references

  26. arXiv:2010.08930  [pdf, ps, other

    cs.LG

    Dynamic Ensemble Learning for Credit Scoring: A Comparative Study

    Authors: Mahsan Abdoli, Mohammad Akbari, Jamal Shahrabi

    Abstract: Automatic credit scoring, which assesses the probability of default by loan applicants, plays a vital role in peer-to-peer lending platforms to reduce the risk of lenders. Although it has been demonstrated that dynamic selection techniques are effective for classification tasks, the performance of these techniques for credit scoring has not yet been determined. This study attempts to benchmark dif… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

  27. arXiv:2002.10032  [pdf, other

    eess.IV cs.CV cs.LG

    Generalized Octave Convolutions for Learned Multi-Frequency Image Compression

    Authors: Mohammad Akbari, Jie Liang, Jingning Han, Chengjie Tu

    Abstract: Learned image compression has recently shown the potential to outperform the standard codecs. State-of-the-art rate-distortion (R-D) performance has been achieved by context-adaptive entropy coding approaches in which hyperprior and autoregressive models are jointly utilized to effectively capture the spatial dependencies in the latent representations. However, the latents are feature maps of the… ▽ More

    Submitted 31 December, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: 13 pages, 10 figures, 5 tables; Extended version of the paper accepted to AAAI 2021

  28. arXiv:2001.09417  [pdf, other

    eess.IV cs.CV

    Deep Learning-based Image Compression with Trellis Coded Quantization

    Authors: Binglin Li, Mohammad Akbari, Jie Liang, Yang Wang

    Abstract: Recently many works attempt to develop image compression models based on deep learning architectures, where the uniform scalar quantizer (SQ) is commonly applied to the feature maps between the encoder and decoder. In this paper, we propose to incorporate trellis coded quantizer (TCQ) into a deep learning based image compression framework. A soft-to-hard strategy is applied to allow for back propa… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

    Comments: Accepted in Data Compression Conference (DCC) 2020

  29. arXiv:1912.05688  [pdf, other

    eess.IV cs.CV

    Learned Variable-Rate Image Compression with Residual Divisive Normalization

    Authors: Mohammad Akbari, Jie Liang, Jingning Han, Chengjie Tu

    Abstract: Recently it has been shown that deep learning-based image compression has shown the potential to outperform traditional codecs. However, most existing methods train multiple networks for multiple bit rates, which increases the implementation complexity. In this paper, we propose a variable-rate image compression framework, which employs more Generalized Divisive Normalization (GDN) layers than pre… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: 6 pages, 5 figures

  30. arXiv:1909.06859  [pdf, other

    cs.LG cs.IR stat.ML

    MarlRank: Multi-agent Reinforced Learning to Rank

    Authors: Shihao Zou, Zhonghua Li, Mohammad Akbari, Jun Wang, Peng Zhang

    Abstract: When estimating the relevancy between a query and a document, ranking models largely neglect the mutual information among documents. A common wisdom is that if two documents are similar in terms of the same query, they are more likely to have similar relevance score. To mitigate this problem, in this paper, we propose a multi-agent reinforced ranking model, named MarlRank. In particular, by consid… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: CIKM 2019

  31. arXiv:1909.01735  [pdf, other

    stat.ML cs.LG q-bio.QM

    Using Contextual Information to Improve Blood Glucose Prediction

    Authors: Mohammad Akbari, Rumi Chunara

    Abstract: Blood glucose value prediction is an important task in diabetes management. While it is reported that glucose concentration is sensitive to social context such as mood, physical activity, stress, diet, alongside the influence of diabetes pathologies, we need more research on data and methodologies to incorporate and evaluate signals about such temporal context into prediction models. Person-genera… ▽ More

    Submitted 24 August, 2019; originally announced September 2019.

    Comments: 17 pages, 3 figures

  32. arXiv:1907.06566  [pdf, other

    eess.IV cs.LG stat.ML

    Improved Hybrid Layered Image Compression using Deep Learning and Traditional Codecs

    Authors: Haisheng Fu, Feng Liang, Bo Lei, Nai Bian, Qian zhang, Mohammad Akbari, Jie Liang, Chengjie Tu

    Abstract: Recently deep learning-based methods have been applied in image compression and achieved many promising results. In this paper, we propose an improved hybrid layered image compression framework by combining deep learning and the traditional image codecs. At the encoder, we first use a convolutional neural network (CNN) to obtain a compact representation of the input image, which is losslessly enco… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: Submitted to Signal Processing: Image Communication

    Report number: 1907.06566

    Journal ref: Volume 82, March 2020, 115774

  33. Detecting Target-Area Link-Flooding DDoS Attacks using Traffic Analysis and Supervised Learning

    Authors: Mostafa Rezazad, Matthias R. Brust, Mohammad Akbari, Pascal Bouvry, Ngai-Man Cheung

    Abstract: A novel class of extreme link-flooding DDoS (Distributed Denial of Service) attacks is designed to cut off entire geographical areas such as cities and even countries from the Internet by simultaneously targeting a selected set of network links. The Crossfire attack is a target-area link-flooding attack, which is orchestrated in three complex phases. The attack uses a massively distributed large-s… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

    Comments: arXiv admin note: text overlap with arXiv:1801.00235

    Journal ref: Advances in Intelligent Systems and Computing, 2018

  34. arXiv:1812.00912  [pdf, other

    cs.SI cs.AI cs.CY

    From the User to the Medium: Neural Profiling Across Web Communities

    Authors: Mohammad Akbari, Kunal Relia, Anas Elghafari, Rumi Chunara

    Abstract: Online communities provide a unique way for individuals to access information from those in similar circumstances, which can be critical for health conditions that require daily and personalized management. As these groups and topics often arise organically, identifying the types of topics discussed is necessary to understand their needs. As well, these communities and people in them can be quite… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  35. Named Entity Disambiguation using Deep Learning on Graphs

    Authors: Alberto Cetoli, Mohammad Akbari, Stefano Bragaglia, Andrew D. O'Harney, Marc Sloan

    Abstract: We tackle \ac{NED} by comparing entities in short sentences with \wikidata{} graphs. Creating a context vector from graphs through deep learning is a challenging problem that has never been applied to \ac{NED}. Our main contribution is to present an experimental study of recent neural techniques, as well as a discussion about which graph features are most important for the disambiguation task. In… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

  36. arXiv:1806.03348  [pdf, other

    cs.CV

    DSSLIC: Deep Semantic Segmentation-based Layered Image Compression

    Authors: Mohammad Akbari, Jie Liang, Jingning Han

    Abstract: Deep learning has revolutionized many computer vision fields in the last few years, including learning-based image compression. In this paper, we propose a deep semantic segmentation-based layered image compression (DSSLIC) framework in which the semantic segmentation map of the input image is obtained and encoded as the base layer of the bit-stream. A compact representation of the input image is… ▽ More

    Submitted 18 April, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: - More Experimental results added

  37. arXiv:1806.00509  [pdf, other

    cs.LG stat.ML

    Semi-Recurrent CNN-based VAE-GAN for Sequential Data Generation

    Authors: Mohammad Akbari, Jie Liang

    Abstract: A semi-recurrent hybrid VAE-GAN model for generating sequential data is introduced. In order to consider the spatial correlation of the data in each frame of the generated sequence, CNNs are utilized in the encoder, generator, and discriminator. The subsequent frames are sampled from the latent distributions obtained by encoding the previous frames. As a result, the dependencies between the frames… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

    Comments: 5 pages, 6 figures, ICASSP 2018

    Journal ref: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, 2321-2325

  38. arXiv:1803.09002  [pdf, other

    cs.SI cs.CY

    Socio-spatial Self-organizing Maps: Using Social Media to Assess Relevant Geographies for Exposure to Social Processes

    Authors: Kunal Relia, Mohammad Akbari, Dustin Duncan, Rumi Chunara

    Abstract: Social media offers a unique window into attitudes like racism and homophobia, exposure to which are important, hard to measure and understudied social determinants of health. However, individual geo-located observations from social media are noisy and geographically inconsistent. Existing areas by which exposures are measured, like Zip codes, average over irrelevant administratively-defined bound… ▽ More

    Submitted 4 September, 2018; v1 submitted 23 March, 2018; originally announced March 2018.

    Comments: 23 pages, 4 figures, 3 tables

    ACM Class: I.5.3; I.5.1; H.3.3; J.4

    Journal ref: Proc. ACM Hum.-Comput.Interact.2, CSCW, Article 145 (November 2018), 23 pages

  39. arXiv:1802.08402  [pdf

    cs.CV

    Adaptive specular reflection detection and inpainting in colonoscopy video frames

    Authors: Mojtaba Akbari, Majid Mohrekesh, S. M. Reza Soroushmehr, Nader Karimi, Shadrokh Samavi, Kayvan Najarian

    Abstract: Colonoscopy video frames might be contaminated by bright spots with unsaturated values known as specular reflection. Detection and removal of such reflections could enhance the quality of colonoscopy images and facilitate diagnosis procedure. In this paper we propose a novel two-phase method for this purpose, consisting of detection and removal phases. In the detection phase, we employ both HSV an… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: 5 pages, 5 figures

  40. arXiv:1802.07778  [pdf

    cs.CV

    Left Ventricle Segmentation in Cardiac MR Images Using Fully Convolutional Network

    Authors: Mina Nasr-Esfahani, Majid Mohrekesh, Mojtaba Akbari, S. M. Reza Soroushmehr, Ebrahim Nasr-Esfahani, Nader Karimi, Shadrokh Samavi, Kayvan Najarian

    Abstract: Medical image analysis, especially segmenting a specific organ, has an important role in developing clinical decision support systems. In cardiac magnetic resonance (MR) imaging, segmenting the left and right ventricles helps physicians diagnose different heart abnormalities. There are challenges for this task, including the intensity and shape similarity between left ventricle and other organs, i… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

    Comments: 4 pages, 3 figures

  41. Modeling and predicting measured response time of cloud-based web services using long-memory time series

    Authors: Hossein Nourikhah, Mohammad Kazem Akbari, Mohammad Kalantari

    Abstract: Predicting cloud performance from user's perspective is a complex task, because of several factors involved in providing the service to the consumer. In this work, the response time of 10 real-world services is analyzed. We have observed long memory in terms of the measured response time of the CPU-intensive services and statistically verified this observation using estimators of the Hurst exponen… ▽ More

    Submitted 11 April, 2016; originally announced April 2016.

    Journal ref: The Journal of Supercomputing, February 2015, Volume 71, Issue 2, pp 673-696

  42. arXiv:1407.1395  [pdf

    cs.IT

    CB-REFIM: A Practical Coordinated Beamforming in Multicell Networks

    Authors: Mohammad Hossein Akbari, Vahid Tabataba Vakili

    Abstract: Performance of multicell systems is inevitably limited by interference and available resources. Although intercell interference can be mitigated by Base Station (BS) Coordination, the demand on inter-BS information exchange and computational complexity grows rapidly with the number of cells, subcarriers, and users. On the other hand, some of the existing coordination beamforming methods need compu… ▽ More

    Submitted 9 July, 2014; v1 submitted 5 July, 2014; originally announced July 2014.

    Comments: 20 pages, 8 figures, to appear in IET Communication

  43. arXiv:1406.7285  [pdf

    cs.DC cs.NI cs.PF

    Near-Optimal Virtual Machine Packing Based on Resource Requirement of Service Demands Using Pattern Clustering

    Authors: Yaghoob Siahmargooei, Mohammad Kazem Akbari, Seyyed Alireza Hashemi Golpayegani, Saeed Sharifian

    Abstract: Upon the expansion of Cloud Computing and the positive outlook of organizations with regard to the movements towards using cloud computing and their expanding utilization of such valuable processing method, as well as the solutions provided by the cloud infrastructure providers with regard to the reduction of the costs of processing resources, the problem of organizing resources in a cloud environ… ▽ More

    Submitted 27 June, 2014; originally announced June 2014.

    Journal ref: IJASCSE journal, Volume 3, Issue 6, JUNE 2014