Skip to main content

Showing 1–50 of 53 results for author: R, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10225  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Fine-Grained control over Music Generation with Activation Steering

    Authors: Dipanshu Panda, Jayden Koshy Joe, Harshith M R, Swathi Narashiman, Pranay Mathur, Anish Veerakumar, Aniruddh Krishna, Keerthiharan A

    Abstract: We present a method for fine-grained control over music generation through inference-time interventions on an autoregressive generative music transformer called MusicGen. Our approach enables timbre transfer, style transfer, and genre fusion by steering the residual stream using weights of linear probes trained on it, or by steering the attention layer activations in a similar manner. We observe t… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2504.15179  [pdf, other

    cs.CV

    FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

    Authors: Fei Yin, Mallikarjun B R, Chun-Han Yao, Rafał Mantiuk, Varun Jampani

    Abstract: We present a novel framework for generating high-quality, animatable 4D avatar from a single image. While recent advances have shown promising results in 4D avatar creation, existing methods either require extensive multiview data or struggle with shape accuracy and identity consistency. To address these limitations, we propose a comprehensive system that leverages shape, image, and video priors t… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  3. arXiv:2503.07902  [pdf, other

    cs.RO

    LTLCodeGen: Code Generation of Syntactically Correct Temporal Logic for Robot Task Planning

    Authors: Behrad Rabiei, Mahesh Kumar A. R., Zhirui Dai, Surya L. S. R. Pilla, Qiyue Dong, Nikolay Atanasov

    Abstract: This paper focuses on planning robot navigation tasks from natural language specifications. We develop a modular approach, where a large language model (LLM) translates the natural language instructions into a linear temporal logic (LTL) formula with propositions defined by object classes in a semantic occupancy map. The LTL formula and the semantic occupancy map are provided to a motion planning… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  4. Lite2Relight: 3D-aware Single Image Portrait Relighting

    Authors: Pramod Rao, Gereon Fox, Abhimitra Meka, Mallikarjun B R, Fangneng Zhan, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt

    Abstract: Achieving photorealistic 3D view synthesis and relighting of human portraits is pivotal for advancing AR/VR applications. Existing methodologies in portrait relighting demonstrate substantial limitations in terms of generalization and 3D consistency, coupled with inaccuracies in physically realistic lighting and identity preservation. Furthermore, personalization from a single view is difficult to… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted at SIGGRAPH '24: ACM SIGGRAPH 2024 Conference Papers

  5. arXiv:2407.06910  [pdf, other

    cs.IR cs.AI cs.LG

    Fine-grained large-scale content recommendations for MSX sellers

    Authors: Manpreet Singh, Ravdeep Pasricha, Ravi Prasad Kondapalli, Kiran R, Nitish Singh, Akshita Agarwalla, Manoj R, Manish Prabhakar, Laurent Boué

    Abstract: One of the most critical tasks of Microsoft sellers is to meticulously track and nurture potential business opportunities through proactive engagement and tailored solutions. Recommender systems play a central role to help sellers achieve their goals. In this paper, we present a content recommendation model which surfaces various types of content (technical documentation, comparison with competito… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 21, 2024

  6. arXiv:2406.11858  [pdf, other

    cs.CY

    Student Perspectives on Using a Large Language Model (LLM) for an Assignment on Professional Ethics

    Authors: Virginia Grande, Natalie Kiesler, Maria Andreina Francisco R

    Abstract: The advent of Large Language Models (LLMs) started a serious discussion among educators on how LLMs would affect, e.g., curricula, assessments, and students' competencies. Generative AI and LLMs also raised ethical questions and concerns for computing educators and professionals. This experience report presents an assignment within a course on professional competencies, including some related to e… ▽ More

    Submitted 9 April, 2024; originally announced June 2024.

    Comments: accepted at ITiCSE 2024, Milan, Italy

  7. arXiv:2405.20959  [pdf, other

    cs.AI cs.DB

    Navigating Tabular Data Synthesis Research: Understanding User Needs and Tool Capabilities

    Authors: Maria F. Davila R., Sven Groen, Fabian Panse, Wolfram Wingerath

    Abstract: In an era of rapidly advancing data-driven applications, there is a growing demand for data in both research and practice. Synthetic data have emerged as an alternative when no real data is available (e.g., due to privacy regulations). Synthesizing tabular data presents unique and complex challenges, especially handling (i) missing values, (ii) dataset imbalance, (iii) diverse column types, and (i… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 14 pages, 3 figures

  8. arXiv:2404.01122  [pdf

    cs.LG

    Enhanced Precision in Rainfall Forecasting for Mumbai: Utilizing Physics Informed ConvLSTM2D Models for Finer Spatial and Temporal Resolution

    Authors: Ajay Devda, Akshay Sunil, Murthy R, B Deepthi

    Abstract: Forecasting rainfall in tropical areas is challenging due to complex atmospheric behaviour, elevated humidity levels, and the common presence of convective rain events. In the Indian context, the difficulty is further exacerbated because of the monsoon intra seasonal oscillations, which introduce significant variability in rainfall patterns over short periods. Earlier investigations into rainfall… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Submitted to Computer and Geosciences. arXiv admin note: substantial text overlap with arXiv:2310.09311

  9. arXiv:2401.04732  [pdf, other

    cs.IR cs.AI cs.LG

    A case study of Generative AI in MSX Sales Copilot: Improving seller productivity with a real-time question-answering system for content recommendation

    Authors: Manpreet Singh, Ravdeep Pasricha, Nitish Singh, Ravi Prasad Kondapalli, Manoj R, Kiran R, Laurent Boué

    Abstract: In this paper, we design a real-time question-answering system specifically targeted for helping sellers get relevant material/documentation they can share live with their customers or refer to during a call. Taking the Seismic content repository as a relatively large scale example of a diverse dataset of sales material, we demonstrate how LLM embeddings of sellers' queries can be matched with the… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

  10. arXiv:2312.05797  [pdf, other

    cs.CV

    Multimodality in Online Education: A Comparative Study

    Authors: Praneeta Immadisetty, Pooja Rajesh, Akshita Gupta, Anala M R, Soumya A, K. N. Subramanya

    Abstract: The commencement of the decade brought along with it a grave pandemic and in response the movement of education forums predominantly into the online world. With a surge in the usage of online video conferencing platforms and tools to better gauge student understanding, there needs to be a mechanism to assess whether instructors can grasp the extent to which students understand the subject and thei… ▽ More

    Submitted 17 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

  11. arXiv:2311.12719  [pdf

    cs.AI

    Development of a Legal Document AI-Chatbot

    Authors: Pranav Nataraj Devaraj, Rakesh Teja P V, Aaryav Gangrade, Manoj Kumar R

    Abstract: With the exponential growth of digital data and the increasing complexity of legal documentation, there is a pressing need for efficient and intelligent tools to streamline the handling of legal documents.With the recent developments in the AI field, especially in chatbots, it cannot be ignored as a very compelling solution to this problem.An insight into the process of creating a Legal Documentat… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 5 pages, 5 figures

  12. arXiv:2310.06841  [pdf

    cs.CR cs.LG

    Malware Classification using Deep Neural Networks: Performance Evaluation and Applications in Edge Devices

    Authors: Akhil M R, Adithya Krishna V Sharma, Harivardhan Swamy, Pavan A, Ashray Shetty, Anirudh B Sathyanarayana

    Abstract: With the increasing extent of malware attacks in the present day along with the difficulty in detecting modern malware, it is necessary to evaluate the effectiveness and performance of Deep Neural Networks (DNNs) for malware classification. Multiple DNN architectures can be designed and trained to detect and classify malware binaries. Results demonstrate the potential of DNNs in accurately classif… ▽ More

    Submitted 21 August, 2023; originally announced October 2023.

  13. arXiv:2306.00547  [pdf, other

    cs.CV cs.GR

    AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars

    Authors: Mohit Mendiratta, Xingang Pan, Mohamed Elgharib, Kartik Teotia, Mallikarjun B R, Ayush Tewari, Vladislav Golyanik, Adam Kortylewski, Christian Theobalt

    Abstract: Capturing and editing full head performances enables the creation of virtual characters with various applications such as extended reality and media production. The past few years witnessed a steep rise in the photorealism of human head avatars. Such avatars can be controlled through different input data modalities, including RGB, audio, depth, IMUs and others. While these data modalities provide… ▽ More

    Submitted 2 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 17 pages, 17 figures. Project page: https://vcai.mpi-inf.mpg.de/projects/AvatarStudio/

  14. arXiv:2304.13918  [pdf, other

    cs.NE

    Neuromorphic Computing with AER using Time-to-Event-Margin Propagation

    Authors: Madhuvanthi Srivatsav R, Shantanu Chakrabartty, Chetan Singh Thakur

    Abstract: Address-Event-Representation (AER) is a spike-routing protocol that allows the scaling of neuromorphic and spiking neural network (SNN) architectures to a size that is comparable to that of digital neural network architectures. However, in conventional neuromorphic architectures, the AER protocol and, in general, any virtual interconnect plays only a passive role in computation, i.e., only for rou… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  15. arXiv:2303.18193  [pdf, other

    cs.CV cs.GR cs.LG

    GVP: Generative Volumetric Primitives

    Authors: Mallikarjun B R, Xingang Pan, Mohamed Elgharib, Christian Theobalt

    Abstract: Advances in 3D-aware generative models have pushed the boundary of image synthesis with explicit camera control. To achieve high-resolution image synthesis, several attempts have been made to design efficient generators, such as hybrid architectures with both 3D and 2D components. However, such a design compromises multiview consistency, and the design of a pure 3D generator with high resolution i… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: https://vcai.mpi-inf.mpg.de/projects/GVP/index.html

  16. arXiv:2303.14471  [pdf, other

    cs.CV cs.GR

    HQ3DAvatar: High Quality Controllable 3D Head Avatar

    Authors: Kartik Teotia, Mallikarjun B R, Xingang Pan, Hyeongwoo Kim, Pablo Garrido, Mohamed Elgharib, Christian Theobalt

    Abstract: Multi-view volumetric rendering techniques have recently shown great potential in modeling and synthesizing high-quality head avatars. A common approach to capture full head dynamic performances is to track the underlying geometry using a mesh-based template or 3D cube-based graphics primitives. While these model-based approaches achieve promising results, they often fail to learn complex geometri… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 16 Pages, 15 Figures. Project page: https://vcai.mpi-inf.mpg.de/projects/HQ3DAvatar/

  17. arXiv:2302.07672  [pdf, other

    cs.GR

    LiveHand: Real-time and Photorealistic Neural Hand Rendering

    Authors: Akshay Mundra, Mallikarjun B R, Jiayi Wang, Marc Habermann, Christian Theobalt, Mohamed Elgharib

    Abstract: The human hand is the main medium through which we interact with our surroundings, making its digitization an important problem. While there are several works modeling the geometry of hands, little attention has been paid to capturing photo-realistic appearance. Moreover, for applications in extended reality and gaming, real-time rendering is critical. We present the first neural-implicit approach… ▽ More

    Submitted 20 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Project page: https://vcai.mpi-inf.mpg.de/projects/LiveHand/ | Accepted at ICCV '23 | 11 pages, 7 figures

  18. arXiv:2210.15664  [pdf, other

    cs.CV cs.GR

    State of the Art in Dense Monocular Non-Rigid 3D Reconstruction

    Authors: Edith Tretschk, Navami Kairanda, Mallikarjun B R, Rishabh Dabral, Adam Kortylewski, Bernhard Egger, Marc Habermann, Pascal Fua, Christian Theobalt, Vladislav Golyanik

    Abstract: 3D reconstruction of deformable (or non-rigid) scenes from a set of monocular 2D image observations is a long-standing and actively researched area of computer vision and graphics. It is an ill-posed inverse problem, since -- without additional prior assumptions -- it permits infinitely many solutions leading to accurate projection to the input 2D images. Non-rigid reconstruction is a foundational… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 36 pages, 18 figures, 3 tables; State-of-the-Art Report at EUROGRAPHICS 2023

    Journal ref: Computer Graphics Forum, 2023

  19. arXiv:2210.12791  [pdf, other

    astro-ph.IM astro-ph.SR cs.LG

    O-type Stars Stellar Parameter Estimation Using Recurrent Neural Networks

    Authors: Miguel Flores R., Luis J. Corral, Celia R. Fierro-Santillán, Silvana G. Navarro

    Abstract: In this paper, we present a deep learning system approach to estimating luminosity, effective temperature, and surface gravity of O-type stars using the optical region of the stellar spectra. In previous work, we compare a set of machine learning and deep learning algorithms in order to establish a reliable way to fit a stellar model using two methods: the classification of the stellar spectra mod… ▽ More

    Submitted 27 October, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

  20. arXiv:2210.04218  [pdf, other

    cs.CV cs.LG

    Transformer-based Flood Scene Segmentation for Developing Countries

    Authors: Ahan M R, Roshan Roy, Shreyas Sunil Kulkarni, Vaibhav Soni, Ashish Chittora

    Abstract: Floods are large-scale natural disasters that often induce a massive number of deaths, extensive material damage, and economic turmoil. The effects are more extensive and longer-lasting in high-population and low-resource developing countries. Early Warning Systems (EWS) constantly assess water levels and other factors to forecast floods, to help minimize damage. Post-disaster, disaster response t… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: Presented at NeurIPS 2021 Workshop on Machine Learning for the Developing World

  21. arXiv:2209.13876  [pdf

    quant-ph cs.ET

    An Interface for Variational Quantum Eigensolver based Energy (VQE-E) and Force (VQE-F) Calculator to Atomic Simulation Environment (ASE)

    Authors: Nirmal M R, Shampa Sarkar, Manoj Nambiar

    Abstract: The development of quantum algorithms to solve quantum chemistry problems has offered a promising new paradigm of performing computer simulations at the scale of atoms and molecules. Although majority of the research so far has focused on designing quantum algorithms to compute ground and excited state energies and forces, it is useful to run different simulation tasks, such as geometry optimizati… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 6 pages, 2 figures, 2 tables

  22. arXiv:2208.08760  [pdf

    cs.CR

    Blockchain based digital vaccine passport

    Authors: Ms. Megha Rani R, Roshan R Acharya, Ramkishan, Ranjith K, Rakshith Ay Gowda

    Abstract: Travel has been challenging recently since different nations have implemented varied immigration and travel policies. For the time being, immigration officials want proof of each person's immunity to the virus. A vaccine passport serves as evidence that a person has tested negative for or is immune to a particular virus. In terms of COVID-19, those who hold a vaccine passport will be permitted ent… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  23. arXiv:2204.02236  [pdf, other

    cs.IT eess.SP

    Designing Interference-Immune Doppler-TolerantWaveforms for Automotive Radar Applications

    Authors: Robin Amar, Mohammad Alaee-Kerahroodi, Prabhu Babu, Bhavani Shankar M. R

    Abstract: Dynamic target detection using FMCW waveform is challenging in the presence of interference for different radar applications. Degradation in SNR is irreparable and interference is difficult to mitigate in time and frequency domain. In this paper, a waveform design problem is addressed using the Majorization-Minimization (MM) framework by considering PSL/ISL cost functions, resulting in a code sequ… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  24. arXiv:2203.15926  [pdf, other

    cs.CV cs.GR

    Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images

    Authors: Ayush Tewari, Mallikarjun B R, Xingang Pan, Ohad Fried, Maneesh Agrawala, Christian Theobalt

    Abstract: Learning 3D generative models from a dataset of monocular images enables self-supervised 3D reasoning and controllable synthesis. State-of-the-art 3D generative models are GANs which use neural 3D volumetric representations for synthesis. Images are synthesized by rendering the volumes from a given camera. These models can disentangle the 3D scene from the camera viewpoint in any generated image.… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  25. arXiv:2112.07170  [pdf, other

    cs.NI

    Performance evaluation of the QOS provisioning ability of IEEE 802.11e WLAN standard for multimedia traffic

    Authors: Venkata Sitaram. A, Venkatesh. T. G, Arun George, Manivasakan. R, Bhasker Dappuri

    Abstract: This paper presents an analytical model for the average frame transmission delay and the jitter for the different Access Categories (ACs) of the IEEE 802.11e Enhanced Distributed Channel Access (EDCA) mechanism. Following are the salient features of our model. As defined by the standard we consider (1) the virtual collisions among different ACs inside each EDCA station in addition to external coll… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  26. arXiv:2111.11734  [pdf, other

    cs.CV eess.IV

    IR Motion Deblurring

    Authors: Nisha Varghese, Mahesh Mohan M. R., A. N. Rajagopalan

    Abstract: Camera gimbal systems are important in various air or water borne systems for applications such as navigation, target tracking, security and surveillance. A higher steering rate (rotation angle per second) of gimbal is preferable for real-time applications since a given field-of-view (FOV) can be revisited within a short period of time. However, due to relative motion between the gimbal and scene… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  27. arXiv:2109.09620  [pdf, other

    cs.RO

    NASA Space Robotics Challenge 2 Qualification Round: An Approach to Autonomous Lunar Rover Operations

    Authors: Cagri Kilic, Bernardo Martinez R. Jr., Christopher A. Tatsch, Jared Beard, Jared Strader, Shounak Das, Derek Ross, Yu Gu, Guilherme A. S. Pereira, Jason N. Gross

    Abstract: Plans for establishing a long-term human presence on the Moon will require substantial increases in robot autonomy and multi-robot coordination to support establishing a lunar outpost. To achieve these objectives, algorithm design choices for the software developments need to be tested and validated for expected scenarios such as autonomous in-situ resource utilization (ISRU), localization in chal… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 15 pages, 15 figures, 5 tables. Accepted for publications in IEEE Aerospace and Electronic Systems Magazine, 2021. (preprint version)

  28. Heterogeneously-Distributed Joint Radar Communications: Bayesian Resource Allocation

    Authors: Linlong Wu, Kumar Vijay Mishra, Bhavani Shankar M. R., Björn Ottersten

    Abstract: Due to spectrum scarcity, the coexistence of radar and wireless communication has gained substantial research interest recently. Among many scenarios, the heterogeneouslydistributed joint radar-communication system is promising due to its flexibility and compatibility of existing architectures. In this paper, we focus on a heterogeneous radar and communication network (HRCN), which consists of var… ▽ More

    Submitted 4 March, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

  29. arXiv:2104.00359  [pdf, other

    cs.CV

    Efficient and Differentiable Shadow Computation for Inverse Problems

    Authors: Linjie Lyu, Marc Habermann, Lingjie Liu, Mallikarjun B R, Ayush Tewari, Christian Theobalt

    Abstract: Differentiable rendering has received increasing interest for image-based inverse problems. It can benefit traditional optimization-based solutions to inverse problems, but also allows for self-supervision of learning-based approaches for which training data with ground truth annotation is hard to obtain. However, existing differentiable renderers either do not model visibility of the light source… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  30. arXiv:2103.07658  [pdf, other

    cs.CV cs.GR cs.LG

    PhotoApp: Photorealistic Appearance Editing of Head Portraits

    Authors: Mallikarjun B R, Ayush Tewari, Abdallah Dib, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Louis Chevallier, Mohamed Elgharib, Christian Theobalt

    Abstract: Photorealistic editing of portraits is a challenging task as humans are very sensitive to inconsistencies in faces. We present an approach for high-quality intuitive editing of the camera viewpoint and scene illumination in a portrait image. This requires our method to capture and control the full reflectance field of the person in the image. Most editing approaches rely on supervised learning usi… ▽ More

    Submitted 13 May, 2021; v1 submitted 13 March, 2021; originally announced March 2021.

    Comments: http://gvv.mpi-inf.mpg.de/projects/PhotoApp/

  31. Design and implementation of Energy Efficient Lightweight Encryption (EELWE) algorithm for medical applications

    Authors: Radhika Rani Chintala, Narasinga Rao M R, Somu Venkateswarlu

    Abstract: Proportional to the growth in the usage of Human Sensor Networks (HSN), the volume of the data exchange between Sensor devices is increasing at a rapid pace. In this paper, we have proposed an Energy Efficient Lightweight Encryption (EELWE) algorithm for providing the confidentiality of data at the sensor level, particularly suitable for resource-constrained environments. Results obtained have pro… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 11 pages

  32. arXiv:2101.00875  [pdf

    cs.RO physics.app-ph

    Design and Development of Robots End Effector Test Rig

    Authors: Josephine Selvarani Ruth D, Saniya Zeba, Vibha M R, Rokesh Laishram, Gauthama Anand

    Abstract: A Test Rig for end-effectors of a robot is designed such that it achieves a prismatic motion in x-y-z axes for grasping an object. It is a structure, designed with a compact combination of sensors and actuators. Sensors are used for detecting presence, position and disturbance of target work piece or any object and actuators with motor driving system meant for controlling and moving the mechanism… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

  33. arXiv:2010.15668  [pdf

    cs.CY cs.CR cs.SI

    PeopleXploit -- A hybrid tool to collect public data

    Authors: Arjun Anand V, Buvanasri A K, Meenakshi R, Karthika S, Ashok Kumar Mohan

    Abstract: This paper introduces the concept of Open Source Intelligence (OSINT) as an important application in intelligent profiling of individuals. With a variety of tools available, significant data shall be obtained on an individual as a consequence of analyzing his/her internet presence but all of this comes at the cost of low relevance. To increase the relevance score in profiling, PeopleXploit is bein… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 8 pages, 3 images, ICCCSP 2020

  34. arXiv:2010.01679  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Learning Complete 3D Morphable Face Models from Images and Videos

    Authors: Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt

    Abstract: Most 3D face reconstruction methods rely on 3D morphable models, which disentangle the space of facial deformations into identity geometry, expressions and skin reflectance. These models are typically learned from a limited number of 3D scans and thus do not generalize well across different identities and expressions. We present the first approach to learn complete 3D models of face identity geome… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: Project Page - https://gvv.mpi-inf.mpg.de/projects/LeMoMo

  35. arXiv:2009.09485  [pdf, other

    cs.CV cs.GR

    PIE: Portrait Image Embedding for Semantic Control

    Authors: Ayush Tewari, Mohamed Elgharib, Mallikarjun B R., Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt

    Abstract: Editing of portrait images is a very popular and important research topic with a large variety of applications. For ease of use, control should be provided via a semantically meaningful parameterization that is akin to computer animation controls. The vast majority of existing techniques do not provide such intuitive and fine-grained control, or only enable coarse editing of a single isolated cont… ▽ More

    Submitted 20 September, 2020; originally announced September 2020.

    Comments: To appear in SIGGRAPH Asia 2020. Project webpage: https://gvv.mpi-inf.mpg.de/projects/PIE/

  36. arXiv:2008.10247  [pdf, other

    cs.CV cs.GR cs.LG

    Monocular Reconstruction of Neural Face Reflectance Fields

    Authors: Mallikarjun B R., Ayush Tewari, Tae-Hyun Oh, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt

    Abstract: The reflectance field of a face describes the reflectance properties responsible for complex lighting effects including diffuse, specular, inter-reflection and self shadowing. Most existing methods for estimating the face reflectance from a monocular image assume faces to be diffuse with very few approaches adding a specular component. This still leaves out important perceptual aspects of reflecta… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Project page - http://gvv.mpi-inf.mpg.de/projects/FaceReflectanceFields/

  37. arXiv:2006.08870  [pdf, other

    cs.CL cs.SD eess.AS

    End-to-End Code Switching Language Models for Automatic Speech Recognition

    Authors: Ahan M. R., Shreyas Sunil Kulkarni

    Abstract: In this paper, we particularly work on the code-switched text, one of the most common occurrences in the bilingual communities across the world. Due to the discrepancies in the extraction of code-switched text from an Automated Speech Recognition(ASR) module, and thereby extracting the monolingual text from the code-switched text, we propose an approach for extracting monolingual text using Deep B… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 5 pages, 2 figures, To appear in the proceedings of First Workshop on Speech Technologies for Code-switching in Multilingual Communities 2020

  38. arXiv:2005.06843  [pdf, ps, other

    eess.SP cs.IT

    Joint User Grouping, Scheduling, and Precoding for Multicast Energy Efficiency in Multigroup Multicast Systems

    Authors: Ashok Bandi, Bhavani Shankar Mysore R, Symeon Chatzinotas, Björn Ottersten

    Abstract: This paper studies the joint design of user grouping, scheduling (or admission control) and precoding to optimize energy efficiency (EE) for multigroup multicast scenarios in single-cell multiuser MISO downlink channels. Noticing that the existing definition of EE fails to account for group sizes, a new metric called multicast energy efficiency (MEE) is proposed. In this context, the joint design… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

  39. arXiv:2003.09968  [pdf, other

    cs.RO

    Team Mountaineers Space Robotic Challenge Phase-2 Qualification Round Preparation Report

    Authors: Cagri Kilic, Christopher A. Tatsch, Bernardo Martinez R. Jr, Jared J. Beard, Derek W. Ross, Jason N. Gross

    Abstract: Team Mountaineers launched efforts on the NASA Space Robotics Challenge Phase-2 (SRC2). The challenge will be held on the lunar terrain with virtual robotic platforms to establish an in-situ resource utilization process. In this report, we provide an overview of a simulation environment, a virtual mobile robot, and a software architecture that was created by Team Mountaineers in order to prepare f… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

    Comments: 6 pages, 5 figures, technical report

  40. arXiv:1912.01990  [pdf, other

    cs.DS cs.DM

    On Computing the Hamiltonian Index of Graphs

    Authors: Geevarghese Philip, Rani M. R., Subashini R

    Abstract: The $r$-th iterated line graph $L^{r}(G)$ of a graph $G$ is defined by: (i) $L^{0}(G) = G$ and (ii) $L^{r}(G) = L(L^{(r- 1)}(G))$ for $r > 0$, where $L(G)$ denotes the line graph of $G$. The Hamiltonian Index $h(G)$ of $G$ is the smallest $r$ such that $L^{r}(G)$ has a Hamiltonian cycle. Checking if $h(G) = k$ is NP-hard for any fixed integer $k \geq 0$ even for subcubic graphs $G$. We study the p… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: 46 pages

  41. arXiv:1902.02162  [pdf

    cs.CL cs.AI

    Adaptive Artificial Intelligent Q&A Platform

    Authors: M. R, Akram, C. P, Singhabahu, M. S. M Saad, P, Deleepa, Anupiya, Nugaliyadde, Yashas, Mallawarachchi

    Abstract: The paper presents an approach to build a question and answer system that is capable of processing the information in a large dataset and allows the user to gain knowledge from this dataset by asking questions in natural language form. Key content of this research covers four dimensions which are; Corpus Preprocessing, Question Preprocessing, Deep Neural Network for Answer Extraction and Answer Ge… ▽ More

    Submitted 19 January, 2019; originally announced February 2019.

  42. RNNSecureNet: Recurrent neural networks for Cyber security use-cases

    Authors: Mohammed Harun Babu R, Vinayakumar R, Soman KP

    Abstract: Recurrent neural network (RNN) is an effective neural network in solving very complex supervised and unsupervised tasks. There has been a significant improvement in RNN field such as natural language processing, speech processing, computer vision and other multiple domains. This paper deals with RNN application on different use cases like Incident Detection, Fraud Detection, and Android Malware Cl… ▽ More

    Submitted 5 January, 2019; originally announced January 2019.

    Comments: 12 pages. arXiv admin note: text overlap with arXiv:1812.03519

  43. arXiv:1812.06292  [pdf

    cs.CR cs.AI cs.LG

    A short review on Applications of Deep learning for Cyber security

    Authors: Mohammed Harun Babu R, Vinayakumar R, Soman KP

    Abstract: Deep learning is an advanced model of traditional machine learning. This has the capability to extract optimal feature representation from raw input samples. This has been applied towards various use cases in cyber security such as intrusion detection, malware classification, android malware detection, spam and phishing detection and binary analysis. This paper outlines the survey of all the works… ▽ More

    Submitted 29 January, 2019; v1 submitted 15 December, 2018; originally announced December 2018.

    Comments: 15 pages

  44. arXiv:1811.08342  [pdf, other

    cs.CV

    Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

    Authors: Pravendra Singh, Manikandan R, Neeraj Matiyali, Vinay P. Namboodiri

    Abstract: We propose a framework for compressing state-of-the-art Single Shot MultiBox Detector (SSD). The framework addresses compression in the following stages: Sparsity Induction, Filter Selection, and Filter Pruning. In the Sparsity Induction stage, the object detector model is sparsified via an improved global threshold. In Filter Selection & Pruning stage, we select and remove filters using sparsity… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

    Comments: IEEE Winter Conference on Applications of Computer Vision (WACV), 2019

  45. arXiv:1802.06185  [pdf, other

    cs.CL cs.IR

    Building a Word Segmenter for Sanskrit Overnight

    Authors: Vikas Reddy, Amrith Krishna, Vishnu Dutt Sharma, Prateek Gupta, Vineeth M R, Pawan Goyal

    Abstract: There is an abundance of digitised texts available in Sanskrit. However, the word segmentation task in such texts are challenging due to the issue of 'Sandhi'. In Sandhi, words in a sentence often fuse together to form a single chunk of text, where the word delimiter vanishes and sounds at the word boundaries undergo transformations, which is also reflected in the written text. Here, we propose an… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

    Comments: The work is accepted at LREC 2018, Miyazaki, Japan

  46. arXiv:1710.04154  [pdf

    cs.CY

    Pengaruh Perangkat Server Terhadap Kualitas Pengontrolan Jarak Jauh Melalui Internet

    Authors: Gunawan, Imam Muslim R

    Abstract: Internet greatly assist people in improving their quality of life. Almost all areas of human life can be accessed using the internet. Human aided by the internet that provides all sorts of information that they need. Along with the development of the Internet network infrastructure remotely control began to change using the internet. In this study using notebooks and servers Raspberry Pi to find o… ▽ More

    Submitted 1 October, 2017; originally announced October 2017.

    Comments: in Indonesian

    Journal ref: Jurnal Teknik Informatika Prima ISSN 2088-6101 hal 45 vol 7, No 2, Oktober 2014

  47. arXiv:1504.07865  [pdf

    cs.CE astro-ph.IM cs.LG

    ASTROMLSKIT: A New Statistical Machine Learning Toolkit: A Platform for Data Analytics in Astronomy

    Authors: Snehanshu Saha, Surbhi Agrawal, Manikandan. R, Kakoli Bora, Swati Routh, Anand Narasimhamurthy

    Abstract: Astroinformatics is a new impact area in the world of astronomy, occasionally called the final frontier, where several astrophysicists, statisticians and computer scientists work together to tackle various data intensive astronomical problems. Exponential growth in the data volume and increased complexity of the data augments difficult questions to the existing challenges. Classical problems in As… ▽ More

    Submitted 29 April, 2015; originally announced April 2015.

    Comments: Habitability Catalog (HabCat), Supernova classification, data analysis, Astroinformatics, Machine learning, ASTROMLS toolkit, Naïve Bayes, SVD, PCA, Random Forest, SVM, Decision Tree, LDA

  48. arXiv:1404.6544  [pdf, other

    cs.IT

    Interference Mitigating Satellite Broadcast Receiver using Reduced Complexity List-Based Detection in Correlated Noise

    Authors: Zohair Abu-Shaban, Hani Mehrpouyan, Bhavani Shankar M. R., Bjorn Ottersten

    Abstract: The recent commercial trends towards using smaller dish antennas for satellite receivers, and the growing density of broadcasting satellites, necessitate the application of robust adjacent satellite interference (ASI) cancellation schemes. This orbital density growth along with the wider beamwidth of a smaller dish have imposed an overloaded scenario at the satellite receiver, where the number of… ▽ More

    Submitted 25 April, 2014; originally announced April 2014.

  49. arXiv:1404.4443  [pdf, other

    cs.IT

    Enhanced List-Based Group-Wise Overloaded Receiver with Application to Satellite Reception

    Authors: Zohair Abu-Shaban, Bhavani Shankar M. R, Hani Mehrpouyan, Bjorn Ottersten

    Abstract: The market trends towards the use of smaller dish antennas for TV satellite receivers, as well as the growing density of broadcasting satellites in orbit require the application of robust adjacent satellite interference (ASI) cancellation algorithms at the receivers. The wider beamwidth of a small size dish and the growing number of satellites in orbit impose an overloaded scenario, i.e., a scenar… ▽ More

    Submitted 17 April, 2014; originally announced April 2014.

  50. A Survey on Mobile Data Gathering in Wireless Sensor Networks - Bounded Relay

    Authors: Ms. Rubia. R, Mr. SivanArulSelvan

    Abstract: Most of the wireless sensor networks consist of static sensors, which can be deployed in a wide environment for monitoring applications. While transmitting the data from source to static sink, the amount of energy consumption of the sensor node is high. It results in reduced lifetime of the network.Some of the WSN architectures have been proposed based on Mobile Elements. There is large number of… ▽ More

    Submitted 6 February, 2014; originally announced February 2014.

    Comments: 4 pages, 1 figure, "Published with International Journal of Engineering Trends and Technology (IJETT)"

    Journal ref: IJETT, 7(5),205-208,2014 published by seventh sense research group