Skip to main content

Showing 1–50 of 61 results for author: Jose, J

  1. arXiv:2406.18579  [pdf, other

    cs.CV cs.IR

    Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching

    Authors: Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Jie Wang, Joemon M. Jose

    Abstract: Image-text matching (ITM) is a fundamental problem in computer vision. The key issue lies in jointly learning the visual and textual representation to estimate their similarity accurately. Most existing methods focus on feature enhancement within modality or feature interaction across modalities, which, however, neglects the contextual information of the object representation based on the inter-ob… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 22pages, 5 Figures, 6 tables, the extension of CMSEI in WACV23, and submitted to ACM TIST. arXiv admin note: text overlap with arXiv:2210.08908

  2. arXiv:2405.16701  [pdf, other

    cs.CV

    Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition

    Authors: Tong Shi, Xuri Ge, Joemon M. Jose, Nicolas Pugeault, Paul Henderson

    Abstract: Capturing complex temporal relationships between video and audio modalities is vital for Audio-Visual Emotion Recognition (AVER). However, existing methods lack attention to local details, such as facial state changes between video frames, which can reduce the discriminability of features and thus lower recognition accuracy. In this paper, we propose a Detail-Enhanced Intra- and Inter-modal Intera… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Submitted to 27th International Conference of Pattern Recognition (ICPR 2024)

  3. 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

    Authors: Xuri Ge, Songpei Xu, Fuhai Chen, Jie Wang, Guoxin Wang, Shan An, Joemon M. Jose

    Abstract: In this paper, we propose a novel visual Semantic-Spatial Self-Highlighting Network (termed 3SHNet) for high-precision, high-efficiency and high-generalization image-sentence retrieval. 3SHNet highlights the salient identification of prominent objects and their spatial locations within the visual modality, thus allowing the integration of visual semantics-spatial interactions and maintaining indep… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted Information Processing and Management (IP&M), 10 pages, 9 figures and 8 tables

    Journal ref: Information Processing & Management, Volume 61, Issue 4, July 2024, 103716

  4. IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT

    Authors: Junchen Fu, Xuri Ge, Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, Jie Wang, Joemon M. Jose

    Abstract: Multimodal foundation models are transformative in sequential recommender systems, leveraging powerful representation learning capabilities. While Parameter-efficient Fine-tuning (PEFT) is commonly used to adapt foundation models for recommendation tasks, most research prioritizes parameter efficiency, often overlooking critical factors like GPU memory efficiency and training speed. Addressing thi… ▽ More

    Submitted 11 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR2024

  5. arXiv:2404.01618  [pdf, other

    cs.RO

    Multi-Robot Collaborative Navigation with Formation Adaptation

    Authors: Zihao Deng, Peng Gao, Williard Joshua Jose, Hao Zhang

    Abstract: Multi-robot collaborative navigation is an essential ability where teamwork and synchronization are keys. In complex and uncertain environments, adaptive formation is vital, as rigid formations prove to be inadequate. The ability of robots to dynamically adjust their formation enables navigation through unpredictable spaces, maintaining cohesion, and effectively responding to environmental challen… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  6. arXiv:2403.16948  [pdf, other

    cs.IR

    Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling

    Authors: Jie Wang, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

    Abstract: Reinforcement Learning (RL)-based recommender systems have demonstrated promising performance in meeting user expectations by learning to make accurate next-item recommendations from historical user-item interactions. However, existing offline RL-based sequential recommendation methods face the challenge of obtaining effective user feedback from the environment. Effectively modeling the user state… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  7. arXiv:2403.13690  [pdf, other

    cs.SE cs.CV cs.HC

    MotorEase: Automated Detection of Motor Impairment Accessibility Issues in Mobile App UIs

    Authors: Arun Krishnavajjala, SM Hasan Mansur, Justin Jose, Kevin Moran

    Abstract: Recent research has begun to examine the potential of automatically finding and fixing accessibility issues that manifest in software. However, while recent work makes important progress, it has generally been skewed toward identifying issues that affect users with certain disabilities, such as those with visual or hearing impairments. However, there are other groups of users with different types… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to ICSE 2024 Research Track, 13 pages

  8. arXiv:2402.15276  [pdf, other

    cs.IR cs.AI cs.CV

    CFIR: Fast and Effective Long-Text To Image Retrieval for Large Corpora

    Authors: Zijun Long, Xuri Ge, Richard Mccreadie, Joemon Jose

    Abstract: Text-to-image retrieval aims to find the relevant images based on a text query, which is important in various use-cases, such as digital libraries, e-commerce, and multimedia databases. Although Multimodal Large Language Models (MLLMs) demonstrate state-of-the-art performance, they exhibit limitations in handling large-scale, diverse, and ambiguous real-world needs of retrieval, due to the computa… ▽ More

    Submitted 2 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  9. arXiv:2402.06194  [pdf, other

    cs.DC

    SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation

    Authors: Yifan Xiong, Yuting Jiang, Ziyue Yang, Lei Qu, Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, Joe Chau, Peng Cheng, Yongqiang Xiong, Lidong Zhou

    Abstract: Reliability in cloud AI infrastructure is crucial for cloud service providers, prompting the widespread use of hardware redundancies. However, these redundancies can inadvertently lead to hidden degradation, so called "gray failure", for AI workloads, significantly affecting end-to-end performance and concealing performance issues, which complicates root cause analysis for failures and regressions… ▽ More

    Submitted 7 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: USENIX ATC '24

  10. arXiv:2308.13976  [pdf, other

    cs.LG cs.AI

    Label Denoising through Cross-Model Agreement

    Authors: Yu Wang, Xin Xin, Zaiqiao Meng, Joemon Jose, Fuli Feng

    Abstract: Learning from corrupted labels is very common in real-world machine-learning applications. Memorizing such noisy labels could affect the learning of the model, leading to sub-optimal performances. In this work, we propose a novel framework to learn robust machine-learning models from noisy labels. Through an empirical study, we find that different models make relatively similar predictions on clea… ▽ More

    Submitted 18 December, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.09605

  11. arXiv:2305.11081  [pdf, other

    cs.IR

    Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

    Authors: Zhaochun Ren, Na Huang, Yidan Wang, Pengjie Ren, Jun Ma, Jiahuan Lei, Xinlei Shi, Hengliang Luo, Joemon M Jose, Xin Xin

    Abstract: Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state re… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  12. DRackSim: Simulator for Rack-scale Memory Disaggregation

    Authors: Amit Puri, John Jose, Tamarapalli Venkatesh, Vijaykrishnan Narayanan

    Abstract: Memory disaggregation has emerged as an alternative to traditional server architecture in data centers. This paper introduces DRackSim, a simulation infrastructure to model rack-scale hardware disaggregated memory. DRackSim models multiple compute nodes, memory pools, and a rack-scale interconnect similar to GenZ. An application-level simulation approach simulates an x86 out-of-order multi-core pr… ▽ More

    Submitted 19 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  13. Improving Implicit Feedback-Based Recommendation through Multi-Behavior Alignment

    Authors: Xin Xin, Xiangyuan Liu, Hanbing Wang, Pengjie Ren, Zhumin Chen, Jiahuan Lei, Xinlei Shi, Hengliang Luo, Joemon Jose, Maarten de Rijke, Zhaochun Ren

    Abstract: Recommender systems that learn from implicit feedback often use large volumes of a single type of implicit user feedback, such as clicks, to enhance the prediction of sparse target behavior such as purchases. Using multiple types of implicit user feedback for such target behavior prediction purposes is still an open question. Existing studies that attempted to learn from multiple types of user beh… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  14. Design and Evaluation of a Rack-Scale Disaggregated Memory Architecture For Data Centers

    Authors: Amit Puri, John Jose, Tamarapalli Venkatesh

    Abstract: Memory disaggregation is being considered as a strong alternative to traditional architecture to deal with the memory under-utilization in data centers. Disaggregated memory can adapt to dynamically changing memory requirements for the data center applications like data analytics, big data, etc., that require in-memory processing. However, such systems can face high remote memory access latency du… ▽ More

    Submitted 8 April, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

  15. arXiv:2211.10155  [pdf, other

    cs.CV cs.AI

    Structured Pruning Adapters

    Authors: Lukas Hedegaard, Aman Alok, Juby Jose, Alexandros Iosifidis

    Abstract: Adapters are a parameter-efficient alternative to fine-tuning, which augment a frozen base network to learn new tasks. Yet, the inference of the adapted model is often slower than the corresponding fine-tuned model. To improve on this, we propose Structured Pruning Adapters (SPAs), a family of compressing, task-switching network adapters, that accelerate and specialize networks using tiny paramete… ▽ More

    Submitted 2 February, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 11 pages, 6 figures, 2 tables

  16. arXiv:2211.00718  [pdf

    cs.CV cs.AI

    SleepyWheels: An Ensemble Model for Drowsiness Detection leading to Accident Prevention

    Authors: Jomin Jose, Andrew J, Kumudha Raimond, Shweta Vincent

    Abstract: Around 40 percent of accidents related to driving on highways in India occur due to the driver falling asleep behind the steering wheel. Several types of research are ongoing to detect driver drowsiness but they suffer from the complexity and cost of the models. In this paper, SleepyWheels a revolutionary method that uses a lightweight neural network in conjunction with facial landmark identificat… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 20 pages

  17. arXiv:2210.08908  [pdf, other

    cs.CV

    Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

    Authors: Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Joemon M. Jose

    Abstract: Image-sentence retrieval has attracted extensive research attention in multimedia and computer vision due to its promising application. The key issue lies in jointly learning the visual and textual representation to accurately estimate their similarity. To this end, the mainstream schema adopts an object-word based attention to calculate their relevance scores and refine their interactive represen… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: accepted to WACV 2023

  18. arXiv:2208.09070  [pdf

    cs.AR cs.CR

    Electronic, Wireless, and Photonic Network-on-Chip Security: Challenges and Countermeasures

    Authors: Sudeep Pasricha, John Jose, Sujay Deb

    Abstract: Networks-on-chips (NoCs) are an integral part of emerging manycore computing chips. They play a key role in facilitating communication among processing cores and between cores and memory. To meet the aggressive performance and energy-efficiency targets of machine learning and big data applications, NoCs have been evolving to leverage emerging paradigms such as silicon photonics and wireless commun… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  19. arXiv:2206.06190  [pdf, other

    cs.IR

    TransRec: Learning Transferable Recommendation from Mixture-of-Modality Feedback

    Authors: Jie Wang, Fajie Yuan, Mingyue Cheng, Joemon M. Jose, Chenyun Yu, Beibei Kong, Xiangnan He, Zhijin Wang, Bo Hu, Zang Li

    Abstract: Learning large-scale pre-trained models on broad-ranging data and then transfer to a wide range of target tasks has become the de facto paradigm in many machine learning (ML) communities. Such big models are not only strong performers in practice but also offer a promising way to break out of the task-specific modeling restrictions, thereby enabling task-agnostic and unified ML systems. However, s… ▽ More

    Submitted 3 November, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  20. arXiv:2206.03382  [pdf, other

    cs.DC cs.CL cs.CV

    Tutel: Adaptive Mixture-of-Experts at Scale

    Authors: Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong

    Abstract: Sparsely-gated mixture-of-experts (MoE) has been widely adopted to scale deep learning models to trillion-plus parameters with fixed computational cost. The algorithmic performance of MoE relies on its token routing mechanism that forwards each input token to the right sub-models or experts. While token routing dynamically determines the amount of expert workload at runtime, existing systems suffe… ▽ More

    Submitted 5 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  21. arXiv:2204.04993  [pdf, other

    eess.IV cs.CV cs.LG

    Ischemic Stroke Lesion Segmentation Using Adversarial Learning

    Authors: Mobarakol Islam, N Rajiv Vaidyanathan, V Jeya Maria Jose, Hongliang Ren

    Abstract: Ischemic stroke occurs through a blockage of clogged blood vessels supplying blood to the brain. Segmentation of the stroke lesion is vital to improve diagnosis, outcome assessment and treatment planning. In this work, we propose a segmentation model with adversarial learning for ischemic lesion segmentation. We adopt U-Net with skip connection and dropout as segmentation baseline network and a fu… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Published in MICCAI ISLES Challenge 2018

  22. MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection

    Authors: Xuri Ge, Joemon M. Jose, Songpei Xu, Xiao Liu, Hu Han

    Abstract: The Facial Action Coding System (FACS) encodes the action units (AUs) in facial images, which has attracted extensive research attention due to its wide use in facial expression analysis. Many methods that perform well on automatic facial action unit (AU) detection primarily focus on modeling various types of AU relations between corresponding local muscle areas, or simply mining global attention-… ▽ More

    Submitted 22 May, 2024; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: 20 pages, 5 figures, 8 tables;

    Journal ref: ACM Transactions on Intelligent Systems and Technology,2024

  23. arXiv:2203.01800  [pdf, other

    cs.CV

    Automatic Facial Paralysis Estimation with Facial Action Units

    Authors: Xuri Ge, Joemon M. Jose, Pengcheng Wang, Arunachalam Iyer, Xiao Liu, Hu Han

    Abstract: Facial palsy is unilateral facial nerve weakness or paralysis of rapid onset with unknown causes. Automatically estimating facial palsy severeness can be helpful for the diagnosis and treatment of people suffering from it across the world. In this work, we develop and experiment with a novel model for estimating facial palsy severity. For this, an effective Facial Action Units (AU) detection techn… ▽ More

    Submitted 30 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 12 pages, 5 figures, resubmitted to IEEE Transactions on Affective Computing

  24. arXiv:2111.03474  [pdf, other

    cs.LG cs.IR

    Supervised Advantage Actor-Critic for Recommender Systems

    Authors: Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

    Abstract: Casting session-based or sequential recommendation as reinforcement learning (RL) through reward signals is a promising research direction towards recommender systems (RS) that maximize cumulative profits. However, the direct use of RL algorithms in the RS setting is impractical due to challenges like off-policy training, huge action spaces and lack of sufficient reward signals. Recent RL approach… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 9 pages, 4 figures, In Proceedings of the 15th ACM International Conference on Web Search and Data Mining (WSDM '22), February 21-25, 2022, Phoenix, Arizona. arXiv admin note: text overlap with arXiv:2006.05779

  25. arXiv:2110.05688  [pdf

    cs.HC cs.CV cs.CY cs.LG

    Inclusive Design: Accessibility Settings for People with Cognitive Disabilities

    Authors: Trae Waggoner, Julia Ann Jose, Ashwin Nair, Sudarsan Manikandan

    Abstract: The advancement of technology has progressed faster than any other field in the world and with the development of these new technologies, it is important to make sure that these tools can be used by everyone, including people with disabilities. Accessibility options in computing devices help ensure that everyone has the same access to advanced technologies. Unfortunately, for those who require mor… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  26. arXiv:2110.05661  [pdf

    cs.SI cs.LG

    BotNet Detection on Social Media

    Authors: Aniket Chandrakant Devle, Julia Ann Jose, Abhay Shrinivas Saraswathula, Shubham Mehta, Siddhant Srivastava, Sirisha Kona, Sudheera Daggumalli

    Abstract: As our reliance on social media platforms and web services increase day by day, exploiters view these platforms as an opportunity to manipulate our thoughts ad actions. These platforms have become an open playground for social bot accounts. Social bots not only learn human conversations, manners, and presence but also manipulate public opinion, act as scammers, manipulate stock markets, and so on.… ▽ More

    Submitted 27 November, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

  27. Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval

    Authors: Xuri Ge, Fuhai Chen, Joemon M. Jose, Zhilong Ji, Zhongqin Wu, Xiao Liu

    Abstract: The current state-of-the-art image-sentence retrieval methods implicitly align the visual-textual fragments, like regions in images and words in sentences, and adopt attention modules to highlight the relevance of cross-modal semantic correspondences. However, the retrieval performance remains unsatisfactory due to a lack of consistent representation in both semantics and structural spaces. In thi… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 9 pages, 7 figures, Accepted by ACM MM 2021

  28. arXiv:2106.13271  [pdf, ps, other

    cs.CY

    On Fairness and Interpretability

    Authors: Deepak P, Sanil V, Joemon M. Jose

    Abstract: Ethical AI spans a gamut of considerations. Among these, the most popular ones, fairness and interpretability, have remained largely distinct in technical pursuits. We discuss and elucidate the differences between fairness and interpretability across a variety of dimensions. Further, we develop two principles-based frameworks towards developing ethical AI for the future that embrace aspects of bot… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: in IJCAI 2021 Workshop on AI for Social Good, January 2021. [ Ref: https://crcs.seas.harvard.edu/publications/fairness-and-interpretability ]

  29. Learning Robust Recommenders through Cross-Model Agreement

    Authors: Yu Wang, Xin Xin, Zaiqiao Meng, Xiangnan He, Joemon Jose, Fuli Feng

    Abstract: Learning from implicit feedback is one of the most common cases in the application of recommender systems. Generally speaking, interacted examples are considered as positive while negative examples are sampled from uninteracted ones. However, noisy examples are prevalent in real-world implicit feedback. A noisy positive example could be interacted but it actually leads to negative user preference.… ▽ More

    Submitted 13 March, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: 12 pages, 23 figures

    Journal ref: World Wide Web Conference 2022

  30. arXiv:2104.00985  [pdf, other

    eess.IV cs.CV

    Brain Tumor Segmentation and Survival Prediction using 3D Attention UNet

    Authors: Mobarakol Islam, Vibashan VS, V Jeya Maria Jose, Navodini Wijethilake, Uppal Utkarsh, Hongliang Ren

    Abstract: In this work, we develop an attention convolutional neural network (CNN) to segment brain tumors from Magnetic Resonance Images (MRI). Further, we predict the survival rate using various machine learning methods. We adopt a 3D UNet architecture and integrate channel and spatial attention with the decoder network to perform segmentation. For survival prediction, we extract some novel radiomic featu… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: MICCAI-BrainLes Workshop

  31. arXiv:2104.00980  [pdf, other

    eess.IV cs.CV cs.LG

    Glioma Prognosis: Segmentation of the Tumor and Survival Prediction using Shape, Geometric and Clinical Information

    Authors: Mobarakol Islam, V Jeya Maria Jose, Hongliang Ren

    Abstract: Segmentation of brain tumor from magnetic resonance imaging (MRI) is a vital process to improve diagnosis, treatment planning and to study the difference between subjects with tumor and healthy subjects. In this paper, we exploit a convolutional neural network (CNN) with hypercolumn technique to segment tumor from healthy brain tissue. Hypercolumn is the concatenation of a set of vectors which for… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: MICCAI-BrainLes Workshop

  32. arXiv:2101.02557  [pdf

    q-bio.OT cs.LG

    Continuous Glucose Monitoring Prediction

    Authors: Julia Ann Jose, Trae Waggoner, Sudarsan Manikandan

    Abstract: Diabetes is one of the deadliest diseases in the world and affects nearly 10 percent of the global adult population. Fortunately, powerful new technologies allow for a consistent and reliable treatment plan for people with diabetes. One major development is a system called continuous blood glucose monitoring (CGM). In this review, we look at three different continuous meal detection algorithms tha… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

  33. arXiv:2101.00055  [pdf, other

    cs.AR

    Data Criticality in Multi-Threaded Applications: An Insight for Many-Core Systems

    Authors: Abhijit Das, John Jose, Prabhat Mishra

    Abstract: Multi-threaded applications are capable of exploiting the full potential of many-core systems. However, Network-on-Chip (NoC) based inter-core communication in many-core systems is responsible for 60-75% of the miss latency experienced by multi-threaded applications. Delay in the arrival of critical data at the requesting core severely hampers performance. This brief presents some interesting insi… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

  34. arXiv:2009.13724  [pdf, other

    cs.IR

    One Person, One Model, One World: Learning Continual User Representation without Forgetting

    Authors: Fajie Yuan, Guoxiao Zhang, Alexandros Karatzoglou, Joemon Jose, Beibei Kong, Yudong Li

    Abstract: Learning user representations is a vital technique toward effective user modeling and personalized recommender systems. Existing approaches often derive an individual set of model parameters for each task by training on separate data. However, the representation of the same user potentially has some commonalities, such as preference and personality, even in different tasks. As such, these separate… ▽ More

    Submitted 9 May, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  35. arXiv:2008.10233  [pdf

    eess.AS cs.SD

    AMRConvNet: AMR-Coded Speech Enhancement Using Convolutional Neural Networks

    Authors: Williard Joshua Jose

    Abstract: Speech is converted to digital signals using speech coding for efficient transmission. However, this often lowers the quality and bandwidth of speech. This paper explores the application of convolutional neural networks for Artificial Bandwidth Expansion (ABE) and speech enhancement on coded speech, particularly Adaptive Multi-Rate (AMR) used in 2G cellular phone calls. In this paper, we introduce… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: IEEE SMC 2020

  36. arXiv:2006.05779  [pdf, other

    cs.LG cs.AI

    Self-Supervised Reinforcement Learning for Recommender Systems

    Authors: Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

    Abstract: In session-based or sequential recommendation, it is important to consider a number of factors like long-term user engagement, multiple types of user-item interactions such as clicks, purchases etc. The current state-of-the-art supervised approaches fail to model them appropriately. Casting sequential recommendation task as a reinforcement learning (RL) problem is a promising direction. A major co… ▽ More

    Submitted 11 June, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: SIGIR2020

  37. arXiv:2006.04878  [pdf, other

    eess.IV cs.CV

    KiU-Net: Towards Accurate Segmentation of Biomedical Images using Over-complete Representations

    Authors: Jeya Maria Jose, Vishwanath Sindagi, Ilker Hacihaliloglu, Vishal M. Patel

    Abstract: Due to its excellent performance, U-Net is the most widely used backbone architecture for biomedical image segmentation in the recent years. However, in our studies, we observe that there is a considerable performance drop in the case of detecting smaller anatomical landmarks with blurred noisy boundaries. We analyze this issue in detail, and address it by proposing an over-complete architecture (… ▽ More

    Submitted 8 July, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted at MICCAI 2020

  38. arXiv:2004.11460  [pdf, other

    q-bio.QM cs.CY cs.LG stat.ML

    Development of a Machine Learning Model and Mobile Application to Aid in Predicting Dosage of Vitamin K Antagonists Among Indian Patients

    Authors: Amruthlal M, Devika S, Ameer Suhail P A, Aravind K Menon, Vignesh Krishnan, Alan Thomas, Manu Thomas, Sanjay G, Lakshmi Kanth L R, Jimmy Jose, Harikrishnan S

    Abstract: Patients who undergo mechanical heart valve replacements or have conditions like Atrial Fibrillation have to take Vitamin K Antagonists (VKA) drugs to prevent coagulation of blood. These drugs have narrow therapeutic range and need to be very closely monitored due to life threatening side effects. The dosage of VKA drug is determined and revised by a physician based on Prothrombin Time - Internati… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  39. arXiv:2004.04635  [pdf, other

    cs.LG stat.ML

    Graph Highway Networks

    Authors: Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose

    Abstract: Graph Convolution Networks (GCN) are widely used in learning graph representations due to their effectiveness and efficiency. However, they suffer from the notorious over-smoothing problem, in which the learned representations of densely connected nodes converge to alike vectors when many (>3) graph convolutional layers are stacked. In this paper, we argue that there-normalization trick used in GC… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

  40. arXiv:1908.08752  [pdf

    cs.DL

    Ten years of research on ResearchGate, a scoping review using Google Scholar 2008_2017

    Authors: Prieto-Gutierrez, Juan Jose

    Abstract: Objective. To analyse quantitatively the articles published during 2008_2017 about the academic social networking site ResearchGate. Methods. A scoping bibliometric review of documents retrieved using Google Scholar was conducted, limited to publications that contained the word "ResearchGate" in their title and were published from 2008 to 2017. Results. The search yielded 159 documents, once a pre… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

  41. arXiv:1905.09063  [pdf, other

    cs.AI cs.PF

    NTP : A Neural Network Topology Profiler

    Authors: Raghavendra Bhat, Pravin Chandran, Juby Jose, Viswanath Dibbur, Prakash Sirra Ajith

    Abstract: Performance of end-to-end neural networks on a given hardware platform is a function of its compute and memory signature, which in-turn, is governed by a wide range of parameters such as topology size, primitives used, framework used, batching strategy, latency requirements, precision etc. Current benchmarking tools suffer from limitations such as a) being either too granular like DeepBench [1] (o… ▽ More

    Submitted 24 May, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

  42. Towards a Skeleton-Based Action Recognition For Realistic Scenarios

    Authors: Cagatay Odabasi, Jewel Jose

    Abstract: Understanding human actions is a crucial problem for service robots. However, the general trend in Action Recognition is developing and testing these systems on structured datasets. That's why this work presents a practical Skeleton-based Action Recognition framework which can be used in realistic scenarios. Our results show that although non-augmented and non-normalized data may yield comparable… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

  43. arXiv:1904.12796  [pdf, other

    cs.IR cs.AI

    Relational Collaborative Filtering:Modeling Multiple Item Relations for Recommendation

    Authors: Xin Xin, Xiangnan He, Yongfeng Zhang, Yongdong Zhang, Joemon Jose

    Abstract: Existing item-based collaborative filtering (ICF) methods leverage only the relation of collaborative similarity. Nevertheless, there exist multiple relations between items in real-world scenarios. Distinct from the collaborative similarity that implies co-interact patterns from the user perspective, these relations reveal fine-grained knowledge on items from different perspectives of meta-data, f… ▽ More

    Submitted 11 May, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

  44. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  45. arXiv:1808.05163  [pdf, other

    cs.IR cs.LG stat.ML

    A Simple Convolutional Generative Network for Next Item Recommendation

    Authors: Fajie Yuan, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M Jose, Xiangnan He

    Abstract: Convolutional Neural Networks (CNNs) have been recently introduced in the domain of session-based next item recommendation. An ordered collection of past items the user has interacted with in a session (or sequence) are embedded into a 2-dimensional latent matrix, and treated as an image. The convolution and pooling operations are then applied to the mapped item embeddings. In this paper, we first… ▽ More

    Submitted 28 November, 2018; v1 submitted 15 August, 2018; originally announced August 2018.

  46. arXiv:1710.09805  [pdf, other

    cs.LG cs.CL stat.ML

    Improving Negative Sampling for Word Representation using Self-embedded Features

    Authors: Long Chen, Fajie Yuan, Joemon M. Jose, Weinan Zhang

    Abstract: Although the word-popularity based negative sampler has shown superb performance in the skip-gram model, the theoretical motivation behind oversampling popular (non-observed) words as negative samples is still not well understood. In this paper, we start from an investigation of the gradient vanishing issue in the skipgram model without a proper negative sampler. By performing an insightful analys… ▽ More

    Submitted 26 June, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: Accepted in WSDM 2018

  47. arXiv:1601.06181  [pdf, ps, other

    cs.IT

    Secure Content Distribution in Vehicular Networks

    Authors: Viet T. Nguyen, Jubin Jose, Xinzhou Wu, Tom Richardson

    Abstract: Dedicated short range communication (DSRC) relies on secure distribution to vehicles of a certificate revocation list (CRL) for enabling security protocols. CRL distribution utilizing vehicle-to-vehicle (V2V) communications is preferred to an infrastructure-only approach. One approach to V2V CRL distribution, using rateless coding at the source and forwarding at vehicle relays is vulnerable to a p… ▽ More

    Submitted 22 January, 2016; originally announced January 2016.

  48. arXiv:1512.02059  [pdf, other

    cs.NI

    Inter-Vehicle Range Estimation from Periodic Broadcasts

    Authors: Urs Niesen, Venkatesan N. Ekambaram, Jubin Jose, Xinzhou Wu

    Abstract: Dedicated short-range communication (DSRC) enables vehicular communication using periodic broadcast messages. We propose to use these periodic broadcasts to perform inter-vehicle ranging. Motivated by this scenario, we study the general problem of precise range estimation between pairs of moving vehicles using periodic broadcasts. Each vehicle has its own independent and unsynchronized clock, whic… ▽ More

    Submitted 7 July, 2016; v1 submitted 7 December, 2015; originally announced December 2015.

    Comments: 16 pages

    Journal ref: IEEE Transactions on Vehicular Technology, vol. 66, pp. 10637-10646, December 2017

  49. arXiv:1511.01535  [pdf, ps, other

    eess.SY cs.NI

    Distributed Rate and Power Control in Vehicular Networks

    Authors: Jubin Jose, Chong Li, Xinzhou Wu, Lei Ying, Kai Zhu

    Abstract: The focus of this paper is on the rate and power control algorithms in Dedicated Short Range Communication (DSRC) for vehicular networks. We first propose a utility maximization framework by leveraging the well-developed network congestion control, and formulate two subproblems, one on rate control with fixed transmit powers and the other on power control with fixed rates. Distributed rate control… ▽ More

    Submitted 4 November, 2015; originally announced November 2015.

    Comments: Submitted to IEEE/ACM Transactions on Networking

  50. arXiv:1111.1022  [pdf

    cs.SE

    Towards the integration of formal specification in the Áncora methodology

    Authors: Carlos Alberto Fernandez-y-Fernandez, Martín José José

    Abstract: There are some non-formal methodologies such as RUP, OpenUP, agile methodologies such as SCRUP, XP and techniques like those proposed by UML, which allow the development of software. The software industry has struggled to generate quality software, as importance has not been given to the engineering requirements, resulting in a poor specification of requirements and software of poor quality. In or… ▽ More

    Submitted 3 November, 2011; originally announced November 2011.