Skip to main content

Showing 1–50 of 71 results for author: Chan, P

  1. arXiv:2406.19643  [pdf, other

    cs.CL cs.AI

    Unlocking Varied Perspectives: A Persona-Based Multi-Agent Framework with Debate-Driven Text Planning for Argument Generation

    Authors: Zhe Hu, Hou Pong Chan, Jing Li, Yu Yin

    Abstract: Writing persuasive arguments is a challenging task for both humans and machines. It entails incorporating high-level beliefs from various perspectives on the topic, along with deliberate reasoning and planning to construct a coherent narrative. Current language models often generate surface tokens autoregressively, lacking explicit integration of these underlying controls, resulting in limited out… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.04364  [pdf

    cs.CV cs.HC cs.LG

    Use of a Multiscale Vision Transformer to predict Nursing Activities Score from Low Resolution Thermal Videos in an Intensive Care Unit

    Authors: Isaac YL Lee, Thanh Nguyen-Duc, Ryo Ueno, Jesse Smith, Peter Y Chan

    Abstract: Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the Intensive Care Unit (ICU) is often done using the Nursing Activities Score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of Ambient Intelligence (AmI) by using computer vision to passively derive car… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

    Comments: 4 pages, 1 figure

  3. arXiv:2403.12027  [pdf, other

    cs.CL cs.AI cs.CV

    From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

    Abstract: Data visualization in the form of charts plays a pivotal role in data analysis, offering critical insights and aiding in informed decision-making. Automatic chart understanding has witnessed significant advancements with the rise of large foundation models in recent years. Foundation models, such as large language models, have revolutionized various natural language processing tasks and are increa… ▽ More

    Submitted 25 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  4. arXiv:2403.01433  [pdf, other

    cs.CE q-bio.NC

    BrainMass: Advancing Brain Network Analysis for Diagnosis with Large-scale Self-Supervised Learning

    Authors: Yanwu Yang, Chenfei Ye, Guinan Su, Ziyao Zhang, Zhikai Chang, Hairui Chen, Piu Chan, Yue Yu, Ting Ma

    Abstract: Foundation models pretrained on large-scale datasets via self-supervised learning demonstrate exceptional versatility across various tasks. Due to the heterogeneity and hard-to-collect medical data, this approach is especially beneficial for medical image analysis and neuroscience research, as it streamlines broad downstream tasks without the need for numerous costly annotations. However, there ha… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  5. arXiv:2402.11060  [pdf, other

    cs.CL cs.AI cs.IR

    Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

    Authors: Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi R. Fung, Hou Pong Chan, ChengXiang Zhai, Heng Ji

    Abstract: The increasing demand for personalized interactions with large language models (LLMs) calls for the development of methodologies capable of accurately and efficiently identifying user opinions and preferences. Retrieval augmentation emerges as an effective strategy, as it can accommodate a vast number of users without the costs from fine-tuning. Existing research, however, has largely focused on e… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2402.07401  [pdf, other

    cs.CL

    Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate

    Authors: Kyungha Kim, Sangyun Lee, Kung-Hsiang Huang, Hou Pong Chan, Manling Li, Heng Ji

    Abstract: Fact-checking research has extensively explored verification but less so the generation of natural-language explanations, crucial for user trust. While Large Language Models (LLMs) excel in text generation, their capability for producing faithful explanations in fact-checking remains underexamined. Our study investigates LLMs' ability to generate such explanations, finding that zero-shot prompts o… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  7. arXiv:2312.10160  [pdf, other

    cs.CL

    Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning

    Authors: Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi R. Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji

    Abstract: Recent advancements in large vision-language models (LVLMs) have led to significant progress in generating natural language descriptions for visual content and thus enhancing various applications. One issue with these powerful models is that they sometimes produce texts that are factually inconsistent with the visual input. While there has been some effort to mitigate such inconsistencies in natur… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ACL 2024 Findings

  8. arXiv:2310.20352  [pdf, other

    cs.CL

    AMERICANO: Argument Generation with Discourse-driven Decomposition and Agent Interaction

    Authors: Zhe Hu, Hou Pong Chan, Yu Yin

    Abstract: Argument generation is a challenging task in natural language processing, which requires rigorous reasoning and proper content organization. Inspired by recent chain-of-thought prompting that breaks down a complex task into intermediate steps, we propose Americano, a novel framework with agent interaction for argument generation. Our approach decomposes the generation process into sequential actio… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  9. arXiv:2310.13297  [pdf, other

    cs.CL cs.AI cs.LG

    Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting

    Authors: Chenkai Sun, Jinning Li, Yi R. Fung, Hou Pong Chan, Tarek Abdelzaher, ChengXiang Zhai, Heng Ji

    Abstract: Automatic response forecasting for news media plays a crucial role in enabling content producers to efficiently predict the impact of news releases and prevent unexpected negative outcomes such as social conflict and moral injury. To effectively forecast responses, it is essential to develop measures that leverage the social dynamics and contextual information surrounding individuals, especially i… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Main Conference

  10. arXiv:2309.16066  [pdf, other

    cs.LG

    Label Augmentation Method for Medical Landmark Detection in Hip Radiograph Images

    Authors: Yehyun Suh, Peter Chan, J. Ryan Martin, Daniel Moyer

    Abstract: This work reports the empirical performance of an automated medical landmark detection method for predict clinical markers in hip radiograph images. Notably, the detection method was trained using a label-only augmentation scheme; our results indicate that this form of augmentation outperforms traditional data augmentation and produces highly sample efficient estimators. We train a generic U-Net-b… ▽ More

    Submitted 8 December, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  11. arXiv:2309.11478  [pdf, other

    cs.AI

    Fictional Worlds, Real Connections: Developing Community Storytelling Social Chatbots through LLMs

    Authors: Yuqian Sun, Hanyi Wang, Pok Man Chan, Morteza Tabibi, Yan Zhang, Huan Lu, Yuheng Chen, Chang Hee Lee, Ali Asadipour

    Abstract: We address the integration of storytelling and Large Language Models (LLMs) to develop engaging and believable Social Chatbots (SCs) in community settings. Motivated by the potential of fictional characters to enhance social interactions, we introduce Storytelling Social Chatbots (SSCs) and the concept of story engineering to transform fictional game characters into "live" social entities within p… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  12. arXiv:2305.16470  [pdf, other

    cs.CL cs.LG

    Measuring the Effect of Influential Messages on Varying Personas

    Authors: Chenkai Sun, Jinning Li, Hou Pong Chan, ChengXiang Zhai, Heng Ji

    Abstract: Predicting how a user responds to news events enables important applications such as allowing intelligent agents or content producers to estimate the effect on different communities and revise unreleased messages to prevent unexpected bad outcomes such as social conflict and moral injury. We present a new task, Response Forecasting on Personas for News Media, to estimate the response a persona (ch… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  13. arXiv:2305.14647  [pdf, other

    cs.CL

    Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation

    Authors: Qi Zeng, Mankeerat Sidhu, Ansel Blume, Hou Pong Chan, Lu Wang, Heng Ji

    Abstract: Opinions in scientific research papers can be divergent, leading to controversies among reviewers. However, most existing datasets for opinion summarization are centered around product reviews and assume that the analyzed opinions are non-controversial, failing to account for the variability seen in other contexts such as academic papers, political debates, or social media discussions. To address… ▽ More

    Submitted 15 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: IJCAI 2024 AI4Research Workshop

  14. arXiv:2305.14548  [pdf, other

    cs.CL

    Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarization

    Authors: Hou Pong Chan, Qi Zeng, Heng Ji

    Abstract: Existing factual consistency evaluation approaches for text summarization provide binary predictions and limited insights into the weakness of summarization systems. Therefore, we propose the task of fine-grained inconsistency detection, the goal of which is to predict the fine-grained types of factual errors in a summary. Motivated by how humans inspect factual inconsistency in summaries, we prop… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL Findings 2023. Code and data are available at https://github.com/kenchan0226/fineGrainedFact

  15. arXiv:2305.14225  [pdf, other

    cs.CL

    ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Kathleen McKeown, Heng Ji

    Abstract: Considerable advancements have been made to tackle the misrepresentation of information derived from reference articles in the domains of fact-checking and faithful summarization. However, an unaddressed aspect remains - the identification of social media posts that manipulate information within associated news articles. This task presents a significant challenge, primarily due to the prevalence o… ▽ More

    Submitted 12 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  16. arXiv:2305.07982  [pdf, other

    cs.CL

    Zero-shot Faithful Factual Error Correction

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Heng Ji

    Abstract: Faithfully correcting factual errors is critical for maintaining the integrity of textual knowledge bases and preventing hallucinations in sequence-to-sequence models. Drawing on humans' ability to identify and correct factual errors, we present a zero-shot framework that formulates questions about input claims, looks for correct answers in the given evidence, and assesses the faithfulness of each… ▽ More

    Submitted 27 May, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023

  17. Robot Gaze During Autonomous Navigation and its Effect on Social Presence

    Authors: Kerry He, Wesley P. Chan, Akansel Cosgun, Albin Joy, Elizabeth A. Croft

    Abstract: As robots have become increasingly common in human-rich environments, it is critical that they are able to exhibit social cues to be perceived as a cooperative and socially-conformant team member. We investigate the effect of robot gaze cues on people's subjective perceptions of a mobile robot as a socially present entity in three common hallway navigation scenarios. The tested robot gaze behavior… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Submitted to IJSR 2022

  18. arXiv:2305.01951  [pdf, other

    cs.CL

    Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

    Authors: Chi Seng Cheang, Hou Pong Chan, Derek F. Wong, Xuebo Liu, Zhaocong Li, Yanming Sun, Shudong Liu, Lidia S. Chao

    Abstract: Recent pre-trained language models (PLMs) achieve promising results in existing abstractive summarization datasets. However, existing summarization benchmarks overlap in time with the standard pre-training corpora and finetuning datasets. Hence, the strong performance of PLMs may rely on the parametric knowledge that is memorized during pre-training and fine-tuning. Moreover, the knowledge memoriz… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

  19. arXiv:2304.09182  [pdf, other

    cs.LG cs.AI

    A Deep Learning Framework for Traffic Data Imputation Considering Spatiotemporal Dependencies

    Authors: Li Jiang, Ting Zhang, Qiruyi Zuo, Chenyu Tian, George P. Chan, Wai Kin, Chan

    Abstract: Spatiotemporal (ST) data collected by sensors can be represented as multi-variate time series, which is a sequence of data points listed in an order of time. Despite the vast amount of useful information, the ST data usually suffer from the issue of missing or incomplete data, which also limits its applications. Imputation is one viable solution and is often used to prepossess the data for further… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: accepted at ICITE 2022

  20. arXiv:2302.05550  [pdf, other

    cs.IR cs.AI cs.LG cs.SI

    PDSum: Prototype-driven Continuous Summarization of Evolving Multi-document Sets Stream

    Authors: Susik Yoon, Hou Pong Chan, Jiawei Han

    Abstract: Summarizing text-rich documents has been long studied in the literature, but most of the existing efforts have been made to summarize a static and predefined multi-document set. With the rapid development of online platforms for generating and distributing text-rich documents, there arises an urgent need for continuously summarizing dynamically evolving multi-document sets where the composition of… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Accepted by WWW'23

  21. arXiv:2212.13257  [pdf

    physics.med-ph cs.CV eess.IV eess.SY physics.optics

    A portable widefield fundus camera with high dynamic range imaging capability

    Authors: Alfa Rossi, Mojtaba Rahimi, David Le, Taeyoon son, Michael J. Heiferman, R. V. Paul Chan, Xincheng Yao

    Abstract: Fundus photography is indispensable for clinical detection and management of eye diseases. Limited image contrast and field of view (FOV) are common limitations of conventional fundus cameras, making it difficult to detect subtle abnormalities at the early stages of eye diseases. Further improvements of image contrast and FOV coverage are important to improve early disease detection and reliable t… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 12 pages, 8 figures

  22. arXiv:2212.01146  [pdf, other

    cs.CL

    SumREN: Summarizing Reported Speech about Events in News

    Authors: Revanth Gangi Reddy, Heba Elfardy, Hou Pong Chan, Kevin Small, Heng Ji

    Abstract: A primary objective of news articles is to establish the factual record for an event, frequently achieved by conveying both the details of the specified event (i.e., the 5 Ws; Who, What, Where, When and Why regarding the event) and how people reacted to it (i.e., reported statements). However, existing work on news summarization almost exclusively focuses on the event details. In this work, we pro… ▽ More

    Submitted 7 March, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted at AAAI 2023

  23. arXiv:2210.14650  [pdf, other

    cs.CL

    MOCHA: A Multi-Task Training Approach for Coherent Text Generation from Cognitive Perspective

    Authors: Zhe Hu, Hou Pong Chan, Lifu Huang

    Abstract: Teaching neural models to generate narrative coherent texts is a critical problem. Recent pre-trained language models have achieved promising results, but there is still a gap between human written texts and machine-generated outputs. In this work, we propose a novel multi-task training strategy for coherent text generation grounded on the cognitive theory of writing, which empowers the model to l… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

  24. arXiv:2209.14385  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Feature Decoupling in Self-supervised Representation Learning for Open Set Recognition

    Authors: Jingyun Jia, Philip K. Chan

    Abstract: Assuming unknown classes could be present during classification, the open set recognition (OSR) task aims to classify an instance into a known class or reject it as unknown. In this paper, we use a two-stage training strategy for the OSR problems. In the first stage, we introduce a self-supervised feature decoupling method that finds the content features of the input samples from the known classes… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  25. arXiv:2208.12306  [pdf, other

    cs.CL cs.AI cs.CV

    Multimedia Generative Script Learning for Task Planning

    Authors: Qingyun Wang, Manling Li, Hou Pong Chan, Lifu Huang, Julia Hockenmaier, Girish Chowdhary, Heng Ji

    Abstract: Goal-oriented generative script learning aims to generate subsequent steps to reach a particular goal, which is an essential task to assist robots or humans in performing stereotypical activities. An important aspect of this process is the ability to capture historical states visually, which provides detailed information that is not covered by text and will guide subsequent steps. Therefore, we pr… ▽ More

    Submitted 10 July, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: 21 pages, Accepted by Findings of the Association for Computational Linguistics: ACL 2023, Code and Resources at https://github.com/EagleW/Multimedia-Generative-Script-Learning

  26. arXiv:2208.11903  [pdf, other

    cs.RO

    Autonomous social robot navigation in unknown urban environments using semantic segmentation

    Authors: Sophie Buckeridge, Pamela Carreno-Medrano, Akansel Cosgun, Elizabeth Croft, Wesley P. Chan

    Abstract: For autonomous robots navigating in urban environments, it is important for the robot to stay on the designated path of travel (i.e., the footpath), and avoid areas such as grass and garden beds, for safety and social conformity considerations. This paper presents an autonomous navigation approach for unknown urban environments that combines the use of semantic segmentation and LiDAR data. The pro… ▽ More

    Submitted 12 September, 2022; v1 submitted 25 August, 2022; originally announced August 2022.

  27. arXiv:2208.11856  [pdf, other

    cs.RO

    Design and Implementation of a Human-Robot Joint Action Framework using Augmented Reality and Eye Gaze

    Authors: Wesley P. Chan, Morgan Crouch, Khoa Hoang, Charlie Chen, Nicole Robinson, Elizabeth Croft

    Abstract: When humans work together to complete a joint task, each person builds an internal model of the situation and how it will evolve. Efficient collaboration is dependent on how these individual models overlap to form a shared mental model among team members, which is important for collaborative processes in human-robot teams. The development and maintenance of an accurate shared mental model requires… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  28. arXiv:2208.11563  [pdf

    eess.IV cs.CV q-bio.QM

    Contrastive learning-based pretraining improves representation and transferability of diabetic retinopathy classification models

    Authors: Minhaj Nur Alam, Rikiya Yamashita, Vignav Ramesh, Tejas Prabhune, Jennifer I. Lim, R. V. P. Chan, Joelle Hallak, Theodore Leng, Daniel Rubin

    Abstract: Self supervised contrastive learning based pretraining allows development of robust and generalized deep learning models with small, labeled datasets, reducing the burden of label generation. This paper aims to evaluate the effect of CL based pretraining on the performance of referrable vs non referrable diabetic retinopathy (DR) classification. We have developed a CL based framework with neural s… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  29. arXiv:2206.02363  [pdf, other

    cs.CL cs.IR

    Knowledge-based Document Classification with Shannon Entropy

    Authors: AtMa P. O. Chan

    Abstract: Document classification is the detection specific content of interest in text documents. In contrast to the data-driven machine learning classifiers, knowledge-based classifiers can be constructed based on domain specific knowledge, which usually takes the form of a collection of subject related keywords. While typical knowledge-based classifiers compute a prediction score based on the keyword abu… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  30. arXiv:2205.06918  [pdf, other

    cs.CR cs.LG

    Representation learning with function call graph transformations for malware open set recognition

    Authors: Jingyun Jia, Philip K. Chan

    Abstract: Open set recognition (OSR) problem has been a challenge in many machine learning (ML) applications, such as security. As new/unknown malware families occur regularly, it is difficult to exhaust samples that cover all the classes for the training process in ML systems. An advanced malware classification system should classify the known classes correctly while sensitive to the unknown class. In this… ▽ More

    Submitted 12 July, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

  31. arXiv:2203.09100  [pdf, other

    cs.CL

    PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text Generation

    Authors: Zhe Hu, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Hua Wu, Lifu Huang

    Abstract: Despite recent progress of pre-trained language models on generating fluent text, existing methods still suffer from incoherence problems in long-form text generation tasks that require proper content control and planning to form a coherent high-level logical flow. In this work, we propose PLANET, a novel generation framework leveraging autoregressive self-attention mechanism to conduct content pl… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  32. arXiv:2203.08343  [pdf, other

    cs.RO

    Design and Evaluation of an Augmented Reality Head-Mounted Display Interface for Human Robot Teams Collaborating in Physically Shared Manufacturing Tasks

    Authors: Wesley P Chan, Geoffrey Hanks, Maram Sakr, Haomiao Zhang, Tiger Zuo, H F Machiel Van der Loos, Elizabeth Croft

    Abstract: We provide an experimental evaluation of a wearable augmented reality (AR) system we have developed for human-robot teams working on tasks requiring collaboration in shared physical workspace. Recent advances in AR technology have facilitated the development of more intuitive user interfaces for many human-robot interaction applications. While it has been anticipated that AR can provided a more in… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  33. arXiv:2203.08144  [pdf, other

    q-fin.ST cs.CL cs.LG cs.SI

    DeepTrust: A Reliable Financial Knowledge Retrieval Framework For Explaining Extreme Pricing Anomalies

    Authors: Pok Wah Chan

    Abstract: Extreme pricing anomalies may occur unexpectedly without a trivial cause, and equity traders typically experience a meticulous process to source disparate information and analyze its reliability before integrating it into the trusted knowledge base. We introduce DeepTrust, a reliable financial knowledge retrieval framework on Twitter to explain extreme price moves at speed, while ensuring data ver… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 72 pages

  34. arXiv:2203.06822  [pdf, other

    cs.CV cs.CL cs.RO

    Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention

    Authors: Hou Pong Chan, Mingxi Guo, Cheng-Zhong Xu

    Abstract: Grounding a command to the visual environment is an essential ingredient for interactions between autonomous vehicles and humans. In this work, we study the problem of language grounding for autonomous vehicles, which aims to localize a region in a visual scene according to a natural language command from a passenger. Prior work only employs the top layer representations of a vision-and-language p… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: Submitted to IROS 2022

  35. arXiv:2202.07169  [pdf, other

    cs.SE

    Documentation based Semantic-Aware Log Parsing

    Authors: Lei Yu, Tian Wu, Jiaqi Li, Patrick Chan, Hong Min, Fanjing Meng

    Abstract: With the recent advances of deep learning techniques, there are rapidly growing interests in applying machine learning to log data. As a fundamental part of log analytics, accurate log parsing that transforms raw logs to structured events is critical for subsequent machine learning and data mining tasks. Previous approaches either analyze the source code for parsing or are data-driven such as text… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  36. arXiv:2202.04748  [pdf, other

    cs.CV cs.HC cs.LG

    Estimation of Clinical Workload and Patient Activity using Deep Learning and Optical Flow

    Authors: Thanh Nguyen-Duc, Peter Y Chan, Andrew Tay, David Chen, John Tan Nguyen, Jessica Lyall, Maria De Freitas

    Abstract: Contactless monitoring using thermal imaging has become increasingly proposed to monitor patient deterioration in hospital, most recently to detect fevers and infections during the COVID-19 pandemic. In this letter, we propose a novel method to estimate patient motion and observe clinical workload using a similar technical setup but combined with open source object detection algorithms (YOLOv4) an… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  37. Metrics for Evaluating Social Conformity of Crowd Navigation Algorithms

    Authors: Junxian Wang, Wesley P. Chan, Pamela Carreno-Medrano, Akansel Cosgun, Elizabeth Croft

    Abstract: Recent protocols and metrics for training and evaluating autonomous robot navigation through crowds are inconsistent due to diversified definitions of "social behavior". This makes it difficult, if not impossible, to effectively compare published navigation algorithms. Furthermore, with the lack of a good evaluation protocol, resulting algorithms may fail to generalize, due to lack of diversity in… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Journal ref: 2022 IEEE International Conference on Advanced Robotics and Its Social Impacts (ARSO)

  38. arXiv:2110.15521  [pdf, other

    cs.RO cs.HC

    ARviz -- An Augmented Reality-enabled Visualization Platform for ROS Applications

    Authors: Khoa C. Hoang, Wesley P. Chan, Steven Lay, Akansel Cosgun, Elizabeth A. Croft

    Abstract: Current robot interfaces such as teach pendants and 2D screen displays used for task visualization and interaction often seem unintuitive and limited in terms of information flow. This compromises task efficiency as interacting with the interface can distract the user from the task at hand. Augmented Reality (AR) technology offers the capability to create visually rich displays and intuitive inter… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: 9 pages, 10 figures, accepted for IEEE RAM - Special Issue on Extended Reality

  39. arXiv:2109.13845  [pdf

    cs.CV

    Not Color Blind: AI Predicts Racial Identity from Black and White Retinal Vessel Segmentations

    Authors: Aaron S. Coyner, Praveer Singh, James M. Brown, Susan Ostmo, R. V. Paul Chan, Michael F. Chiang, Jayashree Kalpathy-Cramer, J. Peter Campbell

    Abstract: Background: Artificial intelligence (AI) may demonstrate racial bias when skin or choroidal pigmentation is present in medical images. Recent studies have shown that convolutional neural networks (CNNs) can predict race from images that were not previously thought to contain race-specific features. We evaluate whether grayscale retinal vessel maps (RVMs) of patients screened for retinopathy of pre… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 31 pages, 6 figures

  40. arXiv:2109.09908  [pdf, other

    cs.RO

    A Proposed Set of Communicative Gestures for Human Robot Interaction and an RGB Image-based Gesture Recognizer Implemented in ROS

    Authors: Jia Chuan A. Tan, Wesley P. Chan, Nicole L. Robinson, Elizabeth A. Croft, Dana Kulic

    Abstract: We propose a set of communicative gestures and develop a gesture recognition system with the aim of facilitating more intuitive Human-Robot Interaction (HRI) through gestures. First, we propose a set of commands commonly used for human-robot interaction. Next, an online user study with 190 participants was performed to investigate if there was an agreed set of gestures that people intuitively use… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 6 pages, 5 figures, 3 tables, ICRA 2022 Conference

  41. arXiv:2109.06717  [pdf, other

    cs.CL cs.AI

    Controllable Dialogue Generation with Disentangled Multi-grained Style Specification and Attribute Consistency Reward

    Authors: Zhe Hu, Zhiwei Cao, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Jinsong Su, Hua Wu

    Abstract: Controllable text generation is an appealing but challenging task, which allows users to specify particular attributes of the generated outputs. In this paper, we propose a controllable dialogue generation model to steer response generation under multi-attribute constraints. Specifically, we define and categorize the commonly used control attributes into global and local ones, which possess differ… ▽ More

    Submitted 21 October, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted as a regular paper in IEEE/ACM TASLP

  42. arXiv:2108.12780  [pdf, other

    cs.RO

    An Experimental Validation and Comparison of Reaching Motion Models for Unconstrained Handovers: Towards Generating Humanlike Motions for Human-Robot Handovers

    Authors: Wesley P. Chan, Tin Tran, Sara Sheikholeslami, Elizabeth Croft

    Abstract: The Minimum Jerk motion model has long been cited in literature for human point-to-point reaching motions in single-person tasks. While it has been demonstrated that applying minimum-jerk-like trajectories to robot reaching motions in the joint action task of human-robot handovers allows a robot giver to be perceived as more careful, safe, and skilled, it has not been verified whether human reachi… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: Accepted at Humanoids 2020, "The 2020 IEEE-RAS International Conference on Humanoid Robots"; 6 pages, 7 figures, 1 table

  43. arXiv:2108.03405  [pdf, other

    cs.CL

    Controllable Summarization with Constrained Markov Decision Process

    Authors: Hou Pong Chan, Lu Wang, Irwin King

    Abstract: We study controllable text summarization which allows users to gain control on a particular attribute (e.g., length limit) of the generated summaries. In this work, we propose a novel training framework based on Constrained Markov Decision Process (CMDP), which conveniently includes a reward function along with a set of constraints, to facilitate better summarization control. The reward function e… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: To appear in TACL

  44. arXiv:2108.01268  [pdf, other

    cs.CL

    Dialogue Summarization with Supporting Utterance Flow Modeling and Fact Regularization

    Authors: Wang Chen, Piji Li, Hou Pong Chan, Irwin King

    Abstract: Dialogue summarization aims to generate a summary that indicates the key points of a given dialogue. In this work, we propose an end-to-end neural model for dialogue summarization with two novel modules, namely, the \emph{supporting utterance flow modeling module} and the \emph{fact regularization module}. The supporting utterance flow modeling helps to generate a coherent summary by smoothly shif… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Knowledge-Based Systems (KBS)

  45. A Condense-then-Select Strategy for Text Summarization

    Authors: Hou Pong Chan, Irwin King

    Abstract: Select-then-compress is a popular hybrid, framework for text summarization due to its high efficiency. This framework first selects salient sentences and then independently condenses each of the selected sentences into a concise version. However, compressing sentences separately ignores the context information of the document, and is therefore prone to delete salient information. To address this l… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Comments: Accepted by Knowledge-Based Systems (KBS) journal

  46. arXiv:2105.13557  [pdf, other

    cs.LG cs.CV

    Self-supervised Detransformation Autoencoder for Representation Learning in Open Set Recognition

    Authors: Jingyun Jia, Philip K. Chan

    Abstract: The objective of Open set recognition (OSR) is to learn a classifier that can reject the unknown samples while classifying the known classes accurately. In this paper, we propose a self-supervision method, Detransformation Autoencoder (DTAE), for the OSR problem. This proposed method engages in learning representations that are invariant to the transformations of the input data. Experiments on sev… ▽ More

    Submitted 6 July, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2006.15117

  47. arXiv:2105.06822  [pdf, other

    cs.CV

    Multi-task Graph Convolutional Neural Network for Calcification Morphology and Distribution Analysis in Mammograms

    Authors: Hao Du, Melissa Min-Szu Yao, Liangyu Chen, Wing P. Chan, Mengling Feng

    Abstract: The morphology and distribution of microcalcifications in a cluster are the most important characteristics for radiologists to diagnose breast cancer. However, it is time-consuming and difficult for radiologists to identify these characteristics, and there also lacks of effective solutions for automatic characterization. In this study, we proposed a multi-task deep graph convolutional network (GCN… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  48. Group Surfing: A Pedestrian-Based Approach to Sidewalk Robot Navigation

    Authors: Yuqing Du, Nicholas J. Hetherington, Chu Lip Oon, Wesley P. Chan, Camilo Perez Quintero, Elizabeth Croft, H. F. Machiel Van der Loos, .

    Abstract: In this paper, we propose a novel navigation system for mobile robots in pedestrian-rich sidewalk environments. Sidewalks are unique in that the pedestrian-shared space has characteristics of both roads and indoor spaces. Like vehicles on roads, pedestrian movement often manifests as linear flows in opposing directions. On the other hand, pedestrians also form crowds and can exhibit much more rand… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 7 pages, 9 figures. Published in the proceedings of the 2019 IEEE International Conference on Robotics and Automation (ICRA'19)

  49. arXiv:2104.05211  [pdf, other

    cs.RO

    Virtual Barriers in Augmented Reality for Safe and Effective Human-Robot Cooperation in Manufacturing

    Authors: Khoa Cong Hoang, Wesley P. Chan, Steven Lay, Akansel Cosgun, Elizabeth Croft

    Abstract: Safety is a fundamental requirement in any human-robot collaboration scenario. To ensure the safety of users for such scenarios, we propose a novel Virtual Barrier system facilitated by an augmented reality interface. Our system provides two kinds of Virtual Barriers to ensure safety: 1) a Virtual Person Barrier which encapsulates and follows the user to protect them from colliding with the robot,… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 6 pages, submitted to IROS 2021, waiting for result

  50. Seeing Thru Walls: Visualizing Mobile Robots in Augmented Reality

    Authors: Morris Gu, Akansel Cosgun, Wesley P. Chan, Tom Drummond, Elizabeth Croft

    Abstract: We present an approach for visualizing mobile robots through an Augmented Reality headset when there is no line-of-sight visibility between the robot and the human. Three elements are visualized in Augmented Reality: 1) Robot's 3D model to indicate its position, 2) An arrow emanating from the robot to indicate its planned movement direction, and 3) A 2D grid to represent the ground plane. We condu… ▽ More

    Submitted 16 March, 2022; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: Accepted at RO-MAN 2021 "30th IEEE International Conference on Robot and Human Interactive Communication", 6 pages, 5 figures, 5 Tables