Skip to main content

Showing 1–50 of 451 results for author: Tan, W

  1. arXiv:2407.04840  [pdf

    eess.SY

    Analysis of Dead Reckoning Accuracy in Swarm Robotics System

    Authors: Weihang Tan, Timothy Anglea, Yongqiang Wang

    Abstract: The objective of this paper is to determine the position of a single mobile robot in a swarm using dead reckoning techniques. We investigate the accuracy of navigation by using this process. The paper begins with the research background and social importance. Then, the specific experimental setup and analysis of experimental results are presented. Finally, the results are detailed and some potenti… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2407.04069  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

    Authors: Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang

    Abstract: Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them in real-world applications to ensure they produce reliable performance. Despite the well-established importance of evaluating LLMs in the community, the comple… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  3. arXiv:2407.02376  [pdf, other

    astro-ph.HE

    A new subclass of gamma-ray burst originating from compact binary merger

    Authors: Chen-Wei Wang, Wen-Jun Tan, Shao-Lin Xiong, Shu-Xu Yi, Rahim Moradi, Bing Li, Zhen Zhang, Yu Wang, Yan-Zhi Meng, Jia-Cong Liu, Yue Wang, Sheng-Lun Xie, Wang-Chen Xue, Zheng-Hang Yu, Peng Zhang, Wen-Long Zhang, Yan-Qiu Zhang, Chao Zheng

    Abstract: Type I gamma-ray bursts (GRBs) are believed to originate from compact binary merger usually with duration less than 2 seconds for the main emission. However, recent observations of GRB 211211A and GRB 230307A indicate that some merger-origin GRBs could last much longer. Since they show strikingly similar properties (indicating a common mechanism) which are different from the classic "long"-short b… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2407.01634  [pdf, other

    physics.optics physics.ins-det

    Brownian thermal birefringent noise due to non-diagonal anisotropic photoelastic effect in multilayer coated mirrors

    Authors: Yu-Pei Zhang, Shi-Xiang Yang, Wen-Hai Tan, Cheng-Gang Shao, Yiqiu Ma, Shan-Qing Yang

    Abstract: Thermal noise in the mirror coatings limits the accuracy of today's most optical precision measurement experiments. Unlike the more commonly discussed thermal phase noise, the crystalline coating can generate thermal birefringent noise due to its anisotropic nature. In this study, we propose that the non-diagonal anisotropic photoelastic effect induced by the Brownian motion of mirror coating laye… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 8 pages, 4 figures, Accepted by Physical Review D

  5. arXiv:2406.10447  [pdf, other

    cs.CV

    The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences

    Authors: Bria Long, Violet Xiang, Stefan Stojanov, Robert Z. Sparks, Zi Yin, Grace E. Keene, Alvin W. M. Tan, Steven Y. Feng, Chengxu Zhuang, Virginia A. Marchman, Daniel L. K. Yamins, Michael C. Frank

    Abstract: Human children far exceed modern machine learning algorithms in their sample efficiency, achieving high performance in key domains with much less data than current models. This ''data gap'' is a key challenge both for building intelligent artificial systems and for understanding human development. Egocentric video capturing children's experience -- their ''training data'' -- is a key ingredient fo… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 4 tables and SI. Submitted to NeurIPS Datasets and Benchmarks

  6. arXiv:2406.10215  [pdf, other

    cs.CL cs.LG

    DevBench: A multimodal developmental benchmark for language learning

    Authors: Alvin Wei Ming Tan, Sunny Yu, Bria Long, Wanjing Anya Ma, Tonya Murray, Rebecca D. Silverman, Jason D. Yeatman, Michael C. Frank

    Abstract: How (dis)similar are the learning trajectories of vision-language models and children? Recent modeling work has attempted to understand the gap between models' and humans' data efficiency by constructing models trained on less data, especially multimodal naturalistic data. However, such models are often evaluated on adult-level benchmarks, with limited breadth in language abilities tested, and wit… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  7. arXiv:2406.07971  [pdf, other

    cs.CL cs.AI cs.LG

    It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

    Authors: Taiming Lu, Lingfeng Shen, Xinyu Yang, Weiting Tan, Beidi Chen, Huaxiu Yao

    Abstract: Reinforcement Learning from Human Feedback (RLHF) involves training policy models (PMs) and reward models (RMs) to align language models with human preferences. Instead of focusing solely on PMs and RMs independently, we propose to examine their interactions during fine-tuning, introducing the concept of seamlessness. Our study starts with observing the saturation phenomenon, where continual impro… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  8. DHR+S: Distributed Hybrid Rendering with Realistic Real-time Shadows for Interactive Thin Client Metaverse and Game Applications

    Authors: Yu Wei Tan, Siang Ern Low, Jonas Chow, Javon Teo, Anand Bhojan

    Abstract: Distributed hybrid rendering (DHR) is a real-time rendering approach that incorporates cloud-based ray tracing with locally rasterized graphics for interactive thin client metaverse and game applications. With cloud assistance, DHR can generate high-fidelity ray-traced graphics contents remotely and deliver them to thin clients with low graphics capability, including standalone extended reality de… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    MSC Class: 68U05 ACM Class: I.3

  9. arXiv:2405.13274  [pdf, other

    cs.CL

    DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation

    Authors: Weiting Tan, Jingyu Zhang, Lingfeng Shen, Daniel Khashabi, Philipp Koehn

    Abstract: Non-autoregressive Transformers (NATs) are recently applied in direct speech-to-speech translation systems, which convert speech across different languages without intermediate text data. Although NATs generate high-quality outputs and offer faster inference than autoregressive models, they tend to produce incoherent and repetitive results due to complex data distribution (e.g., acoustic and lingu… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  10. arXiv:2405.04940  [pdf, other

    cs.CV

    Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID

    Authors: Wentao Tan, Changxing Ding, Jiayu Jiang, Fei Wang, Yibing Zhan, Dapeng Tao

    Abstract: Text-to-image person re-identification (ReID) retrieves pedestrian images according to textual descriptions. Manually annotating textual descriptions is time-consuming, restricting the scale of existing datasets and therefore the generalization ability of ReID models. As a result, we study the transferable text-to-image ReID problem, where we train a model on our proposed large-scale database and… ▽ More

    Submitted 30 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  11. arXiv:2405.01881  [pdf

    q-fin.RM cs.LG

    Explainable Risk Classification in Financial Reports

    Authors: Xue Wen Tan, Stanley Kok

    Abstract: Every publicly traded company in the US is required to file an annual 10-K financial report, which contains a wealth of information about the company. In this paper, we propose an explainable deep-learning model, called FinBERT-XRC, that takes a 10-K report as input, and automatically assesses the post-event return volatility risk of its associated company. In contrast to previous systems, our pro… ▽ More

    Submitted 6 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: ICIS 2023 Proceedings. 3. https://aisel.aisnet.org/icis2023/blockchain/blockchain/3

  12. How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses

    Authors: Jionghao Lin, Eason Chen, Zeifei Han, Ashish Gurung, Danielle R. Thomas, Wei Tan, Ngoc Dang Nguyen, Kenneth R. Koedinger

    Abstract: Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages, full research paper, EDM 2024

    Journal ref: A&A 687, A227 (2024)

  13. arXiv:2404.09814  [pdf, other

    cs.IT

    A Novel HARQ-CC Assisted SCMA Scheme

    Authors: Man Wang, Zheng Shi, Yunfei Li, Xianda Wu, Weiqiang Tan, Xinrong Ye

    Abstract: This letter proposes a novel hybrid automatic repeat request with chase combining assisted sparse code multiple access (HARQ-CC-SCMA) scheme. Depending on whether the same superimposed packet are retransmitted, synchronous and asynchronous modes are considered for retransmissions. Moreover, factor graph aggregation (FGA) and Log-likelihood ratio combination (LLRC) are proposed for multi-user detec… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  14. arXiv:2404.09538  [pdf, other

    hep-ph hep-ex

    Light single-gluon hybrid states with various (exotic) quantum numbers

    Authors: Wei-Han Tan, Niu Su, Hua-Xing Chen

    Abstract: We apply the QCD sum rule method to study the light single-gluon hybrid states with various (exotic) quantum numbers. We construct twenty-four single-gluon hybrid currents, and use eighteen of them to calculate the masses of forty-four single-gluon hybrid states with the quark-gluon contents $\bar q q g$ ($q=u/d$) and $\bar s s g$. We concentrate on the hybrid states with the exotic quantum number… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages 5 figures, 3 tables, suggestions and comments welcome

  15. arXiv:2404.04244  [pdf, other

    cs.CV

    Fast Diffeomorphic Image Registration using Patch based Fully Convolutional Networks

    Authors: Jiong Wu, Shuang Zhou, Li Lin, Xin Wang, Wenxue Tan

    Abstract: Diffeomorphic image registration is a fundamental step in medical image analysis, owing to its capability to ensure the invertibility of transformations and preservation of topology. Currently, unsupervised learning-based registration techniques primarily extract features at the image level, potentially limiting their efficacy. This paper proposes a novel unsupervised learning-based fully convolut… ▽ More

    Submitted 3 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  16. arXiv:2404.03229  [pdf, other

    astro-ph.HE

    Relation between the keV-MeV and TeV emission of GRB 221009A and its implications

    Authors: Yan-Qiu Zhang, Hao-Xiang Lin, Shao-Lin Xiong, Zhuo Li, Ming-Yu Ge, Chen-Wei Wang, Shu-Xu Yi, Zhen Zhang, Shuang-Nan Zhang, Li-Ming Song, Chao Zheng, Wang-Chen Xue, Jia-Cong Liu, Wen-Jun Tan, Yue Wang, Wen-Long Zhang

    Abstract: Gamma-ray bursts (GRBs) are believed to launch relativistic jets, which generate prompt emission by their internal processes and drive external shocks into surrounding medium, accounting for the long-lasting afterglow emission. However, how the jet powers the external shock is an open question. The unprecedented observations of the keV-MeV emission with GECAM and the TeV emission with LHAASO of so… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  17. arXiv:2403.17207  [pdf, other

    cond-mat.mtrl-sci

    Unified Differentiable Learning of Electric Response

    Authors: Stefano Falletta, Andrea Cepellotti, Anders Johansson, Chuin Wei Tan, Albert Musaelian, Cameron J. Owen, Boris Kozinsky

    Abstract: Predicting response of materials to external stimuli is a primary objective of computational materials science. However, current methods are limited to small-scale simulations due to the unfavorable scaling of computational costs. Here, we implement an equivariant machine-learning framework where response properties stem from exact differential relationships between a generalized potential functio… ▽ More

    Submitted 7 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 15 pages, 6 figures

  18. arXiv:2403.14890  [pdf

    cs.SI

    Unraveling Contagion Origins: Optimal Estimation through Maximum-Likelihood and Starlike Tree Approximation in Markovian Spreading Models

    Authors: Pei-Duo Yu, Chee Wei Tan, Liang Zheng, Chao Zhao

    Abstract: Identifying the source of epidemic-like spread in networks is crucial for tasks like removing internet viruses or finding the rumor source in online social networks. The challenge lies in tracing the source from a snapshot observation of infected nodes. How do we accurately pinpoint the source? Utilizing snapshot data, we apply a probabilistic approach, focusing on the graph boundary and the obser… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  19. Observation of spectral lines in the exceptional GRB 221009A

    Authors: Yan-Qiu Zhang, Shao-Lin Xiong, Ji-Rong Mao, Shuang-Nan Zhang, Wang-Chen Xue, Chao Zheng, Jia-Cong Liu, Zhen Zhang, Xi-Lu Wang, Ming-Yu Ge, Shu-Xu Yi, Li-Ming Song, Zheng-Hua An, Ce Cai, Xin-Qiao Li, Wen-Xi Peng, Wen-Jun Tan, Chen-Wei Wang, Xiang-Yang Wen, Yue Wang, Shuo Xiao, Fan Zhang, Peng Zhang, Shi-Jie Zheng

    Abstract: As the brightest gamma-ray burst ever observed, GRB 221009A provided a precious opportunity to explore spectral line features. In this paper, we performed a comprehensive spectroscopy analysis of GRB 221009A jointly with GECAM-C and Fermi/GBM data to search for emission and absorption lines. For the first time we investigated the line feature throughout this GRB including the most bright part wher… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by SCIENCE CHINA Physics, Mechanics & Astronomy (SCPMA)

    Journal ref: Observation of spectral lines in the exceptional GRB 221009A. Sci. China-Phys. Mech. Astron. 67, 289511 (2024)

  20. New constraints on Triton's atmosphere from the 6 October 2022 stellar occultation

    Authors: Ye Yuan, Chen Zhang, Fan Li, Jian Chen, Yanning Fu, Chunhai Bai, Xing Gao, Yong Wang, Tuhong Zhong, Yixing Gao, Liang Wang, Donghua Chen, Yixing Zhang, Yang Zhang, Wenpeng Xie, Shupi Zhang, Ding Liu, Jun Cao, Xiangdong Yin, Xiaojun Mo, Jing Liu, Xinru Han, Tong Liu, Yuqiang Chen, Zhendong Gao , et al. (25 additional authors not shown)

    Abstract: The atmosphere of Triton was probed directly by observing a ground-based stellar occultation on 6 October 2022. This rare event yielded 23 positive light curves collected from 13 separate observation stations contributing to our campaign. The significance of this event lies in its potential to directly validate the modest pressure fluctuation on Triton, a phenomenon not definitively verified by pr… ▽ More

    Submitted 24 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Astronomy & Astrophysics, in press. 9 pages, 2 figures, 3 tables

    Journal ref: A&A 684, L13 (2024)

  21. arXiv:2403.07312  [pdf, other

    cs.RO

    Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

    Authors: Wenhui Tan, Bei Liu, Junbo Zhang, Ruihua Song, Jianlong Fu

    Abstract: Modeling a generalized visuomotor policy has been a longstanding challenge for both computer vision and robotics communities. Existing approaches often fail to efficiently leverage cross-dataset resources or rely on heavy Vision-Language models, which require substantial computational resources, thereby limiting their multi-task performance and application potential. In this paper, we introduce a… ▽ More

    Submitted 1 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  22. arXiv:2403.06700  [pdf, other

    eess.IV

    Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression

    Authors: Zhi Cao, Youneng Bao, Fanyang Meng, Chao Li, Wen Tan, Genhong Wang, Yongsheng Liang

    Abstract: Deep neural network-based image compression (NIC) has achieved excellent performance, but NIC method models have been shown to be susceptible to backdoor attacks. Adversarial training has been validated in image compression models as a common method to enhance model robustness. However, the improvement effect of adversarial training on model robustness is limited. In this paper, we propose a prior… ▽ More

    Submitted 15 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  23. arXiv:2403.06400  [pdf, other

    cs.CV

    DivCon: Divide and Conquer for Progressive Text-to-Image Generation

    Authors: Yuhao Jia, Wenhan Tan

    Abstract: Diffusion-driven text-to-image (T2I) generation has achieved remarkable advancements. To further improve T2I models' capability in numerical and spatial reasoning, the layout is employed as an intermedium to bridge large language models and layout-based diffusion models. However, these methods still struggle with generating images from textural prompts with multiple objects and complicated spatial… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  24. arXiv:2403.03809  [pdf, other

    eess.SP

    Variational Bayesian Learning based Joint Localization and Channel Estimation with Distance-dependent Noise

    Authors: Yunfei Li, Yiting Luo, Weiqiang Tan, Chunguo Li, Shaodan Ma, Guanghua Yang

    Abstract: In the Industrial Internet of Things (IIoTs) and Ocean of Things (OoTs), the advent of massive intelligent services has imposed stringent requirements on both communication and localization, particularly emphasizing precise localization and channel information. This paper focuses on the challenge of jointly optimizing localization and communication in IoT networks. Departing from the conventional… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  25. arXiv:2403.03186  [pdf, other

    cs.AI

    Cradle: Empowering Foundation Agents Towards General Computer Control

    Authors: Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson , et al. (3 additional authors not shown)

    Abstract: Despite the success in specific scenarios, existing foundation agents still struggle to generalize across various virtual scenarios, mainly due to the dramatically different encapsulations of environments with manually designed observation and action spaces. To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  26. arXiv:2402.13575  [pdf, other

    cs.CV cs.AI

    Flexible Physical Camouflage Generation Based on a Differential Approach

    Authors: Yang Li, Wenyi Tan, Chenxing Zhao, Shuangju Zhou, Xinkai Liang, Quan Pan

    Abstract: This study introduces a novel approach to neural rendering, specifically tailored for adversarial camouflage, within an extensive 3D rendering framework. Our method, named FPA, goes beyond traditional techniques by faithfully simulating lighting conditions and material variations, ensuring a nuanced and realistic representation of textures on a 3D target. To achieve this, we employ a generative ap… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  27. arXiv:2402.01172  [pdf, other

    cs.CL cs.SD eess.AS

    Streaming Sequence Transduction through Dynamic Compression

    Authors: Weiting Tan, Yunmo Chen, Tongfei Chen, Guanghui Qin, Haoran Xu, Heidi C. Zhang, Benjamin Van Durme, Philipp Koehn

    Abstract: We introduce STAR (Stream Transduction with Anchor Representations), a novel Transformer-based model designed for efficient sequence-to-sequence transduction over streams. STAR dynamically segments input streams to create compressed anchor representations, achieving nearly lossless compression (12x) in Automatic Speech Recognition (ASR) and outperforming existing methods. Moreover, STAR demonstrat… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  28. arXiv:2401.17542  [pdf, other

    cs.LG cs.AI cs.CV

    A Medical Data-Effective Learning Benchmark for Highly Efficient Pre-training of Foundation Models

    Authors: Wenxuan Yang, Weimin Tan, Yuqi Sun, Bo Yan

    Abstract: Foundation models, pre-trained on massive datasets, have achieved unprecedented generalizability. However, is it truly necessary to involve such vast amounts of data in pre-training, consuming extensive computational resources? This paper introduces data-effective learning, aiming to use data in the most impactful way to pre-train foundation models. This involves strategies that focus on data qual… ▽ More

    Submitted 15 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  29. arXiv:2401.15814  [pdf, other

    cs.LG

    OntoMedRec: Logically-Pretrained Model-Agnostic Ontology Encoders for Medication Recommendation

    Authors: Weicong Tan, Weiqing Wang, Xin Zhou, Wray Buntine, Gordon Bingham, Hongzhi Yin

    Abstract: Most existing medication recommendation models learn representations for medical concepts based on electronic health records (EHRs) and make recommendations with learnt representations. However, most medications appear in the dataset for limited times, resulting in insufficient learning of their representations. Medical ontologies are the hierarchical classification systems for medical terms where… ▽ More

    Submitted 14 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  30. arXiv:2401.14873  [pdf, ps, other

    hep-th

    Lessons from discrete light-cone quantization for physics at null infinity: Bosons in two dimensions

    Authors: Glenn Barnich, Sucheta Majumdar, Simone Speziale, Wen-Di Tan

    Abstract: Motivated by issues in the context of asymptotically flat spacetimes at null infinity, we discuss in the simplest example of a massless scalar field in two dimensions several subtleties that arise when setting up the canonical formulation on a single or on two intersecting null hyperplanes with a special emphasis on the infinite-dimensional global and conformal symmetries and their canonical gener… ▽ More

    Submitted 20 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 52 pages, 3 figures, cosmetic changes

  31. arXiv:2401.14151  [pdf, other

    cs.LG cs.AI cs.CL

    True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning

    Authors: Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An

    Abstract: Despite the impressive performance across numerous tasks, large language models (LLMs) often fail in solving simple decision-making tasks due to the misalignment of the knowledge in LLMs with environments. On the contrary, reinforcement learning (RL) agents learn policies from scratch, which makes them always align with environments but difficult to incorporate prior knowledge for efficient explor… ▽ More

    Submitted 10 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR2024

  32. arXiv:2401.13136  [pdf, other

    cs.CL cs.AI

    The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

    Authors: Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi

    Abstract: As the influence of large language models (LLMs) spans across global communities, their safety challenges in multilingual settings become paramount for alignment research. This paper examines the variations in safety challenges faced by LLMs across different languages and discusses approaches to alleviating such concerns. By comparing how state-of-the-art LLMs respond to the same set of malicious… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  33. arXiv:2401.11754  [pdf, ps, other

    astro-ph.HE

    Rotating massive strangeon stars and X-ray plateau of short GRBs

    Authors: Xi-Yan Yang, Xiao-Yu Lai, Wei-Wei Tan, Ren-Xin Xu

    Abstract: Strangeon stars, which are proposed to describe the nature of pulsar-like compact stars, have passed various observational tests. The maximum mass of a non-rotating strangeon star could be high, which implies that the remnants of binary strangeon star mergers could even be long-lived massive strangeon stars. We study rigidly rotating strangeon stars in the slowly rotating approximation, using the… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted by RAA

  34. arXiv:2401.08703  [pdf, other

    cs.LG

    Decoupled Prototype Learning for Reliable Test-Time Adaptation

    Authors: Guowei Wang, Changxing Ding, Wentao Tan, Mingkui Tan

    Abstract: Test-time adaptation (TTA) is a task that continually adapts a pre-trained source model to the target domain during inference. One popular approach involves fine-tuning model with cross-entropy loss according to estimated pseudo-labels. However, its performance is significantly affected by noisy pseudo-labels. This study reveals that minimizing the classification error of each sample causes the cr… ▽ More

    Submitted 25 January, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: 12 pages, 5 figures

  35. arXiv:2401.08417  [pdf, other

    cs.CL

    Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

    Authors: Haoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young Jin Kim

    Abstract: Moderate-sized large language models (LLMs) -- those with 7B or 13B parameters -- exhibit promising machine translation (MT) performance. However, even the top-performing 13B LLM-based translation models, like ALMA, does not match the performance of state-of-the-art conventional encoder-decoder translation models or larger-scale LLMs such as GPT-4. In this study, we bridge this performance gap. We… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at ICML 2024

  36. arXiv:2401.08216  [pdf, other

    cs.CR cs.LG

    Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning

    Authors: Yu Jiang, Jiyuan Shen, Ziyao Liu, Chee Wei Tan, Kwok-Yan Lam

    Abstract: Federated learning (FL) is vulnerable to poisoning attacks, where malicious clients manipulate their updates to affect the global model. Although various methods exist for detecting those clients in FL, identifying malicious clients requires sufficient model updates, and hence by the time malicious clients are detected, FL models have been already poisoned. Thus, a method is needed to recover an a… ▽ More

    Submitted 19 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  37. arXiv:2401.07395  [pdf, other

    cs.LG cs.AI

    Harnessing the Power of Beta Scoring in Deep Active Learning for Multi-Label Text Classification

    Authors: Wei Tan, Ngoc Dang Nguyen, Lan Du, Wray Buntine

    Abstract: Within the scope of natural language processing, the domain of multi-label text classification is uniquely challenging due to its expansive and uneven label distribution. The complexity deepens due to the demand for an extensive set of annotated data for training an advanced deep learning model, especially in specialized fields where the labeling task can be labor-intensive and often requires doma… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 7 pages AAAI 2024

  38. arXiv:2401.05264  [pdf

    q-fin.PM

    Comparison of Markowitz Model and Single-Index Model on Portfolio Selection of Malaysian Stocks

    Authors: Zhang Chern Lee, Wei Yun Tan, Hoong Khen Koo, Wilson Pang

    Abstract: Our article is focused on the application of Markowitz Portfolio Theory and the Single Index Model on 10-year historical monthly return data for 10 stocks included in FTSE Bursa Malaysia KLCI, which is also our market index, as well as a risk-free asset which is the monthly fixed deposit rate. We will calculate the minimum variance portfolio and maximum Sharpe portfolio for both the Markowitz mode… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 figures

  39. arXiv:2312.16907  [pdf, other

    cs.CV

    DOEPatch: Dynamically Optimized Ensemble Model for Adversarial Patches Generation

    Authors: Wenyi Tan, Yang Li, Chenxing Zhao, Zhunga Liu, Quan Pan

    Abstract: Object detection is a fundamental task in various applications ranging from autonomous driving to intelligent security systems. However, recognition of a person can be hindered when their clothing is decorated with carefully designed graffiti patterns, leading to the failure of object detection. To achieve greater attack potential against unknown black-box models, adversarial patches capable of af… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  40. arXiv:2312.13614  [pdf, other

    cs.LG cs.CL

    Structure-Aware Path Inference for Neural Finite State Transducers

    Authors: Weiting Tan, Chu-cheng Lin, Jason Eisner

    Abstract: Neural finite-state transducers (NFSTs) form an expressive family of neurosymbolic sequence transduction models. An NFST models each string pair as having been generated by a latent path in a finite-state transducer. As they are deep generative models, both training and inference of NFSTs require inference networks that approximate posterior distributions over such latent variables. In this paper,… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: In Proceedings of ICBINB Workshop at NeurIPS 2023

  41. arXiv:2312.10890  [pdf, other

    cs.CV cs.GR

    Low-latency Space-time Supersampling for Real-time Rendering

    Authors: Ruian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan

    Abstract: With the rise of real-time rendering and the evolution of display devices, there is a growing demand for post-processing methods that offer high-resolution content in a high frame rate. Existing techniques often suffer from quality and latency issues due to the disjointed treatment of frame supersampling and extrapolation. In this paper, we recognize the shared context and mechanisms between frame… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  42. Bayesian Estimate of Mean Proper Scores for Diversity-Enhanced Active Learning

    Authors: Wei Tan, Lan Du, Wray Buntine

    Abstract: The effectiveness of active learning largely depends on the sampling efficiency of the acquisition function. Expected Loss Reduction (ELR) focuses on a Bayesian estimate of the reduction in classification error, and more general costs fit in the same framework. We propose Bayesian Estimate of Mean Proper Scores (BEMPS) to estimate the increase in strictly proper scores such as log probability or n… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 16 pages, TPAMI. arXiv admin note: text overlap with arXiv:2110.14171

    Journal ref: TPAMI, 2023

  43. arXiv:2312.07180  [pdf, other

    cs.CV

    Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation

    Authors: Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan

    Abstract: Existing recurrent optical flow estimation networks are computationally expensive since they use a fixed large number of iterations to update the flow field for each sample. An efficient network should skip iterations when the flow improvement is limited. In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of… ▽ More

    Submitted 5 January, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: 2024, Association for the Advancement of Artificial Intelligence

  44. arXiv:2312.03998  [pdf, other

    cs.LG

    Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification

    Authors: Navid Mohammadi Foumani, Chang Wei Tan, Geoffrey I. Webb, Hamid Rezatofighi, Mahsa Salehi

    Abstract: We argue that time series analysis is fundamentally different in nature to either vision or natural language processing with respect to the forms of meaningful self-supervised learning tasks that can be defined. Motivated by this insight, we introduce a novel approach called \textit{Series2Vec} for self-supervised representation learning. Unlike other self-supervised methods in time series, which… ▽ More

    Submitted 12 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

  45. arXiv:2312.02425  [pdf, other

    cond-mat.mtrl-sci

    Universal Symmetry Constraints on Spin Polarization in Non-centrosymmetric Crystals

    Authors: Wei Tan, Jianfeng Wang, Yang Li, Bing Huang

    Abstract: The current understanding of spin-polarization phenomena in crystals relying on the crystalline symmetries is far from complete. Here, we develop a universal theory, consisting of five basic symmetry-constrained rules, to capture the diverse spin textures (STs) in non-centrosymmetric crystals, via exhaustively classifying the crystalline symmetry operations and their combinations with time-reversa… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  46. Predicted $Ξ_b(6087)^0$ and further predictions

    Authors: Wei-Han Tan, Hui-Min Yang, Hua-Xing Chen

    Abstract: The methods of QCD sum rules and light-cone sum rules within the framework of heavy quark effective theory have been widely applied to study the singly heavy baryons, and especially, we have applied these methods to predict not only the mass and width of the $Ξ_b(6087)^0$ recently discovered by LHCb, but also its observation channel and its mass difference from the $Ξ_b(6100)^-$. We apply the same… ▽ More

    Submitted 20 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures, 1 table, revised version to be published in EPJC

    Journal ref: Eur. Phys. J. C84 (2024) 382

  47. arXiv:2311.14708  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    Large Language Model-Driven Classroom Flipping: Empowering Student-Centric Peer Questioning with Flipped Interaction

    Authors: Chee Wei Tan

    Abstract: Reciprocal questioning is essential for effective teaching and learning, fostering active engagement and deeper understanding through collaborative interactions, especially in large classrooms. Can large language model (LLM), such as OpenAI's GPT (Generative Pre-trained Transformer) series, assist in this? This paper investigates a pedagogical approach of classroom flipping based on flipped intera… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Submitted

  48. arXiv:2311.12315  [pdf, other

    cs.CL

    AcademicGPT: Empowering Academic Research

    Authors: Shufa Wei, Xiaolong Xu, Xianbiao Qi, Xi Yin, Jun Xia, Jingyi Ren, Peijun Tang, Yuxiang Zhong, Yihao Chen, Xiaoqin Ren, Yuxin Liang, Liankai Huang, Kai Xie, Weikang Gui, Wei Tan, Shuanglong Sun, Yongquan Hu, Qinxian Liu, Nanjin Li, Chihao Dai, Lihua Wang, Xiaohui Liu, Lei Zhang, Yutao Xie

    Abstract: Large Language Models (LLMs) have demonstrated exceptional capabilities across various natural language processing tasks. Yet, many of these advanced LLMs are tailored for broad, general-purpose applications. In this technical report, we introduce AcademicGPT, designed specifically to empower academic research. AcademicGPT is a continual training model derived from LLaMA2-70B. Our training corpus… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Technical Report. arXiv admin note: text overlap with arXiv:2310.12081, arXiv:2310.10053 by other authors

  49. arXiv:2311.05707  [pdf, other

    cs.CV

    FMViT: A multiple-frequency mixing Vision Transformer

    Authors: Wei Tan, Yifeng Geng, Xuansong Xie

    Abstract: The transformer model has gained widespread adoption in computer vision tasks in recent times. However, due to the quadratic time and memory complexity of self-attention, which is proportional to the number of input tokens, most existing Vision Transformers (ViTs) encounter challenges in achieving efficient performance in practical industrial deployment scenarios, such as TensorRT and CoreML, wher… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  50. arXiv:2311.04918  [pdf, other

    cs.CL cs.LG

    Low-Resource Named Entity Recognition: Can One-vs-All AUC Maximization Help?

    Authors: Ngoc Dang Nguyen, Wei Tan, Lan Du, Wray Buntine, Richard Beare, Changyou Chen

    Abstract: Named entity recognition (NER), a task that identifies and categorizes named entities such as persons or organizations from text, is traditionally framed as a multi-class classification problem. However, this approach often overlooks the issues of imbalanced label distributions, particularly in low-resource settings, which is common in certain NER contexts, like biomedical NER (bioNER). To address… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 6 pages, 3 figures, ICDM 2023