Skip to main content

Showing 51–100 of 753 results for author: Jin, H

  1. arXiv:2403.03414  [pdf, other

    cs.LG q-bio.NC

    Leveraging The Finite States of Emotion Processing to Study Late-Life Mental Health

    Authors: Yuanzhe Huang, Saurab Faruque, Minjie Wu, Akiko Mizuno, Eduardo Diniz, Shaolin Yang, George Dewitt Stetten, Noah Schweitzer, Hecheng Jin, Linghai Wang, Howard J. Aizenstein

    Abstract: Traditional approaches in mental health research apply General Linear Models (GLM) to describe the longitudinal dynamics of observed psycho-behavioral measurements (questionnaire summary scores). Similarly, GLMs are also applied to characterize relationships between neurobiological measurements (regional fMRI signals) and perceptual stimuli or other regional signals. While these methods are useful… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  2. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  3. arXiv:2403.02901  [pdf, other

    cs.AI

    A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods

    Authors: Hanlei Jin, Yang Zhang, Dan Meng, Jun Wang, Jinghua Tan

    Abstract: Automatic Text Summarization (ATS), utilizing Natural Language Processing (NLP) algorithms, aims to create concise and accurate summaries, thereby significantly reducing the human effort required in processing large volumes of text. ATS has drawn considerable interest in both academic and industrial circles. Many studies have been conducted in the past to survey ATS methods; however, they generall… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2403.02742  [pdf, other

    cs.CL

    Towards Training A Chinese Large Language Model for Anesthesiology

    Authors: Zhonghai Wang, Jie Jiang, Yibing Zhan, Bohao Zhou, Yanhong Li, Chong Zhang, Liang Ding, Hua Jin, Jun Peng, Xu Lin, Weifeng Liu

    Abstract: Medical large language models (LLMs) have gained popularity recently due to their significant practical utility. However, most existing research focuses on general medicine, and there is a need for in-depth study of LLMs in specific fields like anesthesiology. To fill the gap, we introduce Hypnos, a Chinese Anesthesia model built upon existing LLMs, e.g., Llama. Hypnos' contributions have three as… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  5. arXiv:2403.01479  [pdf, other

    cs.CL cs.AI

    Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

    Authors: Heegon Jin, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh, Yeonsoo Lee

    Abstract: The advent of scalable deep models and large datasets has improved the performance of Neural Machine Translation. Knowledge Distillation (KD) enhances efficiency by transferring knowledge from a teacher model to a more compact student model. However, KD approaches to Transformer architecture often rely on heuristics, particularly when deciding which teacher layers to distill from. In this paper, w… ▽ More

    Submitted 25 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

    MSC Class: 68T50 ACM Class: I.2.7

  6. arXiv:2403.01415  [pdf

    cond-mat.mtrl-sci

    Phonon-pair-driven Ferroelectricity Causes Costless Domain-walls and Bulk-boundary Duality

    Authors: Hyun-Jae Lee, Kyoung-June Go, Pawan Kumar, Chang Hoon Kim, Yungyeom Kim, Kyoungjun Lee, Takao Shimizu, Seung Chul Chae, Hosub Jin, Minseong Lee, Umesh Waghmare, Si-Young Choi, Jun Hee Lee

    Abstract: Ferroelectric domain walls, recognized as distinct from the bulk in terms of symmetry, structure, and electronic properties, host exotic phenomena including conductive walls, ferroelectric vortices, novel topologies, and negative capacitance. Contrary to conventional understanding, our study reveals that the structure of domain walls in HfO2 closely resembles its bulk. First, our first-principles… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 24 pages, 4 figures

  7. arXiv:2402.14360  [pdf, other

    math.SG

    Orbifold Kodaira-Spencer maps and closed-string mirror symmetry for punctured Riemann surfaces

    Authors: Hansol Hong, Hyeongjun Jin, Sangwook Lee

    Abstract: When a Weinstein manifold admits an action of a finite abelian group, we propose its mirror construction following the equivariant TQFT-type construction, and obtain as a mirror the orbifolding of the mirror of the quotient with respect to the induced dual group action. As an application, we construct an orbifold Landau-Ginzburg mirror of a punctured Riemann surface given as an abelian cover of th… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 42 pages and 11 figures

    MSC Class: 53D37; 53D40; 55N32

  8. arXiv:2402.12977  [pdf, other

    astro-ph.SR astro-ph.HE

    A sequence of Type Ib, IIb, II-L, and II-P supernovae from binary-star progenitors of varying initial separation

    Authors: Luc Dessart, Claudia P. Gutierrez, Andrea Ercolino, Harim Jin, Norbert Langer

    Abstract: Over the last decade, evidence has accumulated that massive stars do not typically evolve in isolation but instead follow a tumultuous journey with a companion star on their way to core collapse. While Roche-lobe overflow appears instrumental for the production of a large fraction of supernovae (SNe) of Type Ib and Ic, variations in the initial orbital period Pinit of massive interacting binaries… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Submitted to A&A on Dec 22nd, 2023

    Journal ref: A&A 685, A169 (2024)

  9. arXiv:2402.12721  [pdf, other

    cs.CV cs.AI

    PAC-FNO: Parallel-Structured All-Component Fourier Neural Operators for Recognizing Low-Quality Images

    Authors: Jinsung Jeon, Hyundong Jin, Jonghyun Choi, Sanghyun Hong, Dongeun Lee, Kookjin Lee, Noseong Park

    Abstract: A standard practice in developing image recognition models is to train a model on a specific image resolution and then deploy it. However, in real-world inference, models often encounter images different from the training sets in resolution and/or subject to natural variations such as weather changes, noise types and compression artifacts. While traditional solutions involve training multiple mode… ▽ More

    Submitted 14 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024

  10. arXiv:2402.05350  [pdf, other

    cs.CV eess.IV

    Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

    Authors: Junghun Cha, Ali Haider, Seoyun Yang, Hoeyeong Jin, Subin Yang, A. F. M. Shahab Uddin, Jaehyoung Kim, Soo Ye Kim, Sung-Ho Bae

    Abstract: A significant volume of analog information, i.e., documents and images, have been digitized in the form of scanned copies for storing, sharing, and/or analyzing in the digital world. However, the quality of such contents is severely degraded by various distortions caused by printing, storing, and scanning processes in the physical world. Although restoring high-quality content from scanned copies… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted to AAAI 2024

  11. arXiv:2402.05292  [pdf, other

    cond-mat.mes-hall

    Hot Carriers from Intra- and Interband Transitions in Gold-Silver Alloy Nanoparticles

    Authors: Shreyas Ramachandran, Simao Joao, Hanwen Jin, Johannes Lischner

    Abstract: Hot electrons and holes generated from the decay of localized surface plasmons in metallic nanoparticles can be harnessed for applications in solar energy conversion and sensing. In this paper, we study the generation of hot carriers in large spherical gold-silver alloy nanoparticles using a recently developed atomistic modelling approach that combines a solution of Maxwell's equations with large-… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 19 pages, 5 figures

  12. arXiv:2402.03299  [pdf, other

    cs.LG cs.CL cs.CV

    GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models

    Authors: Haibo Jin, Ruoxi Chen, Andy Zhou, Yang Zhang, Haohan Wang

    Abstract: The discovery of "jailbreaks" to bypass safety filters of Large Language Models (LLMs) and harmful responses have encouraged the community to implement safety measures. One major safety measure is to proactively test the LLMs with jailbreaks prior to the release. Therefore, such testing will require a method that can generate jailbreaks massively and efficiently. In this paper, we follow a novel y… ▽ More

    Submitted 30 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 28 papges

  13. KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

    Authors: Zirui Liu, Jiayi Yuan, Hongye Jin, Shaochen Zhong, Zhaozhuo Xu, Vladimir Braverman, Beidi Chen, Xia Hu

    Abstract: Efficiently serving large language models (LLMs) requires batching many requests together to reduce the cost per request. Yet, the key-value (KV) cache, which stores attention keys and values to avoid re-computations, significantly increases memory demands and becomes the new bottleneck in speed and memory usage. This memory demand increases with larger batch sizes and longer context lengths. Addi… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  14. arXiv:2402.00320  [pdf

    eess.IV

    DARCS: Memory-Efficient Deep Compressed Sensing Reconstruction for Acceleration of 3D Whole-Heart Coronary MR Angiography

    Authors: Zhihao Xue, Fan Yang, Juan Gao, Zhuo Chen, Hao Peng, Chao Zou, Hang Jin, Chenxi Hu

    Abstract: Three-dimensional coronary magnetic resonance angiography (CMRA) demands reconstruction algorithms that can significantly suppress the artifacts from a heavily undersampled acquisition. While unrolling-based deep reconstruction methods have achieved state-of-the-art performance on 2D image reconstruction, their application to 3D reconstruction is hindered by the large amount of memory needed to tr… ▽ More

    Submitted 2 February, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: 10 pages, 8 figures

  15. arXiv:2401.17855  [pdf, other

    stat.AP cs.HC cs.IR

    Network-based Topic Structure Visualization

    Authors: Yeseul Jeon, Jina Park, Ick Hoon Jin, Dongjun Chungc

    Abstract: In the real world, many topics are inter-correlated, making it challenging to investigate their structure and relationships. Understanding the interplay between topics and their relevance can provide valuable insights for researchers, guiding their studies and informing the direction of research. In this paper, we utilize the topic-words distribution, obtained from topic models, as item-response d… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  16. arXiv:2401.13329  [pdf, other

    cs.CV

    Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval

    Authors: Dezhao Luo, Shaogang Gong, Jiabo Huang, Hailin Jin, Yang Liu

    Abstract: Video Moment Retrieval (VMR) requires precise modelling of fine-grained moment-text associations to capture intricate visual-language relationships. Due to the lack of a diverse and generalisable VMR dataset to facilitate learning scalable moment-text associations, existing methods resort to joint training on both source and target domain videos for cross-domain applications. Meanwhile, recent dev… ▽ More

    Submitted 29 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  17. arXiv:2401.12918  [pdf, other

    astro-ph.SR astro-ph.GA

    Boron Abundances in Early B Dwarfs of the Galactic Open Cluster NGC 3293

    Authors: Charles R. Proffitt, Harim Jin, Simone Daflon, Daniel J. Lennon, Norbert Langer, Katia Cunha, Talawanda Monroe

    Abstract: New boron abundances or upper limits have been determined for 8 early-B stars in the young Galactic open cluster NGC 3293, using ultraviolet spectra obtained by the Hubble Space Telescope Cosmic Origins Spectrograph. With previous observations, there are now 18 early-B stars in this cluster with boron measurements. Six of the newly observed stars have projected rotational velocities greater than 2… ▽ More

    Submitted 2 May, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 18 pages,7 figures, Submitted to AAS Journals

  18. arXiv:2401.11178  [pdf

    physics.app-ph

    Large Transverse Thermopower in Shape-Engineered Tilted Leg Thermopile

    Authors: Ki Mun Bang, Sang J. Park, Hyun Yu, Hyungyu Jin

    Abstract: We demonstrate that a novel device design, where a shape-engineered tilted-leg thermopile structure is employed, significantly enhances the output voltage in the transverse direction. Owing to the shape engineering of the leg geometry, an additional temperature gradient develops along the long direction of the leg, which is perpendicular to the direction of the applied temperature gradient, thereb… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  19. arXiv:2401.11158  [pdf, ps, other

    q-fin.PR

    Data-driven Option Pricing

    Authors: Min Dai, Hanqing Jin, Xi Yang

    Abstract: We propose an innovative data-driven option pricing methodology that relies exclusively on the dataset of historical underlying asset prices. While the dataset is rooted in the objective world, option prices are commonly expressed as discounted expectations of their terminal payoffs in a risk-neutral world. Bridging this gap motivates us to identify a pricing kernel process, transforming option pr… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 15 pages, 3 figures

  20. arXiv:2401.11089  [pdf, other

    cs.CR cs.AI cs.DC cs.IR

    FedRKG: A Privacy-preserving Federated Recommendation Framework via Knowledge Graph Enhancement

    Authors: Dezhong Yao, Tongtong Liu, Qi Cao, Hai Jin

    Abstract: Federated Learning (FL) has emerged as a promising approach for preserving data privacy in recommendation systems by training models locally. Recently, Graph Neural Networks (GNN) have gained popularity in recommendation tasks due to their ability to capture high-order interactions between users and items. However, privacy concerns prevent the global sharing of the entire user-item graph. To addre… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  21. arXiv:2401.09767  [pdf, other

    cs.CR cs.SE

    On the Effectiveness of Function-Level Vulnerability Detectors for Inter-Procedural Vulnerabilities

    Authors: Zhen Li, Ning Wang, Deqing Zou, Yating Li, Ruqian Zhang, Shouhuai Xu, Chao Zhang, Hai Jin

    Abstract: Software vulnerabilities are a major cyber threat and it is important to detect them. One important approach to detecting vulnerabilities is to use deep learning while treating a program function as a whole, known as function-level vulnerability detectors. However, the limitation of this approach is not understood. In this paper, we investigate its limitation in detecting one class of vulnerabilit… ▽ More

    Submitted 20 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures. To appear in the Proceedings of the 46th International Conference on Software Engineering (ICSE'24)

  22. arXiv:2401.08160  [pdf, ps, other

    cond-mat.str-el

    Non-Fermi-liquid behavior in a ferromagnetic heavy fermion system CeTi$_{1-x}$V$_{x}$Ge$_{3}$

    Authors: R. -Z. Lin, H. Jin, P. Klavins, W. -T. Chen, Y. -Y. Chang, C. -H. Chung, V. Taufour, C. -L. Huang

    Abstract: An investigation of the thermodynamic and electrical transport properties of the isoelectronic chemical substitution series CeTi$_{1-x}$V$_{x}$Ge$_{3}$ (CTVG) single crystals is reported. As x increases, the ferromagnetic (FM) transition temperature is suppressed, reaching absolute zero at the critical concentration x = 0.4, where a non-Fermi-liquid low-temperature specific heat and electrical res… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  23. arXiv:2401.08045  [pdf, other

    cs.CV

    Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

    Authors: Xu Yan, Haiming Zhang, Yingjie Cai, Jingming Guo, Weichao Qiu, Bin Gao, Kaiqiang Zhou, Yue Zhao, Huan Jin, Jiantao Gao, Zhen Li, Lihui Jiang, Wei Zhang, Hongbo Zhang, Dengxin Dai, Bingbing Liu

    Abstract: The rise of large foundation models, trained on extensive datasets, is revolutionizing the field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by extracting intricate patterns and performing effectively across diverse tasks, thereby serving as potent building blocks for a wide range of AI applications. Autonomous driving, a vibrant front in AI applications, remains chal… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Github Repo: https://github.com/zhanghm1995/Forge_VFM4AD

  24. arXiv:2401.01325  [pdf, other

    cs.CL cs.AI cs.LG

    LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

    Authors: Hongye Jin, Xiaotian Han, Jingfeng Yang, Zhimeng Jiang, Zirui Liu, Chia-Yuan Chang, Huiyuan Chen, Xia Hu

    Abstract: It is well known that LLMs cannot generalize well to long contexts whose lengths are larger than the training sequence length. This poses challenges when employing LLMs for processing long input sequences during inference. In this work, we argue that LLMs themselves have inherent capabilities to handle long contexts without fine-tuning. To achieve this goal, we propose SelfExtend to extend the con… ▽ More

    Submitted 3 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  25. arXiv:2401.00288  [pdf, other

    cs.SE cs.AI

    Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

    Authors: Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin, Philip S. Yu

    Abstract: Code intelligence leverages machine learning techniques to extract knowledge from extensive code corpora, with the aim of developing intelligent tools to improve the quality and productivity of computer programming. Currently, there is already a thriving research community focusing on code intelligence, with efforts ranging from software engineering, machine learning, data mining, natural language… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  26. Towards Mitigating Dimensional Collapse of Representations in Collaborative Filtering

    Authors: Huiyuan Chen, Vivian Lai, Hongye Jin, Zhimeng Jiang, Mahashweta Das, Xia Hu

    Abstract: Contrastive Learning (CL) has shown promising performance in collaborative filtering. The key idea is to generate augmentation-invariant embeddings by maximizing the Mutual Information between different augmented views of the same instance. However, we empirically observe that existing CL models suffer from the \textsl{dimensional collapse} issue, where user/item embeddings only span a low-dimensi… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  27. arXiv:2312.15815  [pdf, other

    cs.CL

    Compositional Generalization in Spoken Language Understanding

    Authors: Avik Ray, Yilin Shen, Hongxia Jin

    Abstract: State-of-the-art spoken language understanding (SLU) models have shown tremendous success in benchmark SLU datasets, yet they still fail in many practical scenario due to the lack of model compositionality when trained on limited training data. In this paper, we study two types of compositionality: (a) novel slot combination, and (b) length generalization. We first conduct in-depth analysis, and f… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: Published in INTERSPEECH 2023

    Journal ref: Proceedings of 24th INTERSPEECH Conference (INTERSPEECH 2023), Dublin, Ireland

  28. arXiv:2312.15234  [pdf, other

    cs.LG cs.AI cs.DC cs.PF

    Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems

    Authors: Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Hongyi Jin, Tianqi Chen, Zhihao Jia

    Abstract: In the rapidly evolving landscape of artificial intelligence (AI), generative large language models (LLMs) stand at the forefront, revolutionizing how we interact with our data. However, the computational intensity and memory consumption of deploying these models present substantial challenges in terms of serving efficiency, particularly in scenarios demanding low latency and high throughput. This… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  29. arXiv:2312.11026  [pdf, other

    cs.LG cs.CR cs.DC

    MISA: Unveiling the Vulnerabilities in Split Federated Learning

    Authors: Wei Wan, Yuxuan Ning, Shengshan Hu, Lulu Xue, Minghui Li, Leo Yu Zhang, Hai Jin

    Abstract: \textit{Federated learning} (FL) and \textit{split learning} (SL) are prevailing distributed paradigms in recent years. They both enable shared global model training while keeping data localized on users' devices. The former excels in parallel execution capabilities, while the latter enjoys low dependence on edge computing resources and strong privacy protection. \textit{Split federated learning}… ▽ More

    Submitted 19 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  30. arXiv:2312.10274  [pdf, other

    cs.LG cs.AI

    Operator-learning-inspired Modeling of Neural Ordinary Differential Equations

    Authors: Woojin Cho, Seunghyeon Cho, Hyundong Jin, Jinsung Jeon, Kookjin Lee, Sanghyun Hong, Dongeun Lee, Jonghyun Choi, Noseong Park

    Abstract: Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hi… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  31. arXiv:2312.06683  [pdf, other

    cs.IR

    AT4CTR: Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction

    Authors: Qi Liu, Xuyang Hou, Defu Lian, Zhe Wang, Haoran Jin, Jia Cheng, Jun Lei

    Abstract: Click-through rate (CTR) prediction is a vital task in industrial recommendation systems. Most existing methods focus on the network architecture design of the CTR model for better accuracy and suffer from the data sparsity problem. Especially in industrial recommendation systems, the widely applied negative sample down-sampling technique due to resource limitation worsens the problem, resulting i… ▽ More

    Submitted 18 December, 2023; v1 submitted 9 December, 2023; originally announced December 2023.

  32. arXiv:2312.05772  [pdf, other

    cs.SE

    A^3-CodGen: A Repository-Level Code Generation Framework for Code Reuse with Local-Aware, Global-Aware, and Third-Party-Library-Aware

    Authors: Dianshu Liao, Shidong Pan, Xiaoyu Sun, Xiaoxue Ren, Qing Huang, Zhenchang Xing, Huan Jin, Qinying Li

    Abstract: Code generation tools are essential to help developers in the software development process. Existing tools often disconnect with the working context, i.e., the code repository, causing the generated code to be not similar to human developers. In this paper, we propose a novel code generation framework, dubbed A^3-CodGen, to harness information within the code repository to generate code with fewer… ▽ More

    Submitted 5 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  33. arXiv:2312.03378  [pdf, other

    cs.CV

    Riemannian Complex Matrix Convolution Network for PolSAR Image Classification

    Authors: Junfei Shi, Wei Wang, Haiyan Jin, Mengmeng Nie, Shanshan Ji

    Abstract: Recently, deep learning methods have achieved superior performance for Polarimetric Synthetic Aperture Radar(PolSAR) image classification. Existing deep learning methods learn PolSAR data by converting the covariance matrix into a feature vector or complex-valued vector as the input. However, all these methods cannot learn the structure of complex matrix directly and destroy the channel correlatio… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  34. arXiv:2312.02191  [pdf, other

    cs.CV cs.AI

    Prompt Tuning for Zero-shot Compositional Learning

    Authors: Lingyu Zhang, Ting Hua, Yilin Shen, Hongxia Jin

    Abstract: Open World Compositional Zero-Shot Learning (OW-CZSL) is known to be an extremely challenging task, which aims to recognize unseen compositions formed from seen attributes and objects without any prior assumption of the output space. In order to achieve this goal, a model has to be "smart" and "knowledgeable". To be smart, a model should be good at reasoning the interactions between attributes and… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  35. arXiv:2312.01026  [pdf, other

    cs.CV

    Token Fusion: Bridging the Gap between Token Pruning and Token Merging

    Authors: Minchul Kim, Shangqian Gao, Yen-Chang Hsu, Yilin Shen, Hongxia Jin

    Abstract: Vision Transformers (ViTs) have emerged as powerful backbones in computer vision, outperforming many traditional CNNs. However, their computational overhead, largely attributed to the self-attention mechanism, makes deployment on resource-constrained edge devices challenging. Multiple solutions rely on token pruning or token merging. In this paper, we introduce "Token Fusion" (ToFu), a method that… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: To appear in WACV 2024

  36. arXiv:2311.18763  [pdf, other

    cs.CV cs.AI cs.LG

    Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters

    Authors: James Seale Smith, Yen-Chang Hsu, Zsolt Kira, Yilin Shen, Hongxia Jin

    Abstract: Recent work has demonstrated a remarkable ability to customize text-to-image diffusion models to multiple, fine-grained concepts in a sequential (i.e., continual) manner while only providing a few example images for each concept. This setting is known as continual diffusion. Here, we ask the question: Can we scale these methods to longer concept sequences without forgetting? Although prior work mi… ▽ More

    Submitted 2 May, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CVPR-W 2024

  37. arXiv:2311.13812  [pdf, other

    cond-mat.mtrl-sci cs.AI

    Mechanical Characterization and Inverse Design of Stochastic Architected Metamaterials Using Neural Operators

    Authors: Hanxun Jin, Enrui Zhang, Boyu Zhang, Sridhar Krishnaswamy, George Em Karniadakis, Horacio D. Espinosa

    Abstract: Machine learning (ML) is emerging as a transformative tool for the design of architected materials, offering properties that far surpass those achievable through lab-based trial-and-error methods. However, a major challenge in current inverse design strategies is their reliance on extensive computational and/or experimental datasets, which becomes particularly problematic for designing micro-scale… ▽ More

    Submitted 10 December, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 29 pages, 5 figures

  38. arXiv:2311.13761  [pdf, other

    cs.HC

    On the Feasibility of Reasoning about the Internal States of Blackbox IoT Devices Using Side-Channel Information

    Authors: Wei Sun, Yuwei Xiao, Haojian Jin, Dinesh Bharadia

    Abstract: Internet of Things (IoT) devices are typically designed to function in a secure, closed environment, making it difficult for users to comprehend devices' behaviors. This paper shows that a user can leverage side-channel information to reason fine-grained internal states of black box IoT devices. The key enablers for our design are a multi-model sensing technique that fuses power consumption, netwo… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    ACM Class: H.5.2

  39. arXiv:2311.12066  [pdf, other

    cs.CR

    EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models

    Authors: Ruoxi Chen, Haibo Jin, Jinyin Chen, Lichao Sun

    Abstract: Text-to-image diffusion models have emerged as an evolutionary for producing creative content in image synthesis. Based on the impressive generation abilities of these models, instruction-guided diffusion models can edit images with simple instructions and input images. While they empower users to obtain their desired edited images with ease, they have raised concerns about unauthorized image mani… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  40. arXiv:2311.10764  [pdf, other

    cs.IR cs.AI

    Deep Group Interest Modeling of Full Lifelong User Behaviors for CTR Prediction

    Authors: Qi Liu, Xuyang Hou, Haoran Jin, jin Chen, Zhe Wang, Defu Lian, Tan Qu, Jia Cheng, Jun Lei

    Abstract: Extracting users' interests from their lifelong behavior sequence is crucial for predicting Click-Through Rate (CTR). Most current methods employ a two-stage process for efficiency: they first select historical behaviors related to the candidate item and then deduce the user's interest from this narrowed-down behavior sub-sequence. This two-stage paradigm, though effective, leads to information lo… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  41. arXiv:2311.09189  [pdf, other

    cs.CL

    PsyEval: A Suite of Mental Health Related Tasks for Evaluating Large Language Models

    Authors: Haoan Jin, Siyuan Chen, Dilawaier Dilixiati, Yewei Jiang, Mengyue Wu, Kenny Q. Zhu

    Abstract: Evaluating Large Language Models (LLMs) in the mental health domain poses distinct challenged from other domains, given the subtle and highly subjective nature of symptoms that exhibit significant variability among individuals. This paper presents PsyEval, the first comprehensive suite of mental health-related tasks for evaluating LLMs. PsyEval encompasses five sub-tasks that evaluate three critic… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  42. arXiv:2311.07553  [pdf, other

    cs.CR cs.AI

    An Extensive Study on Adversarial Attack against Pre-trained Models of Code

    Authors: Xiaohu Du, Ming Wen, Zichao Wei, Shangwen Wang, Hai Jin

    Abstract: Transformer-based pre-trained models of code (PTMC) have been widely utilized and have achieved state-of-the-art performance in many mission-critical applications. However, they can be vulnerable to adversarial attacks through identifier substitution or coding style transformation, which can significantly degrade accuracy and may further incur security concerns. Although several approaches have be… ▽ More

    Submitted 23 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to ESEC/FSE 2023

  43. arXiv:2311.06734  [pdf, other

    physics.app-ph cond-mat.soft

    Mechanical Metamaterials Fabricated from Self-assembly: A Perspective

    Authors: Hanxun Jin, Horacio D. Espinosa

    Abstract: Mechanical metamaterials, whose unique mechanical properties stem from their structural design rather than material constituents, are gaining popularity in engineering applications. In particular, recent advances in self-assembly techniques offer the potential to fabricate load-bearing mechanical metamaterials with unparalleled feature size control and scalability compared to those produced by add… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 24 pages, 3 figures

    Journal ref: J. Appl. Mech. 2023, 1-25

  44. arXiv:2311.02639  [pdf, other

    cond-mat.str-el

    A variational Monte Carlo approach to the SU(4) spin-orbital model on the triangular lattice

    Authors: Chun Zhang, Hui-Ke Jin, Yi Zhou

    Abstract: Previous investigations have suggested that the simplest spin-orbital model on the simplest frustrated lattice can host a nematic quantum spin-orbital liquid state. Namely, the orbital degeneracy of the SU(4) Kugel-Khomskii (KK) model tends to enhance quantum fluctuations and stabilize a quantum spin-orbital liquid exhibiting stripy features on the triangular lattice, as revealed by the state-of-t… ▽ More

    Submitted 7 November, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Typos in Eqs. are corrected

  45. arXiv:2311.02103  [pdf, other

    cs.LG cs.AI cs.PL

    Relax: Composable Abstractions for End-to-End Dynamic Machine Learning

    Authors: Ruihang Lai, Junru Shao, Siyuan Feng, Steven S. Lyubomirsky, Bohan Hou, Wuwei Lin, Zihao Ye, Hongyi Jin, Yuchen Jin, Jiawei Liu, Lesheng Jin, Yaxing Cai, Ziheng Jiang, Yong Wu, Sunghyun Park, Prakalp Srivastava, Jared G. Roesch, Todd C. Mowry, Tianqi Chen

    Abstract: Dynamic shape computations have become critical in modern machine learning workloads, especially in emerging large language models. The success of these models has driven demand for deploying them to a diverse set of backend environments. In this paper, we present Relax, a compiler abstraction for optimizing end-to-end dynamic machine learning workloads. Relax introduces first-class symbolic shape… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  46. arXiv:2311.01266  [pdf, other

    cs.SE

    Let's Discover More API Relations: A Large Language Model-based AI Chain for Unsupervised API Relation Inference

    Authors: Qing Huang, Yanbang Sun, Zhenchang Xing, Yuanlong Cao, Jieshan Chen, Xiwei Xu, Huan Jin, Jiaxing Lu

    Abstract: APIs have intricate relations that can be described in text and represented as knowledge graphs to aid software engineering tasks. Existing relation extraction methods have limitations, such as limited API text corpus and affected by the characteristics of the input text.To address these limitations, we propose utilizing large language models (LLMs) (e.g., GPT-3.5) as a neural knowledge base for A… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  47. arXiv:2310.18698  [pdf, other

    cs.CV cs.LG

    Triplet Attention Transformer for Spatiotemporal Predictive Learning

    Authors: Xuesong Nie, Xi Chen, Haoyuan Jin, Zhihang Zhu, Yunfeng Yan, Donglian Qi

    Abstract: Spatiotemporal predictive learning offers a self-supervised learning paradigm that enables models to learn both spatial and temporal patterns by predicting future sequences based on historical sequences. Mainstream methods are dominated by recurrent units, yet they are limited by their lack of parallelization and often underperform in real-world scenarios. To improve prediction quality while maint… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted to WACV 2024

  48. Sky location of Galactic white dwarf binaries in space-based gravitational wave detection

    Authors: Pan Guo, Hong-Bo Jin, Cong-Feng Qiao, Yue-Liang Wu

    Abstract: Quickly localizing the identified white dwarf (WD) binaries is the basic requirement for the space-based gravitational wave (GW) detection. In fact, the amplitude of GW signals are modulated by the periodic motion of GW detectors on the solar orbit. The intensity of the observed signals is enhanced according to the observation time beyond a year to enhance a high signal to noise ratio (SNR). As da… ▽ More

    Submitted 28 March, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 22 pages, 15 figures

    Journal ref: Results in Physics, 2024

  49. arXiv:2310.15019  [pdf, other

    cs.LG cs.AI cs.CL

    Meta learning with language models: Challenges and opportunities in the classification of imbalanced text

    Authors: Apostol Vassilev, Honglan Jin, Munawar Hasan

    Abstract: Detecting out of policy speech (OOPS) content is important but difficult. While machine learning is a powerful tool to tackle this challenging task, it is hard to break the performance ceiling due to factors like quantity and quality limitations on training data and inconsistencies in OOPS definition and data labeling. To realize the full potential of available limited resources, we propose a meta… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 22 pages, including 5 figures, 12 tables, 1 appendix

  50. arXiv:2310.11664  [pdf, other

    cs.LG cs.AI

    Hetero$^2$Net: Heterophily-aware Representation Learning on Heterogenerous Graphs

    Authors: Jintang Li, Zheng Wei, Jiawang Dan, Jing Zhou, Yuchang Zhu, Ruofan Wu, Baokun Wang, Zhang Zhen, Changhua Meng, Hong Jin, Zibin Zheng, Liang Chen

    Abstract: Real-world graphs are typically complex, exhibiting heterogeneity in the global structure, as well as strong heterophily within local neighborhoods. While a growing body of literature has revealed the limitations of common graph neural networks (GNNs) in handling homogeneous graphs with heterophily, little work has been conducted on investigating the heterophily properties in the context of hetero… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Preprint