Skip to main content

Showing 1–50 of 1,718 results for author: Lin, L

  1. arXiv:2407.10625  [pdf, other

    cs.CV

    WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

    Authors: Zijian He, Peixin Chen, Guangrun Wang, Guanbin Li, Philip H. S. Torr, Liang Lin

    Abstract: Video virtual try-on aims to generate realistic sequences that maintain garment identity and adapt to a person's pose and body shape in source videos. Traditional image-based methods, relying on warping and blending, struggle with complex human movements and occlusions, limiting their effectiveness in video try-on applications. Moreover, video-based models require extensive, high-quality data and… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.09652  [pdf, other

    cs.CL

    How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs

    Authors: Andrea W Wen-Yi, Unso Eun Seo Jo, Lu Jia Lin, David Mimno

    Abstract: Contemporary language models are increasingly multilingual, but Chinese LLM developers must navigate complex political and business considerations of language diversity. Language policy in China aims at influencing the public discourse and governing a multi-ethnic society, and has gradually transitioned from a pluralist to a more assimilationist approach since 1949. We explore the impact of these… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Wen-Yi and Jo contributed equally to this work

  3. arXiv:2407.09342  [pdf, other

    cs.RO

    MIXED-SENSE: A Mixed Reality Sensor Emulation Framework for Test and Evaluation of UAVs Against False Data Injection Attacks

    Authors: Kartik A. Pant, Li-Yu Lin, Jaehyeok Kim, Worawis Sribunma, James M. Goppert, Inseok Hwang

    Abstract: We present a high-fidelity Mixed Reality sensor emulation framework for testing and evaluating the resilience of Unmanned Aerial Vehicles (UAVs) against false data injection (FDI) attacks. The proposed approach can be utilized to assess the impact of FDI attacks, benchmark attack detector performance, and validate the effectiveness of mitigation/reconfiguration strategies in single-UAV and UAV swa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 6 pages, 5 figures, IROS 2024

  4. arXiv:2407.07327  [pdf, other

    cs.AI

    Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram

    Authors: Ming-Liang Zhang, Zhong-Zhi Li, Fei Yin, Liang Lin, Cheng-Lin Liu

    Abstract: Geometry problem solving (GPS) requires capacities of multi-modal understanding, multi-hop reasoning and theorem knowledge application. In this paper, we propose a neural-symbolic model for plane geometry problem solving (PGPS), named PGPSNet-v2, with three key steps: modal fusion, reasoning process and knowledge verification. In modal fusion, we leverage textual clauses to express fine-grained st… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: under review by journal

  5. arXiv:2407.06886  [pdf, other

    cs.CV cs.AI cs.LG cs.MA cs.RO

    Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

    Authors: Yang Liu, Weixing Chen, Yongjie Bai, Jingzhou Luo, Xinshuai Song, Kaixuan Jiang, Zhida Li, Ganlong Zhao, Junyi Lin, Guanbin Li, Wen Gao, Liang Lin

    Abstract: Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace and the physical world. Recently, the emergence of Multi-modal Large Models (MLMs) and World Models (WMs) have attracted significant attention due to their remarkable perception, interaction, and reasoning capabilit… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: The first comprehensive review of Embodied AI in the era of MLMs, 37 pages. We also provide the paper list for Embodied AI: https://github.com/HCPLab-SYSU/Embodied_AI_Paper_List

  6. arXiv:2407.06844  [pdf, other

    cs.CV

    Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration

    Authors: Tianshui Chen, Weihang Wang, Tao Pu, Jinghui Qin, Zhijing Yang, Jie Liu, Liang Lin

    Abstract: Modern visual recognition models often display overconfidence due to their reliance on complex deep neural networks and one-hot target supervision, resulting in unreliable confidence scores that necessitate calibration. While current confidence calibration techniques primarily address single-label scenarios, there is a lack of focus on more practical and generalizable multi-label contexts. This pa… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: submitted to TIP

  7. arXiv:2407.06744  [pdf, other

    quant-ph

    Suppression of Local Decay in non-Markovian Waveguide QED

    Authors: Yuan liu, Linhan Lin, Hong-Bo Sun

    Abstract: Atoms coupled to the same environment interfere with each other to yield super- or sub-radiance. Specifically, atoms in subradiant states are promising candidates for long-lifetime qubits and quantum memory because of the immunity to the common environment. However, subradiant states can still be influenced by local environments, which are incoherent for different atoms and cannot be canceled out… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures

  8. arXiv:2407.05634  [pdf, other

    quant-ph math.CA math.NA

    Infinite quantum signal processing for arbitrary Szegő functions

    Authors: Michel Alexis, Lin Lin, Gevorg Mnatsakanyan, Christoph Thiele, Jiasu Wang

    Abstract: We provide a complete solution to the problem of infinite quantum signal processing for the class of Szegő functions, which are functions that satisfy a logarithmic integrability condition and include almost any function that allows for a quantum signal processing representation. We do so by introducing a new algorithm called the Riemann-Hilbert-Weiss algorithm, which can compute any individual ph… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 45 pages, 5 figures. Small updates to main text, and ArXiv title/abstract now renders correctly

    MSC Class: 68Q12; 81P68; 34L25; 42C99

  9. arXiv:2407.05235  [pdf, other

    cs.CV

    Tracking Reflected Objects: A Benchmark

    Authors: Xiaoyu Guo, Pengzhi Zhong, Lizhi Lin, Hao Zhang, Ling Huang, Shuiwang Li

    Abstract: Visual tracking has advanced significantly in recent years, mainly due to the availability of large-scale training datasets. These datasets have enabled the development of numerous algorithms that can track objects with high accuracy and robustness.However, the majority of current research has been directed towards tracking generic objects, with less emphasis on more specialized and challenging sc… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  10. arXiv:2407.04613  [pdf, other

    astro-ph.IM astro-ph.CO

    Thermal and mechanical study of a parametrised cryostat model for optical characterisation of upcoming CMB experiments

    Authors: Thomas J. L. J. Gascard, Yi Wang, Jon E. Gudmundsson, Eve M. Vavagiakis, Cody J. Duell, Zachary B. Huber, Lawrence T. Lin, Michael D. Niemack, Rodrigo G. Freundt

    Abstract: Current and future experiments observing the cosmic microwave background require a detailed understanding of optical performance at cryogenic temperatures. Pre-deployment analysis of optics can be performed in custom-engineered cryogenic test beds, such as Mod-Cam, a first light camera for the CCAT project. This work presents studies of the mechanical and thermal performance of CryoSim, a model of… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: To be published in the SPIE Astronomical Telescopes + Instrumentation (AS24) proceedings

  11. arXiv:2407.03234  [pdf, other

    cs.LG cs.CL cs.CR

    Self-Evaluation as a Defense Against Adversarial Attacks on LLMs

    Authors: Hannah Brown, Leon Lin, Kenji Kawaguchi, Michael Shieh

    Abstract: When LLMs are deployed in sensitive, human-facing settings, it is crucial that they do not output unsafe, biased, or privacy-violating outputs. For this reason, models are both trained and instructed to refuse to answer unsafe prompts such as "Tell me how to build a bomb." We find that, despite these safeguards, it is possible to break model defenses simply by appending a space to the end of a mod… ▽ More

    Submitted 15 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 7 figures

  12. arXiv:2407.03232  [pdf, other

    cs.LG cs.CL

    Single Character Perturbations Break LLM Alignment

    Authors: Leon Lin, Hannah Brown, Kenji Kawaguchi, Michael Shieh

    Abstract: When LLMs are deployed in sensitive, human-facing settings, it is crucial that they do not output unsafe, biased, or privacy-violating outputs. For this reason, models are both trained and instructed to refuse to answer unsafe prompts such as "Tell me how to build a bomb." We find that, despite these safeguards, it is possible to break model defenses simply by appending a space to the end of a mod… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures

  13. arXiv:2407.02818  [pdf, other

    cs.SE cs.ET cs.PL

    WizardMerge -- Save Us From Merging Without Any Clues

    Authors: Qingyu Zhang, Junzhe Li, Jiayi Lin, Jie Ding, Lanteng Lin, Chenxiong Qian

    Abstract: Modern software development necessitates efficient version-oriented collaboration among developers. While Git is the most popular version control system, it generates unsatisfactory version merging results due to textual-based workflow, leading to potentially unexpected results in the merged version of the project. Although numerous merging tools have been proposed for improving merge results, dev… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages

    ACM Class: D.2; D.3

  14. arXiv:2407.01093  [pdf, other

    cs.CL cs.AI cs.MA

    IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation

    Authors: Senyu Han, Lu Chen, Li-Min Lin, Zhengshan Xu, Kai Yu

    Abstract: Large language models have demonstrated their capabilities in storyline creation and human-like character role-playing. Current language model agents mainly focus on reasonable behaviors from the level of individuals, and their behaviors might be hard to constraint on the level of the whole storyline. In this paper we introduce IBSEN, a director-actor coordinate agent framework that generates dram… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted by ACL 2024 Main

  15. arXiv:2407.01017  [pdf, other

    cs.CV

    Coding for Intelligence from the Perspective of Category

    Authors: Wenhan Yang, Zixuan Hu, Lilang Lin, Jiaying Liu, Ling-Yu Duan

    Abstract: Coding, which targets compressing and reconstructing data, and intelligence, often regarded at an abstract computational level as being centered around model learning and prediction, interweave recently to give birth to a series of significant progress. The recent trends demonstrate the potential homogeneity of these two fields, especially when deep-learning models aid these two categories for bet… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  16. arXiv:2406.19233  [pdf, other

    astro-ph.HE

    X-ray and gamma-ray study for 2023 nova eruption of V1716 Sco

    Authors: H. -H. Wang, H. -D. Yan, J. Takata, L. C. -C. Lin

    Abstract: We report the results of X-ray and gamma-ray analyses of the nova V1716 Sco taken by Swift, NICER, NuSTAR and F ermi-LAT. We have detected gamma-ray emission at a significant level exceeding 8 σ in daily bins starting the day after the optical eruption. The gamma-ray emission, characterized by a Test Statistic (TS) value more than four, persisted for approximately 40 days. Notably, harder X-ray em… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 16 pages,13 figures,submitted to AAS Journals. arXiv admin note: text overlap with arXiv:2404.08409

  17. arXiv:2406.18365  [pdf, other

    cs.CL

    Themis: Towards Flexible and Interpretable NLG Evaluation

    Authors: Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan

    Abstract: The evaluation of natural language generation (NLG) tasks is a significant and longstanding research issue. With the recent emergence of powerful large language models (LLMs), some studies have turned to LLM-based automatic evaluation methods, which demonstrate great potential to become a new evaluation paradigm following traditional string-based and model-based metrics. However, despite the impro… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  18. arXiv:2406.18327  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-modal Evidential Fusion Network for Trusted PET/CT Tumor Segmentation

    Authors: Yuxuan Qi, Li Lin, Jiajun Wang, Jingya Zhang, Bin Zhang

    Abstract: Accurate segmentation of tumors in PET/CT images is important in computer-aided diagnosis and treatment of cancer. The key issue of such a segmentation problem lies in the effective integration of complementary information from PET and CT images. However, the quality of PET and CT images varies widely in clinical settings, which leads to uncertainty in the modality information extracted by network… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  19. arXiv:2406.15931  [pdf, other

    eess.SY cs.CE cs.LG stat.AP

    Multistep Criticality Search and Power Shaping in Microreactors with Reinforcement Learning

    Authors: Majdi I. Radaideh, Leo Tunkle, Dean Price, Kamal Abdulraheem, Linyu Lin, Moutaz Elias

    Abstract: Reducing operation and maintenance costs is a key objective for advanced reactors in general and microreactors in particular. To achieve this reduction, developing robust autonomous control algorithms is essential to ensure safe and autonomous reactor operation. Recently, artificial intelligence and machine learning algorithms, specifically reinforcement learning (RL) algorithms, have seen rapid i… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 15 pages, 3 figures, and 2 tables

  20. arXiv:2406.14892  [pdf, other

    astro-ph.IM physics.ins-det

    CCAT: Detector Noise Limited Performance of the RFSoC-based Readout Electronics for mm/sub-mm/far-IR KIDs

    Authors: Adrian K. Sinclair, James Burgoyne, Anthony I. Huber, Colin Murphy, Steve K. Choi, Cody J. Duell, Zachary B. Huber, Yaqiong Li, Scott C. Chapman, Michael D. Niemack, Thomas Nikola, Eve M. Vavagiakis, Samantha Walker, Jordan D. Wheeler, Jason Austermann, Lawrence Lin, Ruixuan Xie, Bugao Zou, Philip D. Mauskopf

    Abstract: The Fred Young Submillimeter Telescope (FYST), on Cerro Chajnantor in the Atacama desert of Chile, will conduct wide-field and small deep-field surveys of the sky with more than 100,000 detectors on the Prime-Cam instrument. Kinetic inductance detectors (KIDs) were chosen as the primary sensor technology for their high density focal plane packing. Additionally, they benefit from low cost, ease of… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: draft submitted to SPIE

  21. MEAT: Median-Ensemble Adversarial Training for Improving Robustness and Generalization

    Authors: Zhaozhe Hu, Jia-Li Yin, Bin Chen, Luojun Lin, Bo-Hao Chen, Ximeng Liu

    Abstract: Self-ensemble adversarial training methods improve model robustness by ensembling models at different training epochs, such as model weight averaging (WA). However, previous research has shown that self-ensemble defense methods in adversarial training (AT) still suffer from robust overfitting, which severely affects the generalization performance. Empirically, in the late phases of training, the A… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  22. arXiv:2406.13368  [pdf

    cond-mat.mtrl-sci

    Lewis Acidity and Basicity Diagnostics of Molten Salt for its Properties and Structure Online Monitoring

    Authors: Changzu Zhu, Jia Song, Xiaorui Xu, Chengyu Wang, Yang Tong, Lve Lin, Shaoqiang Guo, Wentao Zhou, Adrien Couet, Yafei Wang

    Abstract: Analogous to the aqueous solution where the pH of the solvent affects its multiple behaviors, the Lewis acidity-basicity of molten salts also greatly influences their thermophysical and thermochemical properties. In the study, we develop ion probes to quantitatively determine the acidity-basicity scale of molten NaCl-xAlCl3 (x = 1.5-2.1) salt using in-situ ultra-violet visible (UV-Vis) spectroscop… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  23. arXiv:2406.12501  [pdf, other

    cs.IR

    Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback

    Authors: Guipeng Xv, Xinyu Li, Ruobing Xie, Chen Lin, Chong Liu, Feng Xia, Zhanhui Kang, Leyu Lin

    Abstract: Multi-modal recommender systems (MRSs) are pivotal in diverse online web platforms and have garnered considerable attention in recent years. However, previous studies overlook the challenges of (1) noisy multi-modal content, (2) noisy user feedback, and (3) aligning multi-modal content with user feedback. In order to tackle these challenges, we propose Denoising and Aligning Multi-modal Recommende… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  24. arXiv:2406.12148  [pdf, other

    math.NA math.CV

    Complex variable solution on noncircular and asymmetrical tunnelling embedded by bidirectional conformal mapping incorporating Charge Simulation Method

    Authors: Luobin Lin, Fuquan Chen, Changjie Zheng, Shangshun Lin

    Abstract: Mechanical issues of noncircular and asymmetrical tunnelling can be estimated using complex variable method with suitable conformal mapping. Exsiting solution schemes of conformal mapping for noncircular tunnel generally need iteration or optimization strategy, and are thereby mathematically complicated. This paper proposes a new bidirectional conformal mapping for deep and shallow tunnels of nonc… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 41 pages, 13 figures

  25. arXiv:2406.10801  [pdf, other

    cs.CV

    Saliency-guided and Patch-based Mixup for Long-tailed Skin Cancer Image Classification

    Authors: Tianyunxi Wei, Yijin Huang, Li Lin, Pujin Cheng, Sirui Li, Xiaoying Tang

    Abstract: Medical image datasets often exhibit long-tailed distributions due to the inherent challenges in medical data collection and annotation. In long-tailed contexts, some common disease categories account for most of the data, while only a few samples are available in the rare disease categories, resulting in poor performance of deep learning methods. To address this issue, previous approaches have em… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: IEEE ISBI2024

  26. arXiv:2406.08466  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Scaling Laws in Linear Regression: Compute, Parameters, and Data

    Authors: Licong Lin, Jingfeng Wu, Sham M. Kakade, Peter L. Bartlett, Jason D. Lee

    Abstract: Empirically, large-scale deep learning models often satisfy a neural scaling law: the test error of the trained model improves polynomially as the model size and data size grow. However, conventional wisdom suggests the test error consists of approximation, bias, and variance errors, where the variance error increases with model size. This disagrees with the general form of neural scaling laws, wh… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  27. arXiv:2406.07574  [pdf, other

    cs.SI cs.LG

    Biharmonic Distance of Graphs and its Higher-Order Variants: Theoretical Properties with Applications to Centrality and Clustering

    Authors: Mitchell Black, Lucy Lin, Amir Nayyeri, Weng-Keen Wong

    Abstract: Effective resistance is a distance between vertices of a graph that is both theoretically interesting and useful in applications. We study a variant of effective resistance called the biharmonic distance. While the effective resistance measures how well-connected two vertices are, we prove several theoretical results supporting the idea that the biharmonic distance measures how important an edge i… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  28. arXiv:2406.07357  [pdf, other

    cs.CC

    PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph Clustering

    Authors: Longlong Lin, Tao Jia, Zeli Wang, Jin Zhao, Rong-Hua Li

    Abstract: Higher-order graph clustering aims to partition the graph using frequently occurring subgraphs. Motif conductance is one of the most promising higher-order graph clustering models due to its strong interpretability. However, existing motif conductance based graph clustering algorithms are mainly limited by a seminal two-stage reweighting computing framework, needing to enumerate all motif instance… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  29. arXiv:2406.06828  [pdf, other

    astro-ph.IM

    CCAT: Comparisons of 280 GHz TiN and Al Kinetic Inductance Detector Arrays

    Authors: Cody J. Duell, Jason Austermann, James Beall, James R. Burgoyne, Scott C. Chapman, Steve K. Choi, Rodrigo G. Freundt, Jiansong Gao, Christopher Groppi, Anthony I. Huber, Zachary B. Huber, Johannes Hubmayr, Ben Keller, Yaqiong Li, Lawrence T. Lin, Justin Matthewson, Philip Mauskopf, Alicia Middleton, Colin C. Murphy, Michael D. Niemack, Thomas Nikola, Adrian K. Sinclair, Ema Smith, Jeff van Lanen, Anna Vaskuri , et al. (5 additional authors not shown)

    Abstract: The CCAT Collaboration's six-meter Fred Young Submillimeter Telescope is scheduled to begin observing in the Chilean Atacama in 2025, targeting a variety of science goals throughout cosmic history. Prime-Cam is a 1.8-meter diameter cryostat that will host up to seven independent instrument modules designed for simultaneous spectroscopic and broadband, polarimetric surveys at millimeter to submilli… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 6 pages, 3 figures, conference proceedings submitted to the Journal of Low Temperature Physics

  30. arXiv:2406.02990  [pdf, other

    cs.CV

    Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

    Authors: Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin

    Abstract: Predicting genetic mutations from whole slide images is indispensable for cancer diagnosis. However, existing work training multiple binary classification models faces two challenges: (a) Training multiple binary classifiers is inefficient and would inevitably lead to a class imbalance problem. (b) The biological relationships among genes are overlooked, which limits the prediction performance. To… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures, and 3 tables

  31. arXiv:2406.02978  [pdf, other

    cs.CV

    Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond

    Authors: Jiahang Zhang, Lilang Lin, Shuai Yang, Jiaying Liu

    Abstract: Self-supervised learning (SSL), which aims to learn meaningful prior representations from unlabeled data, has been proven effective for label-efficient skeleton-based action understanding. Different from the image domain, skeleton data possesses sparser spatial structures and diverse representation forms, with the absence of background clues and the additional temporal dimension. This presents the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  32. arXiv:2406.02086  [pdf, other

    quant-ph math.NA

    Multi-level quantum signal processing with applications to ground state preparation using fast-forwarded Hamiltonian evolution

    Authors: Yulong Dong, Lin Lin

    Abstract: The preparation of the ground state of a Hamiltonian $H$ with a large spectral radius has applications in many areas such as electronic structure theory and quantum field theory. Given an initial state with a constant overlap with the ground state, and assuming that the Hamiltonian $H$ can be efficiently simulated with an ideal fast-forwarding protocol, we first demonstrate that employing a linear… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 25 pages, 6 figures

  33. arXiv:2406.02059  [pdf, other

    cs.LG

    Graph Adversarial Diffusion Convolution

    Authors: Songtao Liu, Jinghui Chen, Tianfan Fu, Lu Lin, Marinka Zitnik, Dinghao Wu

    Abstract: This paper introduces a min-max optimization formulation for the Graph Signal Denoising (GSD) problem. In this formulation, we first maximize the second term of GSD by introducing perturbations to the graph structure based on Laplacian distance and then minimize the overall loss of the GSD. By solving the min-max optimization problem, we derive a new variant of the Graph Diffusion Convolution (GDC… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  34. arXiv:2406.01858  [pdf, other

    astro-ph.IM

    CCAT: FYST Prime-Cam Readout Software: A framework for massively scalable KID arrays

    Authors: James R. Burgoyne, Adrian K. Sinclair, Scott C. Chapman, Steve K. Choi, Cody J. Duell, Anthony I. Huber, Zachary B. Huber, Ben Keller, Lawrence Lin, Michael D. Niemack, Douglas Scott, Eve M. Vavagiakis, Samantha Walker, Matt Xie, the CCAT collaboration

    Abstract: We outline the development of the readout software for the Prime-Cam and Mod-Cam instruments on the CCAT Fred Young Submillimeter Telescope (FYST), primecam_readout. The instruments feature lumped-element kinetic inductance detector (LEKID) arrays driven by Xilinx ZCU111 RFSoC boards. In the current configuration, each board can drive up to 4000 KIDs, and Prime-Cam is implementing approximately 25… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: SPIE Astronomical Telescopes + Instrumentation conference proceedings

  35. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  36. arXiv:2406.00783  [pdf, other

    cs.CV

    AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark

    Authors: Li Lin, Santosh, Xin Wang, Shu Hu

    Abstract: AI-generated faces have enriched human life, such as entertainment, education, and art. However, they also pose misuse risks. Therefore, detecting AI-generated faces becomes crucial, yet current detectors show biased performance across different demographic groups. Mitigating biases can be done by designing algorithmic fairness methods, which usually require demographically annotated face datasets… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  37. arXiv:2406.00659  [pdf, other

    physics.acc-ph

    High Performance Operation of a Direct-Current and Superconducting Radio-Frequency Combined Photocathode Gun

    Authors: H. Jia, T. Li, T. Wang, Y. Zhao, X. Zhang, H. Xu, Z. Liu, J. Liu, L. Lin, H. Xie, L. Feng, F. Wang, F. Zhu, J. Hao, S. Quan, K. Liu, S. Huang

    Abstract: Superconducting radio-frequency (SRF) guns are promising candidates to deliver high brightness continuous-wave (CW) electron beams for new generations of coherent linac light sources, ultrafast electron diffractions, MeV pulsed beam applications, etc. To solve the compatibility problem of semiconductor photocathodes, a hybrid gun combining a direct-current gap and an SRF cavity has been developed.… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 6 pages, 5 figures

  38. arXiv:2406.00632  [pdf, other

    cs.CV

    Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

    Authors: Yukai Shi, Yupei Lin, Pengxu Wei, Xiaoyu Xian, Tianshui Chen, Liang Lin

    Abstract: Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models with good generalization poses a challenge. To augment the infrared dataset, researchers employ data augmentation techniques, which often involve generating ne… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  39. arXiv:2406.00510  [pdf, other

    cs.CV

    Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

    Authors: Jiaming Li, Jiacheng Zhang, Jichang Li, Ge Li, Si Liu, Liang Lin, Guanbin Li

    Abstract: Open vocabulary object detection (OVD) aims at seeking an optimal object detector capable of recognizing objects from both base and novel categories. Recent advances leverage knowledge distillation to transfer insightful knowledge from pre-trained large-scale vision-language models to the task of object detection, significantly generalizing the powerful capabilities of the detector to identify mor… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: CVPR2024

  40. arXiv:2406.00045  [pdf, other

    cs.CL cs.LG

    Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization

    Authors: Yuanpu Cao, Tianrong Zhang, Bochuan Cao, Ziyi Yin, Lu Lin, Fenglong Ma, Jinghui Chen

    Abstract: Researchers have been studying approaches to steer the behavior of Large Language Models (LLMs) and build personalized LLMs tailored for various applications. While fine-tuning seems to be a direct solution, it requires substantial computational resources and may significantly affect the utility of the original LLM. Recent endeavors have introduced more lightweight strategies, focusing on extracti… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  41. arXiv:2405.20404  [pdf, other

    cs.CL cs.LG

    XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution

    Authors: Yurui Chang, Bochuan Cao, Yujia Wang, Jinghui Chen, Lu Lin

    Abstract: Large Language Models (LLMs) have demonstrated impressive performances in complex text generation tasks. However, the contribution of the input prompt to the generated content still remains obscure to humans, underscoring the necessity of elucidating and explaining the causality between input and output pairs. Existing works for providing prompt-specific explanation often confine model output to b… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  42. arXiv:2405.18386  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning

    Authors: Yixiao Zhang, Yukara Ikemiya, Woosung Choi, Naoki Murata, Marco A. Martínez-Ramírez, Liwei Lin, Gus Xia, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon

    Abstract: Recent advances in text-to-music editing, which employ text queries to modify music (e.g.\ by changing its style or adjusting instrumental components), present unique challenges and opportunities for AI-assisted music creation. Previous approaches in this domain have been constrained by the necessity to train specific editing models from scratch, which is both resource-intensive and inefficient; o… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Code and demo are available at: https://github.com/ldzhangyx/instruct-musicgen

  43. arXiv:2405.17496  [pdf, other

    eess.IV

    UU-Mamba: Uncertainty-aware U-Mamba for Cardiac Image Segmentation

    Authors: Ting Yu Tsai, Li Lin, Shu Hu, Ming-Ching Chang, Hongtu Zhu, Xin Wang

    Abstract: Biomedical image segmentation is critical for accurate identification and analysis of anatomical structures in medical imaging, particularly in cardiac MRI. Manual segmentation is labor-intensive, time-consuming, and prone to errors, highlighting the need for automated methods. However, current machine learning approaches face challenges like overfitting and data demands. To tackle these issues, w… ▽ More

    Submitted 4 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  44. arXiv:2405.16768  [pdf, other

    math.NA

    Time-dependent complex variable solution on quasi three-dimensional shallow tunnelling in gravititational geomaterial with reasonable far-field displacement

    Authors: Luobin Lin, Fuquan Chen, Changjie Zheng

    Abstract: Three-dimensional effect of tunnel face and gravitational excavation generally occur in shallow tunnelling, which are nevertheless not adequately considered in present complex variable solutions. In this paper, a new time-dependent complex variable solution on quasi three-dimensional shallow tunnelling in gravitational geomaterial is derived, and the far-field displacement singularity is eliminate… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 38p pages, 11 figures

  45. arXiv:2405.15768  [pdf, other

    stat.ML cs.AI cs.LG

    Canonical Variates in Wasserstein Metric Space

    Authors: Jia Li, Lin Lin

    Abstract: In this paper, we address the classification of instances each characterized not by a singular point, but by a distribution on a vector space. We employ the Wasserstein metric to measure distances between distributions, which are then used by distance-based classification algorithms such as k-nearest neighbors, k-means, and pseudo-mixture modeling. Central to our investigation is dimension reducti… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: double space 37 pages, 6 figures

  46. High-field magnetoelectric coupling and successive magnetic transitions in Mn-doped polar antiferromagnet Ni3TeO6

    Authors: J. H. Zhang, L. Lin, C. Dong, Y. T. Chang, J. F. Wang, C. L. Lu, P. Z. Chen, W. J. Zhai, G. Z. Zhou, L. Huang, Y. S. Tang, S. H. Zheng, M. F. Liu, X. H. Zhou, Z. B. Yan, J. -M. Liu

    Abstract: Among the 3d transition metal ions doped polar Ni3TeO6, Mn-doped Ni3TeO6 has stimulated great interest due to its high magnetic ordering temperature and complex magnetic phases, but the mechanism of magnetoelectric (ME) coupling is far from understood. Herein we report our systematic investigation of the chemical control of magnetism, metamagnetic transition, and ME properties of Ni3-xMnxTeO6 sing… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 30 pages with 8 figures

    Journal ref: Phys. Rev. B 109, 184112 (2024)

  47. arXiv:2405.15280  [pdf, other

    cs.IR cs.AI cs.LG

    DFGNN: Dual-frequency Graph Neural Network for Sign-aware Feedback

    Authors: Yiqing Wu, Ruobing Xie, Zhao Zhang, Xu Zhang, Fuzhen Zhuang, Leyu Lin, Zhanhui Kang, Yongjun Xu

    Abstract: The graph-based recommendation has achieved great success in recent years. However, most existing graph-based recommendations focus on capturing user preference based on positive edges/feedback, while ignoring negative edges/feedback (e.g., dislike, low rating) that widely exist in real-world recommender systems. How to utilize negative feedback in graph-based recommendations still remains underex… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024 Research Track

  48. arXiv:2405.14767  [pdf, other

    q-fin.ST cs.CL cs.LG q-fin.TR

    FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

    Authors: Hongyang Yang, Boyu Zhang, Neng Wang, Cheng Guo, Xiaoli Zhang, Likun Lin, Junlin Wang, Tianyu Zhou, Mao Guan, Runjia Zhang, Christina Dan Wang

    Abstract: As financial institutions and professionals increasingly incorporate Large Language Models (LLMs) into their workflows, substantial barriers, including proprietary data and specialized knowledge, persist between the finance sector and the AI community. These challenges impede the AI community's ability to enhance financial tasks effectively. Acknowledging financial analysis's critical role, we aim… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: FinRobot Whitepaper V1.0

  49. arXiv:2405.14023  [pdf, other

    cs.LG

    WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response

    Authors: Tianrong Zhang, Bochuan Cao, Yuanpu Cao, Lu Lin, Prasenjit Mitra, Jinghui Chen

    Abstract: The recent breakthrough in large language models (LLMs) such as ChatGPT has revolutionized production processes at an unprecedented pace. Alongside this progress also comes mounting concerns about LLMs' susceptibility to jailbreaking attacks, which leads to the generation of harmful or unsafe content. While safety alignment measures have been implemented in LLMs to mitigate existing jailbreak atte… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  50. arXiv:2405.12521  [pdf, other

    cs.LG

    Unleash Graph Neural Networks from Heavy Tuning

    Authors: Lequan Lin, Dai Shi, Andi Han, Zhiyong Wang, Junbin Gao

    Abstract: Graph Neural Networks (GNNs) are deep-learning architectures designed for graph-type data, where understanding relationships among individual observations is crucial. However, achieving promising GNN performance, especially on unseen data, requires comprehensive hyperparameter tuning and meticulous training. Unfortunately, these processes come with high computational costs and significant human ef… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.