Skip to main content

Showing 1–50 of 213 results for author: Pu, Y

  1. arXiv:2407.08922  [pdf, other

    cs.LG

    Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures?

    Authors: Yingming Pu, Liping Huang, Tao Lin, Hongyu Chen

    Abstract: With the rapid development of artificial intelligence (AI), large language models (LLMs) such as GPT-4 have garnered significant attention in the scientific community, demonstrating great potential in advancing scientific discovery. This progress raises a critical question: are these LLMs well-aligned with real-world physicochemical principles? Current evaluation strategies largely emphasize fact-… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.05700  [pdf, other

    cs.CL cs.AI cs.SE

    InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

    Authors: Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen Pu, Dawei Yin, Xing Hu, Yunji Chen

    Abstract: Recent advancements in open-source code large language models (LLMs) have demonstrated remarkable coding abilities by fine-tuning on the data generated from powerful closed-source LLMs such as GPT-3.5 and GPT-4 for instruction tuning. This paper explores how to further improve an instruction-tuned code LLM by generating data from itself rather than querying closed-source LLMs. Our key observation… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2407.05118  [pdf, other

    cs.CV

    SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding

    Authors: Zixu Cheng, Yujiang Pu, Shaogang Gong, Parisa Kordjamshidi, Yu Kong

    Abstract: Temporal grounding, also known as video moment retrieval, aims at locating video segments corresponding to a given query sentence. The compositional nature of natural language enables the localization beyond predefined events, posing a certain challenge to the compositional generalizability of existing methods. Recent studies establish the correspondence between videos and queries through a decomp… ▽ More

    Submitted 15 July, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  4. arXiv:2407.02499  [pdf, other

    cs.PL cs.AI

    Amortizing Pragmatic Program Synthesis with Rankings

    Authors: Yewen Pu, Saujas Vaduguru, Priyan Vaithilingam, Elena Glassman, Daniel Fried

    Abstract: The usage of Rational Speech Acts (RSA) framework has been successful in building \emph{pragmatic} program synthesizers that return programs which, in addition to being logically consistent with user-generated examples, account for the fact that a user chooses their examples informatively. We present a general method of amortizing the slow, exact RSA synthesizer. Our method first query the exact R… ▽ More

    Submitted 1 June, 2024; originally announced July 2024.

    Comments: icml 2024. arXiv admin note: substantial text overlap with arXiv:2309.03225

  5. arXiv:2406.10667  [pdf, other

    cs.LG

    UniZero: Generalized and Efficient Planning with Scalable Latent World Models

    Authors: Yuan Pu, Yazhe Niu, Jiyuan Ren, Zhenjie Yang, Hongsheng Li, Yu Liu

    Abstract: Learning predictive world models is essential for enhancing the planning capabilities of reinforcement learning agents. Notably, the MuZero-style algorithms, based on the value equivalence principle and Monte Carlo Tree Search (MCTS), have achieved superhuman performance in various domains. However, in environments that require capturing long-term dependencies, MuZero's performance deteriorates ra… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 32 pages, 16 figures

  6. arXiv:2405.19636  [pdf, other

    cs.GR

    Creating Language-driven Spatial Variations of Icon Images

    Authors: Xianghao Xu, Aditya Ganeshan, Karl D. D. Willis, Yewen Pu, Daniel Ritchie

    Abstract: Editing 2D icon images can require significant manual effort from designers. It involves manipulating multiple geometries while maintaining the logical or physical coherence of the objects depicted in the image. Previous language driven image editing methods can change the texture and geometry of objects in the image but fail at producing spatial variations, i.e. modifying spatial relations betwee… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2405.18717  [pdf

    physics.app-ph physics.optics

    Silicon-integrated scandium-doped aluminum nitride electro-optic modulator

    Authors: Tianqi Xu, Yushuai Liu, Yuanmao Pu, Yongxiang Yang, Qize Zhong, Xingyan Zhao, Yang Qiu, Yuan Dong, Tao Wu, Shaonan Zheng, Ting Hu

    Abstract: Scandium-doped aluminum nitride (AlScN) with an asymmetric hexagonal wurtzite structure exhibits enhanced second-order nonlinear and piezoelectric properties compared to aluminum nitride (AlN), while maintaining a relatively large bandgap. It provides a promising platform for photonic integration and facilitates the seamless integration of passive and active functional devices. Here, we present th… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  8. arXiv:2405.16863  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    All-voltage control of Giant Magnetoresistance

    Authors: Lujun Wei, Yiyang Zhang, Fei Huang, Jiajv Yang, Jincheng Peng, Yanghui Li, Yu Lu, Jiarui Chen, Tianyu Liu, Yong Pu, Jun Du

    Abstract: The aim of voltage control of magnetism is to reduce the power consumption of spintronic devices. For a spin valve, the magnetization directions of two ferromagnetic layers determine the giant magnetoresistance magnitude. However, achieving all-voltage manipulation of the magnetization directions between parallel and antiparallel states is a significant challenge. Here, we demonstrate that by util… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.16605  [pdf, other

    cs.CV

    Demystify Mamba in Vision: A Linear Attention Perspective

    Authors: Dongchen Han, Ziyi Wang, Zhuofan Xia, Yizeng Han, Yifan Pu, Chunjiang Ge, Jun Song, Shiji Song, Bo Zheng, Gao Huang

    Abstract: Mamba is an effective state space model with linear computation complexity. It has recently shown impressive efficiency in dealing with high-resolution inputs across various vision tasks. In this paper, we reveal that the powerful Mamba model shares surprising similarities with linear attention Transformer, which typically underperform conventional Transformer in practice. By exploring the similar… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  10. arXiv:2405.13369  [pdf, other

    quant-ph

    Realization of a crosstalk-free multi-ion node for long-distance quantum networking

    Authors: P. -C. Lai, Y. Wang, J. -X. Shi, Z. -B. Cui, Z. -Q. Wang, S. Zhang, P. -Y. Liu, Z. -C. Tian, Y. -D. Sun, X. -Y. Chang, B. -X. Qi, Y. -Y. Huang, Z. -C. Zhou, Y. -K. Wu, Y. Xu, Y. -F. Pu, L. -M. Duan

    Abstract: Trapped atomic ions constitute one of the leading physical platforms for building the quantum repeater nodes to realize large-scale quantum networks. In a long-distance trapped-ion quantum network, it is essential to have crosstalk-free dual-type qubits: one type, called the communication qubit, to establish entangling interface with telecom photons; and the other type, called the memory qubit, to… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 12 pages, 12 figures

  11. arXiv:2405.12786  [pdf, other

    cs.CR

    Rethinking the Vulnerabilities of Face Recognition Systems:From a Practical Perspective

    Authors: Jiahao Chen, Zhiqiang Shen, Yuwen Pu, Chunyi Zhou, Changjiang Li, Jiliang Li, Ting Wang, Shouling Ji

    Abstract: Face Recognition Systems (FRS) have increasingly integrated into critical applications, including surveillance and user authentication, highlighting their pivotal role in modern security systems. Recent studies have revealed vulnerabilities in FRS to adversarial (e.g., adversarial patch attacks) and backdoor attacks (e.g., training data poisoning), raising significant concerns about their reliabil… ▽ More

    Submitted 8 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 19 pages,version 3

  12. arXiv:2405.12751  [pdf, other

    cs.CR

    A Stealthy Backdoor Attack for Without-Label-Sharing Split Learning

    Authors: Yuwen Pu, Zhuoyuan Ding, Jiahao Chen, Chunyi Zhou, Qingming Li, Chunqiang Hu, Shouling Ji

    Abstract: As a novel privacy-preserving paradigm aimed at reducing client computational costs and achieving data utility, split learning has garnered extensive attention and proliferated widespread applications across various fields, including smart health and smart transportation, among others. While recent studies have primarily concentrated on addressing privacy leakage concerns in split learning, such a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 15 pages

  13. arXiv:2405.12719  [pdf, other

    cs.CR

    How to Train a Backdoor-Robust Model on a Poisoned Dataset without Auxiliary Data?

    Authors: Yuwen Pu, Jiahao Chen, Chunyi Zhou, Zhou Feng, Qingming Li, Chunqiang Hu, Shouling Ji

    Abstract: Backdoor attacks have attracted wide attention from academia and industry due to their great security threat to deep neural networks (DNN). Most of the existing methods propose to conduct backdoor attacks by poisoning the training dataset with different strategies, so it's critical to identify the poisoned samples and then train a clean model on the unreliable dataset in the context of defending b… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 13 pages, under review

  14. arXiv:2405.11398  [pdf, other

    physics.optics

    Second-harmonic optical diffraction tomography

    Authors: Amirhossein Saba, Carlo Gigli, Ye Pu, Demetri Psaltis

    Abstract: Optical diffraction tomography (ODT) has emerged as an important label-free tool in biomedicine to measure the three-dimensional (3D) structure of a biological sample. In this paper, we describe ODT using second-harmonic generation (SHG) which is a coherent nonlinear optical process with a strict symmetry selectivity and has several advantages over traditional fluorescence methods. We report the t… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  15. arXiv:2404.17873  [pdf

    physics.bio-ph q-bio.BM q-bio.CB q-bio.SC

    Bacterial stress granule protects mRNA through ribonucleases exclusion

    Authors: Linsen Pei, Yujia Xian, Xiaodan Yan, Charley Schaefer, Aisha H. Syeda, Jamieson Howard, Hebin Liao, Fan Bai, Mark C. Leake, Yingying Pu

    Abstract: Membraneless droplets formed through liquid-liquid phase separation (LLPS) play a crucial role in mRNA storage, enabling organisms to swiftly respond to environmental changes. However, the mechanisms underlying mRNA integration and protection within droplets remain unclear. Here, we unravel the role of bacterial aggresomes as stress granules (SGs) in safeguarding mRNA during stress. We discovered… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  16. arXiv:2404.16364  [pdf, other

    cs.AI

    ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze

    Authors: Chunyu Xuan, Yazhe Niu, Yuan Pu, Shuai Hu, Yu Liu, Jing Yang

    Abstract: Monte Carlo Tree Search (MCTS)-based algorithms, such as MuZero and its derivatives, have achieved widespread success in various decision-making domains. These algorithms employ the reanalyze process to enhance sample efficiency from stale data, albeit at the expense of significant wall-clock time consumption. To address this issue, we propose a general approach named ReZero to boost tree search o… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  17. arXiv:2404.04140  [pdf, other

    cs.CV cs.LG

    Improving Detection in Aerial Images by Capturing Inter-Object Relationships

    Authors: Botao Ren, Botian Xu, Yifan Pu, Jingyi Wang, Zhidong Deng

    Abstract: In many image domains, the spatial distribution of objects in a scene exhibits meaningful patterns governed by their semantic relationships. In most modern detection pipelines, however, the detection proposals are processed independently, overlooking the underlying relationships between objects. In this work, we introduce a transformer-based approach to capture these inter-object relationships to… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  18. arXiv:2403.16601  [pdf, other

    math.AP

    Singular profile of free boundary of incompressible inviscid fluid with external force

    Authors: Lili Du, Yang Pu, Jing Yang

    Abstract: This article is devoted to investigate the singular profile of the free boundary of two-dimensional incompressible inviscid fluid with external force near the stagnation point. More precisely, given an external force with some polynomial type decay close to the stagnation point, the singular profile of the free boundary at stagnation point possible are corner wave, flat and cusp singularity. Throu… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 40 pages. Any comments are welcome

  19. arXiv:2403.14369  [pdf, other

    eess.SY

    A Control Barrier Function Composition Approach for Multi-Agent Systems in Marine Applications

    Authors: Yujia Yang, Chris Manzie, Ye Pu

    Abstract: The agents within a multi-agent system (MAS) operating in marine environments often need to utilize task payloads and avoid collisions in coordination, necessitating adherence to a set of relative-pose constraints, which may include field-of-view, line-of-sight, collision-avoidance, and range constraints. A nominal controller designed for reference tracking may not guarantee the marine MAS stays s… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 8 figures

  20. arXiv:2403.13623  [pdf, other

    quant-ph

    Fast delivery of heralded atom-photon quantum correlation over 12km fiber through multiplexing enhancement

    Authors: Sheng Zhang, Jixuan Shi, Yibo Liang, Yuedong Sun, Yukai Wu, Luming Duan, Yunfei Pu

    Abstract: Distributing quantum entanglement between distant parties is a significant but difficult task in quantum information science, as it can enable numerous applications but suffers from exponential decay in the quantum channel. Quantum repeater is one of the most promising approaches towards this goal. In a quantum repeater protocol, it is essential that the entanglement generation speed within each e… ▽ More

    Submitted 21 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures

  21. arXiv:2403.11127  [pdf, other

    cs.CV

    GRA: Detecting Oriented Objects through Group-wise Rotating and Attention

    Authors: Jiangshan Wang, Yifan Pu, Yizeng Han, Jiayi Guo, Yiru Wang, Xiu Li, Gao Huang

    Abstract: Oriented object detection, an emerging task in recent years, aims to identify and locate objects across varied orientations. This requires the detector to accurately capture the orientation information, which varies significantly within and across images. Despite the existing substantial efforts, simultaneously ensuring model effectiveness and parameter efficiency remains challenging in this scena… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: tech report

  22. arXiv:2403.08357  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Geometric and electronic properties of two kinds of CrO2 magnetic monolayers: D3d and D2h phases

    Authors: Yang Zhang, Xianggong Bo, Jimeng Jing, Lixia Wang, Shiqian Qiao, Hong Wu, Yong Pu, Feng Li

    Abstract: Due to the high magnetic coupling strength between the Cr elements, the bulk phase CrO2 is one of several ferromagnetic oxides known to have the highest Curie temperature. When the dimensionality of the material is reduced from 3D to 2D, the 2D CrO2 system material is expected to maintain a high Curie temperature. In this work, we predict two new phases of CrO2 monolayer (D3d and D2h) by using fir… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 5 pages,4 figures

  23. arXiv:2403.07153  [pdf, other

    cs.CV

    2023 Low-Power Computer Vision Challenge (LPCVC) Summary

    Authors: Leo Chen, Benjamin Boardley, Ping Hu, Yiru Wang, Yifan Pu, Xin Jin, Yongqiang Yao, Ruihao Gong, Bo Li, Gao Huang, Xianglong Liu, Zifu Wan, Xinwang Chen, Ning Liu, Ziyi Zhang, Dongping Liu, Ruijie Shan, Zhengping Che, Fachao Zhang, Xiaofeng Mou, Jian Tang, Maxim Chuprov, Ivan Malofeev, Alexander Goncharenko, Andrey Shcherbin , et al. (5 additional authors not shown)

    Abstract: This article describes the 2023 IEEE Low-Power Computer Vision Challenge (LPCVC). Since 2015, LPCVC has been an international competition devoted to tackling the challenge of computer vision (CV) on edge devices. Most CV researchers focus on improving accuracy, at the expense of ever-growing sizes of machine models. LPCVC balances accuracy with resource requirements. Winners must achieve high accu… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: LPCVC 2023, website: https://lpcv.ai/

  24. arXiv:2402.12326  [pdf, other

    cs.CL cs.CY cs.HC cs.LG cs.MA

    LLM Agents for Psychology: A Study on Gamified Assessments

    Authors: Qisen Yang, Zekun Wang, Honghui Chen, Shenzhi Wang, Yifan Pu, Xin Gao, Wenhao Huang, Shiji Song, Gao Huang

    Abstract: Psychological measurement is essential for mental health, self-understanding, and personal development. Traditional methods, such as self-report scales and psychologist interviews, often face challenges with engagement and accessibility. While game-based and LLM-based tools have been explored to improve user interest and automate assessment, they struggle to balance engagement with generalizabilit… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  25. arXiv:2402.03741  [pdf, other

    cs.LG cs.AI cs.CR

    SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

    Authors: Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu, Yingcai Wu, Shouling Ji

    Abstract: Recent advancements in multi-agent reinforcement learning (MARL) have opened up vast application prospects, such as swarm control of drones, collaborative manipulation by robotic arms, and multi-target encirclement. However, potential security threats during the MARL deployment need more attention and thorough investigation. Recent research reveals that attackers can rapidly exploit the victim's v… ▽ More

    Submitted 26 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in the ACM Conference on Computer and Communications Security (CCS'24), October 14-18, 2024, Salt Lake City, UT, USA

  26. arXiv:2401.14027  [pdf, other

    cs.LG

    The Risk of Federated Learning to Skew Fine-Tuning Features and Underperform Out-of-Distribution Robustness

    Authors: Mengyao Du, Miao Zhang, Yuwen Pu, Kai Xu, Shouling Ji, Quanjun Yin

    Abstract: To tackle the scarcity and privacy issues associated with domain-specific datasets, the integration of federated learning in conjunction with fine-tuning has emerged as a practical solution. However, our findings reveal that federated learning has the risk of skewing fine-tuning features and compromising the out-of-distribution robustness of the model. By introducing three robustness indicators an… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 12 pages, 10 figures

  27. arXiv:2401.11942  [pdf

    physics.optics physics.app-ph

    Single-Photon-Assisted Two-Photon Polymerization

    Authors: Buse Unlu, Maria Isabel Álvarez-Castaño, Antoine Boniface, Ye Pu, Christophe Moser

    Abstract: Light-based additive manufacturing (AM) has revolutionized the fabrication of complex three-dimensional (3D) objects offering a cost-effective and high-speed alternative to traditional machining. One-photon polymerization is a key process in this advancement, standing out for rapid printing time, albeit with limited resolution. Two-photon polymerization (2PP) empowers AM with unprecedented resolut… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 18 pages, 11 figures

  28. arXiv:2312.14677  [pdf, other

    cs.CR cs.AI

    MEAOD: Model Extraction Attack against Object Detectors

    Authors: Zeyu Li, Chenghui Shi, Yuwen Pu, Xuhong Zhang, Yu Li, Jinbao Li, Shouling Ji

    Abstract: The widespread use of deep learning technology across various industries has made deep neural network models highly valuable and, as a result, attractive targets for potential attackers. Model extraction attacks, particularly query-based model extraction attacks, allow attackers to replicate a substitute model with comparable functionality to the victim model and present a significant threat to th… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  29. arXiv:2312.10072  [pdf, other

    cs.HC cs.AI cs.LG stat.AP

    Assessing the Usability of GutGPT: A Simulation Study of an AI Clinical Decision Support System for Gastrointestinal Bleeding Risk

    Authors: Colleen Chan, Kisung You, Sunny Chung, Mauro Giuffrè, Theo Saarinen, Niroop Rajashekar, Yuan Pu, Yeo Eun Shin, Loren Laine, Ambrose Wong, René Kizilcec, Jasjeet Sekhon, Dennis Shung

    Abstract: Applications of large language models (LLMs) like ChatGPT have potential to enhance clinical decision support through conversational interfaces. However, challenges of human-algorithmic interaction and clinician trust are poorly understood. GutGPT, a LLM for gastrointestinal (GI) bleeding risk prediction and management guidance, was deployed in clinical simulation scenarios alongside the electroni… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10, 2023, New Orleans, United States, 11 pages

  30. arXiv:2312.09708  [pdf, other

    cs.LG cs.AI

    GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with Relative Entropy

    Authors: Tianhao Peng, Wenjun Wu, Haitao Yuan, Zhifeng Bao, Zhao Pengrui, Xin Yu, Xuetao Lin, Yu Liang, Yanjun Pu

    Abstract: Graph neural networks (GNNs) have shown advantages in graph-based analysis tasks. However, most existing methods have the homogeneity assumption and show poor performance on heterophilic graphs, where the linked nodes have dissimilar features and different class labels, and the semantically related nodes might be multi-hop away. To address this limitation, this paper presents GraphRARE, a general… ▽ More

    Submitted 13 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures

  31. arXiv:2312.06408  [pdf, other

    cs.LG cs.AI cs.RO

    DiffVL: Scaling Up Soft Body Manipulation using Vision-Language Driven Differentiable Physics

    Authors: Zhiao Huang, Feng Chen, Yewen Pu, Chunru Lin, Hao Su, Chuang Gan

    Abstract: Combining gradient-based trajectory optimization with differentiable physics simulation is an efficient technique for solving soft-body manipulation problems. Using a well-crafted optimization objective, the solver can quickly converge onto a valid trajectory. However, writing the appropriate objective functions requires expert knowledge, making it difficult to collect a large set of naturalistic… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  32. arXiv:2312.06226  [pdf, other

    cs.CV cs.AI

    Invariant Representation via Decoupling Style and Spurious Features from Images

    Authors: Ruimeng Li, Yuanhao Pu, Zhaoyi Li, Hong Xie, Defu Lian

    Abstract: This paper considers the out-of-distribution (OOD) generalization problem under the setting that both style distribution shift and spurious features exist and domain labels are missing. This setting frequently arises in real-world applications and is underlooked because previous approaches mainly handle either of these two factors. The critical challenge is decoupling style and spurious features i… ▽ More

    Submitted 1 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 10 pages, 12 figures

    ACM Class: I.2.6; I.2.10

  33. arXiv:2312.04410  [pdf, other

    cs.CV

    Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

    Authors: Jiayi Guo, Xingqian Xu, Yifan Pu, Zanlin Ni, Chaofei Wang, Manushree Vasu, Shiji Song, Gao Huang, Humphrey Shi

    Abstract: Recently, diffusion models have made remarkable progress in text-to-image (T2I) generation, synthesizing images with high fidelity and diverse contents. Despite this advancement, latent space smoothness within diffusion models remains largely unexplored. Smooth latent spaces ensure that a perturbation on an input latent corresponds to a steady change in the output image. This property proves benef… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: GitHub: https://github.com/SHI-Labs/Smooth-Diffusion

  34. arXiv:2311.17400  [pdf, other

    cs.CL cs.CR cs.LG

    Improving the Robustness of Transformer-based Large Language Models with Dynamic Attention

    Authors: Lujia Shen, Yuwen Pu, Shouling Ji, Changjiang Li, Xuhong Zhang, Chunpeng Ge, Ting Wang

    Abstract: Transformer-based models, such as BERT and GPT, have been widely adopted in natural language processing (NLP) due to their exceptional performance. However, recent studies show their vulnerability to textual adversarial attacks where the model's output can be misled by intentionally manipulating the text inputs. Despite various methods that have been proposed to enhance the model's robustness and… ▽ More

    Submitted 29 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  35. arXiv:2311.13455  [pdf, other

    cs.AI cs.CL

    Generation of Explanations for Logic Reasoning

    Authors: Yanyi Pu

    Abstract: This thesis delves into a fortiori arguments in deductive reasoning, underscoring their relevance in various domains such as law, philosophy, and artificial intelligence. The research is centred on employing GPT-3.5-turbo to automate the analysis of these arguments, with a focus on understanding intricate reasoning processes, generating clear and coherent explanations, and creating novel arguments… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 78 Pages, 16 Figures, Thesis Presentation is available at https://drive.google.com/file/d/1wLIBsjfLvO11PjCS6qx4Y9UgRBUfq3wQ/view?usp=sharing

  36. arXiv:2311.12255  [pdf, other

    cs.LG cs.SI

    Exploring Time Granularity on Temporal Graphs for Dynamic Link Prediction in Real-world Networks

    Authors: Xiangjian Jiang, Yanyi Pu

    Abstract: Dynamic Graph Neural Networks (DGNNs) have emerged as the predominant approach for processing dynamic graph-structured data. However, the influence of temporal information on model performance and robustness remains insufficiently explored, particularly regarding how models address prediction tasks with different time granularities. In this paper, we explore the impact of time granularity when tra… ▽ More

    Submitted 22 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Presented at the Temporal Graph Learning Workshop @ NeurIPS 2023

  37. arXiv:2311.10292  [pdf, other

    quant-ph cs.ET physics.optics

    Realization of a programmable multi-purpose photonic quantum memory with over-thousand qubit manipulations

    Authors: Sheng Zhang, Jixuan Shi, Zhaibin Cui, Ye Wang, Yukai Wu, Luming Duan, Yunfei Pu

    Abstract: Quantum networks can enable various applications such as distributed quantum computing, long-distance quantum communication, and network-based quantum sensing with unprecedented performances. One of the most important building blocks for a quantum network is a photonic quantum memory which serves as the interface between the communication channel and the local functional unit. A programmable quant… ▽ More

    Submitted 29 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 17 pages, 19 figures

    Journal ref: Phys. Rev. X 14, 021018 (2024)

  38. arXiv:2311.05740  [pdf, other

    cs.LG cs.AI cs.PL

    Generating Pragmatic Examples to Train Neural Program Synthesizers

    Authors: Saujas Vaduguru, Daniel Fried, Yewen Pu

    Abstract: Programming-by-example is the task of synthesizing a program that is consistent with a set of user-provided input-output examples. As examples are often an under-specification of one's intent, a good synthesizer must choose the intended program from the many that are consistent with the given set of examples. Prior work frames program synthesis as a cooperative game between a listener (that synthe… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  39. arXiv:2310.17815  [pdf, other

    math.AP math-ph nlin.PS physics.flu-dyn

    Stability of Inverse Problems for Steady Supersonic Flows Past Lipschitz Perturbed Cones

    Authors: Gui-Qiang G. Chen, Yun Pu, Yongqian Zhang

    Abstract: We are concerned with inverse problems for supersonic potential flows past infinite axisymmetric Lipschitz cones. The supersonic flows under consideration are governed by the steady isentropic Euler equations for axisymmetric potential flows, which involve a singular geometric source term. We first study the inverse problem for the stability of an oblique conical shock as an initial-boundary value… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 41 pages, 5 figures. arXiv admin note: text overlap with arXiv:2008.02409

    MSC Class: 35B07; 35B20; 35D30; 35L65; 35L67; 76J20; 76L05; 76N10

  40. arXiv:2310.15590  [pdf, other

    cs.CR cs.CV

    Facial Data Minimization: Shallow Model as Your Privacy Filter

    Authors: Yuwen Pu, Jiahao Chen, Jiayu Pan, Hao li, Diqun Yan, Xuhong Zhang, Shouling Ji

    Abstract: Face recognition service has been used in many fields and brings much convenience to people. However, once the user's facial data is transmitted to a service provider, the user will lose control of his/her private data. In recent years, there exist various security and privacy issues due to the leakage of facial data. Although many privacy-preserving methods have been proposed, they usually fail w… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 14 pages, 11 figures

  41. arXiv:2310.11881  [pdf, other

    cs.CV

    A Comparative Study of Image Restoration Networks for General Backbone Network Design

    Authors: Xiangyu Chen, Zheyuan Li, Yuandong Pu, Yihao Liu, Jiantao Zhou, Yu Qiao, Chao Dong

    Abstract: Despite the significant progress made by deep models in various image restoration tasks, existing image restoration networks still face challenges in terms of task generality. An intuitive manifestation is that networks which excel in certain tasks often fail to deliver satisfactory results in others. To illustrate this point, we select five representative networks and conduct a comparative study… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted to ECCV2024

  42. arXiv:2310.11614  [pdf, other

    cs.AI

    Learning a Hierarchical Planner from Humans in Multiple Generations

    Authors: Leonardo Hernandez Cano, Yewen Pu, Robert D. Hawkins, Josh Tenenbaum, Armando Solar-Lezama

    Abstract: A typical way in which a machine acquires knowledge from humans is by programming. Compared to learning from demonstrations or experiences, programmatic learning allows the machine to acquire a novel skill as soon as the program is written, and, by building a library of programs, a machine can quickly learn how to perform complex tasks. However, as programs often take their execution contexts for… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: First two authors contributed equally

  43. arXiv:2310.08854  [pdf, other

    cs.CV cs.LG

    Rank-DETR for High Quality Object Detection

    Authors: Yifan Pu, Weicong Liang, Yiduo Hao, Yuhui Yuan, Yukang Yang, Chao Zhang, Han Hu, Gao Huang

    Abstract: Modern detection transformers (DETRs) use a set of object queries to predict a list of bounding boxes, sort them by their classification confidence scores, and select the top-ranked predictions as the final detection results for the given input image. A highly performant object detector requires accurate ranking for the bounding box predictions. For DETR-based detectors, the top-ranked bounding bo… ▽ More

    Submitted 2 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  44. arXiv:2310.08348  [pdf, other

    cs.LG

    LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

    Authors: Yazhe Niu, Yuan Pu, Zhenjie Yang, Xueyan Li, Tong Zhou, Jiyuan Ren, Shuai Hu, Hongsheng Li, Yu Liu

    Abstract: Building agents based on tree-search planning capabilities with learned models has achieved remarkable success in classic decision-making problems, such as Go and Atari. However, it has been deemed challenging or even infeasible to extend Monte Carlo Tree Search (MCTS) based algorithms to diverse real-world applications, especially when these environments involve complex action spaces and signific… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Spotlight

  45. arXiv:2309.16077  [pdf, other

    cs.RO cs.LG eess.SY

    Task-Oriented Koopman-Based Control with Contrastive Encoder

    Authors: Xubo Lyu, Hanyang Hu, Seth Siriya, Ye Pu, Mo Chen

    Abstract: We present task-oriented Koopman-based control that utilizes end-to-end reinforcement learning and contrastive encoder to simultaneously learn the Koopman latent embedding, operator, and associated linear controller within an iterative loop. By prioritizing the task cost as the main objective for controller learning, we reduce the reliance of controller design on a well-identified model, which, fo… ▽ More

    Submitted 1 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted by the 7th Annual Conference on Robot Learning (CoRL), 2023 (oral spotlight)

  46. arXiv:2309.05660  [pdf, other

    cs.LG cs.AI cs.CL

    Hypothesis Search: Inductive Reasoning with Language Models

    Authors: Ruocheng Wang, Eric Zelikman, Gabriel Poesia, Yewen Pu, Nick Haber, Noah D. Goodman

    Abstract: Inductive reasoning is a core problem-solving capacity: humans can identify underlying principles from a few examples, which robustly generalize to novel scenarios. Recent work evaluates large language models (LLMs) on inductive reasoning tasks by directly prompting them yielding "in context learning." This works well for straightforward inductive tasks but performs poorly on complex tasks such as… ▽ More

    Submitted 30 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: ICLR 2024. The first two authors contributed equally. Code: https://github.com/Relento/hypothesis_search

  47. arXiv:2309.03225  [pdf, other

    cs.PL cs.AI

    Amortizing Pragmatic Program Synthesis with Rankings

    Authors: Yewen Pu, Saujas Vaduguru, Priyan Vaithilingam, Elena Glassman, Daniel Fried

    Abstract: In program synthesis, an intelligent system takes in a set of user-generated examples and returns a program that is logically consistent with these examples. The usage of Rational Speech Acts (RSA) framework has been successful in building \emph{pragmatic} program synthesizers that return programs which -- in addition to being logically consistent -- account for the fact that a user chooses their… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    ACM Class: I.2.2; D.3.0

  48. arXiv:2309.00399  [pdf, other

    cs.CV

    Fine-grained Recognition with Learnable Semantic Data Augmentation

    Authors: Yifan Pu, Yizeng Han, Yulin Wang, Junlan Feng, Chao Deng, Gao Huang

    Abstract: Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories within the same meta-category. Since images belonging to the same meta-category usually share similar visual appearances, mining discriminative visual cues is the key to distinguishing fine-grained categories. Although commonly used image-l… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  49. arXiv:2308.15949  [pdf, other

    cs.CV

    Latency-aware Unified Dynamic Networks for Efficient Image Recognition

    Authors: Yizeng Han, Zeyu Liu, Zhihang Yuan, Yifan Pu, Chaofei Wang, Shiji Song, Gao Huang

    Abstract: Dynamic computation has emerged as a promising avenue to enhance the inference efficiency of deep networks. It allows selective activation of computational units, leading to a reduction in unnecessary computations for each input sample. However, the actual efficiency of these dynamic models can deviate from theoretical predictions. This mismatch arises from: 1) the lack of a unified approach due t… ▽ More

    Submitted 20 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

  50. arXiv:2308.06656  [pdf, other

    cs.HC

    The Usability of Pragmatic Communication in Regular Expression Synthesis

    Authors: Priyan Vaithilingam, Yewen Pu, Elena L. Glassman

    Abstract: Programming-by-example (PBE) systems aim to alleviate the burden of programming. However, user-specified examples are often ambiguous, leaving multiple programs to satisfy the specification. Consequently, in most prior work, users have had to provide additional examples, particularly negative ones, to further constrain the search over compatible programs. Recent work resolves additional ambiguity… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.