Skip to main content

Showing 1–50 of 433 results for author: Sun, R

  1. arXiv:2407.11331  [pdf, ps, other

    math.MG math.CO

    Illuminating 1-unconditional convex bodies in ${\mathbb R}^3$ and ${\mathbb R}^4$, and certain cases in higher dimensions

    Authors: Wen Rui Sun, Beatrice-Helen Vritsiou

    Abstract: We settle the Hadwiger-Boltyanski Illumination Conjecture for all 1-unconditional convex bodies in ${\mathbb R}^3$ and in ${\mathbb R}^4$. Moreover, we settle the conjecture for those higher-dimensional 1-unconditional convex bodies which have at least one coordinate hyperplane projection equal to the corresponding projection of the circumscribing rectangular box. Finally, we confirm the conjectur… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 62 pages

    MSC Class: 52A40; 52A37 (Primary); 52A20; 52C07 (Secondary)

  2. arXiv:2407.10956  [pdf, other

    cs.AI cs.CL

    Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

    Authors: Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Yuchen Mao, Wenjing Hu, Tianbao Xie, Hongshen Xu, Danyang Zhang, Sida Wang, Ruoxi Sun, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu, Victor Zhong, Lu Chen, Kai Yu, Tao Yu

    Abstract: Data science and engineering workflows often span multiple stages, from warehousing to orchestration, using tools like BigQuery, dbt, and Airbyte. As vision language models (VLMs) advance in multimodal understanding and code generation, VLM-based agents could potentially automate these workflows by generating SQL queries, Python code, and GUI operations. This automation can improve the productivit… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 34 pages, 14 figures, 10 tables

  3. arXiv:2407.10314  [pdf, other

    math.MG math.CO

    On the illumination of 1-symmetric convex bodies

    Authors: Wen Rui Sun, Beatrice-Helen Vritsiou

    Abstract: In ["Illumination of convex bodies with many symmetries", Mathematika 63 (2017)], Tikhomirov verified the Hadwiger-Boltyanski Illumination Conjecture for the class of 1-symmetric convex bodies of sufficiently large dimension. We propose an alternative approach which allows us to settle the conjecture for this class in all dimensions in a uniform way. We also demonstrate that an alternative approac… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 1 figure

    MSC Class: 52A40; 52A37 (Primary); 52A20; 52C07 (Secondary)

  4. arXiv:2407.08995  [pdf, other

    cs.CL

    Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs

    Authors: Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Jiaming Zhou, Haoqin Sun

    Abstract: Recent advancements in LLMs have showcased their remarkable role-playing capabilities, able to accurately simulate the dialogue styles and cognitive processes of various roles based on different instructions and contexts. Studies indicate that assigning LLMs the roles of experts, a strategy known as role-play prompting, can enhance their performance in the corresponding domains. However, the promp… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  5. arXiv:2407.08377  [pdf, other

    cs.CV

    Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

    Authors: Shengqi Xu, Run Sun, Yi Chang, Shuning Cao, Xueyao Xiao, Luxin Yan

    Abstract: Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and adva… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by ECCV 2024

  6. arXiv:2407.05314  [pdf, other

    astro-ph.IM physics.optics

    Additive manufacturing in ceramics: targeting lightweight mirror applications in the visible, ultraviolet and X-ray

    Authors: Carolyn Atkins, Younes Chahid, Gregory Lister, Rhys Tuck, David Isherwood, Nan Yu, Rongyan Sun, Itsuki Noto, Kazuya Yamamura, Marta Civitani, Gabriele Vecchi, Giovanni Pareschi, Simon G. Alcock, Ioana-Theodora Nistea, Murilo Bazan Da Silva

    Abstract: Additive manufacturing (AM; 3D printing) has clear benefits in the production of lightweight mirrors for astronomy: it can create optimised lightweight structures and combine multiple components into one. New capabilities in AM ceramics, silicon carbide infiltrated with silicon and fused silica, offer the possibility to combine the design benefits of AM with a material suitable for visible, ultrav… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 15 pages, 15 figures, submitted to SPIE Astronomical Telescopes & Instrumentation, Advances in Optical and Mechanical Technologies for Telescopes and Instrumentation VI (Conference 13100, Paper 123)

  7. arXiv:2407.03954  [pdf, other

    cs.DB

    Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs

    Authors: Yanping Wu, Renjie Sun, Xiaoyang Wang, Dong Wen, Ying Zhang, Lu Qin, Xuemin Lin

    Abstract: Cohesive subgraph mining is a fundamental problem in bipartite graph analysis. In reality, relationships between two types of entities often occur at some specific timestamps, which can be modeled as a temporal bipartite graph. However, the temporal information is widely neglected by previous studies. Moreover, directly extending the existing models may fail to find some critical groups in tempora… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  8. arXiv:2407.02532  [pdf

    physics.app-ph

    Broadband planar electromagnetic hyper-lens with uniform magnification in air

    Authors: Ran Sun, Fei Sun, Hanchuan Chen, Yichao Liu, Qi Wang

    Abstract: A planar hyper-lens, capable of creating sub-wavelength imaging for broadband electromagnetic wave, is designed based on electromagnetic null medium. Subsequently, a scheme for the implementation of the proposed hyper-lens is given by using well-designed flexural metal plates, which function as the reduced electromagnetic null medium for TM-polarized microwaves. Both simulated and measured results… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  9. arXiv:2406.16793  [pdf, other

    cs.LG cs.AI

    Adam-mini: Use Fewer Learning Rates To Gain More

    Authors: Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun

    Abstract: We propose Adam-mini, an optimizer that achieves on-par or better performance than AdamW with 45% to 50% less memory footprint. Adam-mini reduces memory by cutting down the learning rate resources in Adam (i.e., $1/\sqrt{v}$). We find that $\geq$ 90% of these learning rates in $v$ could be harmlessly removed if we (1) carefully partition the parameters into blocks following our proposed principle… ▽ More

    Submitted 3 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  10. arXiv:2406.15708  [pdf, other

    cs.CL cs.AI cs.LG

    Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization

    Authors: Xingchen Wan, Ruoxi Sun, Hootan Nakhost, Sercan O. Arik

    Abstract: Large language models have demonstrated remarkable capabilities, but their performance is heavily reliant on effective prompt engineering. Automatic prompt optimization (APO) methods are designed to automate this and can be broadly categorized into those targeting instructions (instruction optimization, IO) vs. those targeting exemplars (exemplar selection, ES). Despite their shared objective, the… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  11. arXiv:2406.08688  [pdf, other

    cs.SE cs.AI

    On Security Weaknesses and Vulnerabilities in Deep Learning Systems

    Authors: Zhongzheng Lai, Huaming Chen, Ruoxi Sun, Yu Zhang, Minhui Xue, Dong Yuan

    Abstract: The security guarantee of AI-enabled software systems (particularly using deep learning techniques as a functional core) is pivotal against the adversarial attacks exploiting software vulnerabilities. However, little attention has been paid to a systematic investigation of vulnerabilities in such systems. A common situation learned from the open source software community is that deep learning engi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  12. arXiv:2406.05372  [pdf, ps, other

    stat.ML cs.LG

    Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

    Authors: Jiancong Xiao, Ruoyu Sun, Qi Long, Weijie J. Su

    Abstract: Training Deep Neural Networks (DNNs) with adversarial examples often results in poor generalization to test-time adversarial data. This paper investigates this issue, known as adversarially robust generalization, through the lens of Rademacher complexity. Building upon the studies by Khim and Loh (2018); Yin et al. (2019), numerous works have been dedicated to this problem, yet achieving a satisfa… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  13. arXiv:2406.03746  [pdf, other

    cs.CL cs.AI

    Efficient Knowledge Infusion via KG-LLM Alignment

    Authors: Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, Shuhan Luo, Zhiqiang Zhang

    Abstract: To tackle the problem of domain-specific knowledge scarcity within large language models (LLMs), knowledge graph-retrievalaugmented method has been proven to be an effective and efficient technique for knowledge infusion. However, existing approaches face two primary challenges: knowledge mismatch between public available knowledge graphs and the specific domain of the task at hand, and poor infor… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL2024 Findings

  14. arXiv:2406.02818  [pdf, other

    cs.CL

    Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

    Authors: Yusen Zhang, Ruoxi Sun, Yanfei Chen, Tomas Pfister, Rui Zhang, Sercan Ö. Arik

    Abstract: Addressing the challenge of effectively processing long contexts has become a critical issue for Large Language Models (LLMs). Two common strategies have emerged: 1) reducing the input length, such as retrieving relevant chunks by Retrieval-Augmented Generation (RAG), and 2) expanding the context window limit of LLMs. However, both strategies have drawbacks: input reduction has no guarantee of cov… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 19 pages, 6 figures

  15. arXiv:2406.01908  [pdf, other

    cs.LG math.OC

    PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

    Authors: Bingheng Li, Linxin Yang, Yupeng Chen, Senmiao Wang, Qian Chen, Haitao Mao, Yao Ma, Akang Wang, Tian Ding, Jiliang Tang, Ruoyu Sun

    Abstract: Solving large-scale linear programming (LP) problems is an important task in various areas such as communication networks, power systems, finance and logistics. Recently, two distinct approaches have emerged to expedite LP solving: (i) First-order methods (FOMs); (ii) Learning to optimize (L2O). In this work, we propose an FOM-unrolled neural network (NN) called PDHG-Net, and propose a two-stage L… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  16. arXiv:2406.00222  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training

    Authors: Maximillian Chen, Ruoxi Sun, Sercan Ö. Arık, Tomas Pfister

    Abstract: Large language models (LLMs) aligned through reinforcement learning from human feedback (RLHF) have quickly become one of the dominant paradigms for building intelligent conversational assistant agents. However, despite their strong performance across many benchmarks, LLM-based agents still lack conversational skills such as disambiguation: when generalized assistants are faced with ambiguity, the… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  17. arXiv:2405.20639  [pdf

    physics.bio-ph cond-mat.soft

    Effects of degumming conditions on the structure of the regenerated silk fibroin and the properties of its film

    Authors: Ruixue Sun, Junli Hu

    Abstract: The traditional degumming method using sodium carbonate solution severely damages the structure of silk fibroin and results in low molecular weight, which limits the properties and applications of silk materials. In this study, we report a modified degumming method and compared it with the traditional one. The results indicate that compared with the traditional degumming method, the modified degum… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  18. arXiv:2405.15258  [pdf, other

    cs.CR

    Leakage-Resilient and Carbon-Neutral Aggregation Featuring the Federated AI-enabled Critical Infrastructure

    Authors: Zehang Deng, Ruoxi Sun, Minhui Xue, Sheng Wen, Seyit Camtepe, Surya Nepal, Yang Xiang

    Abstract: AI-enabled critical infrastructures (ACIs) integrate artificial intelligence (AI) technologies into various essential systems and services that are vital to the functioning of society, offering significant implications for efficiency, security and resilience. While adopting decentralized AI approaches (such as federated learning technology) in ACIs is plausible, private and sensitive data are stil… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.13900  [pdf, other

    cs.LG cs.CV

    Rehearsal-free Federated Domain-incremental Learning

    Authors: Rui Sun, Haoran Duan, Jiahua Dong, Varun Ojha, Tejal Shah, Rajiv Ranjan

    Abstract: We introduce a rehearsal-free federated domain incremental learning framework, RefFiL, based on a global prompt-sharing paradigm to alleviate catastrophic forgetting challenges in federated domain-incremental learning, where unseen domains are continually learned. Typical methods for mitigating forgetting, such as the use of additional datasets and the retention of private data from earlier tasks,… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  20. arXiv:2405.11145  [pdf, other

    cs.CV cs.AI cs.MM

    Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions

    Authors: Junzhang Liu, Zhecan Wang, Hammad Ayyubi, Haoxuan You, Chris Thomas, Rui Sun, Shih-Fu Chang, Kai-Wei Chang

    Abstract: Despite the widespread adoption of Vision-Language Understanding (VLU) benchmarks such as VQA v2, OKVQA, A-OKVQA, GQA, VCR, SWAG, and VisualCOMET, our analysis reveals a pervasive issue affecting their integrity: these benchmarks contain samples where answers rely on assumptions unsupported by the provided context. Training models on such data foster biased learning and hallucinations as models te… ▽ More

    Submitted 25 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  21. arXiv:2405.10674  [pdf, other

    cs.CV cs.AI

    From Sora What We Can See: A Survey of Text-to-Video Generation

    Authors: Rui Sun, Yumin Zhang, Tejal Shah, Jiahao Sun, Shuoying Zhang, Wenqi Li, Haoran Duan, Bo Wei, Rajiv Ranjan

    Abstract: With impressive achievements made, artificial intelligence is on the path forward to artificial general intelligence. Sora, developed by OpenAI, which is capable of minute-level world-simulative abilities can be considered as a milestone on this developmental path. However, despite its notable successes, Sora still encounters various obstacles that need to be resolved. In this survey, we embark fr… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: A comprehensive list of text-to-video generation studies in this survey is available at https://github.com/soraw-ai/Awesome-Text-to-Video-Generation

  22. arXiv:2405.06263  [pdf, other

    cs.LG cs.AI

    Learning Latent Dynamic Robust Representations for World Models

    Authors: Ruixiang Sun, Hongyu Zang, Xin Li, Riashat Islam

    Abstract: Visual Model-Based Reinforcement Learning (MBRL) promises to encapsulate agent's knowledge about the underlying dynamics of the environment, enabling learning a world model as a useful planner. However, top MBRL agents such as Dreamer often struggle with visual pixel-based inputs in the presence of exogenous or irrelevant noise in the observation space, due to failure to capture task-specific feat… ▽ More

    Submitted 30 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Journal ref: ICML 2024

  23. arXiv:2405.05068  [pdf, other

    quant-ph cond-mat.other physics.chem-ph physics.comp-ph

    Chemistry Beyond Exact Solutions on a Quantum-Centric Supercomputer

    Authors: Javier Robledo-Moreno, Mario Motta, Holger Haas, Ali Javadi-Abhari, Petar Jurcevic, William Kirby, Simon Martiel, Kunal Sharma, Sandeep Sharma, Tomonori Shirakawa, Iskandar Sitdikov, Rong-Yang Sun, Kevin J. Sung, Maika Takita, Minh C. Tran, Seiji Yunoki, Antonio Mezzacapo

    Abstract: A universal quantum computer can be used as a simulator capable of predicting properties of diverse quantum systems. Electronic structure problems in chemistry offer practical use cases around the hundred-qubit mark. This appears promising since current quantum processors have reached these sizes. However, mapping these use cases onto quantum computers yields deep circuits, and for for pre-fault-t… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  24. arXiv:2404.19093  [pdf, other

    cs.IR cs.AI cs.HC

    Large Language Models as Conversational Movie Recommenders: A User Study

    Authors: Ruixuan Sun, Xinyi Li, Avinash Akella, Joseph A. Konstan

    Abstract: This paper explores the effectiveness of using large language models (LLMs) for personalized movie recommendations from users' perspectives in an online field experiment. Our study involves a combination of between-subject prompt and historic consumption assessments, along with within-subject recommendation scenario evaluations. By examining conversation and survey response data from 160 active us… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  25. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  26. arXiv:2404.07889  [pdf, other

    cs.RO

    On the Performance of Jerk-Constrained Time-Optimal Trajectory Planning for Industrial Manipulators

    Authors: Jee-eun Lee, Andrew Bylard, Robert Sun, Luis Sentis

    Abstract: Jerk-constrained trajectories offer a wide range of advantages that collectively improve the performance of robotic systems, including increased energy efficiency, durability, and safety. In this paper, we present a novel approach to jerk-constrained time-optimal trajectory planning (TOTP), which follows a specified path while satisfying up to third-order constraints to ensure safety and smooth mo… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  27. arXiv:2404.07425  [pdf, ps, other

    eess.SP cs.IT

    Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization

    Authors: Rui Sun, Li You, An-An Lu, Chen Sun, Xiqi Gao, Xiang-Gen Xia

    Abstract: In this paper, we investigate the precoder design for user-centric network (UCN) massive multiple-input multiple-output (mMIMO) downlink with matrix manifold optimization. In UCN mMIMO systems, each user terminal (UT) is served by a subset of base stations (BSs) instead of all the BSs, facilitating the implementation of the system and lowering the dimension of the precoders to be designed. By prov… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 13 pages, 9 figures, journal

  28. arXiv:2404.07293  [pdf, other

    physics.optics cs.MS

    sCWatter: Open source coupled wave scattering simulation for spectroscopy and microscopy

    Authors: Ruijiao Sun, Rohith Reddy, David Mayerich

    Abstract: Several emerging microscopy imaging methods rely on complex interactions between the incident light and the sample. These include interferometry, spectroscopy, and nonlinear optics. Reconstructing a sample from the measured scattered field relies on fast and accurate optical models. Fast approaches like ray tracing and the Born approximation have limitations that are limited when working with high… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  29. arXiv:2404.01780  [pdf, other

    astro-ph.IM astro-ph.GA cs.CV

    CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

    Authors: Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian , et al. (7 additional authors not shown)

    Abstract: Strong gravitational lensing is a powerful tool for investigating dark matter and dark energy properties. With the advent of large-scale sky surveys, we can discover strong lensing systems on an unprecedented scale, which requires efficient tools to extract them from billions of astronomical objects. The existing mainstream lens-finding tools are based on machine learning algorithms and applied to… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: The paper is accepted by the AJ. The complete code could be downloaded with DOI of: 10.12149/101393. Comments are welcome

  30. arXiv:2404.00262  [pdf, other

    cs.CV

    Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation

    Authors: Yuan Wang, Rui Sun, Naisong Luo, Yuwen Pan, Tianzhu Zhang

    Abstract: Open-vocabulary semantic segmentation (OVS) aims to segment images of arbitrary categories specified by class labels or captions. However, most previous best-performing methods, whether pixel grouping methods or region recognition methods, suffer from false matches between image features and category labels. We attribute this to the natural gap between the textual features and visual features. In… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  31. arXiv:2403.16718  [pdf, other

    quant-ph cond-mat.str-el

    Unveiling clean two-dimensional discrete time quasicrystals on a digital quantum computer

    Authors: Kazuya Shinjo, Kazuhiro Seki, Tomonori Shirakawa, Rong-Yang Sun, Seiji Yunoki

    Abstract: In periodically driven (Floquet) systems, evolution typically results in an infinite-temperature thermal state due to continuous energy absorption over time. However, before reaching thermal equilibrium, such systems may transiently pass through a meta-stable state known as a prethermal state. This prethermal state can exhibit phenomena not commonly observed in equilibrium, such as discrete time c… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures + Supplemental Material (17 pages, 20 figures)

    Report number: RIKEN-iTHEMS-Report-24

  32. arXiv:2403.16579  [pdf

    physics.gen-ph

    Experimental demonstration of a thermal-EM concentrator for enhancing EM signals and converging heat fluxes simultaneously

    Authors: Hanchuan Chen, Yichao Liu, Fei Sun, Qianhan Sun, Xiaoxiao Wu, Ran Sun

    Abstract: Simultaneously concentrating EM waves and heat fluxes to the same target region within an on-chip system carries substantial academic research importance and practical application value. Nevertheless, existing researches are primarily aimed at the design and experimentation of concentrators for individual EM waves or temperature fields. In this work, a thermal-EM concentrator, capable of simultane… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures

    Report number: https://onlinelibrary.wiley.com/doi/full/10.1002/lpor.202400488

    Journal ref: Laser Photonics Rev. 2024, 2400488

  33. arXiv:2403.15776  [pdf, other

    cs.CL cs.AI

    Modeling Unified Semantic Discourse Structure for High-quality Headline Generation

    Authors: Minghui Xu, Hao Fei, Fei Li, Shengqiong Wu, Rui Sun, Chong Teng, Donghong Ji

    Abstract: Headline generation aims to summarize a long document with a short, catchy title that reflects the main idea. This requires accurately capturing the core document semantics, which is challenging due to the lengthy and background information-rich na ture of the texts. In this work, We propose using a unified semantic discourse structure (S3) to represent document semantics, achieved by combining do… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  34. arXiv:2403.15146  [pdf, ps, other

    cs.LG math.OC

    On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond

    Authors: Bohan Wang, Huishuai Zhang, Qi Meng, Ruoyu Sun, Zhi-Ming Ma, Wei Chen

    Abstract: This paper aims to clearly distinguish between Stochastic Gradient Descent with Momentum (SGDM) and Adam in terms of their convergence rates. We demonstrate that Adam achieves a faster convergence compared to SGDM under the condition of non-uniformly bounded smoothness. Our findings reveal that: (1) in deterministic environments, Adam can attain the known lower bound for the convergence rate of de… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  35. arXiv:2403.13081  [pdf, other

    stat.AP math.PR q-bio.PE

    Parameter Estimation from Single Patient, Single Time-Point Sequencing Data of Recurrent Tumors

    Authors: Kevin Leder, Ruping Sun, Zicheng Wang, Xuanming Zhang

    Abstract: In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  36. arXiv:2403.12970  [pdf

    eess.IV cs.CV physics.bio-ph physics.optics

    Hybrid deep learning and physics-based neural network for programmable illumination computational microscopy

    Authors: Ruiqing Sun, Delong Yang, Shaohui Zhang, Qun Hao

    Abstract: Relying on either deep models or physical models are two mainstream approaches for solving inverse sample reconstruction problems in programmable illumination computational microscopy. Solutions based on physical models possess strong generalization capabilities while struggling with global optimization of inverse problems due to a lack of insufficient physical constraints. In contrast, deep learn… ▽ More

    Submitted 17 January, 2024; originally announced March 2024.

  37. arXiv:2403.05040  [pdf, ps, other

    math.CO

    Spectral radius and the 2-power of Hamilton paths

    Authors: Te Pi, Rui Sun, Long-Tu Yuan

    Abstract: We determine the maximum number of a graph without containing the 2-power of a Hamilton path. Using this result, we establish a spectral condition for a graph containing the 2-power of a Hamilton path.

    Submitted 7 March, 2024; originally announced March 2024.

  38. arXiv:2403.00875  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    Enhancing Protein Predictive Models via Proteins Data Augmentation: A Benchmark and New Directions

    Authors: Rui Sun, Lirong Wu, Haitao Lin, Yufei Huang, Stan Z. Li

    Abstract: Augmentation is an effective alternative to utilize the small amount of labeled protein data. However, most of the existing work focuses on design-ing new architectures or pre-training tasks, and relatively little work has studied data augmentation for proteins. This paper extends data augmentation techniques previously used for images and texts to proteins and then benchmarks these techniques on… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  39. arXiv:2402.16788  [pdf, other

    cs.LG cs.AI

    Why Transformers Need Adam: A Hessian Perspective

    Authors: Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo

    Abstract: SGD performs worse than Adam by a significant margin on Transformers, but the reason remains unclear. In this work, we provide an explanation through the lens of Hessian: (i) Transformers are "heterogeneous": the Hessian spectrum across parameter blocks vary dramatically, a phenomenon we call "block heterogeneity"; (ii) Heterogeneity hampers SGD: SGD performs worse than Adam on problems with block… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  40. arXiv:2402.16609  [pdf

    q-fin.PM cs.LG

    Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization

    Authors: Ruoyu Sun, Angelos Stefanidis, Zhengyong Jiang, Jionglong Su

    Abstract: As a model-free algorithm, deep reinforcement learning (DRL) agent learns and makes decisions by interacting with the environment in an unsupervised way. In recent years, DRL algorithms have been widely applied by scholars for portfolio optimization in consecutive trading periods, since the DRL agent can dynamically adapt to market changes and does not rely on the specification of the joint dynami… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 46 pages, 15 figures

  41. arXiv:2402.15972  [pdf, other

    cs.LG cs.NI

    Structural Knowledge-Driven Meta-Learning for Task Offloading in Vehicular Networks with Integrated Communications, Sensing and Computing

    Authors: Ruijin Sun, Yao Wen, Nan Cheng, Wei Wan, Rong Chai, Yilong Hui

    Abstract: Task offloading is a potential solution to satisfy the strict requirements of computation-intensive and latency-sensitive vehicular applications due to the limited onboard computing resources. However, the overwhelming upload traffic may lead to unacceptable uploading time. To tackle this issue, for tasks taking environmental data as input, the data perceived by roadside units (RSU) equipped with… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  42. arXiv:2402.11389  [pdf, other

    math.OC

    Spaceport Facility Location Planning within the US National Airspace System

    Authors: Haochen Wu, Kevin R. Sun, Jackson A. Miller, Oliver Jia-Richards, Max Z. Li

    Abstract: The burgeoning commercial space transportation industry necessitates an expansion of launch infrastructure to meet rising demands. However, future operations from these large-scale infrastructures can result in new impacts, particularly to air traffic operations. To rigorously reason about where such future spaceports might be located and what their impacts might be, we introduce a facility locati… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  43. arXiv:2402.04089  [pdf, other

    hep-th math-ph

    Large Volume Scenario from Schoen Manifold with de Sitter under Swampland Conjecture

    Authors: Rui Sun

    Abstract: To naturally allow for string compactification with duality manifested, here we investigate in the self-mirror large volume scenarios from Schoen Calabi-Yau manifold. We explicitly study the geometry of Schoen Calabi-Yau threefold and complete its triple intersection from both ambient and non-ambient spaces. Based on these, we study the large volume scenario of self-mirror Calabi-Yau compactificat… ▽ More

    Submitted 13 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: typos corrected, references added

  44. arXiv:2402.03569  [pdf, other

    cs.CR

    The Invisible Game on the Internet: A Case Study of Decoding Deceptive Patterns

    Authors: Zewei Shi, Ruoxi Sun, Jieshan Chen, Jiamou Sun, Minhui Xue

    Abstract: Deceptive patterns are design practices embedded in digital platforms to manipulate users, representing a widespread and long-standing issue in the web and mobile software development industry. Legislative actions highlight the urgency of globally regulating deceptive patterns. However, despite advancements in detection tools, a significant gap exists in assessing deceptive pattern risks. In this… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  45. arXiv:2402.01665  [pdf, other

    cs.NI cs.LG eess.SP

    Knowledge-Driven Deep Learning Paradigms for Wireless Network Optimization in 6G

    Authors: Ruijin Sun, Nan Cheng, Changle Li, Fangjiong Chen, Wen Chen

    Abstract: In the sixth-generation (6G) networks, newly emerging diversified services of massive users in dynamic network environments are required to be satisfied by multi-dimensional heterogeneous resources. The resulting large-scale complicated network optimization problems are beyond the capability of model-based theoretical methods due to the overwhelming computational complexity and the long processing… ▽ More

    Submitted 15 January, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  46. arXiv:2402.00395  [pdf, other

    cs.AR eess.SP

    ONE-SA: Enabling Nonlinear Operations in Systolic Arrays for Efficient and Flexible Neural Network Inference

    Authors: Ruiqi Sun, Yinchen Ni, Xin He, Jie Zhao, An Zou

    Abstract: The computation and memory-intensive nature of DNNs limits their use in many mobile and embedded contexts. Application-specific integrated circuit (ASIC) hardware accelerators employ matrix multiplication units (such as the systolic arrays) and dedicated nonlinear function units to speed up DNN computations. A close examination of these ASIC accelerators reveals that the designs are often speciali… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted to DATE 2024

  47. arXiv:2401.14688  [pdf, other

    cs.CL

    Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support

    Authors: Xiaojun Wu, Dixiang Zhang, Ruyi Gan, Junyu Lu, Ziwei Wu, Renliang Sun, Jiaxing Zhang, Pingjian Zhang, Yan Song

    Abstract: Recent advancements in text-to-image models have significantly enhanced image generation capabilities, yet a notable gap of open-source models persists in bilingual or Chinese language support. To address this need, we present Taiyi-Diffusion-XL, a new Chinese and English bilingual text-to-image model which is developed by extending the capabilities of CLIP and Stable-Diffusion-XL through a proces… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Taiyi-Diffusion-XL Tech Report

  48. arXiv:2401.13567  [pdf, other

    hep-th math-ph

    Self-mirror Large Volume Scenario with de Sitter

    Authors: Rui Sun

    Abstract: The large volume scenario has been an important issue for flux compactifications with T-dual non-geometric fluxes. As one solution to this issue, to naturally embed duality in string compactification, we investigate in self-mirror Calabi-Yau flux compactification with large volume scenario visited. In particular, at the large volume limit, the non-perturbative terms contribute a special dominant u… ▽ More

    Submitted 13 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: typos corrected, discussion expanded, references added

  49. arXiv:2401.11632  [pdf, other

    cs.IR cs.HC cs.LG

    What Are We Optimizing For? A Human-centric Evaluation of Deep Learning-based Movie Recommenders

    Authors: Ruixuan Sun, Xinyi Wu, Avinash Akella, Ruoyan Kong, Bart Knijnenburg, Joseph A. Konstan

    Abstract: In the past decade, deep learning (DL) models have gained prominence for their exceptional accuracy on benchmark datasets in recommender systems (RecSys). However, their evaluation has primarily relied on offline metrics, overlooking direct user perception and experience. To address this gap, we conduct a human-centric evaluation case study of four leading DL-RecSys models in the movie domain. We… ▽ More

    Submitted 1 May, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

  50. arXiv:2401.10444  [pdf

    cs.AI cs.CY

    Can A Cognitive Architecture Fundamentally Enhance LLMs? Or Vice Versa?

    Authors: Ron Sun

    Abstract: The paper discusses what is needed to address the limitations of current LLM-centered AI systems. The paper argues that incorporating insights from human cognition and psychology, as embodied by a computational cognitive architecture, can help develop systems that are more capable, more reliable, and more human-like. It emphasizes the importance of the dual-process architecture and the hybrid neur… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.