Skip to main content

Showing 1–50 of 1,609 results for author: Cheng, J

  1. arXiv:2407.11562  [pdf, other

    cs.RO cs.AI cs.LG

    RobotKeyframing: Learning Locomotion with High-Level Objectives via Mixture of Dense and Sparse Rewards

    Authors: Fatemeh Zargarbashi, Jin Cheng, Dongho Kang, Robert Sumner, Stelian Coros

    Abstract: This paper presents a novel learning-based control framework that uses keyframing to incorporate high-level objectives in natural locomotion for legged robots. These high-level objectives are specified as a variable number of partial or complete pose targets that are spaced arbitrarily in time. Our proposed framework utilizes a multi-critic reinforcement learning algorithm to effectively handle th… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 15 pages

  2. arXiv:2407.09948  [pdf, other

    math.OC

    An Optimal Pricing Formula for Smart Grid based on Stackelberg Game

    Authors: Jiangjiang Cheng, Ge Chen, Zhouming Wu, Yifen Mu

    Abstract: The dynamic pricing of electricity is one of the most crucial demand response (DR) strategies in smart grid, where the utility company typically adjust electricity prices to influence user electricity demand. This paper models the relationship between the utility company and flexible electricity users as a Stackelberg game. Based on this model, we present a series of analytical results under certa… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  3. arXiv:2407.06653  [pdf, other

    cs.CV

    Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

    Authors: Pengfei Zhao, Qigong Sun, Xiaolin Tian, Yige Yang, Shuo Tao, Jie Cheng, Jiantong Chen

    Abstract: There has been growing interest in facial video-based remote photoplethysmography (rPPG) measurement recently, with a focus on assessing various vital signs such as heart rate and heart rate variability. Despite previous efforts on static datasets, their approaches have been hindered by inaccurate region of interest (ROI) localization and motion issues, and have shown limited generalization in rea… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: CVPR workshop 2024 accepted

  4. arXiv:2407.06142  [pdf, ps, other

    cs.NI eess.SY math.OC

    Delay-Aware Robust Edge Network Hardening Under Decision-Dependent Uncertainty

    Authors: Jiaming Cheng, Duong Thuy Anh Nguyen, Ni Trieu, Duong Tung Nguyen

    Abstract: Edge computing promises to offer low-latency and ubiquitous computation to numerous devices at the network edge. For delay-sensitive applications, link delays can have a direct impact on service quality. These delays can fluctuate drastically over time due to various factors such as network congestion, changing traffic conditions, cyberattacks, component failures, and natural disasters. Thus, it i… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages, 18 figures

  5. arXiv:2407.05681  [pdf

    cond-mat.supr-con cond-mat.str-el

    Bulk high-temperature superconductivity in the high-pressure tetragonal phase of bilayer La2PrNi2O7

    Authors: Ningning Wang, Gang Wang, Xiaoling Shen, Jun Hou, Jun Luo, Xiaoping Ma, Huaixin Yang, Lifen Shi, Jie Dou, Jie Feng, Jie Yang, Yunqing Shi, Zhian Ren, Hanming Ma, Pengtao Yang, Ziyi Liu, Yue Liu, Hua Zhang, Xiaoli Dong, Yuxin Wang, Kun Jiang, Jiangping Hu, Stuart Calder, Jiaqiang Yan, Jianping Sun , et al. (4 additional authors not shown)

    Abstract: The Ruddlesden-Popper (R-P) bilayer nickelate, La3Ni2O7, was recently found to show signatures of high-temperature superconductivity (HTSC) at pressures above 14 GPa. Subsequent investigations achieved zero resistance in single- and poly-crystalline samples under hydrostatic pressure conditions. Yet, obvious diamagnetic signals, the other hallmark of superconductors, are still lacking owing to the… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  6. arXiv:2407.05617  [pdf, other

    eess.IV

    LINEAR: Learning Implicit Neural Representation With Explicit Physical Priors for Accelerated Quantitative T1rho Mapping

    Authors: Yuanyuan Liu, Jinwen Xie, Zhuo-Xu Cui, Qingyong Zhu, Jing Cheng, Dong Liang, Yanjie Zhu

    Abstract: Quantitative T1rho parameter mapping has shown promise in clinical and research studies. However, it suffers from long scan times. Deep learning-based techniques have been successfully applied in accelerated quantitative MR parameter mapping. However, most methods require fully-sampled training dataset, which is impractical in the clinic. In this study, a novel subject-specific unsupervised method… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Yuanyuan Liu and Jinwen Xie contributed equally to this work

  7. arXiv:2407.04938  [pdf, other

    cs.CV

    SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation

    Authors: Guoan Wang, Jin Ye, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Junjun He, Bohan Zhuang

    Abstract: Volumetric medical image segmentation is pivotal in enhancing disease diagnosis, treatment planning, and advancing medical research. While existing volumetric foundation models for medical image segmentation, such as SAM-Med3D and SegVol, have shown remarkable performance on general organs and tumors, their ability to segment certain categories in clinical downstream tasks remains limited. Supervi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Journal ref: MICCAI 2024

  8. arXiv:2407.03590  [pdf, other

    cs.RO

    A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios

    Authors: Zikang Yuan, Xiaoxiang Wang, Jingying Wu, Junda Cheng, Xin Yang

    Abstract: Existing 3D point-based dynamic point detection and removal methods have a significant time overhead, making them difficult to adapt to LiDAR-inertial odometry systems. This paper proposes a label consistency based dynamic point detection and removal method for handling moving vehicles and pedestrians in autonomous driving scenarios, and embeds the proposed dynamic point detection and removal meth… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, submitted to RA-L

  9. arXiv:2407.01571  [pdf, other

    cs.RO cs.LG

    Interpretable DRL-based Maneuver Decision of UCAV Dogfight

    Authors: Haoran Han, Jian Cheng, Maolong Lv

    Abstract: This paper proposes a three-layer unmanned combat aerial vehicle (UCAV) dogfight frame where Deep reinforcement learning (DRL) is responsible for high-level maneuver decision. A four-channel low-level control law is firstly constructed, followed by a library containing eight basic flight maneuvers (BFMs). Double deep Q network (DDQN) is applied for BFM selection in UCAV dogfight, where the opponen… ▽ More

    Submitted 27 May, 2024; originally announced July 2024.

  10. arXiv:2406.19680  [pdf, other

    cs.CV cs.AI cs.MM

    MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

    Authors: Yuang Zhang, Jiaxi Gu, Li-Wen Wang, Han Wang, Junqi Cheng, Yuefeng Zhu, Fangyuan Zou

    Abstract: In recent years, generative artificial intelligence has achieved significant advancements in the field of image generation, spawning a variety of applications. However, video generation still faces considerable challenges in various aspects, such as controllability, video length, and richness of details, which hinder the application and popularization of this technology. In this work, we propose a… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  11. arXiv:2406.18598  [pdf, other

    eess.SP cs.IT

    CubeSat-Enabled Free-Space Optics: Joint Data Communication and Fine Beam Tracking

    Authors: Hossein Safi, Mohammad Taghi Dabiri, Julian Cheng, Iman Tavakkolnia, Harald Haas

    Abstract: The integration of CubeSats with Free Space Optical (FSO) links accelerates a major advancement in high-throughput, low-Earth orbit communication systems. However, CubeSats face challenges such as size, weight, and power (SWaP) limitations, as well as vibrations that cause fluctuations in the angle-of-arrival (AoA) of the optical beam at the receiver. These practical challenges make establishing C… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures

  12. arXiv:2406.18152  [pdf, other

    cs.MA

    Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning

    Authors: Junkai Zhang, Yifan Zhang, Xi Sheryl Zhang, Yifan Zang, Jian Cheng

    Abstract: Efficient collaboration in the centralized training with decentralized execution (CTDE) paradigm remains a challenge in cooperative multi-agent systems. We identify divergent action tendencies among agents as a significant obstacle to CTDE's training efficiency, requiring a large number of training samples to achieve a unified consensus on agents' policies. This divergence stems from the lack of a… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: The AAAI-2024 paper with the appendix

  13. arXiv:2406.16967  [pdf, other

    eess.SP eess.SY

    Remaining useful life prediction of rolling bearings based on refined composite multi-scale attention entropy and dispersion entropy

    Authors: Yunchong Long, Qinkang Pang, Guangjie Zhu, Junxian Cheng, Xiangshun Li

    Abstract: Remaining useful life (RUL) prediction based on vibration signals is crucial for ensuring the safe operation and effective health management of rotating machinery. Existing studies often extract health indicators (HI) from time domain and frequency domain features to analyze complex vibration signals, but these features may not accurately capture the degradation process. In this study, we propose… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 12pages, 9 figures

  14. arXiv:2406.16714  [pdf, other

    cs.CL cs.AI cs.LG

    AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

    Authors: Jiale Cheng, Yida Lu, Xiaotao Gu, Pei Ke, Xiao Liu, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

    Abstract: Although Large Language Models (LLMs) are becoming increasingly powerful, they still exhibit significant but subtle weaknesses, such as mistakes in instruction-following or coding tasks. As these unexpected errors could lead to severe consequences in practical deployments, it is crucial to investigate the limitations within LLMs systematically. Traditional benchmarking approaches cannot thoroughly… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  15. arXiv:2406.16299  [pdf, other

    cs.CL cs.AI

    Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other

    Authors: Yifei Gao, Jie Ou, Lei Wang, Yuting Xiao, Zhiyuan Xiang, Ruiting Dai, Jun Cheng

    Abstract: Emergent Large Language Models (LLMs) use their extraordinary performance and powerful deduction capacity to discern from traditional language models. However, the expenses of computational resources and storage for these LLMs are stunning, quantization then arises as a trending conversation. To address accuracy decay caused by quantization, two streams of works in post-training quantization metho… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Efficient quantization method

    MSC Class: F.2.3

  16. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  17. arXiv:2406.14796  [pdf, other

    cs.LG cs.AI

    MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning

    Authors: Jiali Cheng, Hadi Amiri

    Abstract: Recent advancements in Machine Unlearning (MU) have introduced solutions to selectively remove certain training samples, such as those with outdated or sensitive information, from trained models. Despite these advancements, evaluation of MU methods have been inconsistent, employing different trained models and architectures, and sample removal strategies, which hampers accurate comparison. In addi… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  18. arXiv:2406.14098  [pdf, ps, other

    cs.CV

    HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models

    Authors: Xinrui Zhou, Yuhao Huang, Wufeng Xue, Haoran Dou, Jun Cheng, Han Zhou, Dong Ni

    Abstract: Echocardiography (ECHO) video is widely used for cardiac examination. In clinical, this procedure heavily relies on operator experience, which needs years of training and maybe the assistance of deep learning-based systems for enhanced accuracy and efficiency. However, it is challenging since acquiring sufficient customized data (e.g., abnormal cases) for novice training and deep model development… ▽ More

    Submitted 4 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by MICCAI 2024

  19. arXiv:2406.14021  [pdf, other

    cs.CL cs.LG q-bio.QM

    HIGHT: Hierarchical Graph Tokenization for Graph-Language Alignment

    Authors: Yongqiang Chen, Quanming Yao, Juzheng Zhang, James Cheng, Yatao Bian

    Abstract: Recently there has been a surge of interest in extending the success of large language models (LLMs) to graph modality, such as social networks and molecules. As LLMs are predominantly trained with 1D text data, most existing approaches adopt a graph neural network to represent a graph as a series of node tokens and feed these tokens to LLMs for graph-language alignment. Despite achieving some suc… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preliminary version of an ongoing project: https://higraphllm.github.io/

  20. arXiv:2406.13864  [pdf, other

    cs.LG q-bio.BM

    Evaluating representation learning on the protein structure universe

    Authors: Arian R. Jamasb, Alex Morehead, Chaitanya K. Joshi, Zuobai Zhang, Kieran Didi, Simon V. Mathis, Charles Harris, Jian Tang, Jianlin Cheng, Pietro Lio, Tom L. Blundell

    Abstract: We introduce ProteinWorkshop, a comprehensive benchmark suite for representation learning on protein structures with Geometric Graph Neural Networks. We consider large-scale pre-training and downstream tasks on both experimental and predicted structures to enable the systematic evaluation of the quality of the learned structural representation and their usefulness in capturing functional relations… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ICLR 2024

  21. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  22. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  23. arXiv:2406.10175  [pdf, other

    cs.CV

    Enhancing Incomplete Multi-modal Brain Tumor Segmentation with Intra-modal Asymmetry and Inter-modal Dependency

    Authors: Weide Liu, Jingwen Hou, Xiaoyang Zhong, Huijing Zhan, Jun Cheng, Yuming Fang, Guanghui Yue

    Abstract: Deep learning-based brain tumor segmentation (BTS) models for multi-modal MRI images have seen significant advancements in recent years. However, a common problem in practice is the unavailability of some modalities due to varying scanning protocols and patient conditions, making segmentation from incomplete MRI modalities a challenging issue. Previous methods have attempted to address this by fus… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  24. arXiv:2406.08634  [pdf, other

    eess.IV cs.CV cs.LG

    Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning

    Authors: Zhongao Sun, Jiameng Li, Yuhan Wang, Jiarong Cheng, Qing Zhou, Chun Li

    Abstract: Brain tumor segmentation remains a significant challenge, particularly in the context of multi-modal magnetic resonance imaging (MRI) where missing modality images are common in clinical settings, leading to reduced segmentation accuracy. To address this issue, we propose a novel strategy, which is called masked predicted pre-training, enabling robust feature learning from incomplete modality data… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  25. arXiv:2406.07955  [pdf, other

    cs.LG stat.ML

    How Interpretable Are Interpretable Graph Neural Networks?

    Authors: Yongqiang Chen, Yatao Bian, Bo Han, James Cheng

    Abstract: Interpretable graph neural networks (XGNNs ) are widely adopted in various scientific applications involving graph-structured data. Existing XGNNs predominantly adopt the attention-based mechanism to learn edge or node importance for extracting and making predictions with the interpretable subgraph. However, the representational properties and limitations of these methods remain inadequately explo… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: ICML2024, 44 pages, 21 figures, 12 tables

  26. arXiv:2406.07471  [pdf, other

    cs.CV

    OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding

    Authors: Ming Hu, Peng Xia, Lin Wang, Siyuan Yan, Feilong Tang, Zhongxing Xu, Yimin Luo, Kaimin Song, Jurgen Leitner, Xuelian Cheng, Jun Cheng, Chi Liu, Kaijing Zhou, Zongyuan Ge

    Abstract: Surgical scene perception via videos are critical for advancing robotic surgery, telesurgery, and AI-assisted surgery, particularly in ophthalmology. However, the scarcity of diverse and richly annotated video datasets has hindered the development of intelligent systems for surgical workflow analysis. Existing datasets for surgical workflow analysis, which typically face challenges such as small s… ▽ More

    Submitted 13 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Version 1

  27. arXiv:2406.07177  [pdf, other

    cs.LG

    TernaryLLM: Ternarized Large Language Model

    Authors: Tianqi Chen, Zhe Li, Weixiang Xu, Zeyu Zhu, Dong Li, Lu Tian, Emad Barsoum, Peisong Wang, Jian Cheng

    Abstract: Large language models (LLMs) have achieved remarkable performance on Natural Language Processing (NLP) tasks, but they are hindered by high computational costs and memory requirements. Ternarization, an extreme form of quantization, offers a solution by reducing memory usage and enabling energy-efficient floating-point additions. However, applying ternarization to LLMs faces challenges stemming fr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  28. arXiv:2406.05654  [pdf, other

    cs.CL cs.IR

    DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation

    Authors: Shuting Wang, Jiongnan Liu, Shiren Song, Jiehan Cheng, Yuqi Fu, Peidong Guo, Kun Fang, Yutao Zhu, Zhicheng Dou

    Abstract: Retrieval-Augmented Generation (RAG) offers a promising solution to address various limitations of Large Language Models (LLMs), such as hallucination and difficulties in keeping up with real-time updates. This approach is particularly critical in expert and domain-specific applications where LLMs struggle to cover expert knowledge. Therefore, evaluating RAG models in such scenarios is crucial, ye… ▽ More

    Submitted 16 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  29. arXiv:2406.05320  [pdf, other

    stat.ML cs.LG

    Deep Neural Networks are Adaptive to Function Regularity and Data Distribution in Approximation and Estimation

    Authors: Hao Liu, Jiahui Cheng, Wenjing Liao

    Abstract: Deep learning has exhibited remarkable results across diverse areas. To understand its success, substantial research has been directed towards its theoretical foundations. Nevertheless, the majority of these studies examine how well deep neural networks can model functions with uniform regularity. In this paper, we explore a different angle: how deep neural networks can adapt to different regulari… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  30. arXiv:2406.04451  [pdf, other

    cs.RO

    RiskMap: A Unified Driving Context Representation for Autonomous Motion Planning in Urban Driving Environment

    Authors: Ren Xin, Sheng Wang, Yingbing Chen, Jie Cheng, Ming Liu

    Abstract: Planning is complicated by the combination of perception and map information, particularly when driving in heavy traffic. Developing an extendable and efficient representation that visualizes sensor noise and provides constraints to real-time planning tasks is desirable. We aim to develop an extendable map representation offering prior to cost in planning tasks to simplify the planning process of… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Submission to ICRA 2023 was not accepted. This paper is now available just for public reference

  31. arXiv:2406.03944  [pdf, other

    cs.LG

    Provably Neural Active Learning Succeeds via Prioritizing Perplexing Samples

    Authors: Dake Bu, Wei Huang, Taiji Suzuki, Ji Cheng, Qingfu Zhang, Zhiqiang Xu, Hau-San Wong

    Abstract: Neural Network-based active learning (NAL) is a cost-effective data selection technique that utilizes neural networks to select and train on a small subset of samples. While existing work successfully develops various effective or theory-justified NAL algorithms, the understanding of the two commonly used query criteria of NAL: uncertainty-based and diversity-based, remains in its infancy. In this… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted by the 41th Intemational Conference on Machine Learning (lCML 2024)

  32. arXiv:2406.03088  [pdf, other

    cs.AR cs.LG

    HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator

    Authors: Zhewen Yu, Sudarshan Sreeram, Krish Agrawal, Junyi Wu, Alexander Montgomerie-Corcoran, Cheng Zhang, Jianyi Cheng, Christos-Savvas Bouganis, Yiren Zhao

    Abstract: Deep Neural Networks (DNNs) excel in learning hierarchical representations from raw data, such as images, audio, and text. To compute these DNN models with high performance and energy efficiency, these models are usually deployed onto customized hardware accelerators. Among various accelerator designs, dataflow architecture has shown promising performance due to its layer-pipelined structure and i… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: accepted to FPL2024

  33. arXiv:2406.01843  [pdf, other

    cs.CV

    L-MAGIC: Language Model Assisted Generation of Images with Coherence

    Authors: Zhipeng Cai, Matthias Mueller, Reiner Birkl, Diana Wofk, Shao-Yen Tseng, JunDa Cheng, Gabriela Ben-Melech Stan, Vasudev Lal, Michael Paulitsch

    Abstract: In the current era of generative AI breakthroughs, generating panoramic scenes from a single input image remains a key challenge. Most existing methods use diffusion-based iterative or simultaneous multi-view inpainting. However, the lack of global scene layout priors leads to subpar outputs with duplicated objects (e.g., multiple beds in a bedroom) or requires time-consuming human text inputs for… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: accepted to CVPR 2024

  34. arXiv:2406.01388  [pdf, other

    cs.CV

    AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

    Authors: Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

    Abstract: As cutting-edge Text-to-Image (T2I) generation models already excel at producing remarkable single images, an even more challenging task, i.e., multi-turn interactive image generation begins to attract the attention of related research communities. This task requires models to interact with users over multiple turns to generate a coherent sequence of images. However, since users may switch subject… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Multi-turn interactive image generation

  35. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  36. arXiv:2406.00953  [pdf, ps, other

    math.AP math.DG

    Viscosity solution to complex Hessian equations on compact Hermitian manifolds

    Authors: Jingrui Cheng, Yulun Xu

    Abstract: We prove the existence of viscosity solutions to complex Hessian equations on a compact Hermitian manifold that satisfy a determinant domination condition. This viscosity solution is shown to be unique when the right hand is strictly monotone increasing in terms of the solution. When the right hand side does not depend on the solution, we reduces it to the strict monotonicity of the solvability co… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    MSC Class: 32W20; 35J96

  37. arXiv:2406.00949  [pdf, ps, other

    math.AP

    Sharp dispersive estimates for the wave equation on the 5-dimensional lattice graph

    Authors: Cheng Bi, Jiawei Cheng, Bobo Hua

    Abstract: Schultz \cite{S98} proved dispersive estimates for the wave equation on lattice graphs $\mathbb{Z}^d$ for $d=2,3,$ which was extended to $d=4$ in \cite{BCH23}. By Newton polyhedra and the algorithm introduced by Karpushkin \cite{K83}, we further extend the result to $d=5:$ the sharp decay rate of the fundamental solution of the wave equation on $\mathbb{Z}^5$ is $|t|^{-\frac{11}{6}}.$ Moreover, we… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  38. arXiv:2405.19952  [pdf, ps, other

    nlin.SI math-ph

    Generalized Bigraded Toda Hierarchy

    Authors: Yue Liu, Xingjie Yan, Jinbiao Wang, Jipeng Cheng

    Abstract: Bigraded Toda hierarchy $L_1^M(n)=L_2^N(n)$ is generalized to $L_1^M(n)=L_2^{N}(n)+\sum_{j\in \mathbb Z}\sum_{i=1}^{m}q^{(i)}_nΛ^jr^{(i)}_{n+1}$, which is the analogue of the famous constrained KP hierarchy $L^{k}= (L^{k})_{\geq0}+\sum_{i=1}^{m}q_{i}\partial^{-1}r_i$. It is known that different bosonizations of fermionic KP hierarchy will give rise to different kinds of integrable hierarchies. Sta… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 16 pages

    MSC Class: 35Q53; 37K10; 35Q51

  39. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  40. arXiv:2405.17579  [pdf, other

    cs.RO

    Harnessing Natural Oscillations for High-Speed, Efficient Asymmetrical Locomotion in Quadrupedal Robots

    Authors: Jing Cheng, Yasser G. Alqaham, Zhenyu Gan

    Abstract: This study explores the dynamics of asymmetrical bounding gaits in quadrupedal robots, focusing on the integration of torso pitching and hip motion to enhance speed and stability. Traditional control strategies often enforce a fixed posture, minimizing natural body movements to simplify the control problem. However, this approach may overlook the inherent dynamical advantages found in natural loco… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  41. arXiv:2405.16698  [pdf, other

    hep-ph hep-ex

    The Phase Space Distance Between Collider Events

    Authors: Tianji Cai, Junyi Cheng, Nathaniel Craig, Giacomo Koszegi, Andrew J. Larkoski

    Abstract: How can one fully harness the power of physics encoded in relativistic $N$-body phase space? Topologically, phase space is isomorphic to the product space of a simplex and a hypersphere and can be equipped with explicit coordinates and a Riemannian metric. This natural structure that scaffolds the space on which all collider physics events live opens up new directions for machine learning applicat… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 39 pages, 15 figures

  42. arXiv:2405.16562  [pdf, ps, other

    math.AP

    Global existence and nonexistence analyses for a magnetic fractional pseudo-parabolic equation

    Authors: Jiazhuo Cheng, Qiru Wang

    Abstract: In this paper, we study the initial-boundary value problem for a pseudo-parabolic equation in magnetic fractional Orlicz-Sobolev spaces. First, by employing the imbedding theorems, the theory of potential wells and the Galerkin method, we prove the existence and uniqueness of global solutions with subcritical initial energy, critical initial energy and supercritical initial energy, respectively. F… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 55 pages

    MSC Class: 35R11; 26A33; 35K58; 35B44; 35D30

  43. arXiv:2405.16099  [pdf, other

    cs.CV

    Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation

    Authors: Huizhou Chen, Jiangyi Wang, Yuxin Li, Na Zhao, Jun Cheng, Xulei Yang

    Abstract: 3D environment recognition is essential for autonomous driving systems, as autonomous vehicles require a comprehensive understanding of surrounding scenes. Recently, the predominant approach to define this real-life problem is through 3D occupancy prediction. It attempts to predict the occupancy states and semantic labels for all voxels in 3D space, which enhances the perception capability. Birds-… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures, accepted by IEEE CAI 2024

  44. arXiv:2405.15317  [pdf, other

    cs.LG cs.AI

    NuwaTS: a Foundation Model Mending Every Incomplete Time Series

    Authors: Jinguo Cheng, Chunwei Yang, Wanlin Cai, Yuxuan Liang, Yuankai Wu

    Abstract: Time series imputation plays a crucial role in various real-world systems and has been extensively explored. Models for time series imputation often require specialization, necessitating distinct designs for different domains and missing patterns. In this study, we introduce NuwaTS, a framework to repurpose Pre-trained Language Model (PLM) for general time series imputation. Once trained, this mod… ▽ More

    Submitted 27 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 22 pages, 13 figures

  45. arXiv:2405.15210  [pdf

    cond-mat.str-el

    Spin chirality engineering induced giant topological Hall effect in a kagome magnet

    Authors: Wei Xia, Shihao Zhang, Jian Yuan, Yurui Wei, Haonan Wang, Hong Du, Xiangqi Liu, Jiangteng Guo, Zicheng Tao, Ke Qu, Xia Wang, Xuerong Liu, Wenbo Wang, Jinguang Cheng, Yulin Chen, Jianpeng Liu, Ruidan Zhong, Xuewen Fu, Zhenzhong Yang, Yanfeng Guo

    Abstract: The ferrimagnet TbMn6Sn6 has attracted vast attention, because its pristine Mn kagome lattice with strong spin-orbit coupling and out-of-plane Tb-Mn exchange supports quantum-limit Chern topological magnetism which can be described by the simple spinless Haldane model. We unveil herein that engineering the pristine kagome lattice through partial replacement of Mn by nonmagnetic Cr which tends to c… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 33 pages,4 main figures and 16 SI figures

  46. arXiv:2405.14202  [pdf

    physics.class-ph cond-mat.mes-hall

    Giant Acoustic Geometric Spin and Orbital Hall Effect

    Authors: Wei Wang, Yang Tan, Jingjing Liu, Bin Liang, Jianchun Cheng

    Abstract: Acoustic waves in fluid with spin-0 nature have been long believed not to support spin Hall effect and strong orbital Hall effect that enables experimental observation. Here we report the first theoretical explication and experimental demonstration of giant acoustic geometric spin and orbital Hall effect characterized by a large transverse shift. We reveal that this effect occurs when a vortex bea… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  47. arXiv:2405.14108  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM

    Deep Learning for Protein-Ligand Docking: Are We There Yet?

    Authors: Alex Morehead, Nabin Giri, Jian Liu, Jianlin Cheng

    Abstract: The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein-ligand docking have recently been introduced, to date no prior works have systematically studied the behavior of dockin… ▽ More

    Submitted 7 July, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 31 pages, 2 tables, 27 figures. Under review. Code, data, tutorials, and benchmark results are available at https://github.com/BioinfoMachineLearning/PoseBench

    ACM Class: I.2.1; J.3

  48. Superconductivity near 70 K in boron-carbon clathrates MB$_2$C$_8$ (M = Na, K, Rb, Cs) at ambient pressure

    Authors: Bin Li, Yulan Cheng, Cong Zhu, Jie Cheng, Shengli Liu

    Abstract: Inspired by the first boron-carbon (B-C) clathrate SrB$_3$C$_3$ and the ternary borohydride KB$_2$H$_8$ [Miao et al., Phys. Rev. B 104 L100504 (2021)], we have performed first-principles density functional theory calculations of the electronic and phonon band structures for B-C compounds MB$_2$C$_8$ (M = Na, K, Rb, Cs). Our calculations reveal that these materials are dynamically stable and can po… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Journal ref: Phys. Rev. B 109,184517 (2024)

  49. arXiv:2405.12483  [pdf

    physics.optics physics.app-ph

    Molecule-induced surface second-order nonlinearity in an inversion symmetric microcavity

    Authors: Ru Wang, Yue Dai, Jinsong Cheng, Ruoyu Wang, Xiaoqin Shen

    Abstract: Inversion symmetry eliminates the second-order nonlinear responses in materials commonly used in silicon photonics with electric-dipole approximation. The lack of effective methods to induce the second-order nonlinearity in silicon photonic materials prevents their applications in second-order nonlinear integrated photonics. Here, we experimentally demonstrate a surface second-order nonlinear opti… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 22

  50. arXiv:2405.12274  [pdf, other

    astro-ph.SR astro-ph.GA

    A Model for Eruptive Mass Loss in Massive Stars

    Authors: Shelley J. Cheng, Jared A. Goldberg, Matteo Cantiello, Evan B. Bauer, Mathieu Renzo, Charlie Conroy

    Abstract: Eruptive mass loss in massive stars is known to occur, but the mechanism(s) are not yet well-understood. One proposed physical explanation appeals to opacity-driven super-Eddington luminosities in stellar envelopes. Here, we present a 1D model for eruptive mass loss and implement this model in the MESA stellar evolution code. The model identifies regions in the star where the energy associated wit… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures, submitted to ApJ