Skip to main content

Showing 1–50 of 444 results for author: Jin, P

  1. arXiv:2407.10528  [pdf, other

    cs.CV

    Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

    Authors: Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen

    Abstract: Text-to-motion generation requires not only grounding local actions in language but also seamlessly blending these individual actions to synthesize diverse and realistic global motions. However, existing motion generation methods primarily focus on the direct synthesis of global motions while neglecting the importance of generating and controlling local actions. In this paper, we propose the local… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  2. arXiv:2407.10424  [pdf, other

    cs.PL cs.AI

    CodeV: Empowering LLMs for Verilog Generation through Multi-Level Summarization

    Authors: Yang Zhao, Di Huang, Chongxiao Li, Pengwei Jin, Ziyuan Nan, Tianyun Ma, Lei Qi, Yansong Pan, Zhenxing Zhang, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen

    Abstract: The increasing complexity and high costs associated with modern processor design have led to a surge in demand for processor design automation. Instruction-tuned large language models (LLMs) have demonstrated remarkable performance in automatically generating code for general-purpose programming languages like Python. However, these methods fail on hardware description languages (HDLs) like Verilo… ▽ More

    Submitted 15 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 16 pages, 8 figures, conference

  3. arXiv:2407.08903  [pdf, other

    cs.CR cs.AI cs.AR

    TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing

    Authors: Husheng Han, Xinyao Zheng, Yuanbo Wen, Yifan Hao, Erhu Feng, Ling Liang, Jianan Mu, Xiaqing Li, Tianyun Ma, Pengwei Jin, Xinkai Song, Zidong Du, Qi Guo, Xing Hu

    Abstract: Heterogeneous collaborative computing with NPU and CPU has received widespread attention due to its substantial performance benefits. To ensure data confidentiality and integrity during computing, Trusted Execution Environments (TEE) is considered a promising solution because of its comparatively lower overhead. However, existing heterogeneous TEE designs are inefficient for collaborative computin… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by ASPLOS 2024

  4. arXiv:2407.04872  [pdf, ps, other

    cs.DS

    Faster single-source shortest paths with negative real weights via proper hop distance

    Authors: Yufan Huang, Peter Jin, Kent Quanrud

    Abstract: The textbook algorithm for single-source shortest paths with real-valued edge weights runs in $O(m n)$ time on a graph with $m$ edges and $n$ vertices. A recent breakthrough algorithm by Fineman [Fin24] takes $\tilde O(m n^{8/9})$ randomized time. We present an $\tilde O(m n^{4/5})$ randomized time algorithm building on ideas from [Fin24].

    Submitted 5 July, 2024; originally announced July 2024.

  5. arXiv:2407.04162  [pdf, other

    eess.IV cs.CV

    Measurement Embedded Schrödinger Bridge for Inverse Problems

    Authors: Yuang Wang, Pengfei Jin, Siyeop Yoon, Matthew Tivnan, Quanzheng Li, Li Zhang, Dufan Wu

    Abstract: Score-based diffusion models are frequently employed as structural priors in inverse problems. However, their iterative denoising process, initiated from Gaussian noise, often results in slow inference speeds. The Image-to-Image Schrödinger Bridge (I$^2$SB), which begins with the corrupted image, presents a promising alternative as a prior for addressing inverse problems. In this work, we introduc… ▽ More

    Submitted 22 May, 2024; originally announced July 2024.

    Comments: 14 pages, 2 figures, Neurips preprint

  6. arXiv:2406.18139  [pdf, other

    cs.CL cs.CV

    LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference

    Authors: Zhongwei Wan, Ziang Wu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan

    Abstract: Long-context Multimodal Large Language Models (MLLMs) demand substantial computational resources for inference as the growth of their multimodal Key-Value (KV) cache, in response to increasing input lengths, challenges memory and time efficiency. Unlike single-modality LLMs that manage only textual contexts, the KV cache of long-context MLLMs includes representations from multiple images with temp… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  7. arXiv:2406.04481  [pdf, other

    cs.AI

    Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF

    Authors: Yuan Sun, Navid Salami Pargoo, Peter J. Jin, Jorge Ortiz

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is popular in large language models (LLMs), whereas traditional Reinforcement Learning (RL) often falls short. Current autonomous driving methods typically utilize either human feedback in machine learning, including RL, or LLMs. Most feedback guides the car agent's learning process (e.g., controlling the car). RLHF is usually applied in the fine-t… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  8. arXiv:2406.02862  [pdf, other

    cs.CV

    Rethinking Guidance Information to Utilize Unlabeled Samples:A Label Encoding Perspective

    Authors: Yulong Zhang, Yuan Yao, Shuhao Chen, Pengrong Jin, Yu Zhang, Jian Jin, Jiangang Lu

    Abstract: Empirical Risk Minimization (ERM) is fragile in scenarios with insufficient labeled samples. A vanilla extension of ERM to unlabeled samples is Entropy Minimization (EntMin), which employs the soft-labels of unlabeled samples to guide their learning. However, EntMin emphasizes prediction discriminability while neglecting prediction diversity. To alleviate this issue, in this paper, we rethink the… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  9. arXiv:2405.19465  [pdf, other

    cs.CV

    RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

    Authors: Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li

    Abstract: Text-Video Retrieval (TVR) aims to align relevant video content with natural language queries. To date, most state-of-the-art TVR methods learn image-to-video transfer learning based on large-scale pre-trained visionlanguage models (e.g., CLIP). However, fully fine-tuning these pre-trained models for TVR incurs prohibitively expensive computation costs. To this end, we propose to conduct efficient… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 Findings

  10. arXiv:2405.00002  [pdf, ps, other

    physics.app-ph

    Reconfigurable nonreciprocal heat transport with natural bulk materials

    Authors: Min Lei, Peng Jin, Liujun Xu, Jiping Huang

    Abstract: Non-reciprocity is increasingly scrutinised in contemporary physics and engineering, especially in the realm of heat transport. This concept opens up novel avenues for directional heat transport and thermal regulation. Nonetheless, the development of non-reciprocal thermal metamaterials confronts three primary challenges: a constrained operational temperature range and structural scale, considerab… ▽ More

    Submitted 6 January, 2024; originally announced May 2024.

    Comments: 34 pages, 15 figures

  11. arXiv:2404.14125  [pdf, ps, other

    math.GR math.RT

    Weights for $π$-partial characters of $π$-separable groups

    Authors: Xuewu Chang, Ping Jin

    Abstract: The aim of this paper is to confirm an inequality predicted by Isaacs and Navarro in 1995, which asserts that for any $π'$-subgroup $Q$ of a $π$-separable group $G$, the number of $π'$-weights of $G$ with $Q$ as the first component always exceeds that of irreducible $π$-partial characters of $G$ with $Q$ as their vertex. We also give some sufficient condition to guarantee that these two numbers ar… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    MSC Class: 20C15; 20C20

  12. arXiv:2404.08916  [pdf, other

    cs.CV cs.LG

    Meply: A Large-scale Dataset and Baseline Evaluations for Metastatic Perirectal Lymph Node Detection and Segmentation

    Authors: Weidong Guo, Hantao Zhang, Shouhong Wan, Bingbing Zou, Wanqin Wang, Chenyang Qiu, Jun Li, Peiquan Jin

    Abstract: Accurate segmentation of metastatic lymph nodes in rectal cancer is crucial for the staging and treatment of rectal cancer. However, existing segmentation approaches face challenges due to the absence of pixel-level annotated datasets tailored for lymph nodes around the rectum. Additionally, metastatic lymph nodes are characterized by their relatively small size, irregular shapes, and lower contra… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 13 pages

  13. arXiv:2404.08446  [pdf

    cond-mat.mtrl-sci

    Growth of two-inch free-standing heteroepitaxial diamond on Ir/YSZ/Si (001) substrates via laser-patterned templates

    Authors: Pengfei Qu, Peng Jin, Guangdi Zhou, Zhen Wang, Zhanguo Wang

    Abstract: In this paper, 2-inch free-standing diamonds were prepared by using heteroepitaxy on composite Ir/YSZ/Si (001) substrates. To release stress, patterned templates were fabricated using laser etching after the initial growth of 50-nm-diamond. Then, the subsequent growth was completed on a patterned template. The full width at half maximum of the diamond (400) and (311) X-ray rocking curves were 313.… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 13 pages, 5 figures

  14. arXiv:2403.06069  [pdf, other

    eess.IV cs.CV cs.LG

    Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising

    Authors: Yuang Wang, Siyeop Yoon, Pengfei Jin, Matthew Tivnan, Zhennong Chen, Rui Hu, Li Zhang, Zhiqiang Chen, Quanzheng Li, Dufan Wu

    Abstract: Conditional diffusion models have gained recognition for their effectiveness in image restoration tasks, yet their iterative denoising process, starting from Gaussian noise, often leads to slow inference speeds. As a promising alternative, the Image-to-Image Schrödinger Bridge (I2SB) initializes the generative process from corrupted images and integrates training techniques from conditional diffus… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  15. arXiv:2403.05809  [pdf, other

    math.NA cs.LG

    Shallow ReLU neural networks and finite elements

    Authors: Pengzhan Jin

    Abstract: We point out that (continuous or discontinuous) piecewise linear functions on a convex polytope mesh can be represented by two-hidden-layer ReLU neural networks in a weak sense. In addition, the numbers of neurons of the two hidden layers required to weakly represent are accurately given based on the numbers of polytopes and hyperplanes involved in this mesh. The results naturally hold for constan… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  16. arXiv:2403.02874  [pdf, other

    astro-ph.HE

    The bright black hole X-ray binary 4U 1543-47 during 2021 outburst. A clear state transition from super-Eddington to sub-Eddington accretion revealed by Insight-HXMT

    Authors: Pei Jin, Guobao Zhang, Yuexin Zhang, Mariano Méndez, Jinlu Qu, David M. Russell, Jiancheng Wang, Shuangnan Zhang, Yi-Jung Yang, Shumei Jia, Zixu Yang, Hexin Liu

    Abstract: We present a detailed analysis of the observations with the Hard X-ray Modulation Telescope of the black hole X-ray transient 4U~1543-47 during its outburst in 2021. We find a clear state transition during the outburst decay of the source. Using previous measurements of the black-hole mass and distance to the source, the source luminosity during this transition is close to the Eddington limit. The… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  17. arXiv:2402.15097  [pdf, other

    cs.LG math.NA

    Learning solution operators of PDEs defined on varying domains via MIONet

    Authors: Shanshan Xiao, Pengzhan Jin, Yifa Tang

    Abstract: In this work, we propose a method to learn the solution operators of PDEs defined on varying domains via MIONet, and theoretically justify this method. We first extend the approximation theory of MIONet to further deal with metric spaces, establishing that MIONet can approximate mappings with multiple inputs in metric spaces. Subsequently, we construct a set consisting of some appropriate regions… ▽ More

    Submitted 16 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  18. arXiv:2402.14891  [pdf, other

    cs.CL cs.AI

    LLMBind: A Unified Modality-Task Integration Framework

    Authors: Bin Zhu, Munan Ning, Peng Jin, Bin Lin, Jinfa Huang, Qi Song, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan

    Abstract: In the multi-modal domain, the dependence of various models on specific input formats leads to user confusion and hinders progress. To address this challenge, we introduce \textbf{LLMBind}, a novel framework designed to unify a diverse array of multi-modal tasks. By harnessing a Mixture-of-Experts (MoE) Large Language Model (LLM), LLMBind processes multi-modal inputs and generates task-specific to… ▽ More

    Submitted 18 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  19. arXiv:2402.07156  [pdf, other

    math.NA cs.LG

    A hybrid iterative method based on MIONet for PDEs: Theory and numerical examples

    Authors: Jun Hu, Pengzhan Jin

    Abstract: We propose a hybrid iterative method based on MIONet for PDEs, which combines the traditional numerical iterative solver and the recent powerful machine learning method of neural operator, and further systematically analyze its theoretical properties, including the convergence condition, the spectral behavior, as well as the convergence rate, in terms of the errors of the discretization and the mo… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  20. arXiv:2402.05935  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

    Authors: Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao, Peng Gao

    Abstract: We propose SPHINX-X, an extensive Multimodality Large Language Model (MLLM) series developed upon SPHINX. To improve the architecture and training efficiency, we modify the SPHINX framework by removing redundant visual encoders, bypassing fully-padded sub-images with skip tokens, and simplifying multi-stage training into a one-stage all-in-one paradigm. To fully unleash the potential of MLLMs, we… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024. Code and models are released at https://github.com/Alpha-VLLM/LLaMA2-Accessory

  21. arXiv:2401.15947  [pdf, other

    cs.CV

    MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

    Authors: Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Jinfa Huang, Junwu Zhang, Yatian Pang, Munan Ning, Li Yuan

    Abstract: Recent advances demonstrate that scaling Large Vision-Language Models (LVLMs) effectively improves downstream task performances. However, existing scaling methods enable all model parameters to be active for each token in the calculation, which brings massive training and inferring costs. In this work, we propose a simple yet effective training strategy MoE-Tuning for LVLMs. This strategy innovati… ▽ More

    Submitted 6 July, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: K = P + N represents the length of the output sequence in the formula (8)

  22. arXiv:2401.09732  [pdf, other

    cs.CV

    Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation

    Authors: Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Chang Liu, Xiawu Zheng, Rongrong Ji, Jie Chen

    Abstract: Temporally locating objects with arbitrary class texts is the primary pursuit of open-vocabulary Video Instance Segmentation (VIS). Because of the insufficient vocabulary of video data, previous methods leverage image-text pretraining model for recognizing object instances by separately aligning each frame and class texts, ignoring the correlation between frames. As a result, the separation breaks… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  23. arXiv:2401.04148  [pdf, other

    cs.LG cs.AI eess.SP

    Online Test-Time Adaptation of Spatial-Temporal Traffic Flow Forecasting

    Authors: Pengxin Guo, Pengrong Jin, Ziyue Li, Lei Bai, Yu Zhang

    Abstract: Accurate spatial-temporal traffic flow forecasting is crucial in aiding traffic managers in implementing control measures and assisting drivers in selecting optimal travel routes. Traditional deep-learning based methods for traffic flow forecasting typically rely on historical data to train their models, which are then used to make predictions on future data. However, the performance of the traine… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  24. arXiv:2401.03364  [pdf, other

    physics.app-ph

    A dynamic thermal sensing mechanism with reconfigurable expanded-plane structures

    Authors: Haohan Tan, Haoyang Cai, Peng Jin, Jiping Huang

    Abstract: The precise measurement of temperature is crucial in various fields such as biology, medicine, industrial automation, energy management, and daily life applications. While in most scenarios, sensors with a fixed thermal conductivity inevitably mismatch the analogous parameter of the medium being measured, thus causing the distortion and inaccurate detection of original temperature fields. Despite… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  25. Developing Flying Explorer for Autonomous Digital Modelling in Wild Unknowns

    Authors: Naizhong Zhang. Yaoqiang Pan, Yangwen Jin, Peiqi Jin, Kewei Hu, Xiao Huang, Hanwen Kang

    Abstract: This work presents an innovative solution for robotic odometry, path planning and exploration in wild unknown environments, focusing on digital modelling. The approach uses a minimum cost formulation with pseudo-randomly generated objectives, integrating multi-path planning and evaluation, with emphasis on full coverage of unknown maps based on feasible boundaries of interest. The evaluation carri… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  26. arXiv:2312.13271  [pdf, other

    cs.CV

    Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

    Authors: Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan

    Abstract: Recent one image to 3D generation methods commonly adopt Score Distillation Sampling (SDS). Despite the impressive results, there are multiple deficiencies including multi-view inconsistency, over-saturated and over-smoothed textures, as well as the slow generation speed. To address these deficiencies, we present Repaint123 to alleviate multi-view bias as well as texture degradation and speed up t… ▽ More

    Submitted 27 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Project page: https://pku-yuangroup.github.io/repaint123/

  27. arXiv:2312.02428  [pdf, other

    cs.CV cs.IR

    FreestyleRet: Retrieving Images from Style-Diversified Queries

    Authors: Hao Li, Curise Jia, Peng Jin, Zesen Cheng, Kehan Li, Jialu Sui, Chang Liu, Li Yuan

    Abstract: Image Retrieval aims to retrieve corresponding images based on a given query. In application scenarios, users intend to express their retrieval intent through various query styles. However, current retrieval tasks predominantly focus on text-query retrieval exploration, leading to limited retrieval query options and potential ambiguity or bias in user intention. In this paper, we propose the Style… ▽ More

    Submitted 8 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 16 pages, 7 figures

  28. arXiv:2311.10122  [pdf, other

    cs.CV

    Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

    Authors: Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan

    Abstract: The Large Vision-Language Model (LVLM) has enhanced the performance of various downstream tasks in visual-language understanding. Most existing approaches encode images and videos into separate feature spaces, which are then fed as inputs to large language models. However, due to the lack of unified tokenization for images and videos, namely misalignment before projection, it becomes challenging f… ▽ More

    Submitted 21 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

  29. arXiv:2311.08046  [pdf, other

    cs.CV

    Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

    Authors: Peng Jin, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, Li Yuan

    Abstract: Large language models have demonstrated impressive universal capabilities across a wide range of open-ended tasks and have extended their utility to encompass multimodal conversations. However, existing methods encounter challenges in effectively handling both image and video understanding, particularly with limited visual tokens. In this work, we introduce Chat-UniVi, a Unified Vision-language mo… ▽ More

    Submitted 5 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR 2024 (Highlight)

  30. arXiv:2311.01015  [pdf, other

    cs.CV

    Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs

    Authors: Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Yang Wei, Li Yuan

    Abstract: Most text-driven human motion generation methods employ sequential modeling approaches, e.g., transformer, to extract sentence-level text representations automatically and implicitly for human motion synthesis. However, these compact text representations may overemphasize the action names at the expense of other important properties and lack fine-grained details to guide the synthesis of subtly di… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  31. arXiv:2309.13282  [pdf, other

    physics.app-ph

    Convective Heat Transfer in Porous Materials

    Authors: Peng Jin, Gaole Dai, Fubao Yang

    Abstract: Thermal convection stands out as an exceptionally efficient thermal transport mechanism, distinctly separate from conduction and radiation. Yet, the inherently elusive nature of fluid motion poses challenges in accurately controlling convective heat flow. While recent innovations have harnessed thermal convection to achieve effective thermal conductivity, fusing thermal convection in liquids and t… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  32. Controlling mass and energy diffusion with metamaterials

    Authors: Fubao Yang, Zeren Zhang, Liujun Xu, Zhoufei Liu, Peng Jin, Pengfei Zhuang, Min Lei, Jinrong Liu, Jian-Hua Jiang, Xiaoping Ouyang, Fabio Marchesoni, Jiping Huang

    Abstract: Diffusion driven by temperature or concentration gradients is a fundamental mechanism of energy and mass transport, which inherently differs from wave propagation in both physical foundations and application prospects. Compared with conventional schemes, metamaterials provide an unprecedented potential for governing diffusion processes, based on emerging theories like the transformation and the sc… ▽ More

    Submitted 15 February, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: This review article has been published in Reviews of Modern Physics, volume 96, 015002 (2024)

    Journal ref: Reviews of Modern Physics, volume 96, 015002 (2024)

  33. arXiv:2308.16057  [pdf, other

    physics.app-ph

    Click Metamaterials: Fast Acquisition of Thermal Conductivity and Functionality Diversities

    Authors: Chengmeng Wang, Peng Jin, Fubao Yang, Liujun Xu, Jiping Huang

    Abstract: Material science is an important foundation of modern society development, covering significant areas like chemosynthesis and metamaterials. Click chemistry provides a simple and efficient paradigm for achieving molecular diversity by incorporating modified building blocks into compounds. In contrast, most metamaterial designs are still case by case due to lacking a fundamental mechanism for achie… ▽ More

    Submitted 6 January, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Here, click metamaterials have been proposed and swiftly generate variable thermal conductivities and functionalities by using tunable hollow-filled cells akin to the modified building blocks in click chemistry. This breakthrough holds the promise to transform applications in a range of diffusion and wave systems, thereby having a profound impact on the development of materials science

  34. arXiv:2308.11355  [pdf, ps, other

    math.AG cs.LG math.RT

    Machine learning assisted exploration for affine Deligne-Lusztig varieties

    Authors: Bin Dong, Xuhua He, Pengfei Jin, Felix Schremmer, Qingchao Yu

    Abstract: This paper presents a novel, interdisciplinary study that leverages a Machine Learning (ML) assisted framework to explore the geometry of affine Deligne-Lusztig varieties (ADLV). The primary objective is to investigate the nonemptiness pattern, dimension and enumeration of irreducible components of ADLV. Our proposed framework demonstrates a recursive pipeline of data generation, model training, p… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 36 pages

    MSC Class: 22E35; 22E67

  35. arXiv:2308.08283  [pdf, other

    eess.IV cs.CV cs.LG

    CARE: A Large Scale CT Image Dataset and Clinical Applicable Benchmark Model for Rectal Cancer Segmentation

    Authors: Hantao Zhang, Weidong Guo, Chenyang Qiu, Shouhong Wan, Bingbing Zou, Wanqin Wang, Peiquan Jin

    Abstract: Rectal cancer segmentation of CT image plays a crucial role in timely clinical diagnosis, radiotherapy treatment, and follow-up. Although current segmentation methods have shown promise in delineating cancerous tissues, they still encounter challenges in achieving high segmentation precision. These obstacles arise from the intricate anatomical structures of the rectum and the difficulties in perfo… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages

  36. arXiv:2308.04020  [pdf, other

    cs.CV

    Synthetic Augmentation with Large-scale Unconditional Pre-training

    Authors: Jiarong Ye, Haomiao Ni, Peng Jin, Sharon X. Huang, Yuan Xue

    Abstract: Deep learning based medical image recognition systems often require a substantial amount of training data with expert annotations, which can be expensive and time-consuming to obtain. Recently, synthetic augmentation techniques have been proposed to mitigate the issue by generating realistic images conditioned on class labels. However, the effectiveness of these methods heavily depends on the repr… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: MICCAI 2023

  37. arXiv:2307.15388  [pdf, other

    cs.LG eess.SP physics.geo-ph

    An Empirical Study of Large-Scale Data-Driven Full Waveform Inversion

    Authors: Peng Jin, Yinan Feng, Shihang Feng, Hanchen Wang, Yinpeng Chen, Benjamin Consolvo, Zicheng Liu, Youzuo Lin

    Abstract: This paper investigates the impact of big data on deep learning models to help solve the full waveform inversion (FWI) problem. While it is well known that big data can boost the performance of deep learning models in many tasks, its effectiveness has not been validated for FWI. To address this gap, we present an empirical study that investigates how deep learning models in FWI behave when trained… ▽ More

    Submitted 24 April, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

  38. arXiv:2307.00458  [pdf

    physics.app-ph

    13.56MHz Rectifying Diodes Based on Metal Halide Perovskite

    Authors: Peng Jin, Xuehui Xu, Zeng Chen, Xu Chen, Tianyu Liu, Hanbo Zhu, Xinya Chen, Yang, Yang

    Abstract: The increasing use of portable and wireless technologies has led to a growing focus on radio-frequency identification (RFID) tags. Among the various devices in RFID tags, rectifying diodes are the most demanding in terms of high-frequency performance, and these diodes are dominated by organic materials. However, their intrinsic low carrier mobility largely limits the rectifying ability of organic… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 19pages, 8 figures, research article, not published

  39. The mHz quasi-regular modulations of 4U 1630--47 during its 1998 outburst

    Authors: Qingchang Zhao, Hongxing Yin, Lian Tao, Zixu Yang, Jinlu Qu, Liang Zhang, Shu Zhang, Erlin Qiao, Qingcui Bu, Shujie Zhao, Panping Li, Yiming Huang, Ruican Ma, Ruijing Tang, Pei Jin, Wei Yu, Hexin Liu, Yue Huang, Xiang Ma, Jingyu Xiao, Xuan Zhang, Kang Zhao

    Abstract: We present the results of a detailed timing and spectral analysis of the quasi-regular modulation (QRM) phenomenon in the black hole X-ray binary 4U 1630--47 during its 1998 outburst observed by Rossi X-ray Timing Explore (RXTE). We find that the $\sim$ 50-110 mHz QRM is flux dependent, and the QRM is detected with simultaneous low frequency quasi-periodic oscillations (LFQPOs). According to the b… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 14pages, 15 figures

  40. arXiv:2306.12456  [pdf, other

    cs.AI cs.AR

    Pushing the Limits of Machine Design: Automated CPU Design with AI

    Authors: Shuyao Cheng, Pengwei Jin, Qi Guo, Zidong Du, Rui Zhang, Yunhao Tian, Xing Hu, Yongwei Zhao, Yifan Hao, Xiangtao Guan, Husheng Han, Zhengyue Zhao, Ximing Liu, Ling Li, Xishan Zhang, Yuejie Chu, Weilong Mao, Tianshi Chen, Yunji Chen

    Abstract: Design activity -- constructing an artifact description satisfying given goals and constraints -- distinguishes humanity from other animals and traditional machines, and endowing machines with design abilities at the human level or beyond has been a long-term pursuit. Though machines have already demonstrated their abilities in designing new materials, proteins, and computer programs with advanced… ▽ More

    Submitted 27 June, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: 28 pages

  41. arXiv:2306.12386  [pdf, other

    physics.geo-ph cs.LG

    $\mathbf{\mathbb{E}^{FWI}}$: Multi-parameter Benchmark Datasets for Elastic Full Waveform Inversion of Geophysical Properties

    Authors: Shihang Feng, Hanchen Wang, Chengyuan Deng, Yinan Feng, Yanhua Liu, Min Zhu, Peng Jin, Yinpeng Chen, Youzuo Lin

    Abstract: Elastic geophysical properties (such as P- and S-wave velocities) are of great importance to various subsurface applications like CO$_2$ sequestration and energy exploration (e.g., hydrogen and geothermal). Elastic full waveform inversion (FWI) is widely applied for characterizing reservoir properties. In this paper, we introduce $\mathbf{\mathbb{E}^{FWI}}$, a comprehensive benchmark dataset that… ▽ More

    Submitted 7 September, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: 20 pages, 11 figures

  42. arXiv:2306.10750  [pdf, other

    cs.CV cs.MM

    WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation

    Authors: Zesen Cheng, Peng Jin, Hao Li, Kehan Li, Siheng Li, Xiangyang Ji, Chang Liu, Jie Chen

    Abstract: The top-down and bottom-up methods are two mainstreams of referring segmentation, while both methods have their own intrinsic weaknesses. Top-down methods are chiefly disturbed by Polar Negative (PN) errors owing to the lack of fine-grained cross-modal alignment. Bottom-up methods are mainly perturbed by Inferior Positive (IP) errors due to the lack of prior object information. Nevertheless, we di… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to IJCAI2023

  43. arXiv:2306.05445  [pdf, other

    physics.chem-ph cs.LG q-bio.BM

    Towards Predicting Equilibrium Distributions for Molecular Systems with Deep Learning

    Authors: Shuxin Zheng, Jiyan He, Chang Liu, Yu Shi, Ziheng Lu, Weitao Feng, Fusong Ju, Jiaxi Wang, Jianwei Zhu, Yaosen Min, He Zhang, Shidi Tang, Hongxia Hao, Peiran Jin, Chi Chen, Frank Noé, Haiguang Liu, Tie-Yan Liu

    Abstract: Advances in deep learning have greatly improved structure prediction of molecules. However, many macroscopic observations that are important for real-world applications are not functions of a single molecular structure, but rather determined from the equilibrium distribution of structures. Traditional methods for obtaining these distributions, such as molecular dynamics simulation, are computation… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 80 pages, 11 figures

  44. arXiv:2306.04565  [pdf, ps, other

    math.NT math.CO

    Prime sum graphs and the induced trees they contain

    Authors: Ernie Croot, Patrick Jin

    Abstract: In this paper we show that prime sum graphs on $n$ vertices -- which are graphs on vertex set $\{1,2,...,n\}$ where $ij$ is an edge when $i+j$ is prime -- contain all trees with at most $\exp( c \log n / \log\log n)$ vertices as induced subgraphs. We also prove some results for related graphs, and end with some unsolved problems.

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: This was part of an undergrad research project that Patrick did with me last year

  45. arXiv:2305.18498  [pdf, other

    cs.PL cs.AI cs.CL cs.LG

    ANPL: Towards Natural Programming with Interactive Decomposition

    Authors: Di Huang, Ziyuan Nan, Xing Hu, Pengwei Jin, Shaohui Peng, Yuanbo Wen, Rui Zhang, Zidong Du, Qi Guo, Yewen Pu, Yunji Chen

    Abstract: Though LLMs are capable of generating plausible programs, it's challenging to interact with the LLMs further to revise the program, especially if the user's specific requirements are different from the initial proposal. In this paper, we introduce ANPL, an interactive programming system that ensures users can always refine the generated code towards their specific programmatic intents via structur… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  46. arXiv:2305.18084  [pdf, other

    cs.SE

    Assess and Summarize: Improve Outage Understanding with Large Language Models

    Authors: Pengxiang Jin, Shenglin Zhang, Minghua Ma, Haozhe Li, Yu Kang, Liqun Li, Yudong Liu, Bo Qiao, Chaoyun Zhang, Pu Zhao, Shilin He, Federica Sarro, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Cloud systems have become increasingly popular in recent years due to their flexibility and scalability. Each time cloud computing applications and services hosted on the cloud are affected by a cloud outage, users can experience slow response times, connection issues or total service disruption, resulting in a significant negative business impact. Outages are usually comprised of several concurri… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  47. arXiv:2305.13314  [pdf, other

    physics.geo-ph cs.LG eess.SP

    Auto-Linear Phenomenon in Subsurface Imaging

    Authors: Yinan Feng, Yinpeng Chen, Peng Jin, Shihang Feng, Zicheng Liu, Youzuo Lin

    Abstract: Subsurface imaging involves solving full waveform inversion (FWI) to predict geophysical properties from measurements. This problem can be reframed as an image-to-image translation, with the usual approach being to train an encoder-decoder network using paired data from two domains: geophysical property and measurement. A recent seminal work (InvLINT) demonstrates there is only a linear mapping be… ▽ More

    Submitted 21 May, 2024; v1 submitted 27 April, 2023; originally announced May 2023.

  48. arXiv:2305.12218  [pdf, other

    cs.CV cs.AI cs.IR

    Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

    Authors: Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen

    Abstract: Text-video retrieval is a challenging cross-modal task, which aims to align visual entities with natural language descriptions. Current methods either fail to leverage the local details or are computationally expensive. What's worse, they fail to leverage the heterogeneous concepts in data. In this paper, we propose the Disentangled Conceptualization and Set-to-set Alignment (DiCoSA) to simulate t… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: IJCAI 2023

  49. arXiv:2305.10049  [pdf, other

    cs.CV cs.MM

    TG-VQA: Ternary Game of Video Question Answering

    Authors: Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen

    Abstract: Video question answering aims at answering a question about the video content by reasoning the alignment semantics within them. However, since relying heavily on human instructions, i.e., annotations or priors, current contrastive learning-based VideoQA methods remains challenging to perform fine-grained visual-linguistic alignments. In this work, we innovatively resort to game theory, which can s… ▽ More

    Submitted 18 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: IJCAI 2023

  50. arXiv:2304.07143  [pdf, other

    eess.SY cs.AI

    Car-Following Models: A Multidisciplinary Review

    Authors: Tianya Terry Zhang, Ph. D., Peter J. Jin, Ph. D., Sean T. McQuade, Ph. D., Alexandre Bayen, Ph. D., Benedetto Piccoli

    Abstract: Car-following (CF) algorithms are crucial components of traffic simulations and have been integrated into many production vehicles equipped with Advanced Driving Assistance Systems (ADAS). Insights from the model of car-following behavior help us understand the causes of various macro phenomena that arise from interactions between pairs of vehicles. Car-following models encompass multiple discipli… ▽ More

    Submitted 5 March, 2024; v1 submitted 14 April, 2023; originally announced April 2023.