Skip to main content

Showing 1–50 of 1,317 results for author: Yang, R

  1. arXiv:2407.11401  [pdf, other

    cs.CV cs.IR

    EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis

    Authors: Ruijie Yang, Yan Zhu, Peiyao Fu, Yizhe Zhang, Zhihua Wang, Quanlin Li, Pinghong Zhou, Xian Yang, Shuo Wang

    Abstract: Determining the necessity of resecting malignant polyps during colonoscopy screen is crucial for patient outcomes, yet challenging due to the time-consuming and costly nature of histopathology examination. While deep learning-based classification models have shown promise in achieving optical biopsy with endoscopic images, they often suffer from a lack of explainability. To overcome this limitatio… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024

  2. arXiv:2407.10794  [pdf, other

    cs.CL cs.AI

    Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: Knowledge graphs (KGs) are crucial in the field of artificial intelligence and are widely applied in downstream tasks, such as enhancing Question Answering (QA) systems. The construction of KGs typically requires significant effort from domain experts. Recently, Large Language Models (LLMs) have been used for knowledge graph construction (KGC), however, most existing approaches focus on a local pe… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 24 pages, 11 figures, 13 tables. arXiv admin note: substantial text overlap with arXiv:2402.14293

  3. arXiv:2407.10175  [pdf, other

    stat.AP econ.EM q-fin.PM q-fin.ST

    Low Volatility Stock Portfolio Through High Dimensional Bayesian Cointegration

    Authors: Parley R Yang, Alexander Y Shestopaloff

    Abstract: We employ a Bayesian modelling technique for high dimensional cointegration estimation to construct low volatility portfolios from a large number of stocks. The proposed Bayesian framework effectively identifies sparse and important cointegration relationships amongst large baskets of stocks across various asset spaces, resulting in portfolios with reduced volatility. Such cointegration relationsh… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  4. arXiv:2407.08586  [pdf, other

    nucl-ex

    Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, H. Al-Ta'ani, J. Alexander, A. Angerami, K. Aoki, N. Apadula, Y. Aramaki, H. Asano, E. C. Aschenauer, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, B. Bannier, K. N. Barish, B. Bassalleck, S. Bathe , et al. (377 additional authors not shown)

    Abstract: The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 401 authors from 75 institutions, 20 pages, 15 figures, 2 tables. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  5. arXiv:2407.07913  [pdf, other

    cs.IR cs.AI

    CaseGPT: a case reasoning framework based on language models and retrieval-augmented generation

    Authors: Rui Yang

    Abstract: This paper presents CaseGPT, an innovative approach that combines Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) technology to enhance case-based reasoning in the healthcare and legal sectors. The system addresses the challenges of traditional database queries by enabling fuzzy searches based on imprecise descriptions, thereby improving data searchability and usability. Case… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Submitted to ICCBR

  6. arXiv:2407.07059  [pdf, other

    q-bio.NC cs.LG

    Differentiable Optimization of Similarity Scores Between Models and Brains

    Authors: Nathan Cloos, Moufan Li, Markus Siegel, Scott L. Brincat, Earl K. Miller, Guangyu Robert Yang, Christopher J. Cueva

    Abstract: What metrics should guide the development of more realistic models of the brain? One proposal is to quantify the similarity between models and brains using methods such as linear regression, Centered Kernel Alignment (CKA), and angular Procrustes distance. To better understand the limitations of these similarity measures we analyze neural activity recorded in five experiments on nonhuman primates,… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 16 pages, 6 figures

  7. arXiv:2407.04285  [pdf, other

    cs.LG cs.AI

    Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling

    Authors: Jiawei Xu, Rui Yang, Feng Luo, Meng Fang, Baoxiang Wang, Lei Han

    Abstract: Learning policies from offline datasets through offline reinforcement learning (RL) holds promise for scaling data-driven decision-making and avoiding unsafe and costly online interactions. However, real-world data collected from sensors or humans often contains noise and errors, posing a significant challenge for existing offline RL methods. Our study indicates that traditional offline RL methods… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  8. arXiv:2407.04232  [pdf

    q-bio.QM physics.bio-ph q-bio.BM q-bio.SC

    A Unified Intracellular pH Landscape with SITE-pHorin: a Quantum-Entanglement-Enhanced pH Probe

    Authors: Shu-Ang Li, Xiao-Yan Meng, Su Zhang, Ying-Jie Zhang, Run-Zhou Yang, Dian-Dian Wang, Yang Yang, Pei-Pei Liu, Jian-Sheng Kang

    Abstract: An accurate map of intracellular organelle pH is crucial for comprehending cellular metabolism and organellar functions. However, a unified intracellular pH spectrum using a single probe is still lack. Here, we developed a novel quantum entanglement-enhanced pH-sensitive probe called SITE-pHorin, which featured a wide pH-sensitive range and ratiometric quantitative measurement capabilities. Subseq… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 64 pages, 7 figures, the supplemental material contains 13 supplemental figures and 4 supplemental tables

  9. arXiv:2407.03162  [pdf, other

    cs.RO cs.CV cs.LG

    Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning

    Authors: Runyu Ding, Yuzhe Qin, Jiyue Zhu, Chengzhe Jia, Shiqi Yang, Ruihan Yang, Xiaojuan Qi, Xiaolong Wang

    Abstract: Teleoperation is a crucial tool for collecting human demonstrations, but controlling robots with bimanual dexterous hands remains a challenge. Existing teleoperation systems struggle to handle the complexity of coordinating two hands for intricate manipulations. We introduce Bunny-VisionPro, a real-time bimanual dexterous teleoperation system that leverages a VR headset. Unlike previous vision-bas… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: project page: https://dingry.github.io/projects/bunny_visionpro.html

  10. arXiv:2406.19414  [pdf, other

    q-fin.ST cs.LG q-fin.PR stat.AP stat.ML stat.OT

    Stock Volume Forecasting with Advanced Information by Conditional Variational Auto-Encoder

    Authors: Parley R Yang, Alexander Y Shestopaloff

    Abstract: We demonstrate the use of Conditional Variational Encoder (CVAE) to improve the forecasts of daily stock volume time series in both short and long term forecasting tasks, with the use of advanced information of input variables such as rebalancing dates. CVAE generates non-linear time series as out-of-sample forecasts, which have better accuracy and closer fit of correlation to the actual data, com… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  11. arXiv:2406.19034  [pdf, other

    astro-ph.HE

    Extended GeV $γ$-ray emission around the star forming region of the W3 complex

    Authors: Qihang Wu, Xiaona Sun, Ruizhi Yang, Tingting Ge, Yunfeng Liang, Enwei Liang

    Abstract: We analyze the GeV $γ$-ray emission from the W3 complex using about 14 years of Pass 8 data recorded by the $\it Fermi$ Large Area Telescope (\textit{Fermi}-LAT). We resolve the $γ$-ray emissions around W3 into two components: an elliptical Gaussian overlapping with the molecular gas and a point-like source near the cluster W3 Main. The pion-bump feature of SED for the elliptical Gaussian together… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  12. arXiv:2406.17624  [pdf, other

    cs.CL cs.AI

    Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models

    Authors: Zhiyuan Wen, Yu Yang, Jiannong Cao, Haoming Sun, Ruosong Yang, Shuaiqi Liu

    Abstract: As large language models (LLMs) appear to behave increasingly human-like in text-based interactions, more and more researchers become interested in investigating personality in LLMs. However, the diversity of psychological personality research and the rapid development of LLMs have led to a broad yet fragmented landscape of studies in this interdisciplinary field. Extensive studies across differen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  13. arXiv:2406.17274  [pdf, other

    cs.CL cs.LG

    Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?

    Authors: Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu

    Abstract: Text summarization, a key natural language generation (NLG) task, is vital in various domains. However, the high cost of inaccurate summaries in risk-critical applications, particularly those involving human-in-the-loop decision-making, raises concerns about the reliability of uncertainty estimation on text summarization (UE-TS) evaluation methods. This concern stems from the dependency of uncerta… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 63 pages, 41 figures, 11 tables

  14. arXiv:2406.14128  [pdf, ps, other

    astro-ph.HE

    Identifying Three New AGNs Among Fermi Unidentified Gigaelectronvolt Sources

    Authors: Shunhao Ji, Zhongxiang Wang, Qiangmeng Huang, Ruoheng Yang

    Abstract: We report our identification of three gigaelectronvolt $γ$-ray sources, 4FGL J0502.6+0036, 4FGL J1055.9+6507, and 4FGL J1708.2+5519, as Active Galactic Nuclei (AGNs). They are listed in the latest Fermi-LAT source catalog as unidentified ones. We find that the sources all showed $γ$-ray flux variations in recent years. Using different survey catalogs, we are able to find a radio source within the… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 15 pages, 7 figures, 2 tables, accepted to be published in RAA

  15. arXiv:2406.13369  [pdf, other

    cs.LG cs.SI

    Effective Edge-wise Representation Learning in Edge-Attributed Bipartite Graphs

    Authors: Hewen Wang, Renchi Yang, Xiaokui Xiao

    Abstract: Graph representation learning (GRL) is to encode graph elements into informative vector representations, which can be used in downstream tasks for analyzing graph-structured data and has seen extensive applications in various domains. However, the majority of extant studies on GRL are geared towards generating node representations, which cannot be readily employed to perform edge-based analytics t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 11 pages. Full version of the research paper accepted to KDD 2024

  16. arXiv:2406.12556  [pdf, other

    cs.NI

    Towards Deep Application-Network Integration: Architectures, Progress and Opportunities

    Authors: Berta Serracanta, Kai Gao, Jordi Ros-Giralt, Alberto Rodriguez-Natal, Luis M. Contreras, Richard Yang, Albert Cabellos

    Abstract: With the rise of a new generation of applications (e.g., virtual and augmented reality, artificial intelligence, etc) demanding stringent performance requirements, the need for networking solutions and architectures that can enable a higher Quality of Experience (QoE) is becoming increasingly important. While jointly optimizing application and network may increase the applications' QoE and simul… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  17. arXiv:2406.12449  [pdf

    cs.AI

    Retrieval-Augmented Generation for Generative Artificial Intelligence in Medicine

    Authors: Rui Yang, Yilin Ning, Emilia Keppo, Mingxuan Liu, Chuan Hong, Danielle S Bitterman, Jasmine Chiat Ling Ong, Daniel Shu Wei Ting, Nan Liu

    Abstract: Generative artificial intelligence (AI) has brought revolutionary innovations in various fields, including medicine. However, it also exhibits limitations. In response, retrieval-augmented generation (RAG) provides a potential solution, enabling models to generate more accurate contents by leveraging the retrieval of external knowledge. With the rapid advancement of generative AI, RAG can pave the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  18. arXiv:2406.12367  [pdf, other

    cs.CV cs.LG cs.MM

    Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines

    Authors: Honglei Zhang, Jukka I. Ahonen, Nam Le, Ruiying Yang, Francesco Cricri

    Abstract: This paper investigates the efficacy of jointly optimizing content-specific post-processing filters to adapt a human oriented video/image codec into a codec suitable for machine vision tasks. By observing that artifacts produced by video/image codecs are content-dependent, we propose a novel training strategy based on competitive learning principles. This strategy assigns training samples to filte… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to be preseneted in ICIP 2024

  19. Pluriharmonic solutions to Yang-Mills equations: a $C^*$-algebras approach

    Authors: Marius Beceanu, Sachin Munshi, Rongwei Yang

    Abstract: This partially expository paper provides a view of Yang-Mills equations from the perspective of complex variables, operator theory, and $C^{*}$-algebras. Through operator-valued pluriharmonic and skew-Hermitian differential forms, it constructs a new class of instanton solutions. Furthermore, it provides a complex variable version of the Yang-Mills Lagrangian and the Belavin-Polyakov-Schwartz-Tyup… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  20. arXiv:2406.12053  [pdf, other

    cs.CL

    InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States

    Authors: Mohammad Beigi, Ying Shen, Runing Yang, Zihao Lin, Qifan Wang, Ankith Mohan, Jianfeng He, Ming Jin, Chang-Tien Lu, Lifu Huang

    Abstract: Despite their vast capabilities, Large Language Models (LLMs) often struggle with generating reliable outputs, frequently producing high-confidence inaccuracies known as hallucinations. Addressing this challenge, our research introduces InternalInspector, a novel framework designed to enhance confidence estimation in LLMs by leveraging contrastive learning on internal states including attention st… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  21. arXiv:2406.10983  [pdf, ps, other

    quant-ph

    Expressibility of linear combination of ansatz circuits

    Authors: Peng Wang, Ruyu Yang

    Abstract: Variational Quantum Eigensolver is considered promising for medium-scale noisy quantum computers. Expressibility is an important metric for measuring the capability of a variational quantum Ansatz circuit. A commonly used method to increase expressibility is to increase the circuit depth. However, increasing the circuit depth also introduces more noise. We propose to use a linear combination of an… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 10pages, 9figures

  22. arXiv:2406.10216  [pdf, other

    cs.CL cs.AI

    Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

    Authors: Rui Yang, Ruomeng Ding, Yong Lin, Huan Zhang, Tong Zhang

    Abstract: Reward models trained on human preference data have been proven to be effective for aligning Large Language Models (LLMs) with human intent within the reinforcement learning from human feedback (RLHF) framework. However, the generalization capabilities of current reward models to unseen prompts and responses are limited. This limitation can lead to an unexpected phenomenon known as reward over-opt… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 21 pages

  23. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  24. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  25. arXiv:2406.07801  [pdf, other

    cs.CL cs.SD eess.AS

    PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models

    Authors: Runyan Yang, Huibao Yang, Xiqing Zhang, Tiantian Ye, Ying Liu, Yingying Gao, Shilei Zhang, Chao Deng, Junlan Feng

    Abstract: Recently, there have been attempts to integrate various speech processing tasks into a unified model. However, few previous works directly demonstrated that joint optimization of diverse tasks in multitask speech models has positive influence on the performance of individual tasks. In this paper we present a multitask speech model -- PolySpeech, which supports speech recognition, speech synthesis,… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures

  26. arXiv:2406.05521  [pdf, other

    math.RT math.AG math.QA

    Gaiotto Conjecture for $\mathrm{Rep}_q(\mathrm{F}(4))$

    Authors: Michael Finkelberg, Roman Travkin, Ruotao Yang

    Abstract: This paper is a part of the series proving the Gaiotto conjecture for basic classical quantum supergroups. The previous part arXiv:2107.02653 [math.RT] , arXiv:2306.09556 [math.RT], proved the Gaiotto conjecture for the general linear quantum supergroups $U_q(\mathfrak{gl}(N|M))$. Here we deal with the exceptional quantum supergroup $U_q(\mathfrak{f}(4))$.

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Comments welcome! 50 pages

  27. arXiv:2406.05482  [pdf, other

    cs.LG

    Efficient Topology-aware Data Augmentation for High-Degree Graph Neural Networks

    Authors: Yurui Lai, Xiaoyang Lin, Renchi Yang, Hongtao Wang

    Abstract: In recent years, graph neural networks (GNNs) have emerged as a potent tool for learning on graph-structured data and won fruitful successes in varied fields. The majority of GNNs follow the message-passing paradigm, where representations of each node are learned by recursively aggregating features of its neighbors. However, this mechanism brings severe over-smoothing and efficiency issues over hi… ▽ More

    Submitted 17 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: This is the technical report for the paper accepted to KDD 2024. 16 pages

  28. arXiv:2406.04784  [pdf, other

    cs.CL cs.AI

    SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

    Authors: Ruihan Yang, Jiangjie Chen, Yikai Zhang, Siyu Yuan, Aili Chen, Kyle Richardson, Yanghua Xiao, Deqing Yang

    Abstract: Language agents powered by large language models (LLMs) are increasingly valuable as decision-making tools in domains such as gaming and programming. However, these agents often face challenges in achieving high-level goals without detailed instructions and in adapting to environments where feedback is delayed. In this paper, we present SelfGoal, a novel automatic approach designed to enhance agen… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Preprint

  29. arXiv:2406.03691  [pdf, other

    astro-ph.HE

    Precise measurement of pion-bump structure using future MeV gamma-ray detectors

    Authors: Jiahao Liu, Bing Liu, Ruizhi Yang

    Abstract: The pion-bump structure in the gamma-ray spectrum is a direct proof for the hadronic origin of the gamma rays, and thus the decisive evidence for the acceleration of hadronic cosmic rays in astrophysical objects. However, the identification of such a spectral feature is limited by the resolution and energy coverage of current gamma-ray instruments. Furthermore, there are unavoidable bremsstrahlung… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures, submitted to PRD

  30. arXiv:2406.03320  [pdf, other

    astro-ph.HE

    Detection of extended gamma-ray emission in the vicinity of Cl Danks 1 and 2

    Authors: Jiahao Liu, Bing Liu, Ruizhi Yang

    Abstract: We report the detection of high-energy gamma-ray emission towards the G305 star-forming region. Using almost 15 years of observation data from {\sl Fermi} Large Area Telescope, we detected an extended gamma-ray source in this region with a significance of $\sim 13 σ$. The gamma-ray radiation reveals a clear pion-bump feature and can be fitted with the power law parent proton spectrum with an index… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures, submitted to APJL

  31. arXiv:2406.02222  [pdf, other

    cs.SE

    Towards an Extensible Model-Based Digital Twin Framework for Space Launch Vehicles

    Authors: Ran Wei, Ruizhe Yang, Shijun Liu, Chongsheng Fan, Rong Zhou, Zekun Wu, Haochi Wang, Yifan Cai, Zhe Jiang

    Abstract: The concept of Digital Twin (DT) is increasingly applied to systems on different levels of abstraction across domains, to support monitoring, analysis, diagnosis, decision making and automated control. Whilst the interest in applying DT is growing, the definition of DT is unclear, neither is there a clear pathway to develop DT to fully realise its capacities. In this paper, we revise the concept o… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  32. arXiv:2406.02143  [pdf, other

    cs.CL

    Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models

    Authors: Ruichao Yang, Wei Gao, Jing Ma, Hongzhan Lin, Bo Wang

    Abstract: Learning multi-task models for jointly detecting stance and verifying rumors poses challenges due to the need for training data of stance at post level and rumor veracity at claim level, which are difficult to obtain. To address this issue, we leverage large language models (LLMs) as the foundation annotators for the joint stance detection (SD) and rumor verification (RV) tasks, dubbed as JSDRV. W… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ACL 2024 (Findings)

  33. arXiv:2406.02038  [pdf, other

    cs.CV

    Leveraging Predicate and Triplet Learning for Scene Graph Generation

    Authors: Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, Weixin Li

    Abstract: Scene Graph Generation (SGG) aims to identify entities and predict the relationship triplets \textit{\textless subject, predicate, object\textgreater } in visual scenes. Given the prevalence of large visual variations of subject-object pairs even in the same predicate, it can be quite challenging to model and refine predicate representations directly across such pairs, which is however a common st… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: CVPR 2024

  34. arXiv:2406.01584  [pdf, other

    cs.CV

    SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model

    Authors: An-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo, Ruihan Yang, Jan Kautz, Xiaolong Wang, Sifei Liu

    Abstract: Vision Language Models (VLMs) have demonstrated remarkable performance in 2D vision and language tasks. However, their ability to reason about spatial arrangements remains limited. In this work, we introduce Spatial Region GPT (SpatialRGPT) to enhance VLMs' spatial perception and reasoning capabilities. SpatialRGPT advances VLMs' spatial understanding through two key innovations: (1) a data curati… ▽ More

    Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Project Page: https://www.anjiecheng.me/SpatialRGPT

  35. arXiv:2406.01069  [pdf, other

    cs.CV

    UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment

    Authors: Hantao Zhou, Longxiang Tang, Rui Yang, Guanyi Qin, Yan Zhang, Runze Hu, Xiu Li

    Abstract: Image Quality Assessment (IQA) and Image Aesthetic Assessment (IAA) aim to simulate human subjective perception of image visual quality and aesthetic appeal. Existing methods typically address these tasks independently due to distinct learning objectives. However, they neglect the underlying interconnectedness of both tasks, which hinders the learning of task-agnostic shared representations for hu… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  36. arXiv:2405.19624  [pdf, other

    hep-ph

    Single spin asymmetry $ A _{ U L } ^ { \sin ( 2 φ_ { h } ) }$ in dihadron production in SIDIS

    Authors: Ren Yang, Yangyang Yu, Qihang Zhou, Gang Li, Mao Song, Xuan Luo

    Abstract: The paper calculates the helicity-dependent dihadron fragmentation function (DiFF), by extending the dihadron spectator model and examine the single longitudinal spin asymmetry $A^{\sin(2φ_h)}_{UL}$ from dihadron in semi-inclusive inelastic scattering (SIDIS). This function elucidates the relationship between the longitudinal polarization of the fragmented quark and the transverse momentum of the… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages,10 figures. Version appearing in PRD

    Journal ref: Phys. Rev. D 109, 114038 (2024)

  37. arXiv:2405.19372  [pdf, other

    math.FA

    On a conjecture about generalized integration operators on Hardy spaces

    Authors: Rong Yang, Songxiao Li

    Abstract: A conjecture posed by Chalmoukis in 2020 states that if $T_{g,a}:H^p\to H^q(0<q<p<\infty)$ is bounded, then $g$ must be in $H^{\frac{pq}{p-q}}$. In this article, we provide a positive answer to the aforementioned conjecture. We also consider the compactness of $T_{g,a}:H^p\to H^q(0<q<p<\infty)$.

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: This paper was finished and submitted to manuscripta mathematica on April 24, 2024. In May 24, we found that Nikolaos Chalmoukis and Georgios Nikolaidis have also independently proven this conjecture on arXiv. See arXiv:2405.13920. arXiv admin note: substantial text overlap with arXiv:2405.16278

  38. arXiv:2405.18959  [pdf, other

    cs.CV cs.MM

    Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval

    Authors: Rui Yang, Shuang Wang, Yingping Han, Yuanheng Li, Dong Zhao, Dou Quan, Yanhe Guo, Licheng Jiao

    Abstract: Remote Sensing Image-Text Retrieval (RSITR) is pivotal for knowledge services and data mining in the remote sensing (RS) domain. Considering the multi-scale representations in image content and text vocabulary can enable the models to learn richer representations and enhance retrieval. Current multi-scale RSITR approaches typically align multi-scale fused image features with text features, but ove… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

  39. arXiv:2405.18525  [pdf, other

    cs.CV

    REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment

    Authors: Haonan Han, Rui Yang, Huan Liao, Jiankai Xing, Zunnan Xu, Xiaoming Yu, Junwei Zha, Xiu Li, Wanhua Li

    Abstract: Traditional image-to-3D models often struggle with scenes containing multiple objects due to biases and occlusion complexities. To address this challenge, we present REPARO, a novel approach for compositional 3D asset generation from single images. REPARO employs a two-step process: first, it extracts individual objects from the scene and reconstructs their 3D meshes using off-the-shelf image-to-3… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  40. arXiv:2405.17673  [pdf, other

    cs.CV cs.LG stat.ML

    Fast Samplers for Inverse Problems in Iterative Refinement Models

    Authors: Kushagra Pandey, Ruihan Yang, Stephan Mandt

    Abstract: Constructing fast samplers for unconditional diffusion and flow-matching models has received much attention recently; however, existing methods for solving inverse problems, such as super-resolution, inpainting, or deblurring, still require hundreds to thousands of iterative steps to obtain high-quality results. We propose a plug-and-play framework for constructing efficient samplers for inverse p… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  41. arXiv:2405.17212  [pdf, ps, other

    gr-qc astro-ph.CO

    A new parametrization of Hubble function and Hubble tension

    Authors: Tong-Yu He, Jia-Jun Yin, Zhen-Yu Wang, Zhan-Wen Han, Rong-Jia Yang

    Abstract: We present a new Hubble parameterization method and employ observational data from Hubble, Pantheon, and Baryon Acoustic Oscillations to constrain model parameters. The proposed method is thoroughly validated against these datasets, demonstrating a robust fit to the observational data. The obtained best-fit values are $H_0 = 67.5^{+1.3}_{-1.6}$ $\text{km s}^{-1} \text{Mpc}^{-1}$,… ▽ More

    Submitted 16 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  42. arXiv:2405.16850  [pdf, other

    eess.IV cs.CV cs.LG

    UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation

    Authors: Runzhao Yang, Yinda Chen, Zhihong Zhang, Xiaoyu Liu, Zongren Li, Kunlun He, Zhiwei Xiong, Jinli Suo, Qionghai Dai

    Abstract: In the field of medical image compression, Implicit Neural Representation (INR) networks have shown remarkable versatility due to their flexible compression ratios, yet they are constrained by a one-to-one fitting approach that results in lengthy encoding times. Our novel method, ``\textbf{UniCompress}'', innovatively extends the compression capabilities of INR by being the first to compress multi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  43. arXiv:2405.16726  [pdf, other

    cs.LG

    Exploring Edge Probability Graph Models Beyond Edge Independency: Concepts, Analyses, and Algorithms

    Authors: Fanchen Bu, Ruochen Yang, Paul Bogdan, Kijung Shin

    Abstract: Desirable random graph models (RGMs) should (i) be tractable so that we can compute and control graph statistics, and (ii) generate realistic structures such as high clustering (i.e., high subgraph densities). A popular category of RGMs (e.g., Erdos-Renyi and stochastic Kronecker) outputs edge probabilities, and we need to realize (i.e., sample from) the edge probabilities to generate graphs. Typi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  44. arXiv:2405.16376  [pdf, other

    cs.CL cs.GT

    STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making

    Authors: Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang

    Abstract: Large Language Models (LLMs) like GPT-4 have revolutionized natural language processing, showing remarkable linguistic proficiency and reasoning capabilities. However, their application in strategic multi-agent decision-making environments is hampered by significant limitations including poor mathematical reasoning, difficulty in following instructions, and a tendency to generate incorrect informa… ▽ More

    Submitted 27 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: 39 pages, 4 figures

  45. arXiv:2405.16278  [pdf, ps, other

    math.CV math.FA

    Generalized integration operators on analytic tent spaces

    Authors: Rong Yang, Lian Hu, Songxiao Li

    Abstract: In this paper, the boundedness and compactness of generalized integration operators $T_g^{n,k}$ between different analytic tent spaces in the unit disc are completely characterized.

    Submitted 25 May, 2024; originally announced May 2024.

  46. arXiv:2405.16209  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Analytical photoresponses of Schottky contact MoS2 phototransistors

    Authors: Jianyong Wei, Yumeng Liu, Yizhuo Wang, Kai Li, Zhentao Lian, Maosong Xie, Xinhan Yang, Seyed Saleh Mousavi Khaleghi, Fuxing Dai, Weida Hu, Xuejiao Gao, Rui Yang, Yaping Dan

    Abstract: High-gain photodetectors based on two-dimensional (2D) semiconductors, in particular those in photoconductive mode, have been extensively investigated in the past decade. However, the classical photoconductive theory was derived on two misplaced assumptions. In this work, we established an explicit analytical device model for Schottky contact MoS2 phototransistors that fits well with experimental… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures

  47. arXiv:2405.16030  [pdf, other

    cs.LG

    Constrained Ensemble Exploration for Unsupervised Skill Discovery

    Authors: Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li

    Abstract: Unsupervised Reinforcement Learning (RL) provides a promising paradigm for learning useful behaviors via reward-free per-training. Existing methods for unsupervised RL mainly conduct empowerment-driven skill discovery or entropy-based exploration. However, empowerment often leads to static skills, and pure exploration only maximizes the state coverage rather than learning useful behaviors. In this… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  48. arXiv:2405.15385  [pdf, other

    cs.CV physics.med-ph

    CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation

    Authors: Xia Li, Runzhao Yang, Xiangtai Li, Antony Lomax, Ye Zhang, Joachim Buhmann

    Abstract: Motion information from 4D medical imaging offers critical insights into dynamic changes in patient anatomy for clinical assessments and radiotherapy planning and, thereby, enhances the capabilities of 3D image analysis. However, inherent physical and technical constraints of imaging hardware often necessitate a compromise between temporal resolution and image quality. Frame interpolation emerges… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  49. arXiv:2405.11922  [pdf, other

    cs.SI cs.LG

    Effective Clustering on Large Attributed Bipartite Graphs

    Authors: Renchi Yang, Yidu Wu, Xiaoyang Lin, Qichen Wang, Tsz Nam Chan, Jieming Shi

    Abstract: Attributed bipartite graphs (ABGs) are an expressive data model for describing the interactions between two sets of heterogeneous nodes that are associated with rich attributes, such as customer-product purchase networks and author-paper authorship graphs. Partitioning the target node set in such graphs into k disjoint clusters (referred to as k-ABGC) finds widespread use in various domains, inclu… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: The technical report for the paper was accepted to KDD 2024. 14 pages

  50. arXiv:2405.11921  [pdf, other

    cs.CV

    MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections

    Authors: Jiayue Liu, Xiao Tang, Freeman Cheng, Roy Yang, Zhihao Li, Jianzhuang Liu, Yi Huang, Jiaqi Lin, Shiyong Liu, Xiaofei Wu, Songcen Xu, Chun Yuan

    Abstract: 3D Gaussian Splatting showcases notable advancements in photo-realistic and real-time novel view synthesis. However, it faces challenges in modeling mirror reflections, which exhibit substantial appearance variations from different viewpoints. To tackle this problem, we present MirrorGaussian, the first method for mirror scene reconstruction with real-time rendering based on 3D Gaussian Splatting.… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.