Skip to main content

Showing 1–50 of 553 results for author: Chen, E

  1. arXiv:2407.11505  [pdf, other

    cs.CV

    Haze-Aware Attention Network for Single-Image Dehazing

    Authors: Lihan Tong, Yun Liu, Weijia Li, Liyuan Chen, Erkang Chen

    Abstract: Single-image dehazing is a pivotal challenge in computer vision that seeks to remove haze from images and restore clean background details. Recognizing the limitations of traditional physical model-based methods and the inefficiencies of current attention-based solutions, we propose a new dehazing network combining an innovative Haze-Aware Attention Module (HAAM) with a Multiscale Frequency Enhanc… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 13 pages, 6 figures

    Report number: applsci-3022856 MSC Class: 68I1C; 68I8P ACM Class: I.4.3; I.4.9

  2. arXiv:2407.11311  [pdf, ps, other

    physics.flu-dyn cond-mat.soft nlin.CD physics.app-ph

    Harnessing an elastic flow instability to improve the kinetic performance of chromatographic columns

    Authors: Fabrice Gritti, Emily Y. Chen, Sujit S. Datta

    Abstract: Despite decades of research and development, the optimal efficiency of slurry-packed HPLC columns is still hindered by inherent long-range flow heterogeneity from the wall to the central bulk region of these columns. Here, we show an example of how this issue can be addressed through the straightforward addition of a semidilute amount (500~ppm) of a large, flexible, synthetic polymer (18~MDa parti… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2407.09484  [pdf

    cs.HC cs.CY

    GPTutor: Great Personalized Tutor with Large Language Models for Personalized Learning Content Generation

    Authors: Eason Chen, Jia-En Lee, Jionghao Lin, Kenneth Koedinger

    Abstract: We developed GPTutor, a pioneering web application designed to revolutionize personalized learning by leveraging the capabilities of Generative AI at scale. GPTutor adapts educational content and practice exercises to align with individual students' interests and career goals, enhancing their engagement and understanding of critical academic concepts. The system uses a serverless architecture to d… ▽ More

    Submitted 16 May, 2024; originally announced July 2024.

  4. arXiv:2407.08967  [pdf, other

    cs.CL cs.AI

    Empowering Few-Shot Relation Extraction with The Integration of Traditional RE Methods and Large Language Models

    Authors: Ye Liu, Kai Zhang, Aoran Gan, Linan Yue, Feng Hu, Qi Liu, Enhong Chen

    Abstract: Few-Shot Relation Extraction (FSRE), a subtask of Relation Extraction (RE) that utilizes limited training instances, appeals to more researchers in Natural Language Processing (NLP) due to its capability to extract textual information in extremely low-resource scenarios. The primary methodologies employed for FSRE have been fine-tuning or prompt tuning techniques based on Pre-trained Language Mode… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  5. arXiv:2407.08952  [pdf, other

    cs.CL cs.AI

    Detect, Investigate, Judge and Determine: A Novel LLM-based Framework for Few-shot Fake News Detection

    Authors: Ye Liu, Jiajun Zhu, Kai Zhang, Haoyu Tang, Yanghai Zhang, Xukai Liu, Qi Liu, Enhong Chen

    Abstract: Few-Shot Fake News Detection (FS-FND) aims to distinguish inaccurate news from real ones in extremely low-resource scenarios. This task has garnered increased attention due to the widespread dissemination and harmful impact of fake news on social media. Large Language Models (LLMs) have demonstrated competitive performance with the help of their rich prior knowledge and excellent in-context learni… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  6. arXiv:2407.06645  [pdf, other

    cs.LG cs.CL

    Entropy Law: The Story Behind Data Compression and LLM Performance

    Authors: Mingjia Yin, Chuhan Wu, Yufei Wang, Hao Wang, Wei Guo, Yasheng Wang, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen

    Abstract: Data is the cornerstone of large language models (LLMs), but not all data is useful for model learning. Carefully selected data can better elicit the capabilities of LLMs with much less computational overhead. Most methods concentrate on evaluating the quality of individual samples in data selection, while the combinatorial effects among samples are neglected. Even if each sample is of perfect qua… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  7. arXiv:2407.05458  [pdf, other

    cs.AI

    A Survey of Models for Cognitive Diagnosis: New Developments and Future Directions

    Authors: Fei Wang, Weibo Gao, Qi Liu, Jiatong Li, Guanhao Zhao, Zheng Zhang, Zhenya Huang, Mengxiao Zhu, Shijin Wang, Wei Tong, Enhong Chen

    Abstract: Cognitive diagnosis has been developed for decades as an effective measurement tool to evaluate human cognitive status such as ability level and knowledge mastery. It has been applied to a wide range of fields including education, sport, psychological diagnosis, etc. By providing better awareness of cognitive status, it can serve as the basis for personalized services such as well-designed medical… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  8. arXiv:2407.03125  [pdf, other

    cs.LG cs.AI

    Foundations and Frontiers of Graph Learning Theory

    Authors: Yu Huang, Min Zhou, Menglin Yang, Zhen Wang, Muhan Zhang, Jie Wang, Hong Xie, Hao Wang, Defu Lian, Enhong Chen

    Abstract: Recent advancements in graph learning have revolutionized the way to understand and analyze data with complex structures. Notably, Graph Neural Networks (GNNs), i.e. neural network architectures designed for learning graph representations, have become a popular paradigm. With these models being usually characterized by intuition-driven design or highly intricate components, placing them within the… ▽ More

    Submitted 7 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 35pages,273references. Github link: https://github.com/minehly/awesome-paper-for-graph-learning-theory

  9. arXiv:2407.01770  [pdf, other

    stat.ME

    Exploring causal effects of hormone- and radio-treatments in an observational study of breast cancer using copula-based semi-competing risks models

    Authors: Tonghui Yu, Mengjiao Peng, Yifan Cui, Elynn Chen, Chixiang Chen

    Abstract: Breast cancer patients may experience relapse or death after surgery during the follow-up period, leading to dependent censoring of relapse. This phenomenon, known as semi-competing risk, imposes challenges in analyzing treatment effects on breast cancer and necessitates advanced statistical tools for unbiased analysis. Despite progress in estimation and inference within semi-competing risks regre… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Contact: chixiang.chen@som.umaryland.edu

  10. arXiv:2407.00778  [pdf, ps, other

    physics.flu-dyn cond-mat.mtrl-sci cond-mat.soft nlin.CD physics.app-ph

    Influence of fluid rheology on multistability in the unstable flow of polymer solutions through pore constriction arrays

    Authors: Emily Y. Chen, Sujit S. Datta

    Abstract: Diverse chemical, energy, environmental, and industrial processes involve the flow of polymer solutions in porous media. The accumulation and dissipation of elastic stresses as the polymers are transported through the tortuous, confined pore space can lead to the development of an elastic flow instability above a threshold flow rate. This flow instability can generate complex flows with strong spa… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  11. arXiv:2407.00561  [pdf, ps, other

    stat.ME stat.AP

    Advancing Information Integration through Empirical Likelihood: Selective Reviews and a New Idea

    Authors: Chixiang Chen, Jia Liang, Elynn Chen, Ming Wang

    Abstract: Information integration plays a pivotal role in biomedical studies by facilitating the combination and analysis of independent datasets from multiple studies, thereby uncovering valuable insights that might otherwise remain obscured due to the limited sample size in individual studies. However, sharing raw data from independent studies presents significant challenges, primarily due to the need to… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  12. arXiv:2406.19622  [pdf, other

    cs.LG cs.AI

    Data-Driven Lipschitz Continuity: A Cost-Effective Approach to Improve Adversarial Robustness

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee

    Abstract: The security and robustness of deep neural networks (DNNs) have become increasingly concerning. This paper aims to provide both a theoretical foundation and a practical solution to ensure the reliability of DNNs. We explore the concept of Lipschitz continuity to certify the robustness of DNNs against adversarial attacks, which aim to mislead the network with adding imperceptible perturbations into… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  13. arXiv:2406.14979  [pdf, other

    cs.CL

    Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation

    Authors: Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen

    Abstract: Despite the significant progress of large language models (LLMs) in various tasks, they often produce factual errors due to their limited internal knowledge. Retrieval-Augmented Generation (RAG), which enhances LLMs with external knowledge sources, offers a promising solution. However, these methods can be misled by irrelevant paragraphs in retrieved documents. Due to the inherent uncertainty in L… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  14. arXiv:2406.13618  [pdf, other

    cs.CL

    In-Context Former: Lightning-fast Compressing Context for Large Language Model

    Authors: Xiangfeng Wang, Zaiyi Chen, Zheyong Xie, Tong Xu, Yongyi He, Enhong Chen

    Abstract: With the rising popularity of Transformer-based large language models (LLMs), reducing their high inference costs has become a significant research focus. One effective approach is to compress the long input contexts. Existing methods typically leverage the self-attention mechanism of the LLM itself for context compression. While these methods have achieved notable results, the compression process… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  15. arXiv:2406.12020  [pdf, other

    cs.IR cs.AI

    When Box Meets Graph Neural Network in Tag-aware Recommendation

    Authors: Fake Lin, Ziwei Zhao, Xi Zhu, Da Zhang, Shitian Shen, Xueying Li, Tong Xu, Suojuan Zhang, Enhong Chen

    Abstract: Last year has witnessed the re-flourishment of tag-aware recommender systems supported by the LLM-enriched tags. Unfortunately, though large efforts have been made, current solutions may fail to describe the diversity and uncertainty inherent in user preferences with only tag-driven profiles. Recently, with the development of geometry-based techniques, e.g., box embedding, diversity of user prefer… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  16. arXiv:2406.11094  [pdf, other

    math.HO

    Report on the 12th Annual USA Junior Mathematical Olympiad

    Authors: Bela Bajnok, Evan Chen

    Abstract: We present the problems and solutions to the 12th Annual USA Junior Mathematical Olympiad.

    Submitted 28 April, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2406.09518

    MSC Class: 00

    Journal ref: College Journal of Mathematics, v. 53, no. 1, 2022, pp.13-20

  17. arXiv:2406.09517  [pdf, other

    math.HO

    Report on the 61st Annual International Mathematical Olympiad

    Authors: Bela Bajnok, Evan Chen

    Abstract: We present the problems and solutions to the 61st Annual International Mathematical Olympiad

    Submitted 28 April, 2024; originally announced June 2024.

    MSC Class: 00

    Journal ref: Mathematics Magazine, v. 94, no. 3, 2021, pp 215-224

  18. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  19. arXiv:2406.08358  [pdf, other

    cs.CV cs.AI

    From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition

    Authors: Shiwei Wu, Chao Zhang, Joya Chen, Tong Xu, Likang Wu, Yao Hu, Enhong Chen

    Abstract: People's social relationships are often manifested through their surroundings, with certain objects or interactions acting as symbols for specific relationships, e.g., wedding rings, roses, hugs, or holding hands. This brings unique challenges to recognizing social relationships, requiring understanding and capturing the essence of these contexts from visual appearances. However, current methods o… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  20. arXiv:2406.03085  [pdf, other

    cs.LG cs.IR

    Exploring User Retrieval Integration towards Large Language Models for Cross-Domain Sequential Recommendation

    Authors: Tingjia Shen, Hao Wang, Jiaqing Zhang, Sirui Zhao, Liangyue Li, Zulong Chen, Defu Lian, Enhong Chen

    Abstract: Cross-Domain Sequential Recommendation (CDSR) aims to mine and transfer users' sequential preferences across different domains to alleviate the long-standing cold-start issue. Traditional CDSR models capture collaborative information through user and item modeling while overlooking valuable semantic information. Recently, Large Language Model (LLM) has demonstrated powerful semantic reasoning capa… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures

    ACM Class: I.2.7

  21. arXiv:2406.01276  [pdf, other

    cs.CL

    EduNLP: Towards a Unified and Modularized Library for Educational Resources

    Authors: Zhenya Huang, Yuting Ning, Longhu Qin, Shiwei Tong, Shangzi Xue, Tong Xiao, Xin Lin, Jiayu Liu, Qi Liu, Enhong Chen, Shijing Wang

    Abstract: Educational resource understanding is vital to online learning platforms, which have demonstrated growing applications recently. However, researchers and developers always struggle with using existing general natural language toolkits or domain-specific models. The issue raises a need to develop an effective and easy-to-use one that benefits AI education-related research and applications. To bridg… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  22. arXiv:2405.21075  [pdf, other

    cs.CV cs.CL

    Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    Authors: Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun

    Abstract: In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on developing their capabilities in static image understanding. The potential of MLLMs in processing sequential visual data is still insufficiently explored, highlighting the absence of a comprehensive, high-quality… ▽ More

    Submitted 16 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Project Page: https://video-mme.github.io

  23. arXiv:2405.18708  [pdf, other

    cs.AI cs.IR cs.NE

    Cognitive Evolutionary Learning to Select Feature Interactions for Recommender Systems

    Authors: Runlong Yu, Qixiang Shao, Qi Liu, Huan Liu, Enhong Chen

    Abstract: Feature interaction selection is a fundamental problem in commercial recommender systems. Most approaches equally enumerate all features and interactions by the same pre-defined operation under expert guidance. Their recommendation is unsatisfactory sometimes due to the following issues: (1)~They cannot ensure the learning abilities of models because their architectures are poorly adaptable to tas… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  24. arXiv:2405.18231  [pdf, ps, other

    math.NT math.AG

    Relative Langlands Duality of Toric Periods

    Authors: Eric Y. Chen

    Abstract: The relative Langlands program introduced by Ben-Zvi--Sakellaridis--Venkatesh posits a duality structure exchanging automorphic periods and L-functions, which can be encoded by pairs of dual Hamiltonian actions. In work of the author and Venkatesh, an extension of the definitions to certain singular spaces was made with the objective of restoring duality in some well-known automorphic integrals. I… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  25. arXiv:2405.18212  [pdf, ps, other

    math.NT math.AG

    Some Singular Examples of Relative Langlands Duality

    Authors: Eric Y. Chen, Akshay Venkatesh

    Abstract: Relative Langlands duality structures the study of automorphic periods around a putative duality between certain group actions of Langlands dual reductive groups. In this article, after giving a self-contained exposition of the relevant ingredients from relative Langlands duality, we examine this proposal for some interesting pairs of singular spaces: one pair arising from the cone of nilpotent… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  26. arXiv:2405.17795  [pdf, other

    cs.IR

    Dataset Regeneration for Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: The sequential recommender (SR) system is a crucial component of modern recommender systems, as it aims to capture the evolving preferences of users. Significant efforts have been made to enhance the capabilities of SR systems. These methods typically follow the model-centric paradigm, which involves developing effective models based on fixed datasets. However, this approach often overlooks potent… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  27. arXiv:2405.17744  [pdf, other

    stat.ME

    Factor Augmented Matrix Regression

    Authors: Elynn Chen, Jianqing Fan, Xiaonan Zhu

    Abstract: We introduce \underline{F}actor-\underline{A}ugmented \underline{Ma}trix \underline{R}egression (FAMAR) to address the growing applications of matrix-variate data and their associated challenges, particularly with high-dimensionality and covariate correlations. FAMAR encompasses two key algorithms. The first is a novel non-iterative approach that efficiently estimates the factors and loadings of t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  28. arXiv:2405.16789  [pdf, other

    cs.IR

    NoteLLM-2: Multimodal Large Representation Models for Recommendation

    Authors: Chao Zhang, Haoxin Zhang, Shiwei Wu, Di Wu, Tong Xu, Yan Gao, Yao Hu, Enhong Chen

    Abstract: Large Language Models (LLMs) have demonstrated exceptional text understanding. Existing works explore their application in text embedding tasks. However, there are few works utilizing LLMs to assist multimodal representation tasks. In this work, we investigate the potential of LLMs to enhance multimodal representation in multimodal item-to-item (I2I) recommendations. One feasible method is the tra… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 19 pages, 5 figures

  29. arXiv:2405.15212  [pdf, ps, other

    math.DS

    Packing topological pressure for amenable group actions

    Authors: Ziqing Ding, Ercai Chen, Xiaoyao Zhou

    Abstract: In this paper, we first prove the variational principle for amenable packing topological pressure. Then we obtain an inequality concerning amenable packing pressure for factor maps. Finally, we show that the equality about packing topological pressure of the set of generic points when the system satisfies the almost specification property, or $μ$ is ergodic.

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 27 pages

  30. arXiv:2405.12473  [pdf, other

    cs.IR cs.AI

    Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Zhi Li, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: Cross-domain sequential recommendation (CDSR) aims to uncover and transfer users' sequential preferences across multiple recommendation domains. While significant endeavors have been made, they primarily concentrated on developing advanced transfer modules and aligning user representations using self-supervised learning techniques. However, the problem of aligning item representations has received… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  31. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  32. arXiv:2405.11681  [pdf, other

    stat.ME math.ST

    Distributed Tensor Principal Component Analysis

    Authors: Elynn Chen, Xi Chen, Wenbo Jing, Yichen Zhang

    Abstract: As tensors become widespread in modern data analysis, Tucker low-rank Principal Component Analysis (PCA) has become essential for dimensionality reduction and structural discovery in tensor datasets. Motivated by the common scenario where large-scale tensors are distributed across diverse geographic locations, this paper investigates tensor PCA within a distributed framework where direct data pool… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  33. arXiv:2405.11531  [pdf, other

    cs.IR cs.AI

    Knowledge Graph Pruning for Recommendation

    Authors: Fake Lin, Xi Zhu, Ziwei Zhao, Deqiang Huang, Yu Yu, Xueying Li, Zhi Zheng, Tong Xu, Enhong Chen

    Abstract: Recent years have witnessed the prosperity of knowledge graph based recommendation system (KGRS), which enriches the representation of users, items, and entities by structural knowledge with striking improvement. Nevertheless, its unaffordable computational cost still limits researchers from exploring more sophisticated models. We observe that the bottleneck for training efficiency arises from the… ▽ More

    Submitted 9 July, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

  34. arXiv:2405.08193  [pdf

    cond-mat.mtrl-sci

    Topological grain boundary segregation transitions

    Authors: Vivek Devulapalli, Enze Chen, Tobias Brink, Timofey Frolov, Christian H. Liebscher

    Abstract: Engineering structure of grain boundaries (GBs) by solute segregation is a promising strategy to tailor the properties of polycrystalline materials. Theoretically it has been suggested that solute segregation can trigger phase transitions at GBs offering novel pathways to design interfaces. However, an understanding of their intrinsic atomistic nature is missing. Here, we combine atomic resolution… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 23 pages, 13 figures

  35. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  36. arXiv:2405.07580  [pdf, other

    cs.IR cs.AI

    DynLLM: When Large Language Models Meet Dynamic Graph Recommendation

    Authors: Ziwei Zhao, Fake Lin, Xi Zhu, Zhi Zheng, Tong Xu, Shitian Shen, Xueying Li, Zikai Yin, Enhong Chen

    Abstract: Last year has witnessed the considerable interest of Large Language Models (LLMs) for their potential applications in recommender systems, which may mitigate the persistent issue of data sparsity. Though large efforts have been made for user-item graph augmentation with better graph-based recommendation performance, they may fail to deal with the dynamic graph recommendation task, which involves b… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  37. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  38. arXiv:2405.06866  [pdf, other

    stat.ME

    Dynamic Contextual Pricing with Doubly Non-Parametric Random Utility Models

    Authors: Elynn Chen, Xi Chen, Lan Gao, Jiayu Li

    Abstract: In the evolving landscape of digital commerce, adaptive dynamic pricing strategies are essential for gaining a competitive edge. This paper introduces novel {\em doubly nonparametric random utility models} that eschew traditional parametric assumptions used in estimating consumer demand's mean utility function and noise distribution. Existing nonparametric methods like multi-scale {\em Distributio… ▽ More

    Submitted 10 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  39. arXiv:2405.06232  [pdf, other

    cs.AI

    Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process

    Authors: Tong Xiao, Jiayu Liu, Zhenya Huang, Jinze Wu, Jing Sha, Shijin Wang, Enhong Chen

    Abstract: Geometry Problem Solving (GPS), which is a classic and challenging math problem, has attracted much attention in recent years. It requires a solver to comprehensively understand both text and diagram, master essential geometry knowledge, and appropriately apply it in reasoning. However, existing works follow a paradigm of neural machine translation and only focus on enhancing the capability of enc… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024 Accepted

  40. arXiv:2405.05811  [pdf, other

    cs.CV

    Parallel Cross Strip Attention Network for Single Image Dehazing

    Authors: Lihan Tong, Yun Liu, Tian Ye, Weijia Li, Liyuan Chen, Erkang Chen

    Abstract: The objective of single image dehazing is to restore hazy images and produce clear, high-quality visuals. Traditional convolutional models struggle with long-range dependencies due to their limited receptive field size. While Transformers excel at capturing such dependencies, their quadratic computational complexity in relation to feature map resolution makes them less suitable for pixel-to-pixel… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 10 pages , 4 figures, CTISC'24

    Report number: C052

  41. arXiv:2405.04763  [pdf

    quant-ph physics.ins-det

    Room-temperature photonic quantum computing in integrated silicon photonics with germanium-silicon single-photon avalanche diodes

    Authors: Neil Na, Chou-Yun Hsu, Erik Chen, Richard Soref

    Abstract: Most, if not all, photonic quantum computing (PQC) relies upon superconducting nanowire single-photon detectors (SNSPDs) based on Nb operated at a temperature < 4 K. This paper proposes and analyzes 300 K Si-waveguide-integrated GeSi single-photon avalanche diodes (SPADs) based on the recently demonstrated normal-incidence GeSi SPADs operated at room temperature, and shows that their performance i… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  42. arXiv:2405.03418  [pdf, ps, other

    quant-ph gr-qc physics.hist-ph

    The Decoherent Arrow of Time and the Entanglement Past Hypothesis

    Authors: Jim Al-Khalili, Eddy Keming Chen

    Abstract: If an asymmetry in time does not arise from the fundamental dynamical laws of physics, it may be found in special boundary conditions. The argument normally goes that since thermodynamic entropy in the past is lower than in the future according to the Second Law of Thermodynamics, then tracing this back to the time around the Big Bang means the universe must have started off in a state of very low… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 10 pages, no figures

  43. arXiv:2405.02287  [pdf, other

    cs.CL cs.AI cs.CV

    Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

    Authors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi Tay

    Abstract: We introduce Vibe-Eval: a new open benchmark and framework for evaluating multimodal chat models. Vibe-Eval consists of 269 visual understanding prompts, including 100 of hard difficulty, complete with gold-standard responses authored by experts. Vibe-Eval is open-ended and challenging with dual objectives: (i) vibe checking multimodal chat models for day-to-day tasks and (ii) rigorously testing a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  44. arXiv:2405.01025  [pdf, ps, other

    quant-ph cond-mat.stat-mech physics.hist-ph

    Density Matrix Realism

    Authors: Eddy Keming Chen

    Abstract: Realism about quantum theory naturally leads to realism about the quantum state of the universe. It leaves open whether it is a pure state represented by a wave function, or an impure one represented by a density matrix. I characterize and elaborate on Density Matrix Realism, the thesis that the universal quantum state is objective but can be impure. To clarify the thesis, I compare it with Wave F… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 22 pages

  45. How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses

    Authors: Jionghao Lin, Eason Chen, Zeifei Han, Ashish Gurung, Danielle R. Thomas, Wei Tan, Ngoc Dang Nguyen, Kenneth R. Koedinger

    Abstract: Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages, full research paper, EDM 2024

    Journal ref: A&A 687, A227 (2024)

  46. arXiv:2404.16587  [pdf, other

    cs.CL cs.AI

    Understanding Privacy Risks of Embeddings Induced by Large Language Models

    Authors: Zhihao Zhu, Ninglu Shao, Defu Lian, Chenwang Wu, Zheng Liu, Yi Yang, Enhong Chen

    Abstract: Large language models (LLMs) show early signs of artificial general intelligence but struggle with hallucinations. One promising solution to mitigate these hallucinations is to store external knowledge as embeddings, aiding LLMs in retrieval-augmented generation. However, such a solution risks compromising privacy, as recent studies experimentally showed that the original text can be partially rec… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  47. arXiv:2404.15881  [pdf, other

    cs.CV cs.AI

    Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee

    Abstract: Latency attacks against object detection represent a variant of adversarial attacks that aim to inflate the inference time by generating additional ghost objects in a target image. However, generating ghost objects in the black-box scenario remains a challenge since information about these unqualified objects remains opaque. In this study, we demonstrate the feasibility of generating ghost objects… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  48. arXiv:2404.15209  [pdf, other

    cs.LG stat.ME stat.ML

    Data-Driven Knowledge Transfer in Batch $Q^*$ Learning

    Authors: Elynn Chen, Xi Chen, Wenbo Jing

    Abstract: In data-driven decision-making in marketing, healthcare, and education, it is desirable to utilize a large amount of data from existing ventures to navigate high-dimensional feature spaces and address data scarcity in new ventures. We explore knowledge transfer in dynamic decision-making by concentrating on batch stationary environments and formally defining task discrepancies through the lens of… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  49. arXiv:2404.12387  [pdf, other

    cs.CL cs.CV

    Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

    Authors: Reka Team, Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu , et al. (1 additional authors not shown)

    Abstract: We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka. Reka models are able to process and reason with text, images, video, and audio inputs. This technical report discusses details of training some of these models and provides comprehensive evaluation results. We show that Reka Edge and Reka Flash are not only state-of-the-art but al… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  50. arXiv:2404.07456  [pdf, other

    cs.AI cs.MA

    WESE: Weak Exploration to Strong Exploitation for LLM Agents

    Authors: Xu Huang, Weiwen Liu, Xiaolong Chen, Xingmei Wang, Defu Lian, Yasheng Wang, Ruiming Tang, Enhong Chen

    Abstract: Recently, large language models (LLMs) have demonstrated remarkable potential as an intelligent agent. However, existing researches mainly focus on enhancing the agent's reasoning or decision-making abilities through well-designed prompt engineering or task-specific fine-tuning, ignoring the procedure of exploration and exploitation. When addressing complex tasks within open-world interactive envi… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.