Skip to main content

Showing 51–100 of 412 results for author: Wei, C

  1. arXiv:2401.07475  [pdf, other

    cs.CL

    GWPT: A Green Word-Embedding-based POS Tagger

    Authors: Chengwei Wei, Runqi Pang, C. -C. Jay Kuo

    Abstract: As a fundamental tool for natural language processing (NLP), the part-of-speech (POS) tagger assigns the POS label to each word in a sentence. A novel lightweight POS tagger based on word embeddings is proposed and named GWPT (green word-embedding-based POS tagger) in this work. Following the green learning (GL) methodology, GWPT contains three modules in cascade: 1) representation learning, 2) fe… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  2. arXiv:2312.16660  [pdf, other

    cond-mat.str-el cond-mat.stat-mech nlin.SI

    Unveiling chiral states in the XXZ chain: Finite-size scaling probing symmetry-enriched $c=1$ conformal field theories

    Authors: Chenan Wei, Vagharsh V. Mkhitaryan, Tigran A. Sedrakyan

    Abstract: We study the low-energy properties of the one-dimensional spin-1/2 XXZ chain with time-reversal symmetry-breaking pseudo-scalar chiral interaction and propose a phase diagram for the model. In the integrable case of the isotropic Heisenberg model with the chiral interaction, we employ the thermodynamic Bethe ansatz to find "chiralization", the response of the ground state versus the strength of th… ▽ More

    Submitted 20 June, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: 33 pages, 8 figures

    Journal ref: J. High Energ. Phys. 2024, 125 (2024)

  3. arXiv:2312.16430  [pdf, other

    cs.LG cs.AI

    Preference as Reward, Maximum Preference Optimization with Importance Sampling

    Authors: Zaifan Jiang, Xing Huang, Chao Wei

    Abstract: Preference learning is a key technology for aligning language models with human values. Reinforcement Learning from Human Feedback (RLHF) is a model-based algorithm to optimize preference learning, which first fits a reward model for preference scores and then optimizes the generating policy with an on-policy PPO algorithm to maximize the reward. The processing of RLHF is complex, time-consuming,… ▽ More

    Submitted 25 March, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  4. arXiv:2312.16374  [pdf, other

    cs.CL cs.AI

    LLM Factoscope: Uncovering LLMs' Factual Discernment through Inner States Analysis

    Authors: Jinwen He, Yujia Gong, Kai Chen, Zijin Lin, Chengan Wei, Yue Zhao

    Abstract: Large Language Models (LLMs) have revolutionized various domains with extensive knowledge and creative capabilities. However, a critical issue with LLMs is their tendency to produce outputs that diverge from factual reality. This phenomenon is particularly concerning in sensitive applications such as medical consultation and legal advice, where accuracy is paramount. In this paper, we introduce th… ▽ More

    Submitted 29 December, 2023; v1 submitted 26 December, 2023; originally announced December 2023.

  5. arXiv:2312.14867  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation

    Authors: Max Ku, Dongfu Jiang, Cong Wei, Xiang Yue, Wenhu Chen

    Abstract: In the rapidly advancing field of conditional image generation research, challenges such as limited explainability lie in effectively evaluating the performance and capabilities of various models. This paper introduces VIEScore, a Visual Instruction-guided Explainable metric for evaluating any conditional image generation tasks. VIEScore leverages general knowledge from Multimodal Large Language M… ▽ More

    Submitted 3 June, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted to ACL2024 main

  6. arXiv:2312.11420  [pdf, other

    cs.CL cs.AI cs.CV

    Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

    Authors: Bingchen Zhao, Haoqin Tu, Chen Wei, Jieru Mei, Cihang Xie

    Abstract: This paper introduces an efficient strategy to transform Large Language Models (LLMs) into Multi-Modal Large Language Models (MLLMs). By conceptualizing this transformation as a domain adaptation process, i.e., transitioning from text understanding to embracing multiple modalities, we intriguingly note that, within each attention block, tuning LayerNorm suffices to yield strong performance. Moreov… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: The first two authors contributed equally

  7. arXiv:2312.04547  [pdf, other

    cs.CV cs.AI cs.GR cs.HC

    Digital Life Project: Autonomous 3D Characters with Social Intelligence

    Authors: Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

    Abstract: In this work, we present Digital Life Project, a framework utilizing language as the universal medium to build autonomous 3D characters, who are capable of engaging in social interactions and expressing with articulated body motions, thereby simulating life in a digital environment. Our framework comprises two primary components: 1) SocioMind: a meticulously crafted digital brain that models perso… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Homepage: https://digital-life-project.com/

  8. arXiv:2311.18320  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    BN-embedded monolayer graphene with tunable electronic and topological properties

    Authors: Chih-Piao Chuu, Wei-En Tseng, Kuan-Hung Liu, Ching-Ming Wei, Mei-Yin Chou

    Abstract: Finding an effective and controllable way to create a sizable energy gap in graphene-based systems has been a challenging topic of intensive research. We propose that the hybrid of boron nitride and graphene (h-BNC) at low BN doping serves as an ideal platform for band-gap engineering and valleytronic applications. We report a systematic first-principles study of the atomic configurations and band… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  9. arXiv:2311.17136  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

    Authors: Cong Wei, Yang Chen, Haonan Chen, Hexiang Hu, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen

    Abstract: Existing information retrieval (IR) models often assume a homogeneous format, limiting their applicability to diverse user needs, such as searching for images with text descriptions, searching for a news article with a headline image, or finding a similar photo with a query image. To approach such different information-seeking demands, we introduce UniIR, a unified instruction-guided multimodal re… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Our code and dataset are available on this project page: https://tiger-ai-lab.github.io/UniIR/

  10. arXiv:2311.16502  [pdf, other

    cs.CL cs.AI cs.CV

    MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

    Authors: Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

    Abstract: We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning. MMMU includes 11.5K meticulously collected multimodal questions from college exams, quizzes, and textbooks, covering six core disciplines: Art & Design, Business, Science, Health & Medicine, Humanities & Social Science, and… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 Oral

  11. arXiv:2311.15551  [pdf, other

    cs.CV cs.AI cs.CR cs.LG eess.IV

    Instruct2Attack: Language-Guided Semantic Adversarial Attacks

    Authors: Jiang Liu, Chen Wei, Yuxiang Guo, Heng Yu, Alan Yuille, Soheil Feizi, Chun Pong Lau, Rama Chellappa

    Abstract: We propose Instruct2Attack (I2A), a language-guided semantic attack that generates semantically meaningful perturbations according to free-form language instructions. We make use of state-of-the-art latent diffusion models, where we adversarially guide the reverse diffusion process to search for an adversarial latent code conditioned on the input image and text instruction. Compared to existing no… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: under submission, code coming soon

  12. arXiv:2311.12666  [pdf, other

    cs.LG eess.SP

    SSVEP-DAN: A Data Alignment Network for SSVEP-based Brain Computer Interfaces

    Authors: Sung-Yu Chen, Chi-Min Chang, Kuan-Jung Chiang, Chun-Shu Wei

    Abstract: Steady-state visual-evoked potential (SSVEP)-based brain-computer interfaces (BCIs) offer a non-invasive means of communication through high-speed speller systems. However, their efficiency heavily relies on individual training data obtained during time-consuming calibration sessions. To address the challenge of data insufficiency in SSVEP-based BCIs, we present SSVEP-DAN, the first dedicated neur… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  13. arXiv:2311.08994  [pdf

    cond-mat.mtrl-sci

    Thickness dependent mechanical properties of soft ferromagnetic two-dimensional CoTe2

    Authors: Surbhi Slathia, Cencen Wei, Manoj Tripathi, Raphael Tromer, Solomon Demiss Negedu, Conor Boland, Suman Sarkar, Douglas S. Galvao, Alan Dalton, Chandra Sekhar Tiwary

    Abstract: Two dimensional (2D) layered transition-metal-based tellurides (chalcogens) are known to harness their surface atoms characteristics to enhance topographical activities for energy conversion, storage, and magnetic applications. High surface energy due to unsaturated dangling bonds and larger lateral size than the thickness (volume) makes them a potential candidate for emerging electronics. Neverth… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  14. arXiv:2311.00618  [pdf, other

    cs.CV

    De-Diffusion Makes Text a Strong Cross-Modal Interface

    Authors: Chen Wei, Chenxi Liu, Siyuan Qiao, Zhishuai Zhang, Alan Yuille, Jiahui Yu

    Abstract: We demonstrate text as a strong cross-modal interface. Rather than relying on deep embeddings to connect image and language as the interface representation, our approach represents an image as text, from which we enjoy the interpretability and flexibility inherent to natural language. We employ an autoencoder that uses a pre-trained text-to-image diffusion model for decoding. The encoder is traine… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Technical report. Project page: https://dediffusion.github.io

  15. arXiv:2310.18891  [pdf, other

    cs.HC cs.CY cs.RO eess.SY

    Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles

    Authors: Luca Crosato, Kai Tian, Hubert P. H Shum, Edmond S. L. Ho, Yafei Wang, Chongfeng Wei

    Abstract: Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current s… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  16. arXiv:2310.17380  [pdf, ps, other

    math.AG

    Bott Vanishing via Hodge Theory

    Authors: Chuanhao Wei

    Abstract: In this paper, we revise the Bott Vanishing on projective toric varieties by giving it an alternative proof with a condition that is compatible with the condition of Kawamata-Viehweg Vanishing. This proof can also be adapted to generalize Bott Vanishing to the setting using mixed Hodge modules. Lastly, we give a counter-example towards the relative Bott Vanishing for birational morphisms.

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 8 pages; suggestions are welcome

    MSC Class: 14M25; 14F17; 14C30

  17. arXiv:2310.11550  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

    Authors: Haolin Liu, Chen-Yu Wei, Julian Zimmert

    Abstract: We study online reinforcement learning in linear Markov decision processes with adversarial losses and bandit feedback, without prior knowledge on transitions or access to simulators. We introduce two algorithms that achieve improved regret performance compared to existing approaches. The first algorithm, although computationally inefficient, ensures a regret of… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  18. arXiv:2310.04642  [pdf, ps, other

    math.CO

    On two conjectural series involving Riemann zeta function

    Authors: Chuanan Wei, Ce Xu

    Abstract: Riemann zeta function is important in a lot of branches of number theory. With the help of the operator method and several transformation formulas for hypergeometric series, we prove four series involving Riemann zeta function. Two of them are series expansions for $ζ(7)$ and $ζ(3)^2$ recently conjectured by Z.-W. Sun.

    Submitted 6 October, 2023; originally announced October 2023.

  19. arXiv:2309.17448  [pdf, other

    cs.CV

    SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

    Authors: Zhongang Cai, Wanqi Yin, Ailing Zeng, Chen Wei, Qingping Sun, Yanjun Wang, Hui En Pang, Haiyi Mei, Mingyuan Zhang, Lei Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

    Abstract: Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion capture with numerous applications. Despite encouraging progress, current state-of-the-art methods still depend largely on a confined set of training datasets. In this work, we investigate scaling up EHPS towards the first generalist foundation model (dubbed SMPLer-X), with up to ViT-Huge as the backbone and tra… ▽ More

    Submitted 30 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Homepage: https://caizhongang.github.io/projects/SMPLer-X/

  20. arXiv:2309.10670  [pdf, other

    cond-mat.str-el

    Symmetry considerations in exact diagonalization: spin-1/2 pyrochlore magnets

    Authors: C. Wei, S. H. Curnoe

    Abstract: We describe how the methods of group theory (symmetry) are used to optimize the problem of exact diagonalization of a quantum system on a 16-site pyrochlore lattice. By analytically constructing a complete set of symmetrized states, we completely block-diagonalize the Hamiltonian. As an example, we consider a spin-1/2 system with nearest neighbour exchange interactions.

    Submitted 19 September, 2023; originally announced September 2023.

  21. arXiv:2309.08836  [pdf, other

    cs.CL cs.AI cs.CY

    Bias and Fairness in Chatbots: An Overview

    Authors: Jintang Xue, Yun-Cheng Wang, Chengwei Wei, Xiaofeng Liu, Jonghye Woo, C. -C. Jay Kuo

    Abstract: Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in mode… ▽ More

    Submitted 10 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  22. arXiv:2309.07120  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG

    Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics

    Authors: Haoqin Tu, Bingchen Zhao, Chen Wei, Cihang Xie

    Abstract: Multi-modal large language models (MLLMs) are trained based on large language models (LLM), with an enhanced capability to comprehend multi-modal inputs and generate textual responses. While they excel in multi-modal tasks, the pure NLP abilities of MLLMs are often underestimated and left untested. In this study, we get out of the box and unveil an intriguing characteristic of MLLMs -- our prelimi… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  23. arXiv:2309.01560  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Quasi 1D Nanobelts from the Sustainable Liquid Exfoliation of Terrestrial Minerals for Future Martian based Electronics

    Authors: Cencen Wei, Abhijit Roy, Adel K. A. Aljarid, Yi Hu, S. Mark Roe, Dimitrios G. Papageorgiou, Raul Arenal, Conor S. Boland

    Abstract: The sky is the limit with regards to the societal impact nanomaterials can have on our lives. However, in this study we show that their potential is out of this world. The planet Mars has an abundant source of calcium sulfate minerals and in our work, we show that these deposits can be the basis of transformative nanomaterials to potentially support future space endeavors. Through a scalable eco-f… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  24. arXiv:2309.00814  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

    Authors: Haolin Liu, Chen-Yu Wei, Julian Zimmert

    Abstract: We consider the adversarial linear contextual bandit problem, where the loss vectors are selected fully adversarially and the per-round action set (i.e. the context) is drawn from a fixed distribution. Existing methods for this problem either require access to a simulator to generate free i.i.d. contexts, achieve a sub-optimal regret no better than $\widetilde{O}(T^{\frac{5}{6}})$, or are computat… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  25. MuraNet: Multi-task Floor Plan Recognition with Relation Attention

    Authors: Lingxiao Huang, Jung-Hsuan Wu, Chiching Wei, Wilson Li

    Abstract: The recognition of information in floor plan data requires the use of detection and segmentation models. However, relying on several single-task models can result in ineffective utilization of relevant information when there are multiple tasks present simultaneously. To address this challenge, we introduce MuraNet, an attention-based multi-task model for segmentation and detection tasks in floor p… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: Document Analysis and Recognition - ICDAR 2023 Workshops. ICDAR 2023. Lecture Notes in Computer Science, vol 14193. Springer, Cham

  26. arXiv:2309.00218  [pdf, other

    hep-th gr-qc

    Catastrophic Emission of Charges from Near-Extremal Nariai Black Holes

    Authors: Chiang-Mei Chen, Chun-Chih Huang, Sang Pyo Kim, Chun-Yu Wei

    Abstract: Using the in-out formalism and also the monodromy method, we study the emission of charges from near-extremal charged Nariai black holes with the black hole and cosmological horizons close to each other. The emission becomes catastrophic for a charge with energy greater than its chemical potential, whose leading exponential factor increases inversely proportional to the separation of two horizons.… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 15 pages

  27. arXiv:2308.14492  [pdf, other

    cs.CV

    PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds

    Authors: Zhongang Cai, Liang Pan, Chen Wei, Wanqi Yin, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

    Abstract: Human pose and shape estimation (HPS) has attracted increasing attention in recent years. While most existing studies focus on HPS from 2D images or videos with inherent depth ambiguity, there are surging need to investigate HPS from 3D point clouds as depth sensors have been frequently employed in commercial devices. However, real-world sensory 3D points are usually noisy and incomplete, and also… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  28. arXiv:2308.13904  [pdf, other

    cs.CL cs.CR cs.LG

    LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors

    Authors: Chengkun Wei, Wenlong Meng, Zhikun Zhang, Min Chen, Minghu Zhao, Wenjing Fang, Lei Wang, Zihui Zhang, Wenzhi Chen

    Abstract: Prompt-tuning has emerged as an attractive paradigm for deploying large-scale language models due to its strong downstream task performance and efficient multitask serving ability. Despite its wide adoption, we empirically show that prompt-tuning is vulnerable to downstream task-agnostic backdoors, which reside in the pretrained models and can affect arbitrary downstream tasks. The state-of-the-ar… ▽ More

    Submitted 14 October, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: To Appear in the Network and Distributed System Security (NDSS) Symposium 2024, 26 February - 1 March 2024, San Diego, CA, USA; typos corrected

  29. arXiv:2308.13465  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Ptychographic nanoscale imaging of the magnetoelectric coupling in freestanding BiFeO$_3$

    Authors: Tim A. Butcher, Nicholas W. Phillips, Chun-Chien Chiu, Chia-Chun Wei, Sheng-Zhu Ho, Yi-Chun Chen, Erik Fröjdh, Filippo Baruffaldi, Maria Carulla, Jiaguo Zhang, Anna Bergamaschi, Carlos A. F. Vaz, Armin Kleibert, Simone Finizio, Jan-Chi Yang, Shih-Wen Huang, Jörg Raabe

    Abstract: Understanding the magnetic and ferroelectric ordering of magnetoelectric multiferroic materials at the nanoscale necessitates a versatile imaging method with high spatial resolution. Here, soft X-ray ptychography is employed to simultaneously image the ferroelectric and antiferromagnetic domains in an 80 nm thin freestanding film of the room-temperature multiferroic BiFeO$_3$ (BFO). The antiferrom… ▽ More

    Submitted 29 June, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Supporting information available with published version: https://doi.org/10.1002/adma.202311157

    Journal ref: Adv. Mater. 2024, 36, 2311157

  30. DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting

    Authors: Tianyu Fu, Chiyue Wei, Yu Wang, Rex Ying

    Abstract: We introduce DeSCo, a scalable neural deep subgraph counting pipeline, designed to accurately predict both the count and occurrence position of queries on target graphs post single training. Firstly, DeSCo uses a novel canonical partition and divides the large target graph into small neighborhood graphs, greatly reducing the count variation while guaranteeing no missing or double-counting. Secondl… ▽ More

    Submitted 19 December, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages main text, 2 pages references, 11 pages appendix; open source at https://github.com/fuvty/DeSCo

    ACM Class: I.2.8

    Journal ref: WSDM'24, March 4-8, 2024, Merida, Mexico

  31. arXiv:2308.06440  [pdf, ps, other

    math.CO

    On some conjectural series containing harmonic numbers of 3-order

    Authors: Chuanan Wei, Ce Xu

    Abstract: Harmonic numbers are important in a lot of branches of number theory. By means of the derivative operator, the integral operator, and several summation and transformation formulas for hypergeometric series, we prove four series containing harmonic numbers of 3-order. Three of them are conjectures which were recently proposed by Z.-W. Sun.

    Submitted 11 August, 2023; originally announced August 2023.

  32. arXiv:2307.14624  [pdf, other

    cs.CV

    FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene

    Authors: Chengrui Wei, Meng Yang, Lei He, Nanning Zheng

    Abstract: It has long been an ill-posed problem to predict absolute depth maps from single images in real (unseen) indoor scenes. We observe that it is essentially due to not only the scale-ambiguous problem but also the focal-ambiguous problem that decreases the generalization ability of monocular depth estimation. That is, images may be captured by cameras of different focal lengths in scenes of different… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  33. arXiv:2307.10455  [pdf, other

    cs.CV cs.AI cs.LG

    A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset

    Authors: Zahra Gharaee, ZeMing Gong, Nicholas Pellegrino, Iuliia Zarubiieva, Joakim Bruslund Haurum, Scott C. Lowe, Jaclyn T. A. McKeown, Chris C. Y. Ho, Joschka McLeod, Yi-Yun C Wei, Jireh Agda, Sujeevan Ratnasingham, Dirk Steinke, Angel X. Chang, Graham W. Taylor, Paul Fieguth

    Abstract: In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled insect images, the BIOSCAN-Insect Dataset. Each record is taxonomically classified by an expert, and also has associated genetic information including raw nucleotide barcode sequences and assigned barcode index numbers, which are genetically-based proxies for species classification. This paper presents a c… ▽ More

    Submitted 13 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  34. arXiv:2307.07930  [pdf, other

    cs.CL cs.AI

    GeoGPT: Understanding and Processing Geospatial Tasks through An Autonomous GPT

    Authors: Yifan Zhang, Cheng Wei, Shangyou Wu, Zhengting He, Wenhao Yu

    Abstract: Decision-makers in GIS need to combine a series of spatial algorithms and operations to solve geospatial tasks. For example, in the task of facility siting, the Buffer tool is usually first used to locate areas close or away from some specific entities; then, the Intersect or Erase tool is used to select candidate areas satisfied multiple requirements. Though professionals can easily understand an… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 23 pages, 4 figures

  35. arXiv:2306.17170  [pdf, other

    cs.DC cs.AI eess.SY

    An Overview on Generative AI at Scale with Edge-Cloud Computing

    Authors: Yun-Cheng Wang, Jintang Xue, Chengwei Wei, C. -C. Jay Kuo

    Abstract: As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing fram… ▽ More

    Submitted 9 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  36. arXiv:2306.12624  [pdf, other

    cs.CV

    DreamEdit: Subject-driven Image Editing

    Authors: Tianle Li, Max Ku, Cong Wei, Wenhu Chen

    Abstract: Subject-driven image generation aims at generating images containing customized subjects, which has recently drawn enormous attention from the research community. However, the previous works cannot precisely control the background and position of the target subject. In this work, we aspire to fill the void and propose two novel subject-driven sub-tasks, i.e., Subject Replacement and Subject Additi… ▽ More

    Submitted 16 August, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  37. arXiv:2306.11700  [pdf, other

    math.OC cs.LG eess.SY

    Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

    Authors: Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro

    Abstract: We study the problem of computing an optimal policy of an infinite-horizon discounted constrained Markov decision process (constrained MDP). Despite the popularity of Lagrangian-based policy search methods used in practice, the oscillation of policy iterates in these methods has not been fully understood, bringing out issues such as violation of constraints and sensitivity to hyper-parameters. To… ▽ More

    Submitted 16 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 65 pages, 17 figures, and 1 table; NeurIPS 2023

  38. arXiv:2306.11189  [pdf

    cs.CL

    BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets

    Authors: Po-Ting Lai, Chih-Hsuan Wei, Ling Luo, Qingyu Chen, Zhiyong Lu

    Abstract: Biomedical relation extraction (RE) is the task of automatically identifying and characterizing relations between biomedical concepts from free text. RE is a central task in biomedical natural language processing (NLP) research and plays a critical role in many downstream applications, such as literature-based discovery and knowledge graph construction. State-of-the-art methods were used primarily… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  39. arXiv:2306.06659  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Ferromagnetic Superconductivity in Two-dimensional Niobium Diselenide

    Authors: Tingyu Qu, Shangjian Jin, Fuchen Hou, Deyi Fu, Junye Huang, Darryl Foo Chuan Wei, Xiao Chang, Kenji Watanabe, Takashi Taniguchi, Junhao Lin, Shaffique Adam, Barbaros Özyilmaz

    Abstract: The co-existence of ferromagnetism and superconductivity becomes possible through unconventional pairing in the superconducting state. Such materials are exceedingly rare in solid-state systems but are promising platforms to explore topological phases, such as Majorana bound states. Theoretical investigations date back to the late 1950s, but only a few systems have so far been experimentally ident… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: 26 pages, 13 figures

  40. arXiv:2306.02641  [pdf, ps, other

    math.CO

    On some conjectural series containing binomial coefficients and harmonic numbers

    Authors: Chuanan Wei

    Abstract: Binomial coefficients and harmonic numbers are important in many branches of number theory. With the help of the operator method and several summation and transformation formulas for hypergeometric series, we prove eight conjectural series of Z.-W. Sun containing binomial coefficients and harmonic numbers in this paper.

    Submitted 5 June, 2023; originally announced June 2023.

  41. arXiv:2306.01747  [pdf, other

    cs.CV

    UMDFood: Vision-language models boost food composition compilation

    Authors: Peihua Ma, Yixin Wu, Ning Yu, Yang Zhang, Michael Backes, Qin Wang, Cheng-I Wei

    Abstract: Nutrition information is crucial in precision nutrition and the food industry. The current food composition compilation paradigm relies on laborious and experience-dependent methods. However, these methods struggle to keep up with the dynamic consumer market, resulting in delayed and incomplete nutrition data. In addition, earlier machine learning methods overlook the information in food ingredien… ▽ More

    Submitted 6 November, 2023; v1 submitted 17 May, 2023; originally announced June 2023.

    Comments: 13 pages, 9 figures

  42. arXiv:2306.00989  [pdf, other

    cs.CV cs.LG

    Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

    Authors: Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer

    Abstract: Modern hierarchical vision transformers have added several vision-specific components in the pursuit of supervised classification performance. While these components lead to effective accuracies and attractive FLOP counts, the added complexity actually makes these transformers slower than their vanilla ViT counterparts. In this paper, we argue that this additional bulk is unnecessary. By pretraini… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023 Oral version. Code+Models: https://github.com/facebookresearch/hiera

  43. arXiv:2305.17380  [pdf, ps, other

    cs.LG stat.ML

    No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

    Authors: Tiancheng Jin, Junyan Liu, Chloé Rouyer, William Chang, Chen-Yu Wei, Haipeng Luo

    Abstract: Existing online learning algorithms for adversarial Markov Decision Processes achieve ${O}(\sqrt{T})$ regret after $T$ rounds of interactions even if the loss functions are chosen arbitrarily by an adversary, with the caveat that the transition function has to be fixed. This is because it has been shown that adversarial transition functions make no-regret learning impossible. Despite such impossib… ▽ More

    Submitted 26 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Update the camera-ready version for NeurIPS 2023

    ACM Class: I.2.6

  44. arXiv:2305.07861  [pdf

    physics.optics physics.app-ph

    Ultra-wideband Waveguide-coupled Photodiodes Heterogeneously Integrated on a Thin-film Lithium Niobate Platform

    Authors: Chao Wei, Youren Yu, Ziyun Wang, Lin Jiang, Zhongming Zeng, Jia Ye, Xihua Zou, Wei Pan, Xiaojun Xie, Lianshan Yan

    Abstract: With the advantages of large electro-optical coefficient, wide transparency window, and strong optical confinement, thin-film lithium niobate (TFLN) technique has enabled the development of various high-performance optoelectronics devices, ranging from the ultra-wideband electro-optic modulators to the high-efficient quantum sources. However, the TFLN platform does not natively promise lasers and… ▽ More

    Submitted 1 July, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: 17 pages, 8 figures

    Journal ref: Light: Advanced Manufacturing, 4, Article number: 30 (2023)

  45. arXiv:2305.05900  [pdf, other

    cs.LG cs.CR cs.CV

    DPMLBench: Holistic Evaluation of Differentially Private Machine Learning

    Authors: Chengkun Wei, Minghu Zhao, Zhikun Zhang, Min Chen, Wenlong Meng, Bo Liu, Yuan Fan, Wenzhi Chen

    Abstract: Differential privacy (DP), as a rigorous mathematical definition quantifying privacy leakage, has become a well-accepted standard for privacy protection. Combined with powerful machine learning techniques, differentially private machine learning (DPML) is increasingly important. As the most classic DPML algorithm, DP-SGD incurs a significant loss of utility, which hinders DPML's deployment in prac… ▽ More

    Submitted 14 October, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: To appear in the ACM Conference on Computer and Communications Security (CCS), November 2023, Tivoli Congress Center, Copenhagen, Denmark

  46. The chromatic Point Spread Function of weak lensing measurement in Chinese Space Station survey Telescope

    Authors: Q. Y. Liu, X. Z. Er, Z. H. Fan, D. Z. Liu, G. L. Li, C. L. Wei, Z. Ban, X. B. Li, D. Yue

    Abstract: The weak gravitational lensing is a powerful tool in modern cosmology. To accurately measure the weak lensing signal, one has to control the systematic bias to a small level. One of the most difficult problems is how to correct the smearing effect of the Point Spread Function (PSF) on the shape of the galaxies. The chromaticity of PSF for a broad-band observation can lead to new subtle effects. Si… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  47. arXiv:2305.03563  [pdf, other

    cs.MA

    Cooperative Driving of Connected Autonomous Vehicles in Heterogeneous Mixed Traffic: A Game Theoretic Approach

    Authors: Shiyu Fang, Peng Hang, Chongfeng Wei, Yang Xing, Jian Sun

    Abstract: High-density, unsignalized intersection has always been a bottleneck of efficiency and safety. The emergence of Connected Autonomous Vehicles (CAVs) results in a mixed traffic condition, further increasing the complexity of the transportation system. Against this background, this paper aims to study the intricate and heterogeneous interaction of vehicles and conflict resolution at the high-density… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  48. arXiv:2305.00832  [pdf, ps, other

    cs.LG stat.ML

    First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

    Authors: Julia Olkhovskaya, Jack Mayo, Tim van Erven, Gergely Neu, Chen-Yu Wei

    Abstract: We consider the adversarial linear contextual bandit setting, which allows for the loss functions associated with each of $K$ arms to change over time without restriction. Assuming the $d$-dimensional contexts are drawn from a fixed known distribution, the worst-case expected regret over the course of $T$ rounds is known to scale as $\tilde O(\sqrt{Kd T})$. Under the additional assumption that the… ▽ More

    Submitted 24 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  49. arXiv:2304.09753  [pdf, ps, other

    math.CO math.NT

    On some conjectures of Z.-W. Sun involving harmonic numbers

    Authors: Chuanan Wei

    Abstract: Harmonic numbers are significant in various branches of number theory. With the help of the digamma function, we prove ten conjectural series of Z.-W. Sun involving harmonic numbers. Several ones of them are also series expansions of $\log2/π^2$.

    Submitted 18 April, 2023; originally announced April 2023.

  50. arXiv:2304.07745  [pdf, other

    cs.CV eess.IV

    Framework for Quality Evaluation of Smart Roadside Infrastructure Sensors for Automated Driving Applications

    Authors: Laurent Kloeker, Chenghua Liu, Chao Wei, Lutz Eckstein

    Abstract: The use of smart roadside infrastructure sensors is highly relevant for future applications of connected and automated vehicles. External sensor technology in the form of intelligent transportation system stations (ITS-Ss) can provide safety-critical real-time information about road users in the form of a digital twin. The choice of sensor setups has a major influence on the downstream function as… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: Accepted to be published as part of the 34th IEEE Intelligent Vehicles Symposium (IV), Anchorage, Alaska, USA, June 4-7, 2023