Skip to main content

Showing 1–50 of 210 results for author: Ouyang, Y

  1. arXiv:2406.16004  [pdf, other

    cs.CV

    RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization

    Authors: Mingshu Zhao, Yi Luo, Yong Ouyang

    Abstract: In the realm of resource-constrained mobile vision tasks, the pursuit of efficiency and performance consistently drives innovation in lightweight Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). While ViTs excel at capturing global context through self-attention mechanisms, their deployment in resource-limited environments is hindered by computational complexity and latency. Co… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Tech report

  2. arXiv:2406.15699  [pdf, other

    cs.CV

    Self-Supervised Alignment Learning for Medical Image Segmentation

    Authors: Haofeng Li, Yiming Ouyang, Xiang Wan

    Abstract: Recently, self-supervised learning (SSL) methods have been used in pre-training the segmentation models for 2D and 3D medical images. Most of these methods are based on reconstruction, contrastive learning and consistency regularization. However, the spatial correspondence of 2D slices from a 3D medical image has not been fully exploited. In this paper, we propose a novel self-supervised alignment… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted by (ISBI 2024) 2024 IEEE International Symposium on Biomedical Imaging

  3. arXiv:2406.13960  [pdf, other

    cs.CL cs.AI

    Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas

    Authors: Yi Cheng, Wenge Liu, Kaishuai Xu, Wenjun Hou, Yi Ouyang, Chak Tou Leong, Xian Wu, Yefeng Zheng

    Abstract: Previous research on persona-based dialogue agents typically preset the agent's persona before deployment, which remains static thereafter. In this paper, we take a step further and explore a new paradigm called Self-evolving Personalized Dialogue Agents (SPDA), where the agent continuously evolves during the conversation to better align with the user's anticipation by dynamically adapting its per… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Work in progress

  4. arXiv:2406.12174  [pdf, other

    math.OC

    Expected Bipartite Matching Distance in A $D$-dimensional $L^p$ Space: Approximate Closed-form Formulas and Applications to Mobility Services

    Authors: Shiyu Shen, Yuhui Zhai, Yanfeng Ouyang

    Abstract: Although many well-known algorithms can solve the bipartite matching problem instance efficiently, it remains an open question how one could estimate the expected optimal matching distance for arbitrary numbers of randomly distributed vertices in a $D$-dimensional $L^p$ space (referred to as a random bipartite matching problem, or RBMP). This paper proposes an analytical model with closed-form for… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.10870  [pdf, other

    cs.CL

    COOL: Comprehensive Knowledge Enhanced Prompt Learning for Domain Adaptive Few-shot Fake News Detection

    Authors: Yi Ouyang, Peng Wu, Li Pan

    Abstract: Most Fake News Detection (FND) methods often struggle with data scarcity for emerging news domain. Recently, prompt learning based on Pre-trained Language Models (PLM) has emerged as a promising approach in domain adaptive few-shot learning, since it greatly reduces the need for labeled data by bridging the gap between pre-training and downstream task. Furthermore, external knowledge is also helpf… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  6. arXiv:2406.05975  [pdf, ps, other

    math.NT

    Divisibility of class numbers of quadratic fields and a conjecture of Iizuka

    Authors: Yi Ouyang, Qimin Song

    Abstract: Assume $x,\ y,\ n$ are positive integers and $n$ is odd. In this note, we show that the class number of the imaginary quadratic field $\mathbb{Q}(\sqrt{x^{2}-y^{n}})$ is divisible by $n$ for fixed $x, n$ if $\gcd(2x,y)=1$ and $y>C$ where $C$ is a constant depending only on $x$ and $n$. Based on this result, for any odd integer $n$ and any positive integer $m$, we construct an infinite family of… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  7. arXiv:2405.16876  [pdf, other

    cs.LG cs.AI

    Transfer Learning for Diffusion Models

    Authors: Yidong Ouyang, Liyan Xie, Hongyuan Zha, Guang Cheng

    Abstract: Diffusion models, a specific type of generative model, have achieved unprecedented performance in recent years and consistently produce high-quality synthetic samples. A critical prerequisite for their notable success lies in the presence of a substantial number of training samples, which can be impractical in real-world applications due to high collection costs or associated risks. Consequently,… ▽ More

    Submitted 27 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: 24 pages

  8. arXiv:2405.14391  [pdf, other

    cs.AI cs.CL cs.CY

    Explainable Few-shot Knowledge Tracing

    Authors: Haoxuan Li, Jifan Yu, Yuanxin Ouyang, Zhuang Liu, Wenge Rong, Juanzi Li, Zhang Xiong

    Abstract: Knowledge tracing (KT), aiming to mine students' mastery of knowledge by their exercise records and predict their performance on future test questions, is a critical task in educational assessment. While researchers achieved tremendous success with the rapid development of deep learning techniques, current knowledge tracing tasks fall into the cracks from real-world teaching scenarios. Relying hea… ▽ More

    Submitted 25 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  9. arXiv:2405.12260  [pdf, ps, other

    math.ST math.PR

    On an upper bound of the set of copulas with a given curvilinear section

    Authors: Yao Ouyang, Yonghui Sun, Hua-Peng Zhang

    Abstract: The characterizations when two natural upper bounds of the set of copulas with a given diagonal section are copulas have been well studied in the literature. Given a curvilinear section, however, there is only a partial result concerning the characterization when a natural upper bound of the set of copulas is a copula. In this paper, we completely solve the characterization problem for this natura… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  10. Potential Surface Ice Distribution on Close-in Terrestrial Exoplanets around M dwarfs

    Authors: Yueyun Ouyang, Feng Ding

    Abstract: Previous studies suggested that surface ice could be distributed on close-in terrestrial exoplanets around M-dwarfs if heat redistribution on the planets is very inefficient. In general, orbital and atmospheric parameters play an important role in the climate on terrestrial planets, including the cold-trap region where the permanent surface water reservoir can potentially be distributed. Here, we… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted at Monthly Notices of the Royal Astronomical Society

  11. arXiv:2404.18620  [pdf, other

    cs.CV

    FlexiFilm: Long Video Generation with Flexible Conditions

    Authors: Yichen Ouyang, jianhao Yuan, Hao Zhao, Gaoang Wang, Bo zhao

    Abstract: Generating long and consistent videos has emerged as a significant yet challenging problem. While most existing diffusion-based video generation models, derived from image generation models, demonstrate promising performance in generating short videos, their simple conditioning mechanism and sampling strategy-originally designed for image generation-cause severe performance degradation when adapte… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 9 pages, 9 figures

  12. arXiv:2404.05962  [pdf, other

    cs.IR cs.IT

    Wasserstein Dependent Graph Attention Network for Collaborative Filtering with Uncertainty

    Authors: Haoxuan Li, Yuanxin Ouyang, Zhuang Liu, Wenge Rong, Zhang Xiong

    Abstract: Collaborative filtering (CF) is an essential technique in recommender systems that provides personalized recommendations by only leveraging user-item interactions. However, most CF methods represent users and items as fixed points in the latent space, lacking the ability to capture uncertainty. While probabilistic embedding is proposed to intergrate uncertainty, they suffer from several limitation… ▽ More

    Submitted 29 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE TCSS

  13. arXiv:2404.05291  [pdf, other

    cs.RO

    Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models

    Authors: Yutao Ouyang, Jinhan Li, Yunfei Li, Zhongyu Li, Chao Yu, Koushil Sreenath, Yi Wu

    Abstract: We present a large language model (LLM) based system to empower quadrupedal robots with problem-solving abilities for long-horizon tasks beyond short-term motions. Long-horizon tasks for quadrupeds are challenging since they require both a high-level understanding of the semantics of the problem for task planning and a broad range of locomotion and manipulation skills to interact with the environm… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  14. arXiv:2404.02936  [pdf, other

    cs.CL cs.LG

    Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

    Authors: Jingyang Zhang, Jingwei Sun, Eric Yeats, Yang Ouyang, Martin Kuo, Jianyi Zhang, Hao Frank Yang, Hai Li

    Abstract: The problem of pre-training data detection for large language models (LLMs) has received growing attention due to its implications in critical issues like copyright violation and test data contamination. Despite improved performance, existing methods (including the state-of-the-art, Min-K%) are mostly developed upon simple heuristics and lack solid, reasonable foundations. In this work, we propose… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Project page and code is available at https://zjysteven.github.io/mink-plus-plus/

  15. arXiv:2404.00639  [pdf, other

    cs.AR cs.LG

    RL-MUL: Multiplier Design Optimization with Deep Reinforcement Learning

    Authors: Dongsheng Zuo, Jiadong Zhu, Yikang Ouyang, Yuzhe Ma

    Abstract: Multiplication is a fundamental operation in many applications, and multipliers are widely adopted in various circuits. However, optimizing multipliers is challenging and non-trivial due to the huge design space. In this paper, we propose RL-MUL, a multiplier design optimization framework based on reinforcement learning. Specifically, we utilize matrix and tensor representations for the compressor… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Extension of DAC 2023 version

  16. arXiv:2403.16702  [pdf, other

    cs.CL cs.IR cs.SE

    ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search

    Authors: Zehan Li, Jianfei Zhang, Chuantao Yin, Yuanxin Ouyang, Wenge Rong

    Abstract: Retrieval-based code question answering seeks to match user queries in natural language to relevant code snippets. Previous approaches typically rely on pretraining models using crafted bi-modal and uni-modal datasets to align text and code representations. In this paper, we introduce ProCQA, a large-scale programming question answering dataset extracted from the StackOverflow community, offering… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  17. arXiv:2403.16029  [pdf, other

    math.OC

    Planning Charging Stations and Service Operations of Dockless Electric Micromobility Systems

    Authors: Yining Liu, Yanfeng Ouyang

    Abstract: Dockless electric micro-mobility services (e.g., shared e-scooters and e-bikes) have been increasingly popular in the recent decade, and a variety of charging technologies have emerged for these services. The use of charging stations, to/from which service vehicles are transported by the riders for charging, poses as a promising approach because it reduces the need for dedicated staff or contracto… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  18. arXiv:2403.10339  [pdf, other

    cs.LG

    Generation is better than Modification: Combating High Class Homophily Variance in Graph Anomaly Detection

    Authors: Rui Zhang, Dawei Cheng, Xin Liu, Jie Yang, Yi Ouyang, Xian Wu, Yefeng Zheng

    Abstract: Graph-based anomaly detection is currently an important research topic in the field of graph neural networks (GNNs). We find that in graph anomaly detection, the homophily distribution differences between different classes are significantly greater than those in homophilic and heterophilic graphs. For the first time, we introduce a new metric called Class Homophily Variance, which quantitatively d… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  19. arXiv:2403.00803  [pdf, other

    cs.IR cs.AI cs.LG

    LiMAML: Personalization of Deep Recommender Models via Meta Learning

    Authors: Ruofan Wang, Prakruthi Prabhakar, Gaurav Srivastava, Tianqi Wang, Zeinab S. Jalali, Varun Bharill, Yunbo Ouyang, Aastha Nigam, Divya Venugopalan, Aman Gupta, Fedor Borisyuk, Sathiya Keerthi, Ajith Muralidharan

    Abstract: In the realm of recommender systems, the ubiquitous adoption of deep neural networks has emerged as a dominant paradigm for modeling diverse business objectives. As user bases continue to expand, the necessity of personalization and frequent model updates have assumed paramount significance to ensure the delivery of relevant and refreshed experiences to a diverse array of members. In this work, we… ▽ More

    Submitted 23 February, 2024; originally announced March 2024.

  20. A Review of Data Mining in Personalized Education: Current Trends and Future Prospects

    Authors: Zhang Xiong, Haoxuan Li, Zhuang Liu, Zhuofan Chen, Hao Zhou, Wenge Rong, Yuanxin Ouyang

    Abstract: Personalized education, tailored to individual student needs, leverages educational technology and artificial intelligence (AI) in the digital age to enhance learning effectiveness. The integration of AI in educational platforms provides insights into academic performance, learning preferences, and behaviors, optimizing the personal learning process. Driven by data mining techniques, it not only b… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 25 pages, 5 figures

    Journal ref: Frontiers of Digital Education, 2024 ,1(1): 26-50

  21. arXiv:2402.11572  [pdf, other

    cs.CL

    Cobra Effect in Reference-Free Image Captioning Metrics

    Authors: Zheng Ma, Changxin Wang, Yawen Ouyang, Fei Zhao, Jianbing Zhang, Shujian Huang, Jiajun Chen

    Abstract: Evaluating the compatibility between textual descriptions and corresponding images represents a core endeavor within multi-modal research. In recent years, a proliferation of reference-free methods, leveraging visual-language pre-trained models (VLMs), has emerged. Empirical evidence has substantiated that these innovative approaches exhibit a higher correlation with human judgment, marking a sign… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: pre-print version

  22. arXiv:2402.11139  [pdf, other

    cs.LG cs.AI

    LiGNN: Graph Neural Networks at LinkedIn

    Authors: Fedor Borisyuk, Shihai He, Yunbo Ouyang, Morteza Ramezani, Peng Du, Xiaochen Hou, Chengming Jiang, Nitin Pasumarthy, Priya Bannur, Birjodh Tiwana, Ping Liu, Siddharth Dangi, Daqi Sun, Zhoutao Pei, Xiao Shi, Sirou Zhu, Qianqi Shen, Kuang-Hsuan Lee, David Stein, Baolei Li, Haichao Wei, Amol Ghoting, Souvik Ghosh

    Abstract: In this paper, we present LiGNN, a deployed large-scale Graph Neural Networks (GNNs) Framework. We share our insight on developing and deployment of GNNs at large scale at LinkedIn. We present a set of algorithmic improvements to the quality of GNN representation learning including temporal graph architectures with long term losses, effective cold start solutions via graph densification, ID embedd… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  23. arXiv:2402.08813  [pdf, other

    math.OC cs.LG eess.SY

    Model approximation in MDPs with unbounded per-step cost

    Authors: Berk Bozkurt, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

    Abstract: We consider the problem of designing a control policy for an infinite-horizon discounted cost Markov decision process $\mathcal{M}$ when we only have access to an approximate model $\hat{\mathcal{M}}$. How well does an optimal policy $\hatπ^{\star}$ of the approximate model perform when used in the original model $\mathcal{M}$? We answer this question by bounding a weighted norm of the difference… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  24. arXiv:2402.06859  [pdf, other

    cs.LG cs.AI cs.IR

    LiRank: Industrial Large Scale Ranking Models at LinkedIn

    Authors: Fedor Borisyuk, Mingzhou Zhou, Qingquan Song, Siyu Zhu, Birjodh Tiwana, Ganesh Parameswaran, Siddharth Dangi, Lars Hertel, Qiang Xiao, Xiaochen Hou, Yunbo Ouyang, Aman Gupta, Sheallika Singh, Dan Liu, Hailing Cheng, Lei Le, Jonathan Hung, Sathiya Keerthi, Ruoyan Wang, Fengyu Zhang, Mohit Kothari, Chen Zhu, Daqi Sun, Yun Dai, Xun Luan , et al. (9 additional authors not shown)

    Abstract: We present LiRank, a large-scale ranking framework at LinkedIn that brings to production state-of-the-art modeling architectures and optimization methods. We unveil several modeling improvements, including Residual DCN, which adds attention and residual connections to the famous DCNv2 architecture. We share insights into combining and tuning SOTA architectures to create a unified model, including… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    ACM Class: H.3.3

  25. arXiv:2402.04093  [pdf, other

    quant-ph

    Robust projective measurements through measuring code-inspired observables

    Authors: Yingkai Ouyang

    Abstract: Quantum measurements are ubiquitous in quantum information processing tasks, but errors can render their outputs unreliable. Here, we present a scheme that implements a robust projective measurement through measuring code-inspired observables. Namely, given a projective POVM, a classical code and a constraint on the number of measurement outcomes each observable can have, we construct commuting ob… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 7 pages, 1 figure, 2 columns

  26. arXiv:2401.14291  [pdf, other

    math.RA

    On the Algebraic Classification of Non-singular Flexible Kokotsakis Polyhedra

    Authors: Yang Liu, Yi Ouyang, Dominik L. Michels

    Abstract: Across various scientific and engineering domains, a growing interest in flexible and deployable structures is becoming evident. These structures facilitate seamless transitions between distinct states of shape and find broad applicability ranging from robotics and solar cells to meta-materials and architecture. In this contribution, we study a class of mechanisms known as Kokotsakis polyhedra wit… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    MSC Class: 12D05; 12F05; 52C25

  27. arXiv:2401.12087  [pdf, other

    cs.CL

    Revisiting Demonstration Selection Strategies in In-Context Learning

    Authors: Keqin Peng, Liang Ding, Yancheng Yuan, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao

    Abstract: Large language models (LLMs) have shown an impressive ability to perform a wide range of tasks using in-context learning (ICL), where a few examples are used to describe a task to the model. However, the performance of ICL varies significantly with the choice of demonstrations, and it is still unclear why this happens or what factors will influence its choice. In this work, we first revisit the fa… ▽ More

    Submitted 23 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: ACL 2024

  28. arXiv:2401.05886  [pdf, other

    quant-ph

    Finding the optimal probe state for multiparameter quantum metrology using conic programming

    Authors: Masahito Hayashi, Yingkai Ouyang

    Abstract: The aim of the channel estimation is to estimate the parameters encoded in a quantum channel. For this aim, it is allowed to choose the input state as well as the measurement to get the outcome. Various precision bounds are known for the state estimation. For the channel estimation, the respective bounds are determined depending on the choice of the input state. However, determining the optimal in… ▽ More

    Submitted 26 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 36 pages, 2 columns, 5 figures. Title change, added references, and edited introduction

  29. arXiv:2312.11792  [pdf, other

    cs.CL

    COOPER: Coordinating Specialized Agents towards a Complex Dialogue Goal

    Authors: Yi Cheng, Wenge Liu, Jian Wang, Chak Tou Leong, Yi Ouyang, Wenjie Li, Xian Wu, Yefeng Zheng

    Abstract: In recent years, there has been a growing interest in exploring dialogues with more complex goals, such as negotiation, persuasion, and emotional support, which go beyond traditional service-focused dialogue systems. Apart from the requirement for much more sophisticated strategic reasoning and communication skills, a significant challenge of these tasks lies in the difficulty of objectively measu… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  30. arXiv:2310.14605  [pdf, other

    cs.CL cs.MM

    M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis

    Authors: Fei Zhao, Chunhui Li, Zhen Wu, Yawen Ouyang, Jianbing Zhang, Xinyu Dai

    Abstract: Multimodal Aspect-based Sentiment Analysis (MABSA) is a fine-grained Sentiment Analysis task, which has attracted growing research interests recently. Existing work mainly utilizes image information to improve the performance of MABSA task. However, most of the studies overestimate the importance of images since there are many noise images unrelated to the text in the dataset, which will have a ne… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023

  31. arXiv:2310.12139  [pdf, ps, other

    math.OC stat.CO

    Optimal and parameter-free gradient minimization methods for convex and nonconvex optimization

    Authors: Guanghui Lan, Yuyuan Ouyang, Zhe Zhang

    Abstract: We propose novel optimal and parameter-free algorithms for computing an approximate solution with small (projected) gradient norm. Specifically, for computing an approximate solution such that the norm of its (projected) gradient does not exceed $\varepsilon$, we obtain the following results: a) for the convex case, the total number of gradient evaluations is bounded by… ▽ More

    Submitted 29 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  32. arXiv:2310.08439  [pdf, other

    physics.comp-ph cs.DC

    TensorMD: Scalable Tensor-Diagram based Machine Learning Interatomic Potential on Heterogeneous Many-Core Processors

    Authors: Xin Chen, Yucheng Ouyang, Xin Chen, Zhenchuan Chen, Rongfen Lin, Xingyu Gao, Lifang Wang, Fang Li, Yin Liu, Honghui Shang, Haifeng Song

    Abstract: Molecular dynamics simulations have emerged as a potent tool for investigating the physical properties and kinetic behaviors of materials at the atomic scale, particularly in extreme conditions. Ab initio accuracy is now achievable with machine learning based interatomic potentials. With recent advancements in high-performance computing, highly accurate and large-scale simulations become feasible.… ▽ More

    Submitted 12 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  33. arXiv:2309.14963  [pdf, ps, other

    math.NT math.AG

    Neighborhood of vertices in the isogeny graph of principally polarized superspecial abelian surfaces

    Authors: Zheng Xu, Yi Ouyang, Zijian Zhou

    Abstract: For two supersingular elliptic curves $E$ and $E'$ defined over $\mathbb{F}_{p^2}$, let $[E \times E']$ be the superspecial abelian surface with the principal polarization $\{0\} \times E' + E \times \{0\}$. We determine local structure of the vertices $[E \times E']$ in the $(\ell, \ell)$-isogeny graph of principally polarized superspecial abelian surfaces where either $E$ or $E'$ is defined over… ▽ More

    Submitted 12 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  34. arXiv:2309.11868  [pdf, ps, other

    math.FA

    A Radon-Nikodym theorem for monotone measures

    Authors: Yao Ouyang, Jun Li

    Abstract: A version of Radon-Nikodym theorem for the Choquet integral w.r.t. monotone measures is proved. Without any presumptive condition, we obtain a necessary and sufficient condition for the ordered pair $(μ, ν)$ of finite monotone measures to have the so-called Radon-Nikodym property related to a nonnegative measurable function $f$. If $ν$ is null-continuous and weakly null-additive, then $f$ is uniqu… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  35. Towards Better Modeling with Missing Data: A Contrastive Learning-based Visual Analytics Perspective

    Authors: Laixin Xie, Yang Ouyang, Longfei Chen, Ziming Wu, Quan Li

    Abstract: Missing data can pose a challenge for machine learning (ML) modeling. To address this, current approaches are categorized into feature imputation and label prediction and are primarily focused on handling missing data to enhance ML performance. These approaches rely on the observed data to estimate the missing values and therefore encounter three main shortcomings in imputation, including the need… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 18 pages, 11 figures. This paper is accepted by IEEE Transactions on Visualization and Computer Graphics (TVCG)

    ACM Class: I.1.2; H.1.2; H.4.2

  36. arXiv:2309.03599  [pdf, other

    cs.CV

    Chasing Consistency in Text-to-3D Generation from a Single Image

    Authors: Yichen Ouyang, Wenhao Chai, Jiayi Ye, Dapeng Tao, Yibing Zhan, Gaoang Wang

    Abstract: Text-to-3D generation from a single-view image is a popular but challenging task in 3D vision. Although numerous methods have been proposed, existing works still suffer from the inconsistency issues, including 1) semantic inconsistency, 2) geometric inconsistency, and 3) saturation inconsistency, resulting in distorted, overfitted, and over-saturated generations. In light of the above issues, we p… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 9 pages, 11 figures

  37. arXiv:2308.15030  [pdf, other

    cs.AI

    SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget

    Authors: Rui Kong, Yuanchun Li, Qingtian Feng, Weijun Wang, Xiaozhou Ye, Ye Ouyang, Linghe Kong, Yunxin Liu

    Abstract: Mixture of experts (MoE) is a popular technique to improve capacity of Large Language Models (LLMs) with conditionally-activated parallel experts. However, serving MoE models on memory-constrained devices is challenging due to the large parameter size. Typical solutions such as memory swapping or expert pruning may lead to significantly higher latency or severe accuracy loss. In this paper, we int… ▽ More

    Submitted 29 May, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: Accepted at ACL 2024

  38. arXiv:2308.07248  [pdf

    stat.ME stat.AP

    Maintaining the validity of inference from linear mixed models in stepped-wedge cluster randomized trials under misspecified random-effects structures

    Authors: Yongdong Ouyang, Monica Taljaard, Andrew B Forbes, Fan Li

    Abstract: Linear mixed models are commonly used in analyzing stepped-wedge cluster randomized trials (SW-CRTs). A key consideration for analyzing a SW-CRT is accounting for the potentially complex correlation structure, which can be achieved by specifying a random effects structure. Common random effects structures for a SW-CRT include random intercept, random cluster-by-period, and discrete-time decay. Rec… ▽ More

    Submitted 14 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

  39. arXiv:2308.02229  [pdf, ps, other

    math.AP

    A priori estimates for higher-order fractional Laplace equations

    Authors: Yugao Ouyang, Meiqing Xu, Ran Zhuo

    Abstract: In this paper, we establish a priori estimates for the positive solutions to a higher-order fractional Laplace equation on a bounded domain by a blowing-up and rescaling argument. To overcome the technical difficulty due to the high-order and fractional order mixed operators, we divide the high-order fractional Laplacian equation into a system, and provide uniform estimates for each equation in th… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  40. arXiv:2307.12227  [pdf, other

    cs.HC

    FSLens: A Visual Analytics Approach to Evaluating and Optimizing the Spatial Layout of Fire Stations

    Authors: Longfei Chen, He Wang, Yang Ouyang, Yang Zhou, Naiyu Wang, Quan Li

    Abstract: The provision of fire services plays a vital role in ensuring the safety of residents' lives and property. The spatial layout of fire stations is closely linked to the efficiency of fire rescue operations. Traditional approaches have primarily relied on mathematical planning models to generate appropriate layouts by summarizing relevant evaluation criteria. However, this optimization process prese… ▽ More

    Submitted 25 July, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE VIS 2023

  41. arXiv:2307.12199  [pdf, other

    cs.HC

    Leveraging Historical Medical Records as a Proxy via Multimodal Modeling and Visualization to Enrich Medical Diagnostic Learning

    Authors: Yang Ouyang, Yuchen Wu, He Wang, Chenyang Zhang, Furui Cheng, Chang Jiang, Lixia Jin, Yuanwu Cao, Quan Li

    Abstract: Simulation-based Medical Education (SBME) has been developed as a cost-effective means of enhancing the diagnostic skills of novice physicians and interns, thereby mitigating the need for resource-intensive mentor-apprentice training. However, feedback provided in most SBME is often directed towards improving the operational proficiency of learners, rather than providing summative medical diagnose… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE VIS 2023

  42. arXiv:2307.11449  [pdf

    cs.AI

    AIGC Empowering Telecom Sector White Paper_chinese

    Authors: Ye Ouyang, Yaqin Zhang, Xiaozhou Ye, Yunxin Liu, Yong Song, Yang Liu, Sen Bian, Zhiyong Liu

    Abstract: In the global craze of GPT, people have deeply realized that AI, as a transformative technology and key force in economic and social development, will bring great leaps and breakthroughs to the global industry and profoundly influence the future world competition pattern. As the builder and operator of information and communication infrastructure, the telecom sector provides infrastructure support… ▽ More

    Submitted 23 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  43. arXiv:2307.10004  [pdf

    cs.AI

    6G Network Business Support System

    Authors: Ye Ouyang, Yaqin Zhang, Peng Wang, Yunxin Liu, Wen Qiao, Jun Zhu, Yang Liu, Feng Zhang, Shuling Wang, Xidong Wang

    Abstract: 6G is the next-generation intelligent and integrated digital information infrastructure, characterized by ubiquitous interconnection, native intelligence, multi-dimensional perception, global coverage, green and low-carbon, native network security, etc. 6G will realize the transition from serving people and people-things communication to supporting the efficient connection of intelligent agents, a… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  44. arXiv:2307.09045  [pdf

    cs.NI

    6G Network Operation Support System

    Authors: Ye Ouyang, Yaqin Zhang, Xiaozhou Ye, Yunxin Liu, Xidong Wang, Jie Sun, Yang Liu, Shoufeng Wang, Sen Bian, Yun Li

    Abstract: 6G is the next-generation intelligent and integrated digital information infrastructure, characterized by ubiquitous interconnection, native intelligence, multi-dimensional perception, global coverage, green and low-carbon, native network security, etc. 6G will realize the transition from serving people and people-things communication to supporting the efficient connection of intelligent agents, a… ▽ More

    Submitted 25 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 103 pages, 20 figures, 52 references (chinese version)

  45. arXiv:2307.00467  [pdf, other

    cs.LG stat.ML

    MissDiff: Training Diffusion Models on Tabular Data with Missing Values

    Authors: Yidong Ouyang, Liyan Xie, Chongxuan Li, Guang Cheng

    Abstract: The diffusion model has shown remarkable performance in modeling data distributions and synthesizing data. However, the vanilla diffusion model requires complete or fully observed data for training. Incomplete data is a common issue in various real-world applications, including healthcare and finance, particularly when dealing with tabular datasets. This work presents a unified and principled diff… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 22 pages, short version is accepted by ICML workshop on Structured Probabilistic Inference & Generative Modeling 2023

    Report number: 22

  46. arXiv:2306.10518  [pdf, other

    cs.RO

    LAGOON: Language-Guided Motion Control

    Authors: Shusheng Xu, Huaijie Wang, Jiaxuan Gao, Yutao Ouyang, Chao Yu, Yi Wu

    Abstract: We aim to control a robot to physically behave in the real world following any high-level language command like "cartwheel" or "kick". Although human motion datasets exist, this task remains particularly challenging since generative models can produce physically unrealistic motions, which will be more severe for robots due to different body structures and physical properties. Deploying such a moti… ▽ More

    Submitted 19 May, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 6 pages, 5 figures, 2 tables

    Journal ref: 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  47. arXiv:2304.10455  [pdf

    cond-mat.mtrl-sci

    An extreme value statistics model of heterogeneous ice nucleation for quantifying the stability of supercooled aqueous systems

    Authors: Anthony N. Consiglio, Yu Ouyang, Matthew J. Powell-Palm, Boris Rubinsky

    Abstract: The propensity of water to remain in a metastable liquid state at temperatures below its equilibrium melting point holds significant potential for cryopreserving biological material such as tissues and organs. The benefits conferred are a direct result of progressively reducing metabolic expenditure due to colder temperatures while simultaneously avoiding the irreversible damage caused by the crys… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Report number: 064511

    Journal ref: J. Chem. Phys. 159, 064511 (2023)

  48. arXiv:2304.04233  [pdf, other

    cs.CR

    ODDFUZZ: Discovering Java Deserialization Vulnerabilities via Structure-Aware Directed Greybox Fuzzing

    Authors: Sicong Cao, Biao He, Xiaobing Sun, Yu Ouyang, Chao Zhang, Xiaoxue Wu, Ting Su, Lili Bo, Bin Li, Chuanlei Ma, Jiajia Li, Tao Wei

    Abstract: Java deserialization vulnerability is a severe threat in practice. Researchers have proposed static analysis solutions to locate candidate vulnerabilities and fuzzing solutions to generate proof-of-concept (PoC) serialized objects to trigger them. However, existing solutions have limited effectiveness and efficiency. In this paper, we propose a novel hybrid solution ODDFUZZ to efficiently discover… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: To appear in the Main Track of IEEE S&P 2023

  49. arXiv:2303.14457  [pdf, other

    cs.CV cs.AI cs.GR

    Diverse Motion In-betweening with Dual Posture Stitching

    Authors: Tianxiang Ren, Jubo Yu, Shihui Guo, Ying Ma, Yutao Ouyang, Zijiao Zeng, Yazhan Zhang, Yipeng Qin

    Abstract: In-betweening is a technique for generating transitions given initial and target character states. The majority of existing works require multiple (often $>$10) frames as input, which are not always accessible. Our work deals with a focused yet challenging problem: to generate the transition when given exactly two frames (only the first and last). To cope with this challenging scenario, we impleme… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 10 pages, 5 figures

  50. arXiv:2303.13780  [pdf, other

    cs.CL

    Towards Making the Most of ChatGPT for Machine Translation

    Authors: Keqin Peng, Liang Ding, Qihuang Zhong, Li Shen, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao

    Abstract: ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies have shown that it achieves comparable results to commercial systems for high-resource languages, but lags behind in complex tasks, e.g., low-resource and distant-language-pairs translation. However, they usually adopt simple prompts which can not fully elicit the capability of ChatGPT. In this paper, we aim… ▽ More

    Submitted 20 October, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: EMNLP 2023 (findings)