Skip to main content

Showing 1–50 of 149 results for author: Hao, M

  1. arXiv:2407.10268  [pdf

    cond-mat.supr-con

    Weakly Coupled Type-II Superconductivity in a Laves compound ZrRe2

    Authors: Yingpeng Yu, Zhaolong Liu, Qi Li, Zhaoxu Chen, Yulong Wang, Munan Hao, Yaling Yang, Chunsheng Gong, Long Chen, Zhenkai Xie, Kaiyao Zhou, Huifen Ren, Xu Chen, Shifeng Jin

    Abstract: We present a comprehensive investigation of the superconducting properties of ZrRe2, a Re-based hexagonal Laves compounds. ZrRe2 crystallizes in a C14-type structure (space group P63/mmc), with cell parameters a=b=5.2682(5) and c=8.63045 . Resistivity and magnetic susceptibility data both suggest that ZrRe2 exhibits a sharp superconducting transition above 6.1 K. The measured lower and upper criti… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 14 pages,7 figures, 2 tables

  2. arXiv:2406.19531  [pdf, other

    stat.ML cs.LG

    Forward and Backward State Abstractions for Off-policy Evaluation

    Authors: Meiling Hao, Pingfan Su, Liyuan Hu, Zoltan Szabo, Qingyuan Zhao, Chengchun Shi

    Abstract: Off-policy evaluation (OPE) is crucial for evaluating a target policy's impact offline before its deployment. However, achieving accurate OPE in large state spaces remains challenging.This paper studies state abstractions-originally designed for policy learning-in the context of OPE. Our contributions are three-fold: (i) We define a set of irrelevance conditions central to learning state abstracti… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 42 pages, 5 figures

    ACM Class: G.3; I.2.6; G.1.2

  3. arXiv:2406.14844  [pdf, other

    cs.LG cs.AI

    DN-CL: Deep Symbolic Regression against Noise via Contrastive Learning

    Authors: Jingyi Liu, Yanjie Li, Lina Yu, Min Wu, Weijun Li, Wenqiang Li, Meilan Hao, Yusong Deng, Shu Wei

    Abstract: Noise ubiquitously exists in signals due to numerous factors including physical, electronic, and environmental effects. Traditional methods of symbolic regression, such as genetic programming or deep learning models, aim to find the most fitting expressions for these signals. However, these methods often overlook the noise present in real-world data, leading to reduced fitting accuracy. To tackle… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.11208  [pdf

    cs.NI

    Privacy-preserving Pseudonym Schemes for Personalized 3D Avatars in Mobile Social Metaverses

    Authors: Cheng Su, Xiaofeng Luo, Zhenmou Liu, Jiawen Kang, Min Hao, Zehui Xiong, Zhaohui Yang, Chongwen Huang

    Abstract: The emergence of mobile social metaverses, a novel paradigm bridging physical and virtual realms, has led to the widespread adoption of avatars as digital representations for Social Metaverse Users (SMUs) within virtual spaces. Equipped with immersive devices, SMUs leverage Edge Servers (ESs) to deploy their avatars and engage with other SMUs in virtual spaces. To enhance immersion, SMUs incline t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6pages, 4 figures

  5. arXiv:2406.05874  [pdf, other

    cs.CR

    Stealthy Targeted Backdoor Attacks against Image Captioning

    Authors: Wenshu Fan, Hongwei Li, Wenbo Jiang, Meng Hao, Shui Yu, Xiao Zhang

    Abstract: In recent years, there has been an explosive growth in multimodal learning. Image captioning, a classical multimodal task, has demonstrated promising applications and attracted extensive research attention. However, recent studies have shown that image caption models are vulnerable to some security threats such as backdoor attacks. Existing backdoor attacks against image captioning typically pair… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  6. arXiv:2405.20710  [pdf, other

    cs.IR

    Information Maximization via Variational Autoencoders for Cross-Domain Recommendation

    Authors: Xuying Ning, Wujiang Xu, Xiaolei Liu, Mingming Ha, Qiongxu Ma, Youru Li, Linxun Chen, Yongfeng Zhang

    Abstract: Cross-Domain Sequential Recommendation (CDSR) methods aim to address the data sparsity and cold-start problems present in Single-Domain Sequential Recommendation (SDSR). Existing CDSR methods typically rely on overlapping users, designing complex cross-domain modules to capture users' latent interests that can propagate across different domains. However, their propagated informative information is… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  7. arXiv:2405.15403  [pdf, other

    cs.LG stat.ML

    Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random

    Authors: Mingming Ha, Xuewen Tao, Wenfang Lin, Qionxu Ma, Wujiang Xu, Linxun Chen

    Abstract: In most practical applications such as recommendation systems, display advertising, and so forth, the collected data often contains missing values and those missing values are generally missing-not-at-random, which deteriorates the prediction performance of models. Some existing estimators and regularizers attempt to achieve unbiased estimation to improve the predictive performance. However, varia… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  8. arXiv:2405.14620  [pdf, other

    cs.LG

    Closed-form Symbolic Solutions: A New Perspective on Solving Partial Differential Equations

    Authors: Shu Wei, Yanjie Li, Lina Yu, Min Wu, Weijun Li, Meilan Hao, Wenqiang Li, Jingyi Liu, Yusong Deng

    Abstract: Solving partial differential equations (PDEs) in Euclidean space with closed-form symbolic solutions has long been a dream for mathematicians. Inspired by deep learning, Physics-Informed Neural Networks (PINNs) have shown great promise in numerically solving PDEs. However, since PINNs essentially approximate solutions within the continuous function space, their numerical solutions fall short in bo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  9. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  10. arXiv:2404.11816  [pdf, other

    cs.LG

    Tailoring Generative Adversarial Networks for Smooth Airfoil Design

    Authors: Joyjit Chattoraj, Jian Cheng Wong, Zhang Zexuan, Manna Dai, Xia Yingzhi, Li Jichao, Xu Xinxing, Ooi Chin Chun, Yang Feng, Dao My Ha, Liu Yong

    Abstract: In the realm of aerospace design, achieving smooth curves is paramount, particularly when crafting objects such as airfoils. Generative Adversarial Network (GAN), a widely employed generative AI technique, has proven instrumental in synthesizing airfoil designs. However, a common limitation of GAN is the inherent lack of smoothness in the generated airfoil surfaces. To address this issue, we prese… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  11. arXiv:2404.06330  [pdf, other

    cs.LG cs.AI

    Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jingyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng

    Abstract: The mathematical formula is the human language to describe nature and is the essence of scientific research. Finding mathematical formulas from observational data is a major demand of scientific research and a major challenge of artificial intelligence. This area is called symbolic regression. Originally symbolic regression was often formulated as a combinatorial optimization problem and solved us… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 21 pages

  12. arXiv:2404.04175  [pdf, other

    physics.soc-ph cond-mat.dis-nn cond-mat.stat-mech

    Interplay of network structure and talent configuration on wealth dynamics

    Authors: Jaeseok Hur, Meesoon Ha, Hawoong Jeong

    Abstract: The economic success of individuals is often determined by a combination of talent, luck, and assistance from others. We introduce a new agent-based model that simultaneously considers talent, luck, and social interaction. This model allows us to explore how network structure (how agents interact) and talent distribution among agents affect the dynamics of capital accumulation through analytical a… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 11 pages, 7 figures, and 1 table (+ combined with Supplemental Material: 8 pages, 8 figures, and 1 table)

  13. arXiv:2403.04264  [pdf, other

    cs.AI

    Competitive Facility Location under Random Utilities and Routing Constraints

    Authors: Hoang Giang Pham, Tien Thanh Dam, Ngan Ha Duong, Tien Mai, Minh Hoang Ha

    Abstract: In this paper, we study a facility location problem within a competitive market context, where customer demand is predicted by a random utility choice model. Unlike prior research, which primarily focuses on simple constraints such as a cardinality constraint on the number of selected locations, we introduce routing constraints that necessitate the selection of locations in a manner that guarantee… ▽ More

    Submitted 9 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  14. arXiv:2402.18603  [pdf, other

    cs.LG cs.AI cs.CL

    MMSR: Symbolic Regression is a Multimodal Task

    Authors: Yanjie Li, Jingyi Liu, Weijun Li, Lina Yu, Min Wu, Wenqiang Li, Meilan Hao, Su Wei, Yusong Deng

    Abstract: Mathematical formulas are the crystallization of human wisdom in exploring the laws of nature for thousands of years. Describing the complex laws of nature with a concise mathematical formula is a constant pursuit of scientists and a great challenge for artificial intelligence. This field is called symbolic regression. Symbolic regression was originally formulated as a combinatorial optimization p… ▽ More

    Submitted 14 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 page

  15. arXiv:2402.14497  [pdf, other

    cond-mat.mtrl-sci cond-mat.soft

    A Sparse Bayesian Committee Machine Potential for Hydrocarbons

    Authors: Soohaeng Yoo Willow, Gyung Su Kim, Miran Ha, Amir Hajibabaei, Chang Woo Myung

    Abstract: Accurate and scalable universal interatomic potentials are key for understanding material properties at the atomic level, a task often hindered by the steep computational scaling. Although recent developments of machine learning potential has made significant progress, the flexibility and expansion to a wide range of compounds within a single model seems still challenging to build in particular fo… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages, 6 figures

  16. arXiv:2402.13718  [pdf, other

    cs.CL

    $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens

    Authors: Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun

    Abstract: Processing and reasoning over long contexts is crucial for many practical applications of Large Language Models (LLMs), such as document comprehension and agent construction. Despite recent strides in making LLMs process contexts with more than 100K tokens, there is currently a lack of a standardized benchmark to evaluate this long-context capability. Existing public benchmarks typically focus on… ▽ More

    Submitted 24 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2023.12.15ARR

  17. arXiv:2402.12175  [pdf, other

    cs.LG cs.NE

    Learning Discretized Bayesian Networks with GOMEA

    Authors: Damy M. F. Ha, Tanja Alderliesten, Peter A. N. Bosman

    Abstract: Bayesian networks model relationships between random variables under uncertainty and can be used to predict the likelihood of events and outcomes while incorporating observed evidence. From an eXplainable AI (XAI) perspective, such models are interesting as they tend to be compact. Moreover, captured relations can be directly inspected by domain experts. In practice, data is often real-valued. Unl… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: The code is available at: https://github.com/damyha/dbn_gomea

  18. arXiv:2402.10937  [pdf

    cs.AR cs.AI cs.CE cs.GT cs.LG

    A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction

    Authors: Hailiang Li, Yan Huo, Yan Wang, Xu Yang, Miaohui Hao, Xiao Wang

    Abstract: As the modern CPU, GPU, and NPU chip design complexity and transistor counts keep increasing, and with the relentless shrinking of semiconductor technology nodes to nearly 1 nanometer, the placement and routing have gradually become the two most pivotal processes in modern very-large-scale-integrated (VLSI) circuit back-end design. How to evaluate routability efficiently and accurately in advance… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: The paper is submitted to the International Symposium of EDA (2024, XiAn, China)

  19. arXiv:2401.15103  [pdf, other

    cs.LG cs.AI

    PruneSymNet: A Symbolic Neural Network and Pruning Algorithm for Symbolic Regression

    Authors: Min Wu, Weijun Li, Lina Yu, Wenqiang Li, Jingyi Liu, Yanjie Li, Meilan Hao

    Abstract: Symbolic regression aims to derive interpretable symbolic expressions from data in order to better understand and interpret data. %which plays an important role in knowledge discovery and interpretable machine learning. In this study, a symbolic network called PruneSymNet is proposed for symbolic regression. This is a novel neural network whose activation function consists of common elementary f… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  20. arXiv:2401.14424  [pdf, other

    cs.LG cs.AI

    Discovering Mathematical Formulas from Data via GPT-guided Monte Carlo Tree Search

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jingyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng

    Abstract: Finding a concise and interpretable mathematical formula that accurately describes the relationship between each variable and the predicted value in the data is a crucial task in scientific research, as well as a significant challenge in artificial intelligence. This problem is referred to as symbolic regression, which is an NP-hard problem. In the previous year, a novel symbolic regression method… ▽ More

    Submitted 30 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 24 pages

  21. arXiv:2401.04246  [pdf, other

    cs.LG q-bio.BM

    Scalable Normalizing Flows Enable Boltzmann Generators for Macromolecules

    Authors: Joseph C. Kim, David Bloore, Karan Kapoor, Jun Feng, Ming-Hong Hao, Mengdi Wang

    Abstract: The Boltzmann distribution of a protein provides a roadmap to all of its functional states. Normalizing flows are a promising tool for modeling this distribution, but current methods are intractable for typical pharmacological targets; they become computationally intractable due to the size of the system, heterogeneity of intra-molecular potential energy, and long-range interactions. To remedy the… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  22. arXiv:2401.03968  [pdf, other

    q-bio.QM cs.LG q-bio.GN

    scDiffusion: conditional generation of high-quality single-cell data using diffusion model

    Authors: Erpai Luo, Minsheng Hao, Lei Wei, Xuegong Zhang

    Abstract: Single-cell RNA sequencing (scRNA-seq) data are important for studying the laws of life at single-cell level. However, it is still challenging to obtain enough high-quality scRNA-seq data. To mitigate the limited availability of data, generative models have been proposed to computationally generate synthetic scRNA-seq data. Nevertheless, the data generated with current models are not very realisti… ▽ More

    Submitted 4 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  23. arXiv:2401.01772  [pdf, other

    cs.AI cs.NI

    A Novel Paradigm for Neural Computation: X-Net with Learnable Neurons and Adaptable Structure

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jinyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng, Liping Zhang, Xiaoli Dong, Hong Qin, Xin Ning, Yugui Zhang, Baoli Lu, Jian Xu, Shuang Li

    Abstract: Multilayer perception (MLP) has permeated various disciplinary domains, ranging from bioinformatics to financial analytics, where their application has become an indispensable facet of contemporary scientific research endeavors. However, MLP has obvious drawbacks. 1), The type of activation function is single and relatively fixed, which leads to poor `representation ability' of the network, and it… ▽ More

    Submitted 12 July, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 35 pages

  24. arXiv:2311.15156  [pdf, other

    cs.LG cs.AI q-bio.GN

    xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data

    Authors: Jing Gong, Minsheng Hao, Xingyi Cheng, Xin Zeng, Chiming Liu, Jianzhu Ma, Xuegong Zhang, Taifeng Wang, Le Song

    Abstract: Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classic… ▽ More

    Submitted 24 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  25. arXiv:2311.07326  [pdf, other

    cs.LG cs.AI

    MetaSymNet: A Dynamic Symbolic Regression Network Capable of Evolving into Arbitrary Formulations

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jinyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng

    Abstract: Mathematical formulas serve as the means of communication between humans and nature, encapsulating the operational laws governing natural phenomena. The concise formulation of these laws is a crucial objective in scientific research and an important challenge for artificial intelligence (AI). While traditional artificial neural networks (MLP) excel at data fitting, they often yield uninterpretable… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 16 pages

  26. arXiv:2311.04760  [pdf, other

    cs.IR cs.LG

    Towards Open-world Cross-Domain Sequential Recommendation: A Model-Agnostic Contrastive Denoising Approach

    Authors: Wujiang Xu, Xuying Ning, Wenfang Lin, Mingming Ha, Qiongxu Ma, Qianqiao Liang, Xuewen Tao, Linxun Chen, Bing Han, Minnan Luo

    Abstract: Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlapping users with abundant behaviors. However, in real-world recommender systems, CDSR sc… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  27. Rethinking Cross-Domain Sequential Recommendation under Open-World Assumptions

    Authors: Wujiang Xu, Qitian Wu, Runzhong Wang, Mingming Ha, Qiongxu Ma, Linxun Chen, Bing Han, Junchi Yan

    Abstract: Cross-Domain Sequential Recommendation (CDSR) methods aim to tackle the data sparsity and cold-start problems present in Single-Domain Sequential Recommendation (SDSR). Existing CDSR works design their elaborate structures relying on overlapping users to propagate the cross-domain information. However, current CDSR methods make closed-world assumptions, assuming fully overlapping users across mult… ▽ More

    Submitted 12 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Journal ref: Proceedings of the ACM Web Conference 2024 (WWW '24)

  28. arXiv:2309.13705  [pdf, other

    cs.LG cs.AI

    A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data

    Authors: Wenqiang Li, Weijun Li, Lina Yu, Min Wu, Linjun Sun, Jingyi Liu, Yanjie Li, Shu Wei, Yusong Deng, Meilan Hao

    Abstract: Symbolic regression (SR) is a powerful technique for discovering the underlying mathematical expressions from observed data. Inspired by the success of deep learning, recent deep generative SR methods have shown promising results. However, these methods face difficulties in processing high-dimensional problems and learning constants due to the large search space, and they don't scale well to unsee… ▽ More

    Submitted 1 June, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by ICML 2024

  29. arXiv:2309.10361  [pdf, other

    cs.CV cs.LG cs.MM

    Improving CLIP Robustness with Knowledge Distillation and Self-Training

    Authors: Clement Laroudie, Andrei Bursuc, Mai Lan Ha, Gianni Franchi

    Abstract: This paper examines the robustness of a multi-modal computer vision model, CLIP (Contrastive Language-Image Pretraining), in the context of unsupervised learning. The main objective is twofold: first, to evaluate the robustness of CLIP, and second, to explore strategies for augmenting its robustness. To achieve this, we introduce a novel approach named LP-CLIP. This technique involves the distilla… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  30. arXiv:2308.04823  [pdf

    cs.CL

    Evaluating the Generation Capabilities of Large Chinese Language Models

    Authors: Hui Zeng, Jingyuan Xue, Meng Hao, Chen Sun, Bin Ning, Na Zhang

    Abstract: This paper unveils CG-Eval, the first-ever comprehensive and automated evaluation framework designed for assessing the generative capabilities of large Chinese language models across a spectrum of academic disciplines. CG-Eval stands out for its automated process, which critically assesses models based on their proficiency in generating precise and contextually relevant responses to a diverse arra… ▽ More

    Submitted 29 January, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  31. arXiv:2308.02870  [pdf, other

    cs.CL cs.SD eess.AS

    ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging

    Authors: Fangyuan Wang, Ming Hao, Yuhai Shi, Bo Xu

    Abstract: The conventional recipe for Automatic Speech Recognition (ASR) models is to 1) train multiple checkpoints on a training set while relying on a validation set to prevent overfitting using early stopping and 2) average several last checkpoints or that of the lowest validation losses to obtain the final model. In this paper, we rethink and update the early stopping and checkpoint averaging from the p… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  32. arXiv:2307.01293  [pdf, other

    physics.soc-ph cond-mat.dis-nn cond-mat.stat-mech

    Hidden multiscale organization and robustness of real multiplex networks

    Authors: Gangmin Son, Meesoon Ha, Hawoong Jeong

    Abstract: Hidden geometry enables the investigation of complex networks at different scales. Extending this framework to multiplex networks, we uncover a novel kind of mesoscopic organization in real multiplex systems, named $\textit{clan}$, a group of nodes that preserve their local geometric arrangement across layers. Furthermore, we reveal the intimate relationship between the unfolding of clan structure… ▽ More

    Submitted 6 February, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Journal ref: Phys. Rev. E 109, 024301 (2024)

  33. arXiv:2306.04192  [pdf, other

    cs.CR

    Extracting Cloud-based Model with Prior Knowledge

    Authors: Shiqian Zhao, Kangjie Chen, Meng Hao, Jian Zhang, Guowen Xu, Hongwei Li, Tianwei Zhang

    Abstract: Machine Learning-as-a-Service, a pay-as-you-go business pattern, is widely accepted by third-party users and developers. However, the open inference APIs may be utilized by malicious customers to conduct model extraction attacks, i.e., attackers can replicate a cloud-based black-box model merely via querying malicious examples. Existing model extraction attacks mainly depend on the posterior knowl… ▽ More

    Submitted 13 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  34. arXiv:2305.19569  [pdf

    cs.LG cs.AI cs.CY eess.SP

    Domain knowledge-informed Synthetic fault sample generation with Health Data Map for cross-domain Planetary Gearbox Fault Diagnosis

    Authors: Jong Moon Ha, Olga Fink

    Abstract: Extensive research has been conducted on fault diagnosis of planetary gearboxes using vibration signals and deep learning (DL) approaches. However, DL-based methods are susceptible to the domain shift problem caused by varying operating conditions of the gearbox. Although domain adaptation and data synthesis methods have been proposed to overcome such domain shifts, they are often not directly app… ▽ More

    Submitted 26 November, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Under review / added arXiv identifier / Updated to revised version

    Journal ref: Published in Mechanical Systems and Signal Processing Volume 202, 1 November 2023, 110680

  35. Blockchain-enabled Parametric Solar Energy Insurance via Remote Sensing

    Authors: Mingyu Hao, Keyang Qian, Sid Chi-Kin Chau

    Abstract: Despite its popularity, the nature of solar energy is highly uncertain and weather dependent, affecting the business viability and investment of solar energy generation, especially for household users. To stabilize the income from solar energy generation, there have been limited traditional options, such as using energy storage to pool excessive solar energy in off-peak periods or financial deriva… ▽ More

    Submitted 17 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: To appear in ACM e-Energy 2023

  36. arXiv:2305.08384  [pdf, other

    cs.CR cs.NI

    Privacy-preserving Blockchain-enabled Parametric Insurance via Remote Sensing and IoT

    Authors: Mingyu Hao, Keyang Qian, Sid Chi-Kin Chau

    Abstract: Traditional Insurance, a popular approach of financial risk management, has suffered from the issues of high operational costs, opaqueness, inefficiency and a lack of trust. Recently, blockchain-enabled "parametric insurance" through authorized data sources (e.g., remote sensing and IoT) aims to overcome these issues by automating the underwriting and claim processes of insurance policies on a blo… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  37. arXiv:2305.07862  [pdf

    eess.SY

    Research on Cooperative Search Technology of Heterogeneous UAVs in Complex Environments

    Authors: Zhenchang Liu, Mingrui Hao

    Abstract: This paper studies heterogeneous UAVs cooperative search technology suitable for complex environments. In the application, a fixed-wing UAV drops rotor UAVs to deploy the cluster rapidly. Meanwhile, the fixed-wing UAV works as a communication relay node to improve the cooperative search performance of the cluster further. Aiming at the cooperative search requirements of heterogeneous UAVs, a jumpi… ▽ More

    Submitted 10 November, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: 26 pages, 26 figures

    ACM Class: F.2.1

  38. arXiv:2305.05263  [pdf

    astro-ph.EP cond-mat.mtrl-sci physics.geo-ph

    Evidence of a hydrated mineral enriched in water and ammonium molecules in the Chang'e-5 lunar sample

    Authors: Shifeng Jin, Munan Hao, Zhongnan Guo, Bohao Yin, Yuxin Ma, Lijun Deng, Xu Chen, Yanpeng Song, Cheng Cao, Congcong Chai, Yunqi Ma, Jiangang Guo, Xiaolong Chen

    Abstract: The presence and distribution of water on the Moon are fundamental to our understanding of the Earth-Moon system. Despite extensive research and remote detection, the origin and chemical form of lunar water (H2O) have remained elusive. In this study, we present the discovery of a hydrated mineral, (NH4)MgCl3*6H2O, in lunar soil samples returned by the Chang'e-5 mission, containing approximately 41… ▽ More

    Submitted 28 June, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 28 pages, 4 figures

  39. arXiv:2304.02472  [pdf, other

    q-fin.RM cs.LG q-fin.TR

    Learning to Predict Short-Term Volatility with Order Flow Image Representation

    Authors: Artem Lensky, Mingyu Hao

    Abstract: Introduction: The paper addresses the challenging problem of predicting the short-term realized volatility of the Bitcoin price using order flow information. The inherent stochastic nature and anti-persistence of price pose difficulties in accurate prediction. Methods: To address this, we propose a method that transforms order flow data over a fixed time interval (snapshots) into images. The ord… ▽ More

    Submitted 20 March, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

  40. arXiv:2303.05565  [pdf, other

    cs.RO eess.SY

    Towards Generalized Robot Assembly through Compliance-Enabled Contact Formations

    Authors: Andrew S. Morgan, Quentin Bateux, Mei Hao, Aaron M. Dollar

    Abstract: Contact can be conceptualized as a set of constraints imposed on two bodies that are interacting with one another in some way. The nature of a contact, whether a point, line, or surface, dictates how these bodies are able to move with respect to one another given a force, and a set of contacts can provide either partial or full constraint on a body's motion. Decades of work have explored how to ex… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2023

  41. arXiv:2302.05919  [pdf, other

    cs.IR

    Neural Node Matching for Multi-Target Cross Domain Recommendation

    Authors: Wujiang Xu, Shaoshuai Li, Mingming Ha, Xiaobo Guo, Qiongxu Ma, Xiaolei Liu, Linxun Chen, Zhenfeng Zhu

    Abstract: Multi-Target Cross Domain Recommendation(CDR) has attracted a surge of interest recently, which intends to improve the recommendation performance in multiple domains (or systems) simultaneously. Most existing multi-target CDR frameworks primarily rely on the existence of the majority of overlapped users across domains. However, general practical CDR scenarios cannot meet the strictly overlapping r… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 13pages

    Journal ref: The IEEE International Conference on Data Engineering 2023

  42. arXiv:2302.05114  [pdf

    cs.CV

    Exploiting Neighborhood Structural Features for Change Detection

    Authors: Mengmeng Wang, Zhiqiang Han, Peizhen Yang, Bai Zhu, Ming Hao, Jianwei Fan, Yuanxin Ye

    Abstract: In this letter, a novel method for change detection is proposed using neighborhood structure correlation. Because structure features are insensitive to the intensity differences between bi-temporal images, we perform the correlation analysis on structure features rather than intensity information. First, we extract the structure feature maps by using multi-orientated gradient information. Then, th… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  43. arXiv:2302.03731  [pdf, other

    cs.LG q-bio.QM

    MMA-RNN: A Multi-level Multi-task Attention-based Recurrent Neural Network for Discrimination and Localization of Atrial Fibrillation

    Authors: Yifan Sun, Jingyan Shen, Yunfan Jiang, Zhaohui Huang, Minsheng Hao, Xuegong Zhang

    Abstract: The automatic detection of atrial fibrillation based on electrocardiograph (ECG) signals has received wide attention both clinically and practically. It is challenging to process ECG signals with cyclical pattern, varying length and unstable quality due to noise and distortion. Besides, there has been insufficient research on separating persistent atrial fibrillation from paroxysmal atrial fibrill… ▽ More

    Submitted 8 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 9 pages, 5 figures

  44. Primal-Dual Cops and Robber

    Authors: Minh Tuan Ha, Paul Jungeblut, Torsten Ueckerdt, Paweł Żyliński

    Abstract: Cops and Robber is a family of two-player games played on graphs in which one player controls a number of cops and the other player controls a robber. In alternating turns, each player moves (all) their figures. The cops try to capture the robber while the latter tries to flee indefinitely. In this paper we consider a variant of the game played on a planar graph where the robber moves between adja… ▽ More

    Submitted 10 January, 2024; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: Equal to the published version

    Journal ref: Computing in Geometry and Topology, 3(2), 4:1-4:12 (2024)

  45. arXiv:2301.02494  [pdf, other

    cs.LG cs.AI

    Adaptive Pattern Extraction Multi-Task Learning for Multi-Step Conversion Estimations

    Authors: Xuewen Tao, Mingming Ha, Xiaobo Guo, Qiongxu Ma, Hongwei Cheng, Wenfang Lin

    Abstract: Multi-task learning (MTL) has been successfully used in many real-world applications, which aims to simultaneously solve multiple tasks with a single model. The general idea of multi-task learning is designing kinds of global parameter sharing mechanism and task-specific feature extractor to improve the performance of all tasks. However, challenge still remains in balancing the trade-off of variou… ▽ More

    Submitted 23 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 18 pages, 9 figures

  46. arXiv:2212.00024  [pdf, other

    cs.LG cs.AI

    Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation

    Authors: Ying Chen, Siwei Qiang, Mingming Ha, Xiaolei Liu, Shaoshuai Li, Lingfeng Yuan, Xiaobo Guo, Zhenfeng Zhu

    Abstract: In recent years, semi-supervised graph learning with data augmentation (DA) is currently the most commonly used and best-performing method to enhance model robustness in sparse scenarios with few labeled samples. Differing from homogeneous graph, DA in heterogeneous graph has greater challenges: heterogeneity of information requires DA strategies to effectively handle heterogeneous relations, whic… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  47. arXiv:2211.07166  [pdf, other

    cs.LG cs.CR cs.DC

    Optimal Privacy Preserving for Federated Learning in Mobile Edge Computing

    Authors: Hai M. Nguyen, Nam H. Chu, Diep N. Nguyen, Dinh Thai Hoang, Van-Dinh Nguyen, Minh Hoang Ha, Eryk Dutkiewicz, Marwan Krunz

    Abstract: Federated Learning (FL) with quantization and deliberately added noise over wireless networks is a promising approach to preserve user differential privacy (DP) while reducing wireless resources. Specifically, an FL process can be fused with quantized Binomial mechanism-based updates contributed by multiple users. However, optimizing quantization parameters, communication resources (e.g., transmit… ▽ More

    Submitted 20 May, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 16 pages, 10 figures

  48. arXiv:2211.05405  [pdf, other

    cs.CV cs.CL

    VieCap4H-VLSP 2021: ObjectAoA-Enhancing performance of Object Relation Transformer with Attention on Attention for Vietnamese image captioning

    Authors: Nghia Hieu Nguyen, Duong T. D. Vo, Minh-Quan Ha

    Abstract: Image captioning is currently a challenging task that requires the ability to both understand visual information and use human language to describe this visual information in the image. In this paper, we propose an efficient way to improve the image understanding ability of transformer-based method by extending Object Relation Transformer architecture with Attention on Attention mechanism. Experim… ▽ More

    Submitted 20 March, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted for publishing at the VNU Journal of Science: Computer Science and Communication Engineering

  49. arXiv:2207.14753  [pdf, other

    stat.ME

    Estimating Causal Effects with Hidden Confounding using Instrumental Variables and Environments

    Authors: James P. Long, Hongxu Zhu, Kim-Anh Do, Min Jin Ha

    Abstract: Recent works have proposed regression models which are invariant across data collection environments. These estimators often have a causal interpretation under conditions on the environments and type of invariance imposed. One recent example, the Causal Dantzig (CD), is consistent under hidden confounding and represents an alternative to classical instrumental variable estimators such as Two Stage… ▽ More

    Submitted 9 November, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: 32 pages, 7 figures, 4 tables

  50. arXiv:2207.08221  [pdf, ps, other

    math.CO

    Expanders on matrices over a finite chain ring, I

    Authors: Dung M. Ha, Hieu T. Ngo

    Abstract: In this work and its sequel, we study the expanding phenomenon of matrices over a finite chain ring of large residue field. A sum-product estimate is proved. It is showed that $x+yz$ is a moderate expander on $n\times n$ matrices with exponent $\frac{n+1}{6}$. These results generalise the main theorems in a recent work of Xie and Ge. The proofs use spectral graph theory and elementary divisor theo… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 20 pages

    MSC Class: 11T30; 05C50