Skip to main content

Showing 1–50 of 74 results for author: Bu, Y

  1. arXiv:2406.12243  [pdf, other

    cs.IR cs.AI

    CherryRec: Enhancing News Recommendation Quality via LLM-driven Framework

    Authors: Shaohuang Wang, Lun Wang, Yunhan Bu, Tianwei Huang

    Abstract: Large Language Models (LLMs) have achieved remarkable progress in language understanding and generation. Custom LLMs leveraging textual features have been applied to recommendation systems, demonstrating improvements across various recommendation scenarios. However, most existing methods perform untrained recommendation based on pre-trained knowledge (e.g., movie recommendation), and the auto-regr… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2404.15799  [pdf

    cs.DL

    Towards the relationship between AIGC in manuscript writing and author profiles: evidence from preprints in LLMs

    Authors: Jialin Liu, Yi Bu

    Abstract: AIGC tools such as ChatGPT have profoundly changed scientific research, leading to widespread attention on its use on academic writing. Leveraging preprints from large language models, this study examined the use of AIGC in manuscript writing and its correlation with author profiles. We found that: (1) since the release of ChatGPT, the likelihood of abstracts being AI-generated has gradually incre… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 8 pages, 4 figures, 1 table

    MSC Class: J.0

  3. Joint Sparsity Pattern Learning Based Channel Estimation for Massive MIMO-OTFS Systems

    Authors: Kuo Meng, Shaoshi Yang, Xiao-Yang Wang, Yan Bu, Yurong Tang, Jianhua Zhang, Lajos Hanzo

    Abstract: We propose a channel estimation scheme based on joint sparsity pattern learning (JSPL) for massive multi-input multi-output (MIMO) orthogonal time-frequency-space (OTFS) modulation aided systems. By exploiting the potential joint sparsity of the delay-Doppler-angle (DDA) domain channel, the channel estimation problem is transformed into a sparse recovery problem. To solve it, we first apply the sp… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 6 pages, 6 figures, accepted to appear on IEEE Transactions on Vehicular Technology, Mar. 2024

  4. arXiv:2402.08936  [pdf, other

    cs.CV

    Predictive Temporal Attention on Event-based Video Stream for Energy-efficient Situation Awareness

    Authors: Yiming Bu, Jiayang Liu, Qinru Qiu

    Abstract: The Dynamic Vision Sensor (DVS) is an innovative technology that efficiently captures and encodes visual information in an event-driven manner. By combining it with event-driven neuromorphic processing, the sparsity in DVS camera output can result in high energy efficiency. However, similar to many embedded systems, the off-chip communication between the camera and processor presents a bottleneck… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  5. arXiv:2402.06160  [pdf, other

    cs.LG stat.ML

    Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?

    Authors: Maohao Shen, J. Jon Ryu, Soumya Ghosh, Yuheng Bu, Prasanna Sattigeri, Subhro Das, Gregory W. Wornell

    Abstract: This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies… ▽ More

    Submitted 12 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 29 pages, 12 figures

  6. arXiv:2402.03655  [pdf, other

    cs.LG math.NA stat.ML

    Operator SVD with Neural Networks via Nested Low-Rank Approximation

    Authors: J. Jon Ryu, Xiangxiang Xu, H. S. Melihcan Erol, Yuheng Bu, Lizhong Zheng, Gregory W. Wornell

    Abstract: Computing eigenvalue decomposition (EVD) of a given linear operator, or finding its leading eigenvalues and eigenfunctions, is a fundamental task in many machine learning and scientific computing problems. For high-dimensional eigenvalue problems, training neural networks to parameterize the eigenfunctions is considered as a promising alternative to the classical numerical linear algebra technique… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 44 pages, 7 figures

  7. arXiv:2401.13927  [pdf, other

    cs.CL

    Adaptive Text Watermark for Large Language Models

    Authors: Yepeng Liu, Yuheng Bu

    Abstract: The advancement of Large Language Models (LLMs) has led to increasing concerns about the misuse of AI-generated text, and watermarking for LLM-generated text has emerged as a potential solution. However, it is challenging to generate high-quality watermarked text while maintaining strong security, robustness, and the ability to detect watermarks without prior knowledge of the prompt or model. This… ▽ More

    Submitted 8 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: ICML2024

  8. arXiv:2401.04900  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG stat.ML

    SPT: Spectral Transformer for Red Giant Stars Age and Mass Estimation

    Authors: Mengmeng Zhang, Fan Wu, Yude Bu, Shanshan Li, Zhenping Yi, Meng Liu, Xiaoming Kong

    Abstract: The age and mass of red giants are essential for understanding the structure and evolution of the Milky Way. Traditional isochrone methods for these estimations are inherently limited due to overlapping isochrones in the Hertzsprung-Russell diagram, while asteroseismology, though more precise, requires high-precision, long-term observations. In response to these challenges, we developed a novel fr… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted by A&A

  9. arXiv:2401.02904  [pdf, other

    cs.LG stat.ML

    Class-wise Generalization Error: an Information-Theoretic Analysis

    Authors: Firas Laakom, Yuheng Bu, Moncef Gabbouj

    Abstract: Existing generalization theories of supervised learning typically take a holistic approach and provide bounds for the expected generalization over the whole data distribution, which implicitly assumes that the model generalizes similarly for all the classes. In practice, however, there are significant variations in generalization performance among different classes, which cannot be captured by the… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 26 pages

  10. arXiv:2310.09453   

    cs.SI

    Effects of Same-Race Mentorship Preferences on Academic Performance and Survival

    Authors: Meijun Liu, Yi Bu, Daifeng Li, Ying Ding, Daniel E. Acuna

    Abstract: Same-race mentorship preference refers to mentors or mentees forming connections significantly influenced by a shared race. Although racial diversity in science has been well-studied and linked to favorable outcomes, the extent and effects of same-race mentorship preferences remain largely underexplored. Here, we analyze 465,355 mentor-mentee pairs from more than 60 research areas over the last 70… ▽ More

    Submitted 4 May, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 1. After further evaluating the race prediction method, we observed unsatisfactory accuracy and F1 scores. The study's findings could be impacted by these subpar predictions. 2. Our study incorporates both US and non-US samples, revealing that non-US samples may introduce outliers and distort the results. We recognize that the study's findings and conclusions might be affected by data quality

  11. arXiv:2310.04945  [pdf, other

    cs.CL cs.AI

    Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy

    Authors: Zheng Zhang, Chen Zheng, Da Tang, Ke Sun, Yukun Ma, Yingtong Bu, Xun Zhou, Liang Zhao

    Abstract: This paper introduces a multifaceted methodology for fine-tuning and evaluating large language models (LLMs) for specialized monetization tasks. The goal is to balance general language proficiency with domain-specific skills. The methodology has three main components: 1) Carefully blending in-domain and general-purpose data during fine-tuning to achieve an optimal balance between general and speci… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  12. arXiv:2307.10198  [pdf

    cs.AI

    Has China caught up to the US in AI research? An exploration of mimetic isomorphism as a model for late industrializers

    Authors: Chao Min, Yi Zhao, Yi Bu, Ying Ding, Caroline S. Wagner

    Abstract: Artificial Intelligence (AI), a cornerstone of 21st-century technology, has seen remarkable growth in China. In this paper, we examine China's AI development process, demonstrating that it is characterized by rapid learning and differentiation, surpassing the export-oriented growth propelled by Foreign Direct Investment seen in earlier Asian industrializers. Our data indicates that China current… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  13. What makes a successful rebuttal in computer science conferences? : A perspective on social interaction

    Authors: Junjie Huang, Win-bin Huang, Yi Bu, Qi Cao, Huawei Shen, Xueqi Cheng

    Abstract: With an exponential increase in submissions to top-tier Computer Science (CS) conferences, more and more conferences have introduced a rebuttal stage to the conference peer review process. The rebuttal stage can be modeled as social interactions between authors and reviewers. A successful rebuttal often results in an increased review score after the rebuttal stage. In this paper, we conduct an emp… ▽ More

    Submitted 21 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Journal ref: Volume 17, Issue 3, August 2023, 101427, Journal of Informetrics

  14. arXiv:2306.15804  [pdf

    physics.soc-ph cs.CY

    The Impact of Heterogeneous Shared Leadership in Scientific Teams

    Authors: Huimin Xu, Meijun Liu, Yi Bu, Shujing Sun, Yi Zhang, Chenwei Zhang, Daniel E. Acuna, Steven Gray, Eric Meyer, Ying Ding

    Abstract: Leadership is evolving dynamically from an individual endeavor to shared efforts. This paper aims to advance our understanding of shared leadership in scientific teams. We define three kinds of leaders, junior (10-15), mid (15-20), and senior (20+) based on career age. By considering the combinations of any two leaders, we distinguish shared leadership as heterogeneous when leaders are in differen… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  15. arXiv:2306.05583  [pdf, other

    cs.LG cs.IT

    Gibbs-Based Information Criteria and the Over-Parameterized Regime

    Authors: Haobo Chen, Yuheng Bu, Gregory W. Wornell

    Abstract: Double-descent refers to the unexpected drop in test loss of a learning algorithm beyond an interpolating threshold with over-parameterization, which is not predicted by information criteria in their classical forms due to the limitations in the standard asymptotic approach. We update these analyses using the information risk minimization framework and provide Akaike Information Criterion (AIC) an… ▽ More

    Submitted 13 November, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

  16. arXiv:2305.20074  [pdf, other

    cs.CV cs.AI cs.IT cs.LG

    Feature Learning in Image Hierarchies using Functional Maximal Correlation

    Authors: Bo Hu, Yuheng Bu, José C. Príncipe

    Abstract: This paper proposes the Hierarchical Functional Maximal Correlation Algorithm (HFMCA), a hierarchical methodology that characterizes dependencies across two hierarchical levels in multiview systems. By framing view similarities as dependencies and ensuring contrastivity by imposing orthonormality, HFMCA achieves faster convergence and increased stability in self-supervised learning. HFMCA defines… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  17. arXiv:2305.08207  [pdf, other

    eess.SP cs.IT math.ST

    A Bilateral Bound on the Mean-Square Error for Estimation in Model Mismatch

    Authors: Amir Weiss, Alejandro Lancho, Yuheng Bu, Gregory W. Wornell

    Abstract: A bilateral (i.e., upper and lower) bound on the mean-square error under a general model mismatch is developed. The bound, which is derived from the variational representation of the chi-square divergence, is applicable in the Bayesian and nonBayesian frameworks to biased and unbiased estimators. Unlike other classical MSE bounds that depend only on the model, our bound is also estimator-dependent… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted for publication in Proc. of ISIT 2023

  18. arXiv:2305.00593  [pdf, other

    cs.LG cs.CL

    Reliable Gradient-free and Likelihood-free Prompt Tuning

    Authors: Maohao Shen, Soumya Ghosh, Prasanna Sattigeri, Subhro Das, Yuheng Bu, Gregory Wornell

    Abstract: Due to privacy or commercial constraints, large pre-trained language models (PLMs) are often offered as black-box APIs. Fine-tuning such models to downstream tasks is challenging because one can neither access the model's internal representations nor propagate gradients through it. This paper addresses these challenges by developing techniques for adapting PLMs with only API access. Building on re… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: EACL 2023 (Findings)

  19. arXiv:2304.14332  [pdf, other

    cs.LG cs.IT

    On the Generalization Error of Meta Learning for the Gibbs Algorithm

    Authors: Yuheng Bu, Harsha Vardhan Tetali, Gholamali Aminian, Miguel Rodrigues, Gregory Wornell

    Abstract: We analyze the generalization ability of joint-training meta learning algorithms via the Gibbs algorithm. Our exact characterization of the expected meta generalization error for the meta Gibbs algorithm is based on symmetrized KL information, which measures the dependence between all meta-training datasets and the output parameters, including task-specific and meta parameters. Additionally, we de… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: Accepted at ISIT 2023

  20. Lightweight Machine Learning for Digital Cross-Link Interference Cancellation with RF Chain Characteristics in Flexible Duplex MIMO Systems

    Authors: Jing-Sheng Tan, Shaoshi Yang, Kuo Meng, Jianhua Zhang, Yurong Tang, Yan Bu, Guizhen Wang

    Abstract: The flexible duplex (FD) technique, including dynamic time-division duplex (D-TDD) and dynamic frequency-division duplex (D-FDD), is regarded as a promising solution to achieving a more flexible uplink/downlink transmission in 5G-Advanced or 6G mobile communication systems. However, it may introduce serious cross-link interference (CLI). For better mitigating the impact of CLI, we first present a… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 5 pages, 6 figures

  21. arXiv:2302.08077  [pdf, other

    cs.LG

    Group Fairness with Uncertainty in Sensitive Attributes

    Authors: Abhin Shah, Maohao Shen, Jongha Jon Ryu, Subhro Das, Prasanna Sattigeri, Yuheng Bu, Gregory W. Wornell

    Abstract: Learning a fair predictive model is crucial to mitigate biased decisions against minority groups in high-stakes applications. A common approach to learn such a model involves solving an optimization problem that maximizes the predictive power of the model under an appropriate group fairness constraint. However, in practice, sensitive attributes are often missing or noisy resulting in uncertainty.… ▽ More

    Submitted 7 June, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  22. arXiv:2302.03242  [pdf, other

    cs.CV cs.MM cs.SI

    Combating Online Misinformation Videos: Characterization, Detection, and Future Directions

    Authors: Yuyan Bu, Qiang Sheng, Juan Cao, Peng Qi, Danding Wang, Jintao Li

    Abstract: With information consumption via online video streaming becoming increasingly popular, misinformation video poses a new threat to the health of the online information ecosystem. Though previous studies have made much progress in detecting misinformation in text and image formats, video-based misinformation brings new and unique challenges to automatic detection systems: 1) high information heterog… ▽ More

    Submitted 6 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at ACM Multimedia 2023 (MM 2023). 11 pages, 4 figures, and 89 references

  23. arXiv:2212.07359  [pdf, other

    cs.LG

    Post-hoc Uncertainty Learning using a Dirichlet Meta-Model

    Authors: Maohao Shen, Yuheng Bu, Prasanna Sattigeri, Soumya Ghosh, Subhro Das, Gregory Wornell

    Abstract: It is known that neural networks have the problem of being over-confident when directly using the output label distribution to generate uncertainty measures. Existing methods mainly resolve this issue by retraining the entire model to impose the uncertainty quantification capability so that the learned model can achieve desired performance in accuracy and uncertainty prediction simultaneously. How… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted by AAAI 2023

  24. arXiv:2211.10973  [pdf, other

    cs.MM

    FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms

    Authors: Peng Qi, Yuyan Bu, Juan Cao, Wei Ji, Ruihao Shui, Junbin Xiao, Danding Wang, Tat-Seng Chua

    Abstract: Short video platforms have become an important channel for news sharing, but also a new breeding ground for fake news. To mitigate this problem, research of fake news video detection has recently received a lot of attention. Existing works face two roadblocks: the scarcity of comprehensive and largescale datasets and insufficient utilization of multimodal information. Therefore, in this paper, we… ▽ More

    Submitted 2 December, 2022; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: To appear in AAAI 2023 AISI track. This version contains appendix with additional details

  25. arXiv:2210.09864  [pdf, ps, other

    cs.IT

    Information-theoretic Characterizations of Generalization Error for the Gibbs Algorithm

    Authors: Gholamali Aminian, Yuheng Bu, Laura Toni, Miguel R. D. Rodrigues, Gregory W. Wornell

    Abstract: Various approaches have been developed to upper bound the generalization error of a supervised learning algorithm. However, existing bounds are often loose and even vacuous when evaluated in practice. As a result, they may fail to characterize the exact generalization ability of a learning algorithm. Our main contributions are exact characterizations of the expected generalization error of the wel… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: under review. arXiv admin note: text overlap with arXiv:2107.13656, arXiv:2111.01635

  26. arXiv:2210.08188  [pdf, ps, other

    cs.IT cs.LG

    How Does Pseudo-Labeling Affect the Generalization Error of the Semi-Supervised Gibbs Algorithm?

    Authors: Haiyun He, Gholamali Aminian, Yuheng Bu, Miguel Rodrigues, Vincent Y. F. Tan

    Abstract: We provide an exact characterization of the expected generalization error (gen-error) for semi-supervised learning (SSL) with pseudo-labeling via the Gibbs algorithm. The gen-error is expressed in terms of the symmetrized KL information between the output hypothesis, the pseudo-labeled dataset, and the labeled dataset. Distribution-free upper and lower bounds on the gen-error can also be obtained.… ▽ More

    Submitted 15 June, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: 30 pages, 4 figures

  27. Data-Driven Blind Synchronization and Interference Rejection for Digital Communication Signals

    Authors: Alejandro Lancho, Amir Weiss, Gary C. F. Lee, Jennifer Tang, Yuheng Bu, Yury Polyanskiy, Gregory W. Wornell

    Abstract: We study the potential of data-driven deep learning methods for separation of two communication signals from an observation of their mixture. In particular, we assume knowledge on the generation process of one of the signals, dubbed signal of interest (SOI), and no knowledge on the generation process of the second signal, referred to as interference. This form of the single-channel source separati… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 9 pages, 6 figures, accepted at IEEE GLOBECOM 2022 (this version contains extended proofs)

  28. Exploring the Distribution Regularities of User Attention and Sentiment toward Product Aspects in Online Reviews

    Authors: Chenglei Qin, Chengzhi Zhang, Yi Bu

    Abstract: [Purpose] To better understand the online reviews and help potential consumers, businessmen, and product manufacturers effectively obtain users' evaluation on product aspects, this paper explores the distribution regularities of user attention and sentiment toward product aspects from the temporal perspective of online reviews. [Design/methodology/approach] Temporal characteristics of online revie… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  29. Exploiting Temporal Structures of Cyclostationary Signals for Data-Driven Single-Channel Source Separation

    Authors: Gary C. F. Lee, Amir Weiss, Alejandro Lancho, Jennifer Tang, Yuheng Bu, Yury Polyanskiy, Gregory W. Wornell

    Abstract: We study the problem of single-channel source separation (SCSS), and focus on cyclostationary signals, which are particularly suitable in a variety of application domains. Unlike classical SCSS approaches, we consider a setting where only examples of the sources are available rather than their models, inspiring a data-driven approach. For source models with underlying cyclostationary Gaussian cons… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  30. arXiv:2205.08756  [pdf

    cs.DL

    Team formation and team performance: The balance between team freshness and repeat collaboration

    Authors: Meijun Liu, Ajay Jaiswal, Yi Bu, Chao Min, Sijie Yang, Zhibo Liu, Daniel Daniel Acuña, Ying Ding

    Abstract: Incorporating fresh members in teams is considered a pathway to team creativity. However, whether freshness improves team performance or not remains unclear, as well as the optimal involvement of fresh members for team performance. This study uses a group of authors on the byline of a publication as a proxy for a scientific team. We extend an indicator, i.e., team freshness, to measure the extent… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  31. arXiv:2203.10839  [pdf, other

    cs.CL cs.AI cs.CY

    TCM-SD: A Benchmark for Probing Syndrome Differentiation via Natural Language Processing

    Authors: Mucheng Ren, Heyan Huang, Yuxiang Zhou, Qianwen Cao, Yuan Bu, Yang Gao

    Abstract: Traditional Chinese Medicine (TCM) is a natural, safe, and effective therapy that has spread and been applied worldwide. The unique TCM diagnosis and treatment system requires a comprehensive analysis of a patient's symptoms hidden in the clinical record written in free text. Prior studies have shown that this system can be informationized and intelligentized with the aid of artificial intelligenc… ▽ More

    Submitted 2 August, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: 10 main pages + 2 reference pages, to appear at CCL2022

  32. arXiv:2202.12150  [pdf, ps, other

    cs.IT cs.LG

    Tighter Expected Generalization Error Bounds via Convexity of Information Measures

    Authors: Gholamali Aminian, Yuheng Bu, Gregory Wornell, Miguel Rodrigues

    Abstract: Generalization error bounds are essential to understanding machine learning algorithms. This paper presents novel expected generalization error upper bounds based on the average joint distribution between the output hypothesis and each input training sample. Multiple generalization error upper bounds based on different information measures are provided, including Wasserstein distance, total variat… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 10 pages, 1 figure

  33. arXiv:2202.08461  [pdf, other

    cs.DL cs.AI

    The Gene of Scientific Success

    Authors: Xiangjie Kong, Jun Zhang, Da Zhang, Yi Bu, Ying Ding, Feng Xia

    Abstract: This paper elaborates how to identify and evaluate causal factors to improve scientific impact. Currently, analyzing scientific impact can be beneficial to various academic activities including funding application, mentor recommendation, and discovering potential cooperators etc. It is universally acknowledged that high-impact scholars often have more opportunities to receive awards as an encourag… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Journal ref: ACM Transactions on Knowledge Discovery from Data. 14, no. 4 (2020): 41

  34. arXiv:2202.00796  [pdf, other

    cs.LG cs.IT

    On Balancing Bias and Variance in Unsupervised Multi-Source-Free Domain Adaptation

    Authors: Maohao Shen, Yuheng Bu, Gregory Wornell

    Abstract: Due to privacy, storage, and other constraints, there is a growing need for unsupervised domain adaptation techniques in machine learning that do not require access to the data used to train a collection of source models. Existing methods for multi-source-free domain adaptation (MSFDA) typically train a target model using pseudo-labeled data produced by the source models, which focus on improving… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: ICML 2023

  35. arXiv:2111.01635  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

    Authors: Yuheng Bu, Gholamali Aminian, Laura Toni, Miguel Rodrigues, Gregory Wornell

    Abstract: We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, $α$-weighted-ERM and two-stage-ERM. Our key result is an exact characterization of the generalization behaviour using the conditional symmetrized KL information between the output hypothesis and the target training samples g… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  36. arXiv:2110.15403  [pdf, other

    cs.LG stat.ML

    Selective Regression Under Fairness Criteria

    Authors: Abhin Shah, Yuheng Bu, Joshua Ka-Wing Lee, Subhro Das, Rameswar Panda, Prasanna Sattigeri, Gregory W. Wornell

    Abstract: Selective regression allows abstention from prediction if the confidence to make an accurate prediction is not sufficient. In general, by allowing a reject option, one expects the performance of a regression model to increase at the cost of reducing coverage (i.e., by predicting on fewer samples). However, as we show, in some cases, the performance of a minority subgroup can decrease while we redu… ▽ More

    Submitted 14 July, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

  37. arXiv:2108.13246  [pdf, other

    cs.CV

    LUAI Challenge 2021 on Learning to Understand Aerial Images

    Authors: Gui-Song Xia, Jian Ding, Ming Qian, Nan Xue, Jiaming Han, Xiang Bai, Michael Ying Yang, Shengyang Li, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang, Qiang Zhou, Chao-hui Yu, Kaixuan Hu, Yingjia Bu, Wenming Tan, Zhe Yang, Wei Li, Shang Liu, Jiaxuan Zhao, Tianzhi Ma, Zi-han Gao, Lingqi Wang , et al. (11 additional authors not shown)

    Abstract: This report summarizes the results of Learning to Understand Aerial Images (LUAI) 2021 challenge held on ICCV 2021, which focuses on object detection and semantic segmentation in aerial images. Using DOTA-v2.0 and GID-15 datasets, this challenge proposes three tasks for oriented object detection, horizontal object detection, and semantic segmentation of common categories in aerial images. This cha… ▽ More

    Submitted 17 September, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 7 pages, 2 figures, accepted by ICCVW 2021

  38. arXiv:2108.04108  [pdf

    physics.soc-ph cs.AI

    Team Power Dynamics and Team Impact: New Perspectives on Scientific Collaboration using Career Age as a Proxy for Team Power

    Authors: Huimin Xu, Yi Bu, Meijun Liu, Chenwei Zhang, Mengyi Sun, Yi Zhang, Eric Meyer, Eduardo Salas, Ying Ding

    Abstract: Power dynamics influence every aspect of scientific collaboration. Team power dynamics can be measured by team power level and team power hierarchy. Team power level is conceptualized as the average level of the possession of resources, expertise, or decision-making authorities of a team. Team power hierarchy represents the vertical differences of the possessions of resources in a team. In Science… ▽ More

    Submitted 14 April, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

  39. arXiv:2107.13656  [pdf, ps, other

    cs.LG cs.IT math.ST stat.ML

    Characterizing the Generalization Error of Gibbs Algorithm with Symmetrized KL information

    Authors: Gholamali Aminian, Yuheng Bu, Laura Toni, Miguel R. D. Rodrigues, Gregory Wornell

    Abstract: Bounding the generalization error of a supervised learning algorithm is one of the most important problems in learning theory, and various approaches have been developed. However, existing bounds are often loose and lack of guarantees. As a result, they may fail to characterize the exact generalization ability of a learning algorithm. Our main contribution is an exact characterization of the expec… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: The first and second author have contributed equally to the paper. This paper is accepted in the ICML-21 Workshop on Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning: https://sites.google.com/view/itr3/schedule

  40. arXiv:2105.05182  [pdf, other

    cs.HC

    PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback

    Authors: Yaohua Bu, Tianyi Ma, Weijun Li, Hang Zhou, Jia Jia, Shengqi Chen, Kaiyuan Xu, Dachuan Shi, Haozhe Wu, Zhihan Yang, Kun Li, Zhiyong Wu, Yuanchun Shi, Xiaobo Lu, Ziwei Liu

    Abstract: Second language (L2) English learners often find it difficult to improve their pronunciations due to the lack of expressive and personalized corrective feedback. In this paper, we present Pronunciation Teacher (PTeacher), a Computer-Aided Pronunciation Training (CAPT) system that provides personalized exaggerated audio-visual corrective feedback for mispronunciations. Though the effectiveness of e… ▽ More

    Submitted 11 May, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

  41. arXiv:2104.02856   

    cs.IT cs.PF

    Irregular-Mapped Protograph LDPC-Coded Modulation: A Bandwidth-Efficient Solution for $5$G Networks with Massive Data-Storage Requirement

    Authors: Yi Fang, Yingcheng Bu, Pingping Chen, Shahid Mumtaz, Francis C. M. Lau, Sattam Al Otaibi

    Abstract: The huge amount of data produced in the fifth-generation (5G) networks not only brings new challenges to the reliability and efficiency of mobile devices but also drives rapid development of new storage techniques. With the benefits of fast access speed and high reliability, NAND flash memory has become a promising storage solution for the 5G networks. In this paper, we investigate a protograph-co… ▽ More

    Submitted 20 July, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: More research effort should be made to improve the quality of this paper with the help of other collegues. The paper must be withdrawed at this stage as some content should be revised and changed

  42. arXiv:2101.08577  [pdf

    cs.DL

    References of References: How Far is the Knowledge Ancestry

    Authors: Chao Min, Jiawei Xu, Tao Han, Yi Bu

    Abstract: Scientometrics studies have extended from direct citations to high-order citations, as simple citation count is found to tell only part of the story regarding scientific impact. This extension is deemed to be beneficial in scenarios like research evaluation, science history modeling, and information retrieval. In contrast to citations of citations (forward citation generations), references of refe… ▽ More

    Submitted 1 April, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

  43. arXiv:2012.15259  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    A Maximal Correlation Approach to Imposing Fairness in Machine Learning

    Authors: Joshua Lee, Yuheng Bu, Prasanna Sattigeri, Rameswar Panda, Gregory Wornell, Leonid Karlinsky, Rogerio Feris

    Abstract: As machine learning algorithms grow in popularity and diversify to many industries, ethical and legal concerns regarding their fairness have become increasingly relevant. We explore the problem of algorithmic fairness, taking an information-theoretic view. The maximal correlation framework is introduced for expressing fairness constraints and shown to be capable of being used to derive regularizer… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Comments: 9 Pages 4 Figures

  44. PolarDet: A Fast, More Precise Detector for Rotated Target in Aerial Images

    Authors: Pengbo Zhao, Zhenshen Qu, Yingjia Bu, Wenming Tan, Ye Ren, Shiliang Pu

    Abstract: Fast and precise object detection for high-resolution aerial images has been a challenging task over the years. Due to the sharp variations on object scale, rotation, and aspect ratio, most existing methods are inefficient and imprecise. In this paper, we represent the oriented objects by polar method in polar coordinate and propose PolarDet, a fast and accurate one-stage object detector based on… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: 11 pages, 10 figures, 5 tables

  45. Pandemics are catalysts of scientific novelty: Evidence from COVID-19

    Authors: Meijun Liu, Yi Bu, Chongyan Chen, Jian Xu, Daifeng Li, Yan Leng, Richard Barry Freeman, Eric Meyer, Wonjin Yoon, Mujeen Sung, Minbyul Jeong, Jinhyuk Lee, Jaewoo Kang, Chao Min, Min Song, Yujia Zhai, Ying Ding

    Abstract: Scientific novelty drives the efforts to invent new vaccines and solutions during the pandemic. First-time collaboration and international collaboration are two pivotal channels to expand teams' search activities for a broader scope of resources required to address the global challenge, which might facilitate the generation of novel ideas. Our analysis of 98,981 coronavirus papers suggests that sc… ▽ More

    Submitted 14 November, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: 19 pages, 3 figures

    ACM Class: J.4

  46. ChoreoNet: Towards Music to Dance Synthesis with Choreographic Action Unit

    Authors: Zijie Ye, Haozhe Wu, Jia Jia, Yaohua Bu, Wei Chen, Fanbo Meng, Yanfeng Wang

    Abstract: Dance and music are two highly correlated artistic forms. Synthesizing dance motions has attracted much attention recently. Most previous works conduct music-to-dance synthesis via directly music to human skeleton keypoints mapping. Meanwhile, human choreographers design dance motions from music in a two-stage manner: they firstly devise multiple choreographic dance units (CAUs), each with a serie… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 10 pages, 5 figures, Accepted by ACM MM 2020

  47. arXiv:2009.05748  [pdf, other

    eess.AS cs.AI

    Visual-speech Synthesis of Exaggerated Corrective Feedback

    Authors: Yaohua Bu, Weijun Li, Tianyi Ma, Shengqi Chen, Jia Jia, Kun Li, Xiaobo Lu

    Abstract: To provide more discriminative feedback for the second language (L2) learners to better identify their mispronunciation, we propose a method for exaggerated visual-speech feedback in computer-assisted pronunciation training (CAPT). The speech exaggeration is realized by an emphatic speech generation neural network based on Tacotron, while the visual exaggeration is accomplished by ADC Viseme Blend… ▽ More

    Submitted 15 December, 2020; v1 submitted 12 September, 2020; originally announced September 2020.

  48. arXiv:2009.01812  [pdf

    cs.DL physics.soc-ph

    The Pace of Artificial Intelligence Innovations: Speed, Talent, and Trial-and-Error

    Authors: Xuli Tang, Xin Li, Ying Ding, Min Song, Yi Bu

    Abstract: Innovations in artificial intelligence (AI) are occurring at speeds faster than ever witnessed before. However, few studies have managed to measure or depict this increasing velocity of innovations in the field of AI. In this paper, we combine data on AI from arXiv and Semantic Scholar to explore the pace of AI innovations from three perspectives: AI publications, AI players, and AI updates (trial… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Journal of Informetrics 2021

  49. arXiv:2008.12007  [pdf

    cs.DL

    An empirical review of the different variants of the Probabilistic Affinity Index as applied to scientific collaboration

    Authors: Zaida Chinchilla-Rodríguez, Yi Bu, Nicolás Robinson-García, Cassidy R. Sugimoto

    Abstract: Responsible indicators are crucial for research assessment and monitoring. Transparency and accuracy of indicators are required to make research assessment fair and ensure reproducibility. However, sometimes it is difficult to conduct or replicate studies based on indicators due to the lack of transparency in conceptualization and operationalization. In this paper, we review the different variants… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: 35 pages, 3 figures, 5 tables

  50. arXiv:2007.10287  [pdf, other

    cs.AI cs.CL

    Coronavirus Knowledge Graph: A Case Study

    Authors: Chongyan Chen, Islam Akef Ebeid, Yi Bu, Ying Ding

    Abstract: The emergence of the novel COVID-19 pandemic has had a significant impact on global healthcare and the economy over the past few months. The virus's rapid widespread has led to a proliferation in biomedical research addressing the pandemic and its related topics. One of the essential Knowledge Discovery tools that could help the biomedical research community understand and eventually find a cure f… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: 8 pages; Accepted by ACM KDD 2020