Skip to main content

Showing 1–50 of 91 results for author: Lyu, H

  1. arXiv:2407.11409  [pdf, other

    cs.CL

    Representation Bias in Political Sample Simulations with Large Language Models

    Authors: Weihong Qi, Hanjia Lyu, Jiebo Luo

    Abstract: This study seeks to identify and quantify biases in simulating political samples with Large Language Models, specifically focusing on vote choice and public opinion. Using the GPT-3.5-Turbo model, we leverage data from the American National Election Studies, German Longitudinal Election Study, Zuobiao Dataset, and China Family Panel Studies to simulate voting behaviors and public opinions. This me… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2406.09105  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance

    Authors: Chenwei Lin, Hanjia Lyu, Xian Xu, Jiebo Luo

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated outstanding performance in various general multimodal applications such as image recognition and visual reasoning, and have also shown promising potential in specialized domains. However, the application potential of LVLMs in the insurance domain-characterized by rich application scenarios and abundant multimodal data-has not been effectively… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2405.01314  [pdf, other

    eess.SY cs.LG

    Non-iterative Optimization of Trajectory and Radio Resource for Aerial Network

    Authors: Hyeonsu Lyu, Jonggyu Jang, Harim Lee, Hyun Jong Yang

    Abstract: We address a joint trajectory planning, user association, resource allocation, and power control problem to maximize proportional fairness in the aerial IoT network, considering practical end-to-end quality-of-service (QoS) and communication schedules. Though the problem is rather ancient, apart from the fact that the previous approaches have never considered user- and time-specific QoS, we point… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2404.18017  [pdf

    q-fin.PM cs.LG q-fin.CP

    Application of Deep Learning for Factor Timing in Asset Management

    Authors: Prabhu Prasad Panda, Maysam Khodayari Gharanchaei, Xilin Chen, Haoshu Lyu

    Abstract: The paper examines the performance of regression models (OLS linear regression, Ridge regression, Random Forest, and Fully-connected Neural Network) on the prediction of CMA (Conservative Minus Aggressive) factor premium and the performance of factor timing investment with them. Out-of-sample R-squared shows that more flexible models have better performance in explaining the variance in factor pre… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  5. arXiv:2404.13265  [pdf

    q-bio.GN cs.AI cs.LG

    F5C-finder: An Explainable and Ensemble Biological Language Model for Predicting 5-Formylcytidine Modifications on mRNA

    Authors: Guohao Wang, Ting Liu, Hongqiang Lyu, Ze Liu

    Abstract: As a prevalent and dynamically regulated epigenetic modification, 5-formylcytidine (f5C) is crucial in various biological processes. However, traditional experimental methods for f5C detection are often laborious and time-consuming, limiting their ability to map f5C sites across the transcriptome comprehensively. While computational approaches offer a cost-effective and high-throughput alternative… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 34 pages, 10 figures, journal

  6. arXiv:2404.12186  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving UCB Decision Process Verification via zk-SNARKs

    Authors: Xikun Jiang, He Lyu, Chenhao Ying, Yibin Xu, Boris Düdder, Yuan Luo

    Abstract: With the increasingly widespread application of machine learning, how to strike a balance between protecting the privacy of data and algorithm parameters and ensuring the verifiability of machine learning has always been a challenge. This study explores the intersection of reinforcement learning and data privacy, specifically addressing the Multi-Armed Bandit (MAB) problem with the Upper Confidenc… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  7. arXiv:2404.09690  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration

    Authors: Chenwei Lin, Hanjia Lyu, Jiebo Luo, Xian Xu

    Abstract: The emergence of Large Multimodal Models (LMMs) marks a significant milestone in the development of artificial intelligence. Insurance, as a vast and complex discipline, involves a wide variety of data forms in its operational processes, including text, images, and videos, thereby giving rise to diverse multimodal tasks. Despite this, there has been limited systematic exploration of multimodal tas… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  8. arXiv:2404.01855  [pdf, other

    cs.IR cs.AI

    Where to Move Next: Zero-shot Generalization of LLMs for Next POI Recommendation

    Authors: Shanshan Feng, Haoming Lyu, Caishun Chen, Yew-Soon Ong

    Abstract: Next Point-of-interest (POI) recommendation provides valuable suggestions for users to explore their surrounding environment. Existing studies rely on building recommendation models from large-scale users' check-in data, which is task-specific and needs extensive computational resources. Recently, the pretrained large language models (LLMs) have achieved significant advancements in various NLP tas… ▽ More

    Submitted 22 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  9. FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation

    Authors: Hanfang Lyu, Yuanchen Bai, Xin Liang, Ujaan Das, Chuhan Shi, Leiliang Gong, Yingchi Li, Mingfei Sun, Ming Ge, Xiaojuan Ma

    Abstract: Preference-based learning aims to align robot task objectives with human values. One of the most common methods to infer human preferences is by pairwise comparisons of robot task trajectories. Traditional comparison-based preference labeling systems seldom support labelers to digest and identify critical differences between complex trajectories recorded in videos. Our formative study (N = 12) sug… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted to ACM Conference on Intelligent User Interfaces (IUI) 2024, March 18-21, 2024, Greenville, SC, USA

  10. arXiv:2402.13022  [pdf, other

    cs.CL cs.MM

    SoMeLVLM: A Large Vision Language Model for Social Media Processing

    Authors: Xinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen, Jiebo Luo, Xuanjing Huang, Zhongyu Wei

    Abstract: The growth of social media, characterized by its multimodal nature, has led to the emergence of diverse phenomena and challenges, which calls for an effective approach to uniformly solve automated tasks. The powerful Large Vision Language Models make it possible to handle a variety of tasks simultaneously, but even with carefully designed prompting methods, the general domain models often fall sho… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  11. arXiv:2402.07101  [pdf, ps, other

    math.OC cs.LG

    On the Complexity of First-Order Methods in Stochastic Bilevel Optimization

    Authors: Jeongyeol Kwon, Dohyun Kwon, Hanbaek Lyu

    Abstract: We consider the problem of finding stationary points in Bilevel optimization when the lower-level problem is unconstrained and strongly convex. The problem has been extensively studied in recent years; the main technical challenge is to keep track of lower-level solutions $y^*(x)$ in response to the changes in the upper-level variables $x$. Subsequently, all existing approaches tie their analyses… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  12. arXiv:2401.08733  [pdf, other

    cs.SI

    In the Eyes of the Bystander: Are the Stances on Different Conflicts Correlated?

    Authors: Yiyao Tao, Hengyu Zhang, Babli Dey, Selenge Tulga, Hanjia Lyu, Jiebo Luo

    Abstract: Public opinion on international conflicts, such as the concurrent Russia-Ukraine and Israel-Palestine crises, often reflects a society's values, beliefs, and history. These simultaneous conflicts have sparked heated global online discussions, offering a unique opportunity to explore the dynamics of public opinion in multiple international crises. This study investigates how public opinions toward… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  13. arXiv:2401.08212  [pdf, other

    cs.CV

    Human vs. LMMs: Exploring the Discrepancy in Emoji Interpretation and Usage in Digital Communication

    Authors: Hanjia Lyu, Weihong Qi, Zhongyu Wei, Jiebo Luo

    Abstract: Leveraging Large Multimodal Models (LMMs) to simulate human behaviors when processing multimodal information, especially in the context of social media, has garnered immense interest due to its broad potential and far-reaching implications. Emojis, as one of the most unique aspects of digital communication, are pivotal in enriching and often clarifying the emotional and tonal dimensions. Yet, ther… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in ICWSM 2024

  14. arXiv:2401.07694  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic optimization with arbitrary recurrent data sampling

    Authors: William G. Powell, Hanbaek Lyu

    Abstract: For obtaining optimal first-order convergence guarantee for stochastic optimization, it is necessary to use a recurrent data sampling algorithm that samples every data point with sufficient frequency. Most commonly used data sampling algorithms (e.g., i.i.d., MCMC, random reshuffling) are indeed recurrent under mild assumptions. In this work, we show that for a particular class of stochastic optim… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 41 pages, 3 figures, 1 table

  15. arXiv:2401.02810  [pdf, other

    cs.LG cs.AI math.NA

    Physics-Informed Neural Networks for High-Frequency and Multi-Scale Problems using Transfer Learning

    Authors: Abdul Hannan Mustajab, Hao Lyu, Zarghaam Rizvi, Frank Wuttke

    Abstract: Physics-informed neural network (PINN) is a data-driven solver for partial and ordinary differential equations(ODEs/PDEs). It provides a unified framework to address both forward and inverse problems. However, the complexity of the objective function often leads to training failures. This issue is particularly prominent when solving high-frequency and multi-scale problems. We proposed using transf… ▽ More

    Submitted 15 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 18 pages

  16. arXiv:2401.02582  [pdf, other

    cs.CV

    CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs

    Authors: Daoan Zhang, Junming Yang, Hanjia Lyu, Zijian Jin, Yuan Yao, Mingkai Chen, Jiebo Luo

    Abstract: When exploring the development of Artificial General Intelligence (AGI), a critical task for these models involves interpreting and processing information from multiple image inputs. However, Large Multimodal Models (LMMs) encounter two issues in such scenarios: (1) a lack of fine-grained perception, and (2) a tendency to blend information across multiple images. We first extensively investigate t… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  17. arXiv:2312.07040  [pdf, ps, other

    cs.AI cs.CR

    Patch-MI: Enhancing Model Inversion Attacks via Patch-Based Reconstruction

    Authors: Jonggyu Jang, Hyeonsu Lyu, Hyun Jong Yang

    Abstract: Model inversion (MI) attacks aim to reveal sensitive information in training datasets by solely accessing model weights. Generative MI attacks, a prominent strand in this field, utilize auxiliary datasets to recreate target data attributes, restricting the images to remain photo-realistic, but their success often depends on the similarity between auxiliary and target datasets. If the distributions… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 11 pages

  18. arXiv:2312.05586  [pdf, other

    cs.LG cs.AI

    Deeper Understanding of Black-box Predictions via Generalized Influence Functions

    Authors: Hyeonsu Lyu, Jonggyu Jang, Sehyun Ryu, Hyun Jong Yang

    Abstract: Influence functions (IFs) elucidate how training data changes model behavior. However, the increasing size and non-convexity in large-scale models make IFs inaccurate. We suspect that the fragility comes from the first-order approximation which may cause nuisance changes in parameters irrelevant to the examined data. However, simply computing influence from the chosen parameters can be misleading,… ▽ More

    Submitted 6 May, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: 16 pages, 6 figures, and 2 tables

    ACM Class: I.2.0

  19. arXiv:2311.14930  [pdf, other

    cs.HC

    StreamFunnel: Facilitating Communication Between a VR Streamer and Many Spectators

    Authors: Haohua Lyu, Cyrus Vachha, Qianyi Chen, Balasaravanan Thoravi Kumaravel, Bjöern Hartmann

    Abstract: The increasing adoption of Virtual Reality (VR) systems in different domains have led to a need to support interaction between many spectators and a VR user. This is common in game streaming, live performances, and webinars. Prior CSCW systems for VR environments are limited to small groups of users. In this work, we identify problems associated with interaction carried out with large groups of us… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 12 pages, 7 figures

  20. arXiv:2311.14910  [pdf, other

    math.DS cs.LG stat.ML

    A latent linear model for nonlinear coupled oscillators on graphs

    Authors: Agam Goyal, Zhaoxing Wu, Richard P. Yim, Binhao Chen, Zihong Xu, Hanbaek Lyu

    Abstract: A system of coupled oscillators on an arbitrary graph is locally driven by the tendency to mutual synchronization between nearby oscillators, but can and often exhibit nonlinear behavior on the whole graph. Understanding such nonlinear behavior has been a key challenge in predicting whether all oscillators in such a system will eventually synchronize. In this paper, we demonstrate that, surprising… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 23 pages, 14 figures

  21. arXiv:2311.13892  [pdf, other

    cs.CL cs.AI

    General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level

    Authors: Bingkang Shi, Xiaodan Zhang, Dehan Kong, Yulei Wu, Zongzhen Liu, Honglei Lyu, Longtao Huang

    Abstract: The social biases and unwelcome stereotypes revealed by pretrained language models are becoming obstacles to their application. Compared to numerous debiasing methods targeting word level, there has been relatively less attention on biases present at phrase level, limiting the performance of debiasing in discipline domains. In this paper, we propose an automatic multi-token debiasing pipeline call… ▽ More

    Submitted 25 January, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted by ICASSP 2024 as mian conference paper

  22. arXiv:2311.13712  [pdf, other

    cs.AI

    Data Acquisition: A New Frontier in Data-centric AI

    Authors: Lingjiao Chen, Bilge Acun, Newsha Ardalani, Yifan Sun, Feiyang Kang, Hanrui Lyu, Yongchan Kwon, Ruoxi Jia, Carole-Jean Wu, Matei Zaharia, James Zou

    Abstract: As Machine Learning (ML) systems continue to grow, the demand for relevant and comprehensive datasets becomes imperative. There is limited study on the challenges of data acquisition due to ad-hoc processes and lack of consistent methodologies. We first present an investigation of current data marketplaces, revealing lack of platforms offering detailed information about datasets, transparent prici… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  23. arXiv:2311.11182  [pdf, other

    stat.ML cs.LG

    Exponentially Convergent Algorithms for Supervised Matrix Factorization

    Authors: Joowon Lee, Hanbaek Lyu, Weixin Yao

    Abstract: Supervised matrix factorization (SMF) is a classical machine learning method that simultaneously seeks feature extraction and classification tasks, which are not necessarily a priori aligned objectives. Our goal is to use SMF to learn low-rank latent factors that offer interpretable, data-reconstructive, and class-discriminative features, addressing challenges posed by high-dimensional data. Train… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 33 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2206.06774

    Journal ref: Neural Information Processing Systems 2023

  24. Supervised low-rank semi-nonnegative matrix factorization with frequency regularization for forecasting spatio-temporal data

    Authors: Keunsu Kim, Hanbaek Lyu, Jinsu Kim, Jae-Hun Jung

    Abstract: We propose a novel methodology for forecasting spatio-temporal data using supervised semi-nonnegative matrix factorization (SSNMF) with frequency regularization. Matrix factorization is employed to decompose spatio-temporal data into spatial and temporal components. To improve clarity in the temporal patterns, we introduce a nonnegativity constraint on the time domain along with regularization in… ▽ More

    Submitted 19 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 35 pages, Final version

    MSC Class: 65F22; 65F55 and 86A04

    Journal ref: Journal of Scientific Computing (2024)

  25. arXiv:2311.07547  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    GPT-4V(ision) as A Social Media Analysis Engine

    Authors: Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

    Abstract: Recent research has offered insights into the extraordinary capabilities of Large Multimodal Models (LMMs) in various general vision and language tasks. There is growing interest in how LMMs perform in more specialized domains. Social media content, inherently multimodal, blends text, images, videos, and sometimes audio. Understanding social multimedia content remains a challenging problem for con… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  26. arXiv:2311.05185  [pdf, other

    cs.LG cs.AI

    Mixture of Weak & Strong Experts on Graphs

    Authors: Hanqing Zeng, Hanjia Lyu, Diyi Hu, Yinglong Xia, Jiebo Luo

    Abstract: Realistic graphs contain both (1) rich self-features of nodes and (2) informative structures of neighborhoods, jointly handled by a Graph Neural Network (GNN) in the typical setup. We propose to decouple the two modalities by Mixture of weak and strong experts (Mowst), where the weak expert is a light-weight Multi-layer Perceptron (MLP), and the strong expert is an off-the-shelf GNN. To adapt the… ▽ More

    Submitted 22 June, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in ICLR 2024

  27. arXiv:2310.10951  [pdf, other

    eess.IV cs.CV

    FusionU-Net: U-Net with Enhanced Skip Connection for Pathology Image Segmentation

    Authors: Zongyi Li, Hongbing Lyu, Jun Wang

    Abstract: In recent years, U-Net and its variants have been widely used in pathology image segmentation tasks. One of the key designs of U-Net is the use of skip connections between the encoder and decoder, which helps to recover detailed information after upsampling. While most variations of U-Net adopt the original skip connection design, there is semantic gap between the encoder and decoder that can nega… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 9 pages, 4 figures and 4 tables

  28. arXiv:2310.08084  [pdf, other

    cs.CV

    Volumetric Medical Image Segmentation via Scribble Annotations and Shape Priors

    Authors: Qiuhui Chen, Haiying Lyu, Xinyue Hu, Yong Lu, Yi Hong

    Abstract: Recently, weakly-supervised image segmentation using weak annotations like scribbles has gained great attention in computer vision and medical image analysis, since such annotations are much easier to obtain compared to time-consuming and labor-intensive labeling at the pixel/voxel level. However, due to a lack of structure supervision on regions of interest (ROIs), existing scribble-based methods… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2205.06779

  29. arXiv:2309.09508  [pdf, other

    cs.CL cs.SI

    Understanding Divergent Framing of the Supreme Court Controversies: Social Media vs. News Outlets

    Authors: Jinsheng Pan, Zichen Wang, Weihong Qi, Hanjia Lyu, Jiebo Luo

    Abstract: Understanding the framing of political issues is of paramount importance as it significantly shapes how individuals perceive, interpret, and engage with these matters. While prior research has independently explored framing within news media and by social media users, there remains a notable gap in our comprehension of the disparities in framing political issues between these two distinct groups.… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  30. arXiv:2308.14508  [pdf, other

    cs.CL

    LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

    Authors: Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li

    Abstract: Although large language models (LLMs) demonstrate impressive performance for many language tasks, most of them can only handle texts a few thousand tokens long, limiting their applications on longer sequence inputs, such as books, reports, and codebases. Recent works have proposed methods to improve LLMs' long context capabilities by extending context windows and more sophisticated memory mechanis… ▽ More

    Submitted 19 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: ACL 2024

  31. arXiv:2307.15780  [pdf, other

    cs.CL cs.AI cs.IR

    LLM-Rec: Personalized Recommendation via Prompting Large Language Models

    Authors: Hanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Christopher Leung, Jiajie Tang, Jiebo Luo

    Abstract: Text-based recommendation holds a wide range of practical applications due to its versatility, as textual descriptions can represent nearly any type of item. However, directly employing the original item descriptions may not yield optimal recommendation performance due to the lack of comprehensive information to align with user preferences. Recent advances in large language models (LLMs) have show… ▽ More

    Submitted 2 April, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

  32. arXiv:2307.05809  [pdf, other

    cs.SI

    Excitements and Concerns in the Post-ChatGPT Era: Deciphering Public Perception of AI through Social Media Analysis

    Authors: Weihong Qi, Jinsheng Pan, Hanjia Lyu, Jiebo Luo

    Abstract: As AI systems become increasingly prevalent in various aspects of daily life, gaining a comprehensive understanding of public perception towards these AI systems has become increasingly essential for several reasons such as ethical considerations, user experience, fear, disinformation, regulation, collaboration, and co-creation. In this study, we investigate how mass social media users perceive th… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  33. arXiv:2306.11550  [pdf, other

    cs.IR cs.AI

    Query Encoder Distillation via Embedding Alignment is a Strong Baseline Method to Boost Dense Retriever Online Efficiency

    Authors: Yuxuan Wang, Hong Lyu

    Abstract: The information retrieval community has made significant progress in improving the efficiency of Dual Encoder (DE) dense passage retrieval systems, making them suitable for latency-sensitive settings. However, many proposed procedures are often too complex or resource-intensive, which makes it difficult for practitioners to adopt them or identify sources of empirical gains. Therefore, in this work… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by the 4th SustaiNLP workshop at ACL 2023

  34. arXiv:2306.04181  [pdf, other

    cs.CL cs.LG

    Benchmarking Foundation Models with Language-Model-as-an-Examiner

    Authors: Yushi Bai, Jiahao Ying, Yixin Cao, Xin Lv, Yuze He, Xiaozhi Wang, Jifan Yu, Kaisheng Zeng, Yijia Xiao, Haozhe Lyu, Jiayin Zhang, Juanzi Li, Lei Hou

    Abstract: Numerous benchmarks have been established to assess the performance of foundation models on open-ended question answering, which serves as a comprehensive test of a model's ability to understand and generate language in a manner similar to humans. Most of these works focus on proposing new datasets, however, we see two main issues within previous benchmarking pipelines, namely testing leakage and… ▽ More

    Submitted 4 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks

  35. arXiv:2306.03086  [pdf, other

    cs.SI

    Dismantling Hate: Understanding Hate Speech Trends Against NBA Athletes

    Authors: Edinam Kofi Klutse, Samuel Nuamah-Amoabeng, Hanjia Lyu, Jiebo Luo

    Abstract: Social media has emerged as a popular platform for sports fans to express their opinions regarding athletes' performance. Fans consistently hold high expectations for athletes, anticipating exceptional performances week after week. This ongoing phenomenon sometimes gives rise to highly negative sentiments, with the worst-case scenario involving the occurrence of hate speech. The National Basketbal… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  36. arXiv:2306.02420  [pdf, other

    cs.LG cs.AI math.NA math.OC

    Complexity of Block Coordinate Descent with Proximal Regularization and Applications to Wasserstein CP-dictionary Learning

    Authors: Dohyun Kwon, Hanbaek Lyu

    Abstract: We consider the block coordinate descent methods of Gauss-Seidel type with proximal regularization (BCD-PR), which is a classical method of minimizing general nonconvex objectives under constraints that has a wide range of practical applications. We theoretically establish the worst-case complexity bound for this algorithm. Namely, we show that for general nonconvex smooth objectives with block-wi… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Proceedings of the 40th International Conference on Machine Learning

  37. arXiv:2305.04923  [pdf, other

    cs.CV cs.AI

    Learning to Evaluate the Artness of AI-generated Images

    Authors: Junyu Chen, Jie An, Hanjia Lyu, Christopher Kanan, Jiebo Luo

    Abstract: Assessing the artness of AI-generated images continues to be a challenge within the realm of image generation. Most existing metrics cannot be used to perform instance-level and reference-free artness evaluation. This paper presents ArtScore, a metric designed to evaluate the degree to which an image resembles authentic artworks by artists (or conversely photographs), thereby offering a novel appr… ▽ More

    Submitted 9 June, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Published in IEEE Transactions on Multimedia

  38. arXiv:2303.15708  [pdf, other

    cs.CL

    Bias or Diversity? Unraveling Fine-Grained Thematic Discrepancy in U.S. News Headlines

    Authors: Jinsheng Pan, Weihong Qi, Zichen Wang, Hanjia Lyu, Jiebo Luo

    Abstract: There is a broad consensus that news media outlets incorporate ideological biases in their news articles. However, prior studies on measuring the discrepancies among media outlets and further dissecting the origins of thematic differences suffer from small sample sizes and limited scope and granularity. In this study, we use a large dataset of 1.8 million news headlines from major U.S. media outle… ▽ More

    Submitted 5 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in Proceedings of the Workshop on News Media and Computational Journalism (MEDIATE), AAAI International Conference on Web and Social Media (ICWSM), 2023

  39. arXiv:2303.15656  [pdf, other

    cs.LG q-bio.QM

    Predicting Adverse Neonatal Outcomes for Preterm Neonates with Multi-Task Learning

    Authors: Jingyang Lin, Junyu Chen, Hanjia Lyu, Igor Khodak, Divya Chhabra, Colby L Day Richardson, Irina Prelipcean, Andrew M Dylag, Jiebo Luo

    Abstract: Diagnosis of adverse neonatal outcomes is crucial for preterm survival since it enables doctors to provide timely treatment. Machine learning (ML) algorithms have been demonstrated to be effective in predicting adverse neonatal outcomes. However, most previous ML-based methods have only focused on predicting a single outcome, ignoring the potential correlations between different outcomes, and pote… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  40. arXiv:2303.13452  [pdf, other

    cs.CY cs.LG

    Human Behavior in the Time of COVID-19: Learning from Big Data

    Authors: Hanjia Lyu, Arsal Imtiaz, Yufei Zhao, Jiebo Luo

    Abstract: Since the World Health Organization (WHO) characterized COVID-19 as a pandemic in March 2020, there have been over 600 million confirmed cases of COVID-19 and more than six million deaths as of October 2022. The relationship between the COVID-19 pandemic and human behavior is complicated. On one hand, human behavior is found to shape the spread of the disease. On the other hand, the pandemic has i… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in the Horizons in Big Data 2022 article collection of Frontiers in Big Data

  41. arXiv:2302.00849  [pdf, other

    cs.LG math.OC

    Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descent

    Authors: Avrajit Ghosh, He Lyu, Xitong Zhang, Rongrong Wang

    Abstract: It is well known that the finite step-size ($h$) in Gradient Descent (GD) implicitly regularizes solutions to flatter minima. A natural question to ask is "Does the momentum parameter $β$ play a role in implicit regularization in Heavy-ball (H.B) momentum accelerated gradient descent (GD+M)?". To answer this question, first, we show that the discrete H.B momentum update (GD+M) follows a continuous… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Journal ref: International Conference on Learning Representations (ICLR-2023)

  42. arXiv:2301.06270  [pdf, other

    cs.CL

    Computational Assessment of Hyperpartisanship in News Titles

    Authors: Hanjia Lyu, Jinsheng Pan, Zichen Wang, Jiebo Luo

    Abstract: We first adopt a human-guided machine learning framework to develop a new dataset for hyperpartisan news title detection with 2,200 manually labeled and 1.8 million machine-labeled titles that were posted from 2014 to the present by nine representative media organizations across three media bias groups - Left, Central, and Right in an active learning manner. A fine-tuned transformer-based language… ▽ More

    Submitted 21 April, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in ICWSM 2024

  43. arXiv:2212.03412  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Artificial Intelligence Security Competition (AISC)

    Authors: Yinpeng Dong, Peng Chen, Senyou Deng, Lianji L, Yi Sun, Hanyu Zhao, Jiaxing Li, Yunteng Tan, Xinyu Liu, Yangyi Dong, Enhui Xu, Jincai Xu, Shu Xu, Xuelin Fu, Changfeng Sun, Haoliang Han, Xuchong Zhang, Shen Chen, Zhimin Sun, Junyi Cao, Taiping Yao, Shouhong Ding, Yu Wu, Jian Lin, Tianpeng Wu , et al. (27 additional authors not shown)

    Abstract: The security of artificial intelligence (AI) is an important research area towards safe, reliable, and trustworthy AI systems. To accelerate the research on AI security, the Artificial Intelligence Security Competition (AISC) was organized by the Zhongguancun Laboratory, China Industrial Control Systems Cyber Emergency Response Team, Institute for Artificial Intelligence, Tsinghua University, and… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Technical report of AISC

  44. arXiv:2211.12981  [pdf, other

    cs.CV cs.MM

    Holistic Visual-Textual Sentiment Analysis with Prior Models

    Authors: Junyu Chen, Jie An, Hanjia Lyu, Christopher Kanan, Jiebo Luo

    Abstract: Visual-textual sentiment analysis aims to predict sentiment with the input of a pair of image and text, which poses a challenge in learning effective features for diverse input images. To address this, we propose a holistic method that achieves robust visual-textual sentiment analysis by exploiting a rich set of powerful pre-trained visual and textual prior models. The proposed method consists of… ▽ More

    Submitted 9 June, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Published in MIPR 2024

  45. arXiv:2210.09475  [pdf, other

    cs.LG

    FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks

    Authors: Syed Asad Rizvi, Nazreen Pallikkavaliyaveetil, David Zhang, Zhuoyang Lyu, Nhi Nguyen, Haoran Lyu, Benjamin Christensen, Josue Ortega Caro, Antonio H. O. Fonseca, Emanuele Zappala, Maryam Bagherian, Christopher Averill, Chadi G. Abdallah, Amin Karbasi, Rex Ying, Maria Brbic, Rahul Madhav Dhodapkar, David van Dijk

    Abstract: Foundation models have achieved remarkable success across many domains, relying on pretraining over vast amounts of data. Graph-structured data often lacks the same scale as unstructured data, making the development of graph foundation models challenging. In this work, we propose Foundation-Informed Message Passing (FIMP), a Graph Neural Network (GNN) message-passing framework that leverages pretr… ▽ More

    Submitted 1 July, 2024; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 16 pages (12 + 4 pages appendix). 5 figures and 4 tables

  46. arXiv:2210.00476  [pdf

    cs.LG cs.AI math.OC

    Robust Bayesian optimization with reinforcement learned acquisition functions

    Authors: Zijing Liu, Xiyao Qu, Xuejun Liu, Hongqiang Lyu

    Abstract: In Bayesian optimization (BO) for expensive black-box optimization tasks, acquisition function (AF) guides sequential sampling and plays a pivotal role for efficient convergence to better optima. Prevailing AFs usually rely on artificial experiences in terms of preferences for exploration or exploitation, which runs a risk of a computational waste or traps in local optima and resultant re-optimiza… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  47. arXiv:2209.14975  [pdf, other

    cs.LG

    Causal Inference via Nonlinear Variable Decorrelation for Healthcare Applications

    Authors: Junda Wang, Weijian Li, Han Wang, Hanjia Lyu, Caroline Thirukumaran, Addisu Mesfin, Jiebo Luo

    Abstract: Causal inference and model interpretability research are gaining increasing attention, especially in the domains of healthcare and bioinformatics. Despite recent successes in this field, decorrelating features under nonlinear environments with human interpretable representations has not been adequately investigated. To address this issue, we introduce a novel method with a variable decorrelation r… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  48. arXiv:2209.04874  [pdf, other

    cs.SI cs.CY

    Doctors vs. Nurses: Understanding the Great Divide in Vaccine Hesitancy among Healthcare Workers

    Authors: Sajid Hussain Rafi Ahamed, Shahid Shakil, Hanjia Lyu, Xinping Zhang, Jiebo Luo

    Abstract: Healthcare workers such as doctors and nurses are expected to be trustworthy and creditable sources of vaccine-related information. Their opinions toward the COVID-19 vaccines may influence the vaccine uptake among the general population. However, vaccine hesitancy is still an important issue even among the healthcare workers. Therefore, it is critical to understand their opinions to help reduce t… ▽ More

    Submitted 18 November, 2022; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: Accepted for publication in Proceedings of the 8th Special Session on Intelligent Data Mining of 2022 IEEE International Conference on Big Data, 2022

  49. arXiv:2206.06774  [pdf, other

    stat.ML cs.LG math.ST

    Supervised Dictionary Learning with Auxiliary Covariates

    Authors: Joowon Lee, Hanbaek Lyu, Weixin Yao

    Abstract: Supervised dictionary learning (SDL) is a classical machine learning method that simultaneously seeks feature extraction and classification tasks, which are not necessarily a priori aligned objectives. The goal of SDL is to learn a class-discriminative dictionary, which is a set of latent feature vectors that can well-explain both the features as well as labels of observed data. In this paper, we… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: 61 pages, 12 figures, 5 tables

  50. arXiv:2204.12680  [pdf, other

    cs.CV

    Improving the Transferability of Adversarial Examples with Restructure Embedded Patches

    Authors: Huipeng Zhou, Yu-an Tan, Yajie Wang, Haoran Lyu, Shangbo Wu, Yuanzhang Li

    Abstract: Vision transformers (ViTs) have demonstrated impressive performance in various computer vision tasks. However, the adversarial examples generated by ViTs are challenging to transfer to other networks with different structures. Recent attack methods do not consider the specificity of ViTs architecture and self-attention mechanism, which leads to poor transferability of the generated adversarial sam… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.