Skip to main content

Showing 1–50 of 87 results for author: Ji, C

  1. arXiv:2407.09642  [pdf, other

    cs.LG

    Seq-to-Final: A Benchmark for Tuning from Sequential Distributions to a Final Time Point

    Authors: Christina X Ji, Ahmed M Alaa, David Sontag

    Abstract: Distribution shift over time occurs in many settings. Leveraging historical data is necessary to learn a model for the last time point when limited data is available in the final period, yet few methods have been developed specifically for this purpose. In this work, we construct a benchmark with different sequences of synthetic shifts to evaluate the effectiveness of 3 classes of methods that 1)… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.00615  [pdf, other

    cs.LG

    GC-Bench: An Open and Unified Benchmark for Graph Condensation

    Authors: Qingyun Sun, Ziying Chen, Beining Yang, Cheng Ji, Xingcheng Fu, Sheng Zhou, Hao Peng, Jianxin Li, Philip S. Yu

    Abstract: Graph condensation (GC) has recently garnered considerable attention due to its ability to reduce large-scale graph datasets while preserving their essential properties. The core concept of GC is to create a smaller, more manageable graph that retains the characteristics of the original graph. Despite the proliferation of graph condensation methods developed in recent years, there is no comprehens… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Preprint, under review)

  3. arXiv:2406.18179  [pdf, other

    cs.LG cs.DB

    DeepExtremeCubes: Integrating Earth system spatio-temporal data for impact assessment of climate extremes

    Authors: Chaonan Ji, Tonio Fincke, Vitus Benson, Gustau Camps-Valls, Miguel-Angel Fernandez-Torres, Fabian Gans, Guido Kraemer, Francesco Martinuzzi, David Montero, Karin Mora, Oscar J. Pellicer-Valero, Claire Robin, Maximilian Soechting, Melanie Weynants, Miguel D. Mahecha

    Abstract: With climate extremes' rising frequency and intensity, robust analytical tools are crucial to predict their impacts on terrestrial ecosystems. Machine learning techniques show promise but require well-structured, high-quality, and curated analysis-ready datasets. Earth observation datasets comprehensively monitor ecosystem dynamics and responses to climatic extremes, yet the data complexity can ch… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  4. arXiv:2406.18129  [pdf, other

    cs.CV cs.LG

    CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection

    Authors: Meiying Zhang, Weiyuan Peng, Guangyao Ding, Chenyang Lei, Chunlin Ji, Qi Hao

    Abstract: Simulation data can be accurately labeled and have been expected to improve the performance of data-driven algorithms, including object detection. However, due to the various domain inconsistencies from simulation to reality (sim-to-real), cross-domain object detection algorithms usually suffer from dramatic performance drops. While numerous unsupervised domain adaptation (UDA) methods have been d… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  5. arXiv:2406.13922  [pdf, ps, other

    cs.IT

    Explicit Performance Bound of Finite Blocklength Coded MIMO: Time-Domain versus Spatiotemporal Channel Coding

    Authors: Feng Ye, Xiaohu You, Jiamin Li, Chuan Zhang, Chen Ji

    Abstract: In the sixth generation (6G), ultra-reliable low-latency communications (URLLC) will further develop to achieve TKu extreme connectivity, and multiple-input multiple-output (MIMO) is expected to be a key enabler for its realization. Since the latency constraint can be represented by the blocklength of a codeword, it is essential to analyze different coded MIMO schemes under finite blocklength regi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures

  6. arXiv:2406.05980  [pdf, other

    cs.CV

    Causality-inspired Latent Feature Augmentation for Single Domain Generalization

    Authors: Jian Xu, Chaojie Ji, Yankai Cao, Ye Li, Ruxin Wang

    Abstract: Single domain generalization (Single-DG) intends to develop a generalizable model with only one single training domain to perform well on other unknown target domains. Under the domain-hungry configuration, how to expand the coverage of source domain and find intrinsic causal features across different distributions is the key to enhancing the models' generalization ability. Existing methods mainly… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  7. arXiv:2405.17894  [pdf, other

    cs.CV cs.AI

    White-box Multimodal Jailbreaks Against Large Vision-Language Models

    Authors: Ruofan Wang, Xingjun Ma, Hanxu Zhou, Chuanjun Ji, Guangnan Ye, Yu-Gang Jiang

    Abstract: Recent advancements in Large Vision-Language Models (VLMs) have underscored their superiority in various multimodal tasks. However, the adversarial robustness of VLMs has not been fully explored. Existing methods mainly assess robustness through unimodal adversarial attacks that perturb images, while assuming inherent resilience against text-based attacks. Different from existing attacks, in this… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  8. arXiv:2404.13105  [pdf, other

    cs.DB cs.CV cs.LG

    On-Demand Earth System Data Cubes

    Authors: David Montero, César Aybar, Chaonan Ji, Guido Kraemer, Maximilian Söchting, Khalil Teber, Miguel D. Mahecha

    Abstract: Advancements in Earth system science have seen a surge in diverse datasets. Earth System Data Cubes (ESDCs) have been introduced to efficiently handle this influx of high-dimensional data. ESDCs offer a structured, intuitive framework for data analysis, organising information within spatio-temporal grids. The structured nature of ESDCs unlocks significant opportunities for Artificial Intelligence… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted at IGARSS24

  9. arXiv:2404.12006  [pdf, other

    cs.CL

    Variational Multi-Modal Hypergraph Attention Network for Multi-Modal Relation Extraction

    Authors: Qian Li, Cheng Ji, Shu Guo, Yong Zhao, Qianren Mao, Shangguang Wang, Yuntao Wei, Jianxin Li

    Abstract: Multi-modal relation extraction (MMRE) is a challenging task that aims to identify relations between entities in text leveraging image information. Existing methods are limited by their neglect of the multiple entity pairs in one sentence sharing very similar contextual information (ie, the same text and image), resulting in increased difficulty in the MMRE task. To address this limitation, we pro… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  10. "That's Not Good Science!": An Argument for the Thoughtful Use of Formative Situations in Research through Design

    Authors: Raquel B Robinson, Anya Osborne, Chen Ji, James Collin Fey, Ella Dagan, Katherine Isbister

    Abstract: Most currently accepted approaches to evaluating Research through Design (RtD) presume that design prototypes are finalized and ready for robust testing in laboratory or in-the-wild settings. However, it is also valuable to assess designs at intermediate phases with mid-fidelity prototypes, not just to inform an ongoing design process, but also to glean knowledge of broader use to the research com… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 8 pages, 1 figure

  11. arXiv:2404.00340  [pdf, other

    cs.RO eess.SY

    Deep Reinforcement Learning in Autonomous Car Path Planning and Control: A Survey

    Authors: Yiyang Chen, Chao Ji, Yunrui Cai, Tong Yan, Bo Su

    Abstract: Combining data-driven applications with control systems plays a key role in recent Autonomous Car research. This thesis offers a structured review of the latest literature on Deep Reinforcement Learning (DRL) within the realm of autonomous vehicle Path Planning and Control. It collects a series of DRL methodologies and algorithms and their applications in the field, focusing notably on their roles… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  12. arXiv:2403.16481  [pdf, other

    cs.CV

    REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices

    Authors: Chaojie Ji, Yufeng Li, Yiyi Liao

    Abstract: This work tackles the challenging task of achieving real-time novel view synthesis on various scenes, including highly reflective objects and unbounded outdoor scenes. Existing real-time rendering methods, especially those based on meshes, often have subpar performance in modeling surfaces with rich view-dependent appearances. Our key idea lies in leveraging meshes for rendering acceleration while… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project Page:https://xdimlab.github.io/REFRAME/

  13. arXiv:2403.04521  [pdf, other

    cs.CL

    Uncertainty-Aware Relational Graph Neural Network for Few-Shot Knowledge Graph Completion

    Authors: Qian Li, Shu Guo, Yinjia Chen, Cheng Ji, Jiawei Sheng, Jianxin Li

    Abstract: Few-shot knowledge graph completion (FKGC) aims to query the unseen facts of a relation given its few-shot reference entity pairs. The side effect of noises due to the uncertainty of entities and triples may limit the few-shot learning, but existing FKGC works neglect such uncertainty, which leads them more susceptible to limited reference samples with noises. In this paper, we propose a novel unc… ▽ More

    Submitted 21 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  14. arXiv:2402.14309  [pdf, other

    cs.CV

    YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5

    Authors: Peng Gao, Chun-Lin Ji, Tao Yu, Ru-Yue Yuan

    Abstract: Object detection, a crucial aspect of computer vision, has seen significant advancements in accuracy and robustness. Despite these advancements, practical applications still face notable challenges, primarily the inaccurate detection or missed detection of small objects. In this paper, we propose YOLO-TLA, an advanced object detection model building on YOLOv5. We first introduce an additional dete… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 11 pages, 11 figures, 7 tables

  15. arXiv:2402.06716  [pdf, other

    cs.LG cs.AI

    Dynamic Graph Information Bottleneck

    Authors: Haonan Yuan, Qingyun Sun, Xingcheng Fu, Cheng Ji, Jianxin Li

    Abstract: Dynamic Graphs widely exist in the real world, which carry complicated spatial and temporal feature patterns, challenging their representation learning. Dynamic Graph Neural Networks (DGNNs) have shown impressive predictive abilities by exploiting the intrinsic dynamics. However, DGNNs exhibit limited robustness, prone to adversarial attacks. This paper presents the novel Dynamic Graph Information… ▽ More

    Submitted 6 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted by the research tracks of The Web Conference 2024 (WWW 2024)

  16. arXiv:2401.02013  [pdf, other

    cs.LG

    SwitchTab: Switched Autoencoders Are Effective Tabular Learners

    Authors: Jing Wu, Suiyao Chen, Qi Zhao, Renat Sergazinov, Chen Li, Shengjie Liu, Chongchao Zhao, Tianpei Xie, Hanqing Guo, Cheng Ji, Daniel Cociorva, Hakan Brunzel

    Abstract: Self-supervised representation learning methods have achieved significant success in computer vision and natural language processing, where data samples exhibit explicit spatial or semantic dependencies. However, applying these methods to tabular data is challenging due to the less pronounced dependencies among data samples. In this paper, we address this limitation by introducing SwitchTab, a nov… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Journal ref: Association for the Advancement of Artificial Intelligence (AAAI), 2024

  17. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  18. arXiv:2312.11521  [pdf, other

    cs.CL cs.AI

    Large Language Models are Complex Table Parsers

    Authors: Bowen Zhao, Changkai Ji, Yuejie Zhang, Wen He, Yingwen Wang, Qing Wang, Rui Feng, Xiaobo Zhang

    Abstract: With the Generative Pre-trained Transformer 3.5 (GPT-3.5) exhibiting remarkable reasoning and comprehension abilities in Natural Language Processing (NLP), most Question Answering (QA) research has primarily centered around general QA tasks based on GPT, neglecting the specific challenges posed by Complex Table QA. In this paper, we propose to incorporate GPT-3.5 to address such challenges, in whi… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023 Main

  19. arXiv:2312.07956  [pdf, other

    cs.DC

    Topology-Dependent Privacy Bound For Decentralized Federated Learning

    Authors: Qiongxiu Li, Wenrui Yu, Changlong Ji, Richard Heusdens

    Abstract: Decentralized Federated Learning (FL) has attracted significant attention due to its enhanced robustness and scalability compared to its centralized counterpart. It pivots on peer-to-peer communication rather than depending on a central server for model aggregation. While prior research has delved into various factors of decentralized FL such as aggregation methods and privacy-preserving technique… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  20. arXiv:2311.11114  [pdf, other

    cs.LG cs.AI

    Environment-Aware Dynamic Graph Learning for Out-of-Distribution Generalization

    Authors: Haonan Yuan, Qingyun Sun, Xingcheng Fu, Ziwei Zhang, Cheng Ji, Hao Peng, Jianxin Li

    Abstract: Dynamic graph neural networks (DGNNs) are increasingly pervasive in exploiting spatio-temporal patterns on dynamic graphs. However, existing works fail to generalize under distribution shifts, which are common in real-world scenarios. As the generation of dynamic graphs is heavily influenced by latent environments, investigating their impacts on the out-of-distribution (OOD) generalization is crit… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  21. arXiv:2311.06952  [pdf, other

    cs.LG math.OC

    A GPU-Accelerated Moving-Horizon Algorithm for Training Deep Classification Trees on Large Datasets

    Authors: Jiayang Ren, Valentín Osuna-Enciso, Morimasa Okamoto, Qiangqiang Mao, Chaojie Ji, Liang Cao, Kaixun Hua, Yankai Cao

    Abstract: Decision trees are essential yet NP-complete to train, prompting the widespread use of heuristic methods such as CART, which suffers from sub-optimal performance due to its greedy nature. Recently, breakthroughs in finding optimal decision trees have emerged; however, these methods still face significant computational costs and struggle with continuous features in large-scale datasets and deep tre… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 36 pages (13 pages for the main body, 23 pages for the appendix), 7 figures

  22. arXiv:2311.01862  [pdf, other

    cs.CL cs.DB

    $R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL

    Authors: Yuhang Zhou, Yu He, Siyu Tian, Yuchen Ni, Zhangyue Yin, Xiang Liu, Chuanjun Ji, Sen Liu, Xipeng Qiu, Guangnan Ye, Hongfeng Chai

    Abstract: While current tasks of converting natural language to SQL (NL2SQL) using Foundation Models have shown impressive achievements, adapting these approaches for converting natural language to Graph Query Language (NL2GQL) encounters hurdles due to the distinct nature of GQL compared to SQL, alongside the diverse forms of GQL. Moving away from traditional rule-based and slot-filling methodologies, we i… ▽ More

    Submitted 1 July, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  23. arXiv:2310.09192  [pdf, other

    cs.LG cs.AI

    Does Graph Distillation See Like Vision Dataset Counterpart?

    Authors: Beining Yang, Kai Wang, Qingyun Sun, Cheng Ji, Xingcheng Fu, Hao Tang, Yang You, Jianxin Li

    Abstract: Training on large-scale graphs has achieved remarkable results in graph representation learning, but its cost and storage have attracted increasing concerns. Existing graph condensation methods primarily focus on optimizing the feature matrices of condensed graphs while overlooking the impact of the structure information from the original graphs. To investigate the impact of the structure informat… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  24. arXiv:2310.06365  [pdf, other

    cs.CL

    Multi-Modal Knowledge Graph Transformer Framework for Multi-Modal Entity Alignment

    Authors: Qian Li, Cheng Ji, Shu Guo, Zhaoji Liang, Lihong Wang, Jianxin Li

    Abstract: Multi-Modal Entity Alignment (MMEA) is a critical task that aims to identify equivalent entity pairs across multi-modal knowledge graphs (MMKGs). However, this task faces challenges due to the presence of different types of information, including neighboring entities, multi-modal attributes, and entity types. Directly incorporating the above information (e.g., concatenation or attention) can lead… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  25. arXiv:2308.14507  [pdf, other

    math.ST cs.IT cs.LG math.PR stat.ML

    Spectral Estimators for Structured Generalized Linear Models via Approximate Message Passing

    Authors: Yihan Zhang, Hong Chang Ji, Ramji Venkataramanan, Marco Mondelli

    Abstract: We consider the problem of parameter estimation in a high-dimensional generalized linear model. Spectral methods obtained via the principal eigenvector of a suitable data-dependent matrix provide a simple yet surprisingly effective solution. However, despite their wide use, a rigorous performance characterization, as well as a principled way to preprocess the data, are available only for unstructu… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

  26. arXiv:2308.07991  [pdf, other

    eess.SP cs.NI

    Demo: Reconfigurable Distributed Antennas and Reflecting Surface (RDARS)-aided Integrated Sensing and Communication System

    Authors: Jintao Wang, Chengwang Ji, Jiajia Guo, Shaodan Ma

    Abstract: Integrated sensing and communication (ISAC) system has been envisioned as a promising technology to be applied in future applications requiring both communication and high-accuracy sensing. Different from most research focusing on theoretical analysis and optimization in the area of ISAC, we implement a reconfigurable distributed antennas and reflecting surfaces (RDARS)-aided ISAC system prototype… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 2 pages, 3 figures. Accepted by IEEE/CIC International Conference on Communications in China, Dalian, China, 2023

  27. A Survey of Beam Management for mmWave and THz Communications Towards 6G

    Authors: Qing Xue, Chengwang Ji, Shaodan Ma, Jiajia Guo, Yongjun Xu, Qianbin Chen, Wei Zhang

    Abstract: Communication in millimeter wave (mmWave) and even terahertz (THz) frequency bands is ushering in a new era of wireless communications. Beam management, namely initial access and beam tracking, has been recognized as an essential technique to ensure robust mmWave/THz communications, especially for mobile scenarios. However, narrow beams at higher carrier frequency lead to huge beam measurement ove… ▽ More

    Submitted 6 February, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: accepted by IEEE Communications Surveys & Tutorials

  28. arXiv:2306.11020  [pdf, other

    cs.CL

    Dual-Gated Fusion with Prefix-Tuning for Multi-Modal Relation Extraction

    Authors: Qian Li, Shu Guo, Cheng Ji, Xutan Peng, Shiyao Cui, Jianxin Li

    Abstract: Multi-Modal Relation Extraction (MMRE) aims at identifying the relation between two entities in texts that contain visual clues. Rich visual content is valuable for the MMRE task, but existing works cannot well model finer associations among different modalities, failing to capture the truly helpful visual information and thus limiting relation extraction performance. In this paper, we propose a n… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  29. Adaptive Ordered Information Extraction with Deep Reinforcement Learning

    Authors: Wenhao Huang, Jiaqing Liang, Zhixu Li, Yanghua Xiao, Chuanjun Ji

    Abstract: Information extraction (IE) has been studied extensively. The existing methods always follow a fixed extraction order for complex IE tasks with multiple elements to be extracted in one instance such as event extraction. However, we conduct experiments on several complex IE datasets and observe that different extraction orders can significantly affect the extraction results for a great portion of i… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 Findings

  30. arXiv:2305.05087  [pdf, other

    cs.LG

    Large-Scale Study of Temporal Shift in Health Insurance Claims

    Authors: Christina X Ji, Ahmed M Alaa, David Sontag

    Abstract: Most machine learning models for predicting clinical outcomes are developed using historical data. Yet, even if these models are deployed in the near future, dataset shift over time may result in less than ideal performance. To capture this phenomenon, we consider a task--that is, an outcome to be predicted at a particular time point--to be non-stationary if a historical model is no longer optimal… ▽ More

    Submitted 18 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: To appear as an oral spotlight and poster at Conference on Health, Inference, and Learning (CHIL) 2023

  31. arXiv:2304.14760  [pdf, ps, other

    cs.AI cs.LG cs.LO

    A New Class of Explanations for Classifiers with Non-Binary Features

    Authors: Chunxi Ji, Adnan Darwiche

    Abstract: Two types of explanations have been receiving increased attention in the literature when analyzing the decisions made by classifiers. The first type explains why a decision was made and is known as a sufficient reason for the decision, also an abductive explanation or a PI-explanation. The second type explains why some other decision was not made and is known as a necessary reason for the decision… ▽ More

    Submitted 22 July, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Will appear in proceedings of the 18th European Conference on Logics in Artificial Intelligence, JELIA 2023

  32. arXiv:2304.01563  [pdf, other

    cs.CL

    Attribute-Consistent Knowledge Graph Representation Learning for Multi-Modal Entity Alignment

    Authors: Qian Li, Shu Guo, Yangyifei Luo, Cheng Ji, Lihong Wang, Jiawei Sheng, Jianxin Li

    Abstract: The multi-modal entity alignment (MMEA) aims to find all equivalent entity pairs between multi-modal knowledge graphs (MMKGs). Rich attributes and neighboring entities are valuable for the alignment task, but existing works ignore contextual gap problems that the aligned entities have different numbers of attributes on specific modality when learning entity representations. In this paper, we propo… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  33. arXiv:2302.09419  [pdf, other

    cs.AI cs.CL cs.LG

    A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

    Authors: Ce Zhou, Qian Li, Chen Li, Jun Yu, Yixin Liu, Guangjing Wang, Kai Zhang, Cheng Ji, Qiben Yan, Lifang He, Hao Peng, Jianxin Li, Jia Wu, Ziwei Liu, Pengtao Xie, Caiming Xiong, Jian Pei, Philip S. Yu, Lichao Sun

    Abstract: Pretrained Foundation Models (PFMs) are regarded as the foundation for various downstream tasks with different data modalities. A PFM (e.g., BERT, ChatGPT, and GPT-4) is trained on large-scale data which provides a reasonable parameter initialization for a wide range of downstream applications. BERT learns bidirectional encoder representations from Transformers, which are trained on large datasets… ▽ More

    Submitted 1 May, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: 99 pages, 16 figures

  34. arXiv:2301.12104  [pdf, other

    cs.LG cs.AI

    Unbiased and Efficient Self-Supervised Incremental Contrastive Learning

    Authors: Cheng Ji, Jianxin Li, Hao Peng, Jia Wu, Xingcheng Fu, Qingyun Sun, Phillip S. Yu

    Abstract: Contrastive Learning (CL) has been proved to be a powerful self-supervised approach for a wide range of domains, including computer vision and graph representation learning. However, the incremental learning issue of CL has rarely been studied, which brings the limitation in applying it to real-world applications. Contrastive learning identifies the samples with the negative ones from the noise di… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

  35. arXiv:2301.00061  [pdf, other

    math.OC cs.LG

    A Global Optimization Algorithm for K-Center Clustering of One Billion Samples

    Authors: Jiayang Ren, Ningning You, Kaixun Hua, Chaojie Ji, Yankai Cao

    Abstract: This paper presents a practical global optimization algorithm for the K-center clustering problem, which aims to select K samples as the cluster centers to minimize the maximum within-cluster distance. This algorithm is based on a reduced-space branch and bound scheme and guarantees convergence to the global optimum in a finite number of steps by only branching on the regions of centers. To improv… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: 34 pages, 6 figures, and 5 tables

  36. arXiv:2212.03421  [pdf, other

    cs.LG cs.CV stat.AP

    Capturing the Flow of Art History

    Authors: Chenxi Ji

    Abstract: Do we really understand how machine classifies art styles? Historically, art is perceived and interpreted by human eyes and there are always controversial discussions over how people identify and understand art. Historians and general public tend to interpret the subject matter of art through the context of history and social factors. Style, however, is different from subject matter. Given the fac… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  37. arXiv:2211.08168  [pdf, other

    cs.CL

    Type Information Utilized Event Detection via Multi-Channel GNNs in Electrical Power Systems

    Authors: Qian Li, Jianxin Li, Lihong Wang, Cheng Ji, Yiming Hei, Jiawei Sheng, Qingyun Sun, Shan Xue, Pengtao Xie

    Abstract: Event detection in power systems aims to identify triggers and event types, which helps relevant personnel respond to emergencies promptly and facilitates the optimization of power supply strategies. However, the limited length of short electrical record texts causes severe information sparsity, and numerous domain-specific terminologies of power systems makes it difficult to transfer knowledge fr… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  38. arXiv:2210.04623  [pdf, other

    cs.DC

    DeltaFS: Pursuing Zero Update Overhead via Metadata-Enabled Delta Compression for Log-structured File System on Mobile Devices

    Authors: Chao Wu, Cheng Ji, Geng Yuan, Riwei Pan, Weichao Guo, Chao Yu, Zongwei Zhu, Yanzhi Wang

    Abstract: Data compression has been widely adopted to release mobile devices from intensive write pressure. Delta compression is particularly promising for its high compression efficacy over conventional compression methods. However, this method suffers from non-trivial system overheads incurred by delta maintenance and read penalty, which prevents its applicability on mobile devices. To this end, this pape… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  39. Position-aware Structure Learning for Graph Topology-imbalance by Relieving Under-reaching and Over-squashing

    Authors: Qingyun Sun, Jianxin Li, Haonan Yuan, Xingcheng Fu, Hao Peng, Cheng Ji, Qian Li, Philip S. Yu

    Abstract: Topology-imbalance is a graph-specific imbalance problem caused by the uneven topology positions of labeled nodes, which significantly damages the performance of GNNs. What topology-imbalance means and how to measure its impact on graph learning remain under-explored. In this paper, we provide a new understanding of topology-imbalance from a global view of the supervision information distribution… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022

  40. arXiv:2207.04750  [pdf, other

    cs.CV

    Geometry-aware Single-image Full-body Human Relighting

    Authors: Chaonan Ji, Tao Yu, Kaiwen Guo, Jingxin Liu, Yebin Liu

    Abstract: Single-image human relighting aims to relight a target human under new lighting conditions by decomposing the input image into albedo, shape and lighting. Although plausible relighting results can be achieved, previous methods suffer from both the entanglement between albedo and lighting and the lack of hard shadows, which significantly decrease the realism. To tackle these two problems, we propos… ▽ More

    Submitted 12 July, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: accepted by ECCV2022

  41. arXiv:2207.02031  [pdf, other

    cs.CV

    AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture

    Authors: Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu

    Abstract: To address the ill-posed problem caused by partial observations in monocular human volumetric capture, we present AvatarCap, a novel framework that introduces animatable avatars into the capture pipeline for high-fidelity reconstruction in both visible and invisible regions. Our method firstly creates an animatable avatar for the subject from a small number (~20) of 3D scans as a prior. Then given… ▽ More

    Submitted 12 July, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022, project page: http://www.liuyebin.com/avatarcap/avatarcap.html, code: https://github.com/lizhe00/AvatarCap

  42. arXiv:2207.01697  [pdf, other

    cs.CV

    BYHE: A Simple Framework for Boosting End-to-end Video-based Heart Rate Measurement Network

    Authors: Weiyu Sun, Xinyu Zhang, Ying Chen, Yun Ge, Chunyu Ji, Xiaolin Huang

    Abstract: Heart rate measuring based on remote photoplethysmography (rPPG) plays an important role in health caring, which estimates heart rate from facial video in a non-contact, less-constrained way. End-to-end neural network is a main branch of rPPG-based heart rate estimation methods, whose trait is recovering rPPG signal containing sufficient heart rate message from original facial video directly. Howe… ▽ More

    Submitted 27 September, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  43. arXiv:2206.07243  [pdf, ps, other

    cs.IT

    Closed-form Approximation for Performance Bound of Finite Blocklength Massive MIMO Transmission

    Authors: Xiaohu You, Bin Sheng, Yongming Huang, Wei Xu, Chuan Zhang, Dongming Wang, Pengcheng Zhu, Chen Ji

    Abstract: Ultra-reliable low latency communications (uRLLC) is adopted in the fifth generation (5G) mobile networks to better support mission-critical applications that demand high level of reliability and low latency. With the aid of well-established multiple-input multiple-output (MIMO) information theory, uRLLC in the future 6G is expected to provide enhanced capability towards extreme connectivity. Sinc… ▽ More

    Submitted 19 July, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

  44. arXiv:2203.10451  [pdf, other

    cs.AI cs.LG cs.LO

    On the Computation of Necessary and Sufficient Explanations

    Authors: Adnan Darwiche, Chunxi Ji

    Abstract: The complete reason behind a decision is a Boolean formula that characterizes why the decision was made. This recently introduced notion has a number of applications, which include generating explanations, detecting decision bias and evaluating counterfactual queries. Prime implicants of the complete reason are known as sufficient reasons for the decision and they correspond to what is known as PI… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: To appear in the proceedings of AAAI 2022

  45. arXiv:2203.01604  [pdf, other

    cs.LG cs.SI

    Curvature Graph Generative Adversarial Networks

    Authors: Jianxin Li, Xingcheng Fu, Qingyun Sun, Cheng Ji, Jiajun Tan, Jia Wu, Hao Peng

    Abstract: Generative adversarial network (GAN) is widely used for generalized and robust learning on graph data. However, for non-Euclidean graph data, the existing GAN-based graph representation methods generate negative samples by random walk or traverse in discrete space, leading to the information loss of topological properties (e.g. hierarchy and circularity). Moreover, due to the topological heterogen… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted by Web Conference (WWW) 2022

  46. arXiv:2201.03166  [pdf, other

    cs.IT cs.AR

    Spatiotemporal 2-D Channel Coding for Very Low Latency Reliable MIMO Transmission

    Authors: Xiaohu You, Chuan Zhang, Bin Sheng, Yongming Huang, Chen Ji, Yifei Shen, Wenyue Zhou, Jian Liu

    Abstract: To fully support vertical industries, 5G and its corresponding channel coding are expected to meet requirements of different applications. However, for applications of 5G and beyond 5G (B5G) such as URLLC, the transmission latency is required to be much shorter than that in eMBB. Therefore, the resulting channel code length reduces drastically. In this case, the traditional 1-D channel coding suff… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  47. arXiv:2112.08903  [pdf, other

    cs.LG cs.AI

    Graph Structure Learning with Variational Information Bottleneck

    Authors: Qingyun Sun, Jianxin Li, Hao Peng, Jia Wu, Xingcheng Fu, Cheng Ji, Philip S. Yu

    Abstract: Graph Neural Networks (GNNs) have shown promising results on a broad spectrum of applications. Most empirical studies of GNNs directly take the observed graph as input, assuming the observed structure perfectly depicts the accurate and complete relations between nodes. However, graphs in the real world are inevitably noisy or incomplete, which could even exacerbate the quality of graph representat… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022, Preprint version with Appendix

  48. arXiv:2112.03134  [pdf, other

    cs.LG cs.CV

    Prototypical Model with Novel Information-theoretic Loss Function for Generalized Zero Shot Learning

    Authors: Chunlin Ji, Hanchu Shen, Zhan Xiong, Feng Chen, Meiying Zhang, Huiwen Yang

    Abstract: Generalized zero shot learning (GZSL) is still a technical challenge of deep learning as it has to recognize both source and target classes without data from target classes. To preserve the semantic relation between source and target classes when only trained with data from source classes, we address the quantification of the knowledge transfer and semantic relation from an information-theoretic v… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  49. arXiv:2111.08274  [pdf, other

    cs.LG cs.AI

    HADFL: Heterogeneity-aware Decentralized Federated Learning Framework

    Authors: Jing Cao, Zirui Lian, Weihong Liu, Zongwei Zhu, Cheng Ji

    Abstract: Federated learning (FL) supports training models on geographically distributed devices. However, traditional FL systems adopt a centralized synchronous strategy, putting high communication pressure and model generalization challenge. Existing optimizations on FL either fail to speedup training on heterogeneous devices or suffer from poor communication efficiency. In this paper, we propose HADFL, a… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted by DAC 2021

  50. arXiv:2110.14508  [pdf, other

    cs.LG cs.AI

    Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance

    Authors: Justin Lim, Christina X Ji, Michael Oberst, Saul Blecker, Leora Horwitz, David Sontag

    Abstract: Individuals often make different decisions when faced with the same context, due to personal preferences and background. For instance, judges may vary in their leniency towards certain drug-related offenses, and doctors may vary in their preference for how to start treatment for certain types of patients. With these examples in mind, we present an algorithm for identifying types of contexts (e.g.,… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: To appear in NeurIPS 2021