Skip to main content

Showing 1–50 of 105 results for author: Yan, F

  1. arXiv:2407.02842  [pdf, other

    cs.CV cs.AI cs.CL

    MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition and Analysis

    Authors: Lei Chen, Feng Yan, Yujie Zhong, Shaoxiang Chen, Zequn Jie, Lin Ma

    Abstract: Multimodal Large Language Models (MLLM) have made significant progress in the field of document analysis. Despite this, existing benchmarks typically focus only on extracting text and simple layout information, neglecting the complex interactions between elements in structured documents such as mind maps and flowcharts. To address this issue, we introduce the new benchmark named MindBench, which n… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: technical report

  2. arXiv:2406.18977  [pdf, other

    cs.RO cs.CL cs.CV

    RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton

    Authors: Fanfan Liu, Feng Yan, Liming Zheng, Chengjian Feng, Yiyang Huang, Lin Ma

    Abstract: Utilizing Vision-Language Models (VLMs) for robotic manipulation represents a novel paradigm, aiming to enhance the model's ability to generalize to new objects and instructions. However, due to variations in camera specifications and mounting positions, existing methods exhibit significant performance disparities across different robotic platforms. To address this challenge, we propose RoboUniVie… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2405.14292  [pdf, other

    cs.CV cs.RO

    A New Method in Facial Registration in Clinics Based on Structure Light Images

    Authors: Pengfei Li, Ziyue Ma, Hong Wang, Juan Deng, Yan Wang, Zhenyu Xu, Feng Yan, Wenjun Tu, Hong Sha

    Abstract: Background and Objective: In neurosurgery, fusing clinical images and depth images that can improve the information and details is beneficial to surgery. We found that the registration of face depth images was invalid frequently using existing methods. To abundant traditional image methods with depth information, a method in registering with depth images and traditional clinical images was investi… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.00839  [pdf, other

    cs.LG cs.AI cs.DC cs.MA cs.PF

    Communication-Efficient Training Workload Balancing for Decentralized Multi-Agent Learning

    Authors: Seyed Mahmoud Sajjadi Mohammadabadi, Lei Yang, Feng Yan, Junshan Zhang

    Abstract: Decentralized Multi-agent Learning (DML) enables collaborative model training while preserving data privacy. However, inherent heterogeneity in agents' resources (computation, communication, and task size) may lead to substantial variations in training time. This heterogeneity creates a bottleneck, lengthening the overall training time due to straggler effects and potentially wasting spare resourc… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted for presentation at ICDCS (44th IEEE International Conference on Distributed Computing Systems). Keywords: decentralized multi-agent learning, federated learning, edge computing, heterogeneous agents, workload balancing, and communication-efficient training )

  5. arXiv:2404.17152  [pdf, other

    cs.CV

    CSCO: Connectivity Search of Convolutional Operators

    Authors: Tunhou Zhang, Shiyu Li, Hsin-Pai Cheng, Feng Yan, Hai Li, Yiran Chen

    Abstract: Exploring dense connectivity of convolutional operators establishes critical "synapses" to communicate feature vectors from different levels and enriches the set of transformations on Computer Vision applications. Yet, even with heavy-machinery approaches such as Neural Architecture Search (NAS), discovering effective connectivity patterns requires tremendous efforts due to either constrained conn… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: To appear on Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2024)

  6. MalleTrain: Deep Neural Network Training on Unfillable Supercomputer Nodes

    Authors: Xiaolong Ma, Feng Yan, Lei Yang, Ian Foster, Michael E. Papka, Zhengchun Liu, Rajkumar Kettimuthu

    Abstract: First-come first-serve scheduling can result in substantial (up to 10%) of transiently idle nodes on supercomputers. Recognizing that such unfilled nodes are well-suited for deep neural network (DNN) training, due to the flexible nature of DNN training tasks, Liu et al. proposed that the re-scaling DNN training tasks to fit gaps in schedules be formulated as a mixed-integer linear programming (MIL… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  7. arXiv:2404.14661  [pdf, other

    cs.CV astro-ph.EP cs.LG

    First Mapping the Canopy Height of Primeval Forests in the Tallest Tree Area of Asia

    Authors: Guangpeng Fan, Fei Yan, Xiangquan Zeng, Qingtao Xu, Ruoyoulan Wang, Binghong Zhang, Jialing Zhou, Liangliang Nan, Jinhu Wang, Zhiwei Zhang, Jia Wang

    Abstract: We have developed the world's first canopy height map of the distribution area of world-level giant trees. This mapping is crucial for discovering more individual and community world-level giant trees, and for analyzing and quantifying the effectiveness of biodiversity conservation measures in the Yarlung Tsangpo Grand Canyon (YTGC) National Nature Reserve. We proposed a method to map the canopy h… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2404.05624  [pdf

    cs.CL cs.AI

    LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking

    Authors: Faren Yan, Peng Yu, Xin Chen

    Abstract: The use of LLMs for natural language processing has become a popular trend in the past two years, driven by their formidable capacity for context comprehension and learning, which has inspired a wave of research from academics and industry professionals. However, for certain NLP tasks, such as NER, the performance of LLMs still falls short when compared to supervised learning methods. In our resea… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 13 pages

  9. arXiv:2404.01617  [pdf, other

    cs.NI cs.LG cs.MM

    LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models

    Authors: Zhiyuan He, Aashish Gottipati, Lili Qiu, Francis Y. Yan, Xufang Luo, Kenuo Xu, Yuqing Yang

    Abstract: We present LLM-ABR, the first system that utilizes the generative capabilities of large language models (LLMs) to autonomously design adaptive bitrate (ABR) algorithms tailored for diverse network characteristics. Operating within a reinforcement learning framework, LLM-ABR empowers LLMs to design key components such as states and neural network architectures. We evaluate LLM-ABR across diverse ne… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2403.16497  [pdf, other

    cs.CV cs.LG

    PathoTune: Adapting Visual Foundation Model to Pathological Specialists

    Authors: Jiaxuan Lu, Fang Yan, Xiaofan Zhang, Yue Gao, Shaoting Zhang

    Abstract: As natural image understanding moves towards the pretrain-finetune era, research in pathology imaging is concurrently evolving. Despite the predominant focus on pretraining pathological foundation models, how to adapt foundation models to downstream tasks is little explored. For downstream adaptation, we propose the existence of two domain gaps, i.e., the Foundation-Task Gap and the Task-Instance… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Submitted to MICCAI 2024

  11. ACCESS: Assurance Case Centric Engineering of Safety-critical Systems

    Authors: Ran Wei, Simon Foster, Haitao Mei, Fang Yan, Ruizhe Yang, Ibrahim Habli, Colin O'Halloran, Nick Tudor, Tim Kelly, Yakoub Nemouchi

    Abstract: Assurance cases are used to communicate and assess confidence in critical system properties such as safety and security. Historically, assurance cases have been manually created documents, which are evaluated by system stakeholders through lengthy and complicated processes. In recent years, model-based system assurance approaches have gained popularity to improve the efficiency and quality of syst… ▽ More

    Submitted 16 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  12. arXiv:2403.07974  [pdf, other

    cs.SE cs.CL cs.LG

    LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

    Authors: Naman Jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica

    Abstract: Large Language Models (LLMs) applied to code-related applications have emerged as a prominent field, attracting significant interest from both academia and industry. However, as new and improved LLMs are developed, existing evaluation benchmarks (e.g., HumanEval, MBPP) are no longer sufficient for assessing their capabilities. In this work, we propose LiveCodeBench, a comprehensive and contaminati… ▽ More

    Submitted 6 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Website - https://livecodebench.github.io/

  13. arXiv:2403.06324  [pdf, other

    cs.NI cs.MM

    ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge

    Authors: Sami Khairy, Gabriel Mittag, Vishak Gopal, Francis Y. Yan, Zhixiong Niu, Ezra Ameri, Scott Inglis, Mehrsa Golestaneh, Ross Cutler

    Abstract: The quality of experience (QoE) delivered by video conferencing systems to end users depends in part on correctly estimating the capacity of the bottleneck link between the sender and the receiver over time. Bandwidth estimation for real-time communications (RTC) remains a significant challenge, primarily due to the continuously evolving heterogeneous network architectures and technologies. From t… ▽ More

    Submitted 15 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  14. arXiv:2403.01246  [pdf, other

    cs.CV

    Dual Graph Attention based Disentanglement Multiple Instance Learning for Brain Age Estimation

    Authors: Fanzhe Yan, Gang Yang, Yu Li, Aiping Liu, Xun Chen

    Abstract: Deep learning techniques have demonstrated great potential for accurately estimating brain age by analyzing Magnetic Resonance Imaging (MRI) data from healthy individuals. However, current methods for brain age estimation often directly utilize whole input images, overlooking two important considerations: 1) the heterogeneous nature of brain aging, where different brain regions may degenerate at d… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures

  15. arXiv:2403.00169  [pdf, other

    cs.LO cs.FL cs.SE

    Quantitative Assurance and Synthesis of Controllers from Activity Diagrams

    Authors: Kangfeng Ye, Fang Yan, Simos Gerasimou

    Abstract: Probabilistic model checking is a widely used formal verification technique to automatically verify qualitative and quantitative properties for probabilistic models. However, capturing such systems, writing corresponding properties, and verifying them require domain knowledge. This makes it not accessible for researchers and engineers who may not have the required knowledge. Previous studies have… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 43 pages, 29 figures, 5 tables, submitted to Journal of Systems and Software (JSS)

    ACM Class: D.2.4; F.3.1; F.3.2; F.4.3

  16. arXiv:2402.08645  [pdf, other

    cs.CV cs.LG

    Peeking Behind the Curtains of Residual Learning

    Authors: Tunhou Zhang, Feng Yan, Hai Li, Yiran Chen

    Abstract: The utilization of residual learning has become widespread in deep and scalable neural nets. However, the fundamental principles that contribute to the success of residual learning remain elusive, thus hindering effective training of plain nets with depth scalability. In this paper, we peek behind the curtains of residual learning by uncovering the "dissipating inputs" phenomenon that leads to con… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Arxiv Preprint

  17. arXiv:2402.02797  [pdf, other

    cs.CV cs.LG

    Joint Attention-Guided Feature Fusion Network for Saliency Detection of Surface Defects

    Authors: Xiaoheng Jiang, Feng Yan, Yang Lu, Ke Wang, Shuai Guo, Tianzhu Zhang, Yanwei Pang, Jianwei Niu, Mingliang Xu

    Abstract: Surface defect inspection plays an important role in the process of industrial manufacture and production. Though Convolutional Neural Network (CNN) based defect inspection methods have made huge leaps, they still confront a lot of challenges such as defect scale variation, complex background, low contrast, and so on. To address these issues, we propose a joint attention-guided feature fusion netw… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  18. arXiv:2401.14159  [pdf, other

    cs.CV

    Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks

    Authors: Tianhe Ren, Shilong Liu, Ailing Zeng, Jing Lin, Kunchang Li, He Cao, Jiayu Chen, Xinyu Huang, Yukang Chen, Feng Yan, Zhaoyang Zeng, Hao Zhang, Feng Li, Jie Yang, Hongyang Li, Qing Jiang, Lei Zhang

    Abstract: We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to combine with the segment anything model (SAM). This integration enables the detection and segmentation of any regions based on arbitrary text inputs and opens a door to connecting various vision models. As shown in Fig.1, a wide range of vision tasks can be achieved by using the versatile Grounded SAM pipeline.… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  19. arXiv:2401.03204  [pdf, ps, other

    cs.CR

    The 4-adic complexity of quaternary sequences with low autocorrelation and high linear complexity

    Authors: Feifei Yan, Pinhui Ke, Lingmei Xiao

    Abstract: Recently, Jiang et al. proposed several new classes of quaternary sequences with low autocorrelation and high linear complexity by using the inverse Gray mapping (JAMC, \textbf{69} (2023): 689--706). In this paper, we estimate the 4-adic complexity of these quaternary sequences. Our results show that these sequences have large 4-adic complexity to resist the attack of the rational approximation al… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  20. arXiv:2312.09894  [pdf, other

    cs.CV cs.AI

    PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains

    Authors: Shengyi Hua, Fang Yan, Tianle Shen, Xiaofan Zhang

    Abstract: Large amounts of digitized histopathological data display a promising future for developing pathological foundation models via self-supervised learning methods. Foundation models pretrained with these methods serve as a good basis for downstream tasks. However, the gap between natural and histopathological images hinders the direct application of existing methods. In this work, we present PathoDue… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  21. arXiv:2312.05642  [pdf, other

    cs.LG cs.AI cs.MA cs.PF

    Speed Up Federated Learning in Heterogeneous Environment: A Dynamic Tiering Approach

    Authors: Seyed Mahmoud Sajjadi Mohammadabadi, Syed Zawad, Feng Yan, Lei Yang

    Abstract: Federated learning (FL) enables collaboratively training a model while keeping the training data decentralized and private. However, one significant impediment to training a model using FL, especially large models, is the resource constraints of devices with heterogeneous computation and communication capacities as well as varying task sizes. Such heterogeneity would render significant variations… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  22. arXiv:2311.18213  [pdf, other

    cs.IR cs.AI

    Beyond Two-Tower Matching: Learning Sparse Retrievable Cross-Interactions for Recommendation

    Authors: Liangcai Su, Fan Yan, Jieming Zhu, Xi Xiao, Haoyi Duan, Zhou Zhao, Zhenhua Dong, Ruiming Tang

    Abstract: Two-tower models are a prevalent matching framework for recommendation, which have been widely deployed in industrial applications. The success of two-tower matching attributes to its efficiency in retrieval among a large number of items, since the item tower can be precomputed and used for fast Approximate Nearest Neighbor (ANN) search. However, it suffers two main challenges, including limited f… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted by SIGIR 2023. Code will be available at https://reczoo.github.io/SparCode

  23. arXiv:2311.00231  [pdf, other

    cs.IR cs.LG

    DistDNAS: Search Efficient Feature Interactions within 2 Hours

    Authors: Tunhou Zhang, Wei Wen, Igor Fedorov, Xi Liu, Buyun Zhang, Fangqiu Han, Wen-Yen Chen, Yiping Han, Feng Yan, Hai Li, Yiran Chen

    Abstract: Search efficiency and serving efficiency are two major axes in building feature interactions and expediting the model development process in recommender systems. On large-scale benchmarks, searching for the optimal feature interaction design requires extensive cost due to the sequential workflow on the large volume of data. In addition, fusing interactions of various sources, orders, and mathemati… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  24. arXiv:2310.20705  [pdf, other

    cs.LG cs.IR

    Farthest Greedy Path Sampling for Two-shot Recommender Search

    Authors: Yufan Cao, Tunhou Zhang, Wei Wen, Feng Yan, Hai Li, Yiran Chen

    Abstract: Weight-sharing Neural Architecture Search (WS-NAS) provides an efficient mechanism for developing end-to-end deep recommender models. However, in complex search spaces, distinguishing between superior and inferior architectures (or paths) is challenging. This challenge is compounded by the limited coverage of the supernet and the co-adaptation of subnet weights, which restricts the exploration and… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 9 pages, 5 figures

  25. arXiv:2309.13850  [pdf, other

    stat.ML cs.LG

    Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts

    Authors: Huy Nguyen, Pedram Akbarian, Fanqi Yan, Nhat Ho

    Abstract: Top-K sparse softmax gating mixture of experts has been widely used for scaling up massive deep-learning architectures without increasing the computational cost. Despite its popularity in real-world applications, the theoretical understanding of that gating function has remained an open problem. The main challenge comes from the structure of the top-K sparse softmax gating function, which partitio… ▽ More

    Submitted 23 February, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted to ICLR 2024, 38 pages, 3 figures, 1 table

  26. arXiv:2309.12858  [pdf, other

    cs.IR cs.AI

    Diffusion Augmentation for Sequential Recommendation

    Authors: Qidong Liu, Fan Yan, Xiangyu Zhao, Zhaocheng Du, Huifeng Guo, Ruiming Tang, Feng Tian

    Abstract: Sequential recommendation (SRS) has become the technical foundation in many applications recently, which aims to recommend the next item based on the user's historical interactions. However, sequential recommendation often faces the problem of data sparsity, which widely exists in recommender systems. Besides, most users only interact with a few items, but existing SRS models often underperform th… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  27. arXiv:2309.12641  [pdf, other

    cs.CV

    Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects

    Authors: Feng Yan, Xiaoheng Jiang, Yang Lu, Lisha Cui, Shupan Li, Jiale Cao, Mingliang Xu, Dacheng Tao

    Abstract: Surface defect inspection is a very challenging task in which surface defects usually show weak appearances or exist under complex backgrounds. Most high-accuracy defect detection methods require expensive computation and storage overhead, making them less practical in some resource-constrained defect detection applications. Although some lightweight methods have achieved real-time inference speed… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  28. arXiv:2309.12639  [pdf, other

    cs.CV

    CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation

    Authors: Xiaoheng Jiang, Kaiyi Guo, Yang Lu, Feng Yan, Hao Liu, Jiale Cao, Mingliang Xu, Dacheng Tao

    Abstract: Surface defect inspection is of great importance for industrial manufacture and production. Though defect inspection methods based on deep learning have made significant progress, there are still some challenges for these methods, such as indistinguishable weak defects and defect-like interference in the background. To address these issues, we propose a transformer network with multi-stage CNN (Co… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  29. HAMUR: Hyper Adapter for Multi-Domain Recommendation

    Authors: Xiaopeng Li, Fan Yan, Xiangyu Zhao, Yichao Wang, Bo Chen, Huifeng Guo, Ruiming Tang

    Abstract: Multi-Domain Recommendation (MDR) has gained significant attention in recent years, which leverages data from multiple domains to enhance their performance concurrently.However, current MDR models are confronted with two limitations. Firstly, the majority of these models adopt an approach that explicitly shares parameters between domains, leading to mutual interference among them. Secondly, due to… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted by CIKM'2023

  30. arXiv:2309.06006  [pdf, ps, other

    cs.CV cs.AI

    SoccerNet 2023 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim , et al. (77 additional authors not shown)

    Abstract: The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, fo… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  31. arXiv:2307.01878  [pdf, other

    cs.CL cs.AI

    KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

    Authors: Weijie Xu, Xiaoyu Jiang, Jay Desai, Bin Han, Fuqin Yan, Francis Iannacci

    Abstract: In text classification tasks, fine tuning pretrained language models like BERT and GPT-3 yields competitive accuracy; however, both methods require pretraining on large text datasets. In contrast, general topic modeling methods possess the advantage of analyzing documents to extract meaningful patterns of words without the need of pretraining. To leverage topic modeling's unsupervised insights ext… ▽ More

    Submitted 11 February, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 12 pages, 4 figures, ICLR 2022 Workshop

    MSC Class: 68T50 ACM Class: I.2.6

    Journal ref: ICLR 2022 Workshop PML4DC

  32. arXiv:2307.00734  [pdf, other

    physics.ao-ph cs.LG physics.flu-dyn

    On the choice of training data for machine learning of geostrophic mesoscale turbulence

    Authors: F. E. Yan, J. Mak, Y. Wang

    Abstract: 'Data' plays a central role in data-driven methods, but is not often the subject of focus in investigations of machine learning algorithms as applied to Earth System Modeling related problems. Here we consider the case of eddy-mean interaction in rotating stratified turbulence in the presence of lateral boundaries, a problem of relevance to ocean modeling, where the eddy fluxes contain dynamically… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: 23 pages, 8 figures

  33. arXiv:2306.10209  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

    Authors: Guanhua Wang, Heyang Qin, Sam Ade Jacobs, Connor Holmes, Samyam Rajbhandari, Olatunji Ruwase, Feng Yan, Lei Yang, Yuxiong He

    Abstract: Zero Redundancy Optimizer (ZeRO) has been used to train a wide range of large language models on massive GPUs clusters due to its ease of use, efficiency, and good scalability. However, when training on low-bandwidth clusters, or at scale which forces batch size per GPU to be small, ZeRO's effective throughput is limited because of high communication volume from gathering weights in forward pass,… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 12 pages

  34. NFTVis: Visual Analysis of NFT Performance

    Authors: Fan Yan, Xumeng Wang, Ketian Mao, Wei Zhang, Wei Chen

    Abstract: A non-fungible token (NFT) is a data unit stored on the blockchain. Nowadays, more and more investors and collectors (NFT traders), who participate in transactions of NFTs, have an urgent need to assess the performance of NFTs. However, there are two challenges for NFT traders when analyzing the performance of NFT. First, the current rarity models have flaws and are sometimes not convincing. In ad… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: This manuscript is accepted for publication in Proceedings of the 16th IEEE Pacific Visualization Symposium (PacificVis '23)

    Journal ref: 2023 IEEE 16th Pacific Visualization Symposium (PacificVis)

  35. arXiv:2305.12724  [pdf, other

    cs.CV

    Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

    Authors: Feng Yan, Weixin Luo, Yujie Zhong, Yiyang Gan, Lin Ma

    Abstract: Existing end-to-end Multi-Object Tracking (e2e-MOT) methods have not surpassed non-end-to-end tracking-by-detection methods. One potential reason is its label assignment strategy during training that consistently binds the tracked objects with tracking queries and then assigns the few newborns to detection queries. With one-to-one bipartite matching, such an assignment will yield unbalanced traini… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  36. arXiv:2305.12333  [pdf, other

    cs.MM cs.AI cs.NI

    GRACE: Loss-Resilient Real-Time Video through Neural Codecs

    Authors: Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang, Yuhan Liu, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang

    Abstract: In real-time video communication, retransmitting lost packets over high-latency networks is not viable due to strict latency requirements. To counter packet losses without retransmission, two primary strategies are employed -- encoder-based forward error correction (FEC) and decoder-based error concealment. The former encodes data with redundancy before transmission, yet determining the optimal re… ▽ More

    Submitted 12 March, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

  37. arXiv:2305.02567  [pdf, other

    cs.CV

    LayoutDM: Transformer-based Diffusion Model for Layout Generation

    Authors: Shang Chai, Liansheng Zhuang, Fengying Yan

    Abstract: Automatic layout generation that can synthesize high-quality layouts is an important tool for graphic design in many applications. Though existing methods based on generative models such as Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) have progressed, they still leave much room for improving the quality and diversity of the results. Inspired by the recent success of… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted by CVPR 2023

  38. arXiv:2304.11796  [pdf, other

    cs.RO

    Coordinated Control of Path Tracking and Yaw Stability for Distributed Drive Electric Vehicle Based on AMPC and DYC

    Authors: Dongmei Wu, Yuying Guan, Xin Xia, Changqing Du, Fuwu Yan, Yang Li, Min Hua, Wei Liu

    Abstract: Maintaining both path-tracking accuracy and yaw stability of distributed drive electric vehicles (DDEVs) under various driving conditions presents a significant challenge in the field of vehicle control. To address this limitation, a coordinated control strategy that integrates adaptive model predictive control (AMPC) path-tracking control and direct yaw moment control (DYC) is proposed for DDEVs.… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  39. arXiv:2212.12180  [pdf, other

    cs.DC cs.LG

    Autothrottle: A Practical Bi-Level Approach to Resource Management for SLO-Targeted Microservices

    Authors: Zibo Wang, Pinghe Li, Chieh-Jan Mike Liang, Feng Wu, Francis Y. Yan

    Abstract: Achieving resource efficiency while preserving end-user experience is non-trivial for cloud application operators. As cloud applications progressively adopt microservices, resource managers are faced with two distinct levels of system behavior: end-to-end application latency and per-service resource usage. Translating between the two levels, however, is challenging because user requests traverse h… ▽ More

    Submitted 14 April, 2024; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: Accepted by USENIX NSDI '24

  40. arXiv:2212.03586  [pdf, other

    cs.CV

    Multiple Object Tracking Challenge Technical Report for Team MT_IoT

    Authors: Feng Yan, Zhiheng Li, Weixin Luo, Zequn jie, Fan Liang, Xiaolin Wei, Lin Ma

    Abstract: This is a brief technical report of our proposed method for Multiple-Object Tracking (MOT) Challenge in Complex Environments. In this paper, we treat the MOT task as a two-stage task including human detection and trajectory matching. Specifically, we designed an improved human detector and associated most of detection to guarantee the integrity of the motion trajectory. We also propose a location-… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: This is a brief technical report for Multiple Object Tracking Challenge of ECCV workshop 2022

  41. arXiv:2211.15759  [pdf, other

    cs.CV

    PIDS: Joint Point Interaction-Dimension Search for 3D Point Cloud

    Authors: Tunhou Zhang, Mingyuan Ma, Feng Yan, Hai Li, Yiran Chen

    Abstract: The interaction and dimension of points are two important axes in designing point operators to serve hierarchical 3D models. Yet, these two axes are heterogeneous and challenging to fully explore. Existing works craft point operator under a single axis and reuse the crafted operator in all parts of 3D models. This overlooks the opportunity to better combine point interactions and dimensions by exp… ▽ More

    Submitted 26 April, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2023: 1298-1307

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2023: 1298-1307

  42. arXiv:2210.13763  [pdf, other

    cs.NI cs.LG

    Teal: Learning-Accelerated Optimization of WAN Traffic Engineering

    Authors: Zhiying Xu, Francis Y. Yan, Rachee Singh, Justin T. Chiu, Alexander M. Rush, Minlan Yu

    Abstract: The rapid expansion of global cloud wide-area networks (WANs) has posed a challenge for commercial optimization engines to efficiently solve network traffic engineering (TE) problems at scale. Existing acceleration strategies decompose TE optimization into concurrent subproblems but realize limited parallelism due to an inherent tradeoff between run time and allocation performance. We present Te… ▽ More

    Submitted 19 May, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

  43. SoccerNet 2022 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao , et al. (69 additional authors not shown)

    Abstract: The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestamps in long untrimmed videos, (2) replay grounding, focusing on retrieving the live moment of an action shown in a replay, (3) pitch localization, focusing on det… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM MMSports 2022

  44. arXiv:2209.01496  [pdf, other

    cs.DC

    InfiniStore: Elastic Serverless Cloud Storage

    Authors: Jingyuan Zhang, Ao Wang, Xiaolong Ma, Benjamin Carver, Nicholas John Newman, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng

    Abstract: Cloud object storage such as AWS S3 is cost-effective and highly elastic but relatively slow, while high-performance cloud storage such as AWS ElastiCache is expensive and provides limited elasticity. We present a new cloud storage service called ServerlessMemory, which stores data using the memory of serverless functions. ServerlessMemory employs a sliding-window-based memory management strategy… ▽ More

    Submitted 16 March, 2023; v1 submitted 3 September, 2022; originally announced September 2022.

    Comments: An extensive report of the paper accepted by VLDB 2023

  45. arXiv:2208.12711  [pdf, other

    cs.CL

    SeSQL: Yet Another Large-scale Session-level Chinese Text-to-SQL Dataset

    Authors: Saihao Huang, Lijie Wang, Zhenghua Li, Zeyang Liu, Chenhui Dou, Fukang Yan, Xinyan Xiao, Hua Wu, Min Zhang

    Abstract: As the first session-level Chinese dataset, CHASE contains two separate parts, i.e., 2,003 sessions manually constructed from scratch (CHASE-C), and 3,456 sessions translated from English SParC (CHASE-T). We find the two parts are highly discrepant and incompatible as training and evaluation data. In this work, we present SeSQL, yet another large-scale session-level text-to-SQL dataset in Chinese,… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 12 pages,4 figures

  46. arXiv:2208.00761  [pdf, other

    cs.DC

    AI Augmented Edge and Fog Computing: Trends and Challenges

    Authors: Shreshth Tuli, Fatemeh Mirhakimi, Samodha Pallewatta, Syed Zawad, Giuliano Casale, Bahman Javadi, Feng Yan, Rajkumar Buyya, Nicholas R. Jennings

    Abstract: In recent years, the landscape of computing paradigms has witnessed a gradual yet remarkable shift from monolithic computing to distributed and decentralized paradigms such as Internet of Things (IoT), Edge, Fog, Cloud, and Serverless. The frontiers of these computing technologies have been boosted by shift from manually encoded algorithms to Artificial Intelligence (AI)-driven autonomous systems… ▽ More

    Submitted 14 April, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted in Elsevier Journal of Network and Computer Applications

  47. arXiv:2207.14696  [pdf, other

    cs.LG

    BiFeat: Supercharge GNN Training via Graph Feature Quantization

    Authors: Yuxin Ma, Ping Gong, Jun Yi, Zhewei Yao, Cheng Li, Yuxiong He, Feng Yan

    Abstract: Graph Neural Networks (GNNs) is a promising approach for applications with nonEuclidean data. However, training GNNs on large scale graphs with hundreds of millions nodes is both resource and time consuming. Different from DNNs, GNNs usually have larger memory footprints, and thus the GPU memory capacity and PCIe bandwidth are the main resource bottlenecks in GNN training. To address this problem,… ▽ More

    Submitted 17 February, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

  48. arXiv:2207.08319  [pdf, other

    cs.CV

    Defect Transformer: An Efficient Hybrid Transformer Architecture for Surface Defect Detection

    Authors: Junpu Wang, Guili Xu, Fuju Yan, Jinjin Wang, Zhengsheng Wang

    Abstract: Surface defect detection is an extremely crucial step to ensure the quality of industrial products. Nowadays, convolutional neural networks (CNNs) based on encoder-decoder architecture have achieved tremendous success in various defect detection tasks. However, due to the intrinsic locality of convolution, they commonly exhibit a limitation in explicitly modeling long-range interactions, critical… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  49. NASRec: Weight Sharing Neural Architecture Search for Recommender Systems

    Authors: Tunhou Zhang, Dehua Cheng, Yuchen He, Zhengxing Chen, Xiaoliang Dai, Liang Xiong, Feng Yan, Hai Li, Yiran Chen, Wei Wen

    Abstract: The rise of deep neural networks offers new opportunities in optimizing recommender systems. However, optimizing recommender systems using deep neural networks requires delicate architecture fabrication. We propose NASRec, a paradigm that trains a single supernet and efficiently produces abundant models/sub-architectures by weight sharing. To overcome the data multi-modality and architecture heter… ▽ More

    Submitted 12 February, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Proceedings of the ACM Web Conference 2023 (WWW'23)

    Journal ref: Proceedings of the ACM Web Conference 2023 (WWW'23)

  50. arXiv:2205.01853  [pdf

    cs.DC cs.LG

    SMLT: A Serverless Framework for Scalable and Adaptive Machine Learning Design and Training

    Authors: Ahsan Ali, Syed Zawad, Paarijaat Aditya, Istemi Ekin Akkus, Ruichuan Chen, Feng Yan

    Abstract: In today's production machine learning (ML) systems, models are continuously trained, improved, and deployed. ML design and training are becoming a continuous workflow of various tasks that have dynamic resource demands. Serverless computing is an emerging cloud paradigm that provides transparent resource management and scaling for users and has the potential to revolutionize the routine of ML des… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.