Skip to main content

Showing 1–49 of 49 results for author: Su, G

  1. arXiv:2406.05135  [pdf

    cs.RO math.OC

    Smart Navigation System for Parking Assignment at Large Events: Incorporating Heterogeneous Driver Characteristics

    Authors: Xi Cheng, Gaofeng Su, Siyuan Feng, Ke Liu, Chen Zhu, Hui Lin, Jilin Song, Jianan Chen

    Abstract: Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducte… ▽ More

    Submitted 14 May, 2024; originally announced June 2024.

  2. arXiv:2405.08298  [pdf, other

    cs.LG

    Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments

    Authors: Ke Liu, Fan Hu, Hui Lin, Xi Cheng, Jianan Chen, Jilin Song, Siyuan Feng, Gaofeng Su, Chen Zhu

    Abstract: This paper explores the optimization of Ground Delay Programs (GDP), a prevalent Traffic Management Initiative used in Air Traffic Management (ATM) to reconcile capacity and demand discrepancies at airports. Employing Reinforcement Learning (RL) to manage the inherent uncertainties in the national airspace system-such as weather variability, fluctuating flight demands, and airport arrival rates-we… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2404.12333  [pdf, other

    cs.CV

    Customizing Text-to-Image Diffusion with Camera Viewpoint Control

    Authors: Nupur Kumari, Grace Su, Richard Zhang, Taesung Park, Eli Shechtman, Jun-Yan Zhu

    Abstract: Model customization introduces new concepts to existing text-to-image models, enabling the generation of the new concept in novel contexts. However, such methods lack accurate camera view control w.r.t the object, and users must resort to prompt engineering (e.g., adding "top-view") to achieve coarse view control. In this work, we introduce a new task -- enabling explicit control of camera viewpoi… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: project page: https://customdiffusion360.github.io

  4. arXiv:2404.00488  [pdf

    cs.CL cs.AI cs.LG

    Noise-Aware Training of Layout-Aware Language Models

    Authors: Ritesh Sarkhel, Xiaoqi Ren, Lauro Beltrao Costa, Guolong Su, Vincent Perot, Yanan Xie, Emmanouil Koukoumidis, Arnab Nandi

    Abstract: A visually rich document (VRD) utilizes visual features along with linguistic cues to disseminate information. Training a custom extractor that identifies named entities from a document requires a large number of instances of the target document type annotated at textual and visual modalities. This is an expensive bottleneck in enterprise scenarios, where we want to train custom extractors for tho… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  5. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  6. arXiv:2403.01433  [pdf, other

    cs.CE q-bio.NC

    BrainMass: Advancing Brain Network Analysis for Diagnosis with Large-scale Self-Supervised Learning

    Authors: Yanwu Yang, Chenfei Ye, Guinan Su, Ziyao Zhang, Zhikai Chang, Hairui Chen, Piu Chan, Yue Yu, Ting Ma

    Abstract: Foundation models pretrained on large-scale datasets via self-supervised learning demonstrate exceptional versatility across various tasks. Due to the heterogeneity and hard-to-collect medical data, this approach is especially beneficial for medical image analysis and neuroscience research, as it streamlines broad downstream tasks without the need for numerous costly annotations. However, there ha… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  7. "May I Speak?": Multi-modal Attention Guidance in Social VR Group Conversations

    Authors: Geonsun Lee, Dae Yeol Lee, Guan-Ming Su, Dinesh Manocha

    Abstract: In this paper, we present a novel multi-modal attention guidance method designed to address the challenges of turn-taking dynamics in meetings and enhance group conversations within virtual reality (VR) environments. Recognizing the difficulties posed by a confined field of view and the absence of detailed gesture tracking in VR, our proposed method aims to mitigate the challenges of noticing new… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  8. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  9. Analysis of Coding Gain Due to In-Loop Reshaping

    Authors: Chau-Wai Wong, Chang-Hong Fu, Mengting Xu, Guan-Ming Su

    Abstract: Reshaping, a point operation that alters the characteristics of signals, has been shown capable of improving the compression ratio in video coding practices. Out-of-loop reshaping that directly modifies the input video signal was first adopted as the supplemental enhancement information (SEI) for the HEVC/H.265 without the need to alter the core design of the video codec. VVC/H.266 further improve… ▽ More

    Submitted 19 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Published in IEEE Transactions on Image Processing

  10. arXiv:2311.11258  [pdf, other

    quant-ph cs.AI cs.LG

    Tensor networks for interpretable and efficient quantum-inspired machine learning

    Authors: Shi-Ju Ran, Gang Su

    Abstract: It is a critical challenge to simultaneously gain high interpretability and efficiency with the current schemes of deep machine learning (ML). Tensor network (TN), which is a well-established mathematical tool originating from quantum mechanics, has shown its unique advantages on developing efficient ``white-box'' ML schemes. Here, we give a brief review on the inspiring progresses made in TN-base… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 12 pages, 3 figures

    Journal ref: Intelligent Computing 2, 0061 (2023)

  11. arXiv:2311.05114  [pdf, other

    cs.HC

    Prompt Your Mind: Refine Personalized Text Prompts within Your Mind

    Authors: Guinan Su, Yanwu Yang, Jie Guo

    Abstract: Large language models (LLMs) have demonstrated remarkable potential in natural language understanding and generation, making them valuable tools for enhancing conversational interactions. However, LLMs encounter challenges such as lacking multi-step reasoning capabilities, and heavy reliance on prompts. In this regard, we introduce a prompt-refinement system named PromptMind, also known as "Prompt… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  12. arXiv:2311.04766  [pdf, other

    cs.CV

    DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation

    Authors: Guinan Su, Yanwu Yang, Zhifeng Li

    Abstract: In recent years, audio-driven 3D facial animation has gained significant attention, particularly in applications such as virtual reality, gaming, and video conferencing. However, accurately modeling the intricate and subtle dynamics of facial expressions remains a challenge. Most existing studies approach the facial animation task as a single regression problem, which often fail to capture the int… ▽ More

    Submitted 12 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  13. arXiv:2309.10952  [pdf, other

    cs.CL cs.AI cs.LG

    LMDX: Language Model-based Document Information Extraction and Localization

    Authors: Vincent Perot, Kai Kang, Florian Luisier, Guolong Su, Xiaoyu Sun, Ramya Sree Boppana, Zilong Wang, Zifeng Wang, Jiaqi Mu, Hao Zhang, Chen-Yu Lee, Nan Hua

    Abstract: Large Language Models (LLM) have revolutionized Natural Language Processing (NLP), improving state-of-the-art and exhibiting emergent capabilities across various tasks. However, their application in extracting information from visually rich documents, which is at the core of many document processing workflows and involving the extraction of key entities from semi-structured documents, has not yet… ▽ More

    Submitted 21 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  14. arXiv:2306.09182  [pdf

    cs.RO

    Rolling control and dynamics model of two section articulated-wing ornithopter

    Authors: G. Su, Y. Cai, J. Zhao

    Abstract: This paper invented a new rolling control mechanism of two section articulated-wing ornithopter, which is analogues to aileron control in plane, however, similar control mechanism leads to opposite result, indicating the ornithopter supposed to go left now go right instead. This research gives a qualitative dynamics model which explains this new phenomenon. Because of wing folding, the differentia… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  15. arXiv:2306.06489  [pdf, other

    cs.RO cs.AI

    On Robot Grasp Learning Using Equivariant Models

    Authors: Xupeng Zhu, Dian Wang, Guanang Su, Ondrej Biza, Robin Walters, Robert Platt

    Abstract: Real-world grasp detection is challenging due to the stochasticity in grasp dynamics and the noise in hardware. Ideally, the system would adapt to the real world by training directly on physical systems. However, this is generally difficult due to the large amount of training data required by most grasp learning models. In this paper, we note that the planar grasp function is $\SE(2)$-equivariant… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted in Autonomous Robot. arXiv admin note: substantial text overlap with arXiv:2202.09468

  16. arXiv:2306.01268  [pdf, other

    cs.CV cs.DL cs.IR

    DeepScribe: Localization and Classification of Elamite Cuneiform Signs Via Deep Learning

    Authors: Edward C. Williams, Grace Su, Sandra R. Schloen, Miller C. Prosser, Susanne Paulus, Sanjay Krishnan

    Abstract: Twenty-five hundred years ago, the paperwork of the Achaemenid Empire was recorded on clay tablets. In 1933, archaeologists from the University of Chicago's Oriental Institute (OI) found tens of thousands of these tablets and fragments during the excavation of Persepolis. Many of these tablets have been painstakingly photographed and annotated by expert cuneiformists, and now provide a rich datase… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Currently under review in the ACM JOCCH

  17. arXiv:2305.04397  [pdf, other

    cs.MA

    Multi-Objective Task Assignment and Multiagent Planning with Hybrid GPU-CPU Acceleration

    Authors: Thomas Robinson, Guoxin Su

    Abstract: Allocation and planning with a collection of tasks and a group of agents is an important problem in multiagent systems. One commonly faced bottleneck is scalability, as in general the multiagent model increases exponentially in size with the number of agents. We consider the combination of random task assignment and multiagent planning under multiple-objective constraints, and show that this probl… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  18. arXiv:2305.02549  [pdf, other

    cs.CL cs.CV cs.LG

    FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

    Authors: Chen-Yu Lee, Chun-Liang Li, Hao Zhang, Timothy Dozat, Vincent Perot, Guolong Su, Xiang Zhang, Kihyuk Sohn, Nikolai Glushnev, Renshen Wang, Joshua Ainslie, Shangbang Long, Siyang Qin, Yasuhisa Fujii, Nan Hua, Tomas Pfister

    Abstract: The recent advent of self-supervised pre-training techniques has led to a surge in the use of multimodal learning in form document understanding. However, existing approaches that extend the mask language modeling to other modalities require careful multi-task tuning, complex reconstruction target designs, or additional pre-training data. In FormNetV2, we introduce a centralized multimodal graph c… ▽ More

    Submitted 13 June, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  19. arXiv:2303.06340  [pdf, other

    q-bio.QM cs.LG eess.IV

    Intelligent diagnostic scheme for lung cancer screening with Raman spectra data by tensor network machine learning

    Authors: Yu-Jia An, Sheng-Chen Bai, Lin Cheng, Xiao-Guang Li, Cheng-en Wang, Xiao-Dong Han, Gang Su, Shi-Ju Ran, Cong Wang

    Abstract: Artificial intelligence (AI) has brought tremendous impacts on biomedical sciences from academic researches to clinical applications, such as in biomarkers' detection and diagnosis, optimization of treatment, and identification of new therapeutic targets in drug discovery. However, the contemporary AI technologies, particularly deep machine learning (ML), severely suffer from non-interpretability,… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures

  20. arXiv:2303.04745  [pdf, other

    cs.LG stat.ML

    A General Theory of Correct, Incorrect, and Extrinsic Equivariance

    Authors: Dian Wang, Xupeng Zhu, Jung Yeon Park, Mingxi Jia, Guanang Su, Robert Platt, Robin Walters

    Abstract: Although equivariant machine learning has proven effective at many tasks, success depends heavily on the assumption that the ground truth function is symmetric over the entire domain matching the symmetry in an equivariant neural network. A missing piece in the equivariant learning literature is the analysis of equivariant networks when symmetry exists only partially in the domain. In this work, w… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at NeurIPS 2023

  21. arXiv:2211.07730  [pdf, other

    cs.LG cs.AI cs.CL

    QueryForm: A Simple Zero-shot Form Entity Query Framework

    Authors: Zifeng Wang, Zizhao Zhang, Jacob Devlin, Chen-Yu Lee, Guolong Su, Hao Zhang, Jennifer Dy, Vincent Perot, Tomas Pfister

    Abstract: Zero-shot transfer learning for document understanding is a crucial yet under-investigated scenario to help reduce the high cost involved in annotating document entities. We present a novel query-based framework, QueryForm, that extracts entity values from form-like documents in a zero-shot fashion. QueryForm contains a dual prompting mechanism that composes both the document schema and a specific… ▽ More

    Submitted 27 June, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of ACL 2023

  22. arXiv:2211.00194  [pdf, other

    cs.RO

    SEIL: Simulation-augmented Equivariant Imitation Learning

    Authors: Mingxi Jia, Dian Wang, Guanang Su, David Klee, Xupeng Zhu, Robin Walters, Robert Platt

    Abstract: In robotic manipulation, acquiring samples is extremely expensive because it often requires interacting with the real world. Traditional image-level data augmentation has shown the potential to improve sample efficiency in various machine learning tasks. However, image-level data augmentation is insufficient for an imitation learning agent to learn good manipulation policies in a reasonable amount… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

  23. arXiv:2208.03953  [pdf, ps, other

    eess.SP cs.IT

    Intelligent MIMO Detection Using Meta Learning

    Authors: Haomiao Huo, Jindan Xu, Gege Su, Wei Xu, Ning Wang

    Abstract: In a K-best detector for multiple-input-multiple-output(MIMO) systems, the value of K needs to be sufficiently large to achieve near-maximum-likelihood (ML) performance. By treating K as a variable that can be adjusted according to a fitting function of some learnable coefficients, an intelligent MIMO detection network based on deep neural networks (DNN) is proposed to reduce complexity of the det… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  24. arXiv:2206.14424  [pdf, other

    cs.RO cs.AI eess.SY

    Collaborative Navigation and Manipulation of a Cable-towed Load by Multiple Quadrupedal Robots

    Authors: Chenyu Yang, Guo Ning Sue, Zhongyu Li, Lizhi Yang, Haotian Shen, Yufeng Chi, Akshara Rai, Jun Zeng, Koushil Sreenath

    Abstract: This paper tackles the problem of robots collaboratively towing a load with cables to a specified goal location while avoiding collisions in real time. The introduction of cables (as opposed to rigid links) enables the robotic team to travel through narrow spaces by changing its intrinsic dimensions through slack/taut switches of the cable. However, this is a challenging problem because of the hyb… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Extended version of the manuscript accepted to IEEE Robotics and Automation Letters (RA-L) 2022

  25. arXiv:2204.04799  [pdf, other

    cs.LG cs.CV

    DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning

    Authors: Zifeng Wang, Zizhao Zhang, Sayna Ebrahimi, Ruoxi Sun, Han Zhang, Chen-Yu Lee, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer Dy, Tomas Pfister

    Abstract: Continual learning aims to enable a single model to learn a sequence of tasks without catastrophic forgetting. Top-performing methods usually require a rehearsal buffer to store past pristine examples for experience replay, which, however, limits their practical value due to privacy and memory constraints. In this work, we present a simple yet effective framework, DualPrompt, which learns a tiny s… ▽ More

    Submitted 5 August, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: Published at ECCV 2022 as a conference paper

  26. arXiv:2203.08411  [pdf, other

    cs.CL cs.CV cs.LG

    FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

    Authors: Chen-Yu Lee, Chun-Liang Li, Timothy Dozat, Vincent Perot, Guolong Su, Nan Hua, Joshua Ainslie, Renshen Wang, Yasuhisa Fujii, Tomas Pfister

    Abstract: Sequence modeling has demonstrated state-of-the-art performance on natural language and document understanding tasks. However, it is challenging to correctly serialize tokens in form-like documents in practice due to their variety of layout patterns. We propose FormNet, a structure-aware sequence model to mitigate the suboptimal serialization of forms. First, we design Rich Attention that leverage… ▽ More

    Submitted 23 March, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  27. arXiv:2203.01217  [pdf, other

    cs.CV

    Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation

    Authors: Weicai Ye, Xinyue Lan, Ge Su, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

    Abstract: Video Panoptic Segmentation (VPS) aims to generate coherent panoptic segmentation and track the identities of all pixels across video frames. Existing methods predominantly utilize the trained instance embedding to keep the consistency of panoptic segmentation. However, they inevitably struggle to cope with the challenges of small objects, similar appearance but inconsistent identities, occlusion,… ▽ More

    Submitted 11 December, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

  28. arXiv:2202.09468  [pdf, other

    cs.RO

    Sample Efficient Grasp Learning Using Equivariant Models

    Authors: Xupeng Zhu, Dian Wang, Ondrej Biza, Guanang Su, Robin Walters, Robert Platt

    Abstract: In planar grasp detection, the goal is to learn a function from an image of a scene onto a set of feasible grasp poses in $\mathrm{SE}(2)$. In this paper, we recognize that the optimal grasp function is $\mathrm{SE}(2)$-equivariant and can be modeled using an equivariant convolutional neural network. As a result, we are able to significantly improve the sample efficiency of grasp learning, obtaini… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  29. arXiv:2112.08654  [pdf, other

    cs.LG cs.CV

    Learning to Prompt for Continual Learning

    Authors: Zifeng Wang, Zizhao Zhang, Chen-Yu Lee, Han Zhang, Ruoxi Sun, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer Dy, Tomas Pfister

    Abstract: The mainstream paradigm behind continual learning has been to adapt the model parameters to non-stationary data distributions, where catastrophic forgetting is the central challenge. Typical methods rely on a rehearsal buffer or known task identity at test time to retrieve learned knowledge and address forgetting, while this work presents a new paradigm for continual learning that aims to train a… ▽ More

    Submitted 21 March, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Published at CVPR 2022 as a conference paper

  30. arXiv:2108.11636  [pdf, other

    cs.CV

    SketchLattice: Latticed Representation for Sketch Manipulation

    Authors: Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song

    Abstract: The key challenge in designing a sketch representation lies with handling the abstract and iconic nature of sketches. Existing work predominantly utilizes either, (i) a pixelative format that treats sketches as natural images employing off-the-shelf CNN-based networks, or (ii) an elaborately designed vector format that leverages the structural information of drawing orders using sequential RNN-bas… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: accepted to ICCV 2021

  31. arXiv:2107.01154  [pdf, other

    cs.LG cs.CR

    Gradient-Leakage Resilient Federated Learning

    Authors: Wenqi Wei, Ling Liu, Yanzhao Wu, Gong Su, Arun Iyengar

    Abstract: Federated learning(FL) is an emerging distributed learning paradigm with default client privacy because clients can keep sensitive data on their devices and only share local training parameter updates with the federated server. However, recent studies reveal that gradient leakages in FL may compromise the privacy of client training data. This paper presents a gradient leakage resilient approach to… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  32. arXiv:2104.12197  [pdf, other

    cs.DC

    RDMAbox : Optimizing RDMA for Memory Intensive Workloads

    Authors: Juhyun Bae, Ling Liu, Yanzhao Wu, Gong Su, Arun Iyengar

    Abstract: We present RDMAbox, a set of low level RDMA optimizations that provide better performance than previous approaches. The optimizations are packaged in easy-to-use kernel and user space libraries for applications and systems in data center. We demonstrate the flexibility and effectiveness of RDMAbox by implementing a kernel remote paging system and a user space file system using RDMAbox. RDMAbox emp… ▽ More

    Submitted 13 August, 2021; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: 10 pages, 12 figures

  33. arXiv:2008.08272  [pdf, other

    cs.PL cs.LG

    Compiling ONNX Neural Network Models Using MLIR

    Authors: Tian Jin, Gheorghe-Teodor Bercea, Tung D. Le, Tong Chen, Gong Su, Haruki Imai, Yasushi Negishi, Anh Leu, Kevin O'Brien, Kiyokuni Kawachiya, Alexandre E. Eichenberger

    Abstract: Deep neural network models are becoming increasingly popular and have been used in various tasks such as computer vision, speech recognition, and natural language processing. Machine learning models are commonly trained in a resource-rich environment and then deployed in a distinct environment such as high availability machines or edge devices. To assist the portability of models, the open-source… ▽ More

    Submitted 30 September, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: 8 pages

  34. arXiv:2008.00902  [pdf, other

    cs.DC

    Efficient Orchestration of Host and Remote Shared Memory for Memory Intensive Workloads

    Authors: Juhyun Bae, Gong Su, Arun Iyengar, Yanzhao Wu, Ling Liu

    Abstract: Since very few contributions to the development of an unified memory orchestration framework for efficient management of both host and remote idle memory have been made, we present Valet, an efficient approach to orchestration of host and remote shared memory for improving performance of memory intensive workloads. The paper makes three original contributions. First, we redesign the data flow in t… ▽ More

    Submitted 28 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: 13 pages, 23 figures, 8 tables, MemSys '20: The International Symposium on Memory Systems, Sept 2020, Washington, DC

  35. arXiv:2007.06813  [pdf, other

    cs.CR

    BDTF: A Blockchain-Based Data Trading Framework with Trusted Execution Environment

    Authors: Guoxiong Su, Wenyuan Yang, Zhengding Luo, Yinghong Zhang, Zhiqiang Bai, Yuesheng Zhu

    Abstract: The need for data trading promotes the emergence of data market. However, in conventional data markets, both data buyers and data sellers have to use a centralized trading platform which might be dishonest. A dishonest centralized trading platform may steal and resell the data seller's data, or may refuse to send data after receiving payment from the data buyer. It seriously affects the fair data… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 6 pages, 5figures

  36. arXiv:2006.09134  [pdf, other

    cs.CV cs.LG eess.IV

    AlphaGAN: Fully Differentiable Architecture Search for Generative Adversarial Networks

    Authors: Yuesong Tian, Li Shen, Li Shen, Guinan Su, Zhifeng Li, Wei Liu

    Abstract: Generative Adversarial Networks (GANs) are formulated as minimax game problems, whereby generators attempt to approach real data distributions by virtue of adversarial learning against discriminators. The intrinsic problem complexity poses the challenge to enhance the performance of generative networks. In this work, we aim to boost model learning from the perspective of network architectures, by… ▽ More

    Submitted 7 August, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: In IEEE Transactions on Pattern Analysis and Machine Intelligence

  37. Adaptive Dithering Using Curved Markov-Gaussian Noise in the Quantized Domain for Mapping SDR to HDR Image

    Authors: Subhayan Mukherjee, Guan-Ming Su, Irene Cheng

    Abstract: High Dynamic Range (HDR) imaging is gaining increased attention due to its realistic content, for not only regular displays but also smartphones. Before sufficient HDR content is distributed, HDR visualization still relies mostly on converting Standard Dynamic Range (SDR) content. SDR images are often quantized, or bit depth reduced, before SDR-to-HDR conversion, e.g. for video transmission. Quant… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

    Comments: 2018 International Conference on Smart Multimedia

  38. arXiv:2001.04029  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Tangent-Space Gradient Optimization of Tensor Network for Machine Learning

    Authors: Zheng-zhi Sun, Shi-ju Ran, Gang Su

    Abstract: The gradient-based optimization method for deep machine learning models suffers from gradient vanishing and exploding problems, particularly when the computational graph becomes deep. In this work, we propose the tangent-space gradient optimization (TSGO) for the probabilistic models to keep the gradients from vanishing or exploding. The central idea is to guarantee the orthogonality between the v… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: 5 pages, 4 figures

    Journal ref: Phys. Rev. E 102, 012152 (2020)

  39. arXiv:1912.10729  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    TextNAS: A Neural Architecture Search Space tailored for Text Representation

    Authors: Yujing Wang, Yaming Yang, Yiren Chen, Jing Bai, Ce Zhang, Guinan Su, Xiaoyu Kou, Yunhai Tong, Mao Yang, Lidong Zhou

    Abstract: Learning text representation is crucial for text classification and other language related tasks. There are a diverse set of text representation networks in the literature, and how to find the optimal one is a non-trivial problem. Recently, the emerging Neural Architecture Search (NAS) techniques have demonstrated good potential to solve the problem. Nevertheless, most of the existing works of NAS… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

  40. arXiv:1907.10290  [pdf, other

    stat.ML cond-mat.str-el cs.LG quant-ph

    Quantum Compressed Sensing with Unsupervised Tensor-Network Machine Learning

    Authors: Shi-Ju Ran, Zheng-Zhi Sun, Shao-Ming Fei, Gang Su, Maciej Lewenstein

    Abstract: We propose tensor-network compressed sensing (TNCS) by combining the ideas of compressed sensing, tensor network (TN), and machine learning, which permits novel and efficient quantum communications of realistic data. The strategy is to use the unsupervised TN machine learning algorithm to obtain the entangled state $|Ψ\rangle$ that describes the probability distribution of a huge amount of classic… ▽ More

    Submitted 13 October, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: 5+6 pages, 3+6 figures. Essential changes and new data were added to this new version

    Journal ref: Phys. Rev. Research 2, 033293 (2020)

  41. arXiv:1907.03710  [pdf, other

    cs.CR

    StackVault: Protection from Untrusted Functions

    Authors: Qi Zhang, Zehra Sura, Ashish Kundu, Gong Su, Arun Iyengar, Ling Liu

    Abstract: Data exfiltration attacks have led to huge data breaches. Recently, the Equifax attack affected 147M users and a third-party library - Apache Struts - was alleged to be responsible for it. These attacks often exploit the fact that sensitive data are stored unencrypted in process memory and can be accessed by any function executing within the same process, including untrusted third-party library fu… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 11 pages

  42. arXiv:1903.10742  [pdf, other

    cs.LG cond-mat.str-el quant-ph stat.ML

    Generative Tensor Network Classification Model for Supervised Machine Learning

    Authors: Zheng-Zhi Sun, Cheng Peng, Ding Liu, Shi-Ju Ran, Gang Su

    Abstract: Tensor network (TN) has recently triggered extensive interests in developing machine-learning models in quantum many-body Hilbert space. Here we purpose a generative TN classification (GTNC) approach for supervised learning. The strategy is to train the generative TN for each class of the samples to construct the classifiers. The classification is implemented by comparing the distance in the many-… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

    Comments: 7 pages, 5 figures

    Journal ref: Phys. Rev. B 101, 075135 (2020)

  43. arXiv:1606.05798  [pdf, ps, other

    stat.ML cs.LG

    Interpretable Two-level Boolean Rule Learning for Classification

    Authors: Guolong Su, Dennis Wei, Kush R. Varshney, Dmitry M. Malioutov

    Abstract: As a contribution to interpretable machine learning research, we develop a novel optimization framework for learning accurate and sparse two-level Boolean rules. We consider rules in both conjunctive normal form (AND-of-ORs) and disjunctive normal form (OR-of-ANDs). A principled objective function is proposed to trade classification accuracy and interpretability, where we use Hamming loss to chara… ▽ More

    Submitted 18 June, 2016; originally announced June 2016.

    Comments: presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY

    Report number: WHI 2016 submission

  44. Impact Analysis of Baseband Quantizer on Coding Efficiency for HDR Video

    Authors: Chau-Wai Wong, Guan-Ming Su, Min Wu

    Abstract: Digitally acquired high dynamic range (HDR) video baseband signal can take 10 to 12 bits per color channel. It is economically important to be able to reuse the legacy 8 or 10-bit video codecs to efficiently compress the HDR video. Linear or nonlinear mapping on the intensity can be applied to the baseband signal to reduce the dynamic range before the signal is sent to the codec, and we refer to t… ▽ More

    Submitted 1 August, 2016; v1 submitted 9 March, 2016; originally announced March 2016.

    Comments: Accepted for publication in IEEE Signal Processing Letters

  45. arXiv:1511.07361  [pdf, ps, other

    cs.LG cs.AI

    Interpretable Two-level Boolean Rule Learning for Classification

    Authors: Guolong Su, Dennis Wei, Kush R. Varshney, Dmitry M. Malioutov

    Abstract: This paper proposes algorithms for learning two-level Boolean rules in Conjunctive Normal Form (CNF, i.e. AND-of-ORs) or Disjunctive Normal Form (DNF, i.e. OR-of-ANDs) as a type of human-interpretable classification model, aiming for a favorable trade-off between the classification accuracy and the simplicity of the rule. Two formulations are proposed. The first is an integer program whose objecti… ▽ More

    Submitted 23 November, 2015; originally announced November 2015.

  46. arXiv:1304.7614  [pdf, ps, other

    cs.SE cs.LO

    Asymptotic Bounds for Quantitative Verification of Perturbed Probabilistic Systems

    Authors: Guoxin Su, David S. Rosenblum

    Abstract: The majority of existing probabilistic model checking case studies are based on well understood theoretical models and distributions. However, real-life probabilistic systems usually involve distribution parameters whose values are obtained by empirical measurements and thus are subject to small perturbations. In this paper, we consider perturbation analysis of reachability in the parametric model… ▽ More

    Submitted 27 August, 2013; v1 submitted 29 April, 2013; originally announced April 2013.

    Comments: This paper is a long version of the paper Asymptotic Bounds for Quantitative Verification of Perturbed Probabilistic Systems in the proceedings of 15th International Conference on Formal Engineering Methods

  47. arXiv:1210.2125  [pdf, ps, other

    cs.PL cs.SE

    Session Communication and Integration

    Authors: Guoxin Su, Mingsheng Ying, Chengqi Zhang

    Abstract: The scenario-based specification of a large distributed system is usually naturally decomposed into various modules. The integration of specification modules contrasts to the parallel composition of program components, and includes various ways such as scenario concatenation, choice, and nesting. The recent development of multiparty session types for process calculi provides useful techniques to a… ▽ More

    Submitted 7 October, 2012; originally announced October 2012.

    Comments: A short version of this paper is submitted for review

  48. Performance Analysis of l_0 Norm Constraint Least Mean Square Algorithm

    Authors: Guolong Su, Jian Jin, Yuantao Gu, Jian Wang

    Abstract: As one of the recently proposed algorithms for sparse system identification, $l_0$ norm constraint Least Mean Square ($l_0$-LMS) algorithm modifies the cost function of the traditional method with a penalty of tap-weight sparsity. The performance of $l_0$-LMS is quite attractive compared with its various precursors. However, there has been no detailed study of its performance. This paper presents… ▽ More

    Submitted 9 March, 2013; v1 submitted 7 March, 2012; originally announced March 2012.

    Comments: 31 pages, 8 figures

    Journal ref: IEEE Transactions on Signal Processing, 60(5): 2223-2235, 2012

  49. arXiv:1103.3196  [pdf, ps, other

    cond-mat.stat-mech cs.SI nlin.AO physics.soc-ph

    Condensation phase transition in nonlinear fitness networks

    Authors: Guifeng Su, Xiaobing Zhang, Yi Zhang

    Abstract: We analyze the condensation phase transitions in out-of-equilibrium complex networks in a unifying framework which includes the nonlinear model and the fitness model as its appropriate limits. We show a novel phase structure which depends on both the fitness parameter and the nonlinear exponent. The occurrence of the condensation phase transitions in the dynamical evolution of the network is demon… ▽ More

    Submitted 3 December, 2012; v1 submitted 16 March, 2011; originally announced March 2011.

    Comments: 6 pages, 5 figures

    Journal ref: Europhys. Lett., 100 (2012) 38003