Skip to main content

Showing 1–14 of 14 results for author: Chang, G

  1. arXiv:2404.01548  [pdf, other

    cs.CV cs.AI

    mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning

    Authors: Jingxuan Wei, Nan Xu, Guiyong Chang, Yin Luo, BiHui Yu, Ruifeng Guo

    Abstract: In the fields of computer vision and natural language processing, multimodal chart question-answering, especially involving color, structure, and textless charts, poses significant challenges. Traditional methods, which typically involve either direct multimodal processing or a table-to-text conversion followed by language model analysis, have limitations in effectively handling these complex scen… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2403.03721  [pdf, other

    cs.CV

    CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection

    Authors: Gyusam Chang, Wonseok Roh, Sujin Jang, Dongwook Lee, Daehyun Ji, Gyeongrok Oh, Jinsun Park, Jinkyu Kim, Sangpil Kim

    Abstract: Recent LiDAR-based 3D Object Detection (3DOD) methods show promising results, but they often do not generalize well to target domains outside the source (or training) data distribution. To reduce such domain gaps and thus to make 3DOD models more generalizable, we introduce a novel unsupervised domain adaptation (UDA) method, called CMDA, which (i) leverages visual semantic cues from an image moda… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024

  3. arXiv:2309.15857  [pdf, other

    cs.CL cs.AI cs.MM

    A Survey on Image-text Multimodal Models

    Authors: Ruifeng Guo, Jingxuan Wei, Linzhuang Sun, Bihui Yu, Guiyong Chang, Dawei Liu, Sibo Zhang, Zhengbing Yao, Mingjun Xu, Liping Bu

    Abstract: With the significant advancements of Large Language Models (LLMs) in the field of Natural Language Processing (NLP), the development of image-text multimodal models has garnered widespread attention. Current surveys on image-text multimodal models mainly focus on representative models or application domains, but lack a review on how general technical models influence the development of domain-spec… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

  4. arXiv:2305.19535  [pdf, other

    stat.ML cs.LG

    Low-rank extended Kalman filtering for online learning of neural networks from streaming data

    Authors: Peter G. Chang, Gerardo Durán-Martín, Alexander Y Shestopaloff, Matt Jones, Kevin Murphy

    Abstract: We propose an efficient online approximate Bayesian inference algorithm for estimating the parameters of a nonlinear function from a potentially non-stationary data stream. The method is based on the extended Kalman filter (EKF), but uses a novel low-rank plus diagonal decomposition of the posterior precision matrix, which gives a cost per step which is linear in the number of model parameters. In… ▽ More

    Submitted 27 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Journal ref: COLLAS conference 2023

  5. arXiv:2207.00865  [pdf, other

    cs.CV

    ORA3D: Overlap Region Aware Multi-view 3D Object Detection

    Authors: Wonseok Roh, Gyusam Chang, Seokha Moon, Giljoo Nam, Chanyoung Kim, Younghyun Kim, Jinkyu Kim, Sangpil Kim

    Abstract: Current multi-view 3D object detection methods often fail to detect objects in the overlap region properly, and the networks' understanding of the scene is often limited to that of a monocular detection network. Moreover, objects in the overlap region are often largely occluded or suffer from deformation due to camera distortion, causing a domain shift. To mitigate this issue, we propose using the… ▽ More

    Submitted 29 June, 2023; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: BMVC2022

  6. arXiv:2111.00364  [pdf, other

    cs.LG cs.AI cs.AR

    Sustainable AI: Environmental Implications, Challenges and Opportunities

    Authors: Carole-Jean Wu, Ramya Raghavendra, Udit Gupta, Bilge Acun, Newsha Ardalani, Kiwan Maeng, Gloria Chang, Fiona Aga Behram, James Huang, Charles Bai, Michael Gschwind, Anurag Gupta, Myle Ott, Anastasia Melnikov, Salvatore Candido, David Brooks, Geeta Chauhan, Benjamin Lee, Hsien-Hsin S. Lee, Bugra Akyildiz, Maximilian Balandat, Joe Spisak, Ravi Jain, Mike Rabbat, Kim Hazelwood

    Abstract: This paper explores the environmental impact of the super-linear growth trends for AI from a holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the carbon footprint of AI computing by examining the model development cycle across industry-scale machine learning use cases and, at the same time, considering the life cycle of system hardware. Taking a step further, w… ▽ More

    Submitted 9 January, 2022; v1 submitted 30 October, 2021; originally announced November 2021.

  7. arXiv:2001.07698  [pdf

    cs.NI cs.LG eess.SP

    Intelligent Bandwidth Allocation for Latency Management in NG-EPON using Reinforcement Learning Methods

    Authors: Qi Zhou, Jingjie Zhu, Junwen Zhang, Zhensheng Jia, Bernardo Huberman, Gee-Kung Chang

    Abstract: A novel intelligent bandwidth allocation scheme in NG-EPON using reinforcement learning is proposed and demonstrated for latency management. We verify the capability of the proposed scheme under both fixed and dynamic traffic loads scenarios to achieve <1ms average latency. The RL agent demonstrates an efficient intelligent mechanism to manage the latency, which provides a promising IBA solution f… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  8. arXiv:1911.10442  [pdf, other

    eess.IV cs.CV cs.LG cs.NE stat.ML

    Ground Truth Simulation for Deep Learning Classification of Mid-Resolution Venus Images Via Unmixing of High-Resolution Hyperspectral Fenix Data

    Authors: Ido Faran, Nathan S. Netanyahu, Eli David, Maxim Shoshany, Fadi Kizel, Jisung Geba Chang, Ronit Rud

    Abstract: Training a deep neural network for classification constitutes a major problem in remote sensing due to the lack of adequate field data. Acquiring high-resolution ground truth (GT) by human interpretation is both cost-ineffective and inconsistent. We propose, instead, to utilize high-resolution, hyperspectral images for solving this problem, by unmixing these images to obtain reliable GT for traini… ▽ More

    Submitted 23 November, 2019; originally announced November 2019.

    Journal ref: IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pages 807-810, Yokohama, Japan, July 2019

  9. arXiv:1902.00046  [pdf

    cs.NI

    Power Loading based on Portfolio Theory for Densified Millimeter-Wave Small-Cell Communications

    Authors: Shuyi Shen, Bernardo A. Huberman, Lin Cheng, Gee-Kung Chang

    Abstract: We experimentally demonstrate a novel scheme of power loading based on portfolio theory for millimeter-wave small-cell densification. By exploiting the statistical characteristics of interference, this approach improves the average throughput by 91% and reduces the variance.

    Submitted 31 January, 2019; originally announced February 2019.

  10. arXiv:1710.06541  [pdf

    cs.NI eess.SP

    Design Considerations of a Sub-50 μW Receiver Front-end for Implantable Devices in MedRadio Band

    Authors: Gregory Chang, Shovan Maity, Baibhab Chatterjee, Shreyas Sen

    Abstract: Emerging health-monitor applications, such as information transmission through multi-channel neural implants, image and video communication from inside the body etc., calls for ultra-low active power (<50$μ$W) high data-rate, energy-scalable, highly energy-efficient (pJ/bit) radios. Previous literature has strongly focused on low average power duty-cycled radios or low power but low-date radios. I… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

    Comments: Accepted to appear on International Conference on VLSI Design 2018 (VLSID)

  11. arXiv:1706.07363  [pdf

    cs.CY

    Smart Wireless Communication is the Cornerstone of Smart Infrastructures

    Authors: Mary Ann Weitnauer, Jennifer Rexford, Nicholas Laneman, Matthieu Bloch, Santiago Griljava, Catherine Ross, Gee-Kung Chang

    Abstract: Emerging smart infrastructures, such as Smart City, Smart Grid, Smart Health, and Smart Transportation, need smart wireless connectivity. However, the requirements of these smart infrastructures cannot be met with today's wireless networks. A new wireless infrastructure is needed to meet unprecedented needs in terms of agility, reliability, security, scalability, and partnerships. We are at the… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

    Comments: A Computing Community Consortium (CCC) white paper, 5 pages

  12. arXiv:1704.06176  [pdf, other

    cs.CV cs.LG stat.ML

    Segmentation of the Proximal Femur from MR Images using Deep Convolutional Neural Networks

    Authors: Cem M. Deniz, Siyuan Xiang, Spencer Hallyburton, Arakua Welbeck, James S. Babb, Stephen Honig, Kyunghyun Cho, Gregory Chang

    Abstract: Magnetic resonance imaging (MRI) has been proposed as a complimentary method to measure bone quality and assess fracture risk. However, manual segmentation of MR images of bone is time-consuming, limiting the use of MRI measurements in the clinical practice. The purpose of this paper is to present an automatic proximal femur segmentation method that is based on deep convolutional neural networks (… ▽ More

    Submitted 5 February, 2019; v1 submitted 20 April, 2017; originally announced April 2017.

    Comments: This is a pre-print of an article published in Scientific Reports. The final authenticated version is available online at: https://doi.org/10.1038/s41598-018-34817-6

    Journal ref: Scientific Reports, volume 8, Article number: 16485 (2018)

  13. arXiv:cs/0409013  [pdf, ps, other

    cs.DS cs.DM

    Locally connected spanning trees on graphs

    Authors: Ching-Chi Lin, Gerard J. Chang, Gen-Huey Chen

    Abstract: A locally connected spanning tree of a graph $G$ is a spanning tree $T$ of $G$ such that the set of all neighbors of $v$ in $T$ induces a connected subgraph of $G$ for every $v\in V(G)$. The purpose of this paper is to give linear-time algorithms for finding locally connected spanning trees on strongly chordal graphs and proper circular-arc graphs, respectively.

    Submitted 8 September, 2004; originally announced September 2004.

    Comments: 14 pages, 3 figures

    ACM Class: F.2.2; G.2.2

  14. arXiv:cs/0408022  [pdf, ps, other

    cs.NI

    Diagnosabilities of regular networks

    Authors: Guey-Yun Chang, Gerard J. Chang, Gen-Huey Chen

    Abstract: In this paper, we study diagnosabilities of multiprocessor systems under two diagnosis models: the PMC model and the comparison model. In each model, we further consider two different diagnosis strategies: the precise diagnosis strategy proposed by Preparata et al. and the pessimistic diagnosis strategy proposed by Friedman. The main result of this paper is to determine diagnosabilities of regul… ▽ More

    Submitted 9 August, 2004; originally announced August 2004.

    Comments: 26 pages

    Report number: NCTS/TPE-Math Technical Report 2004-013