Skip to main content

Showing 1–50 of 160 results for author: Kuo, C - J

  1. arXiv:2405.16144  [pdf, other

    cs.CV cs.AI

    GreenCOD: A Green Camouflaged Object Detection Method

    Authors: Hong-Shuo Chen, Yao Zhu, Suya You, Azad M. Madni, C. -C. Jay Kuo

    Abstract: We introduce GreenCOD, a green method for detecting camouflaged objects, distinct in its avoidance of backpropagation techniques. GreenCOD leverages gradient boosting and deep features extracted from pre-trained Deep Neural Networks (DNNs). Traditional camouflaged object detection (COD) approaches often rely on complex deep neural network architectures, seeking performance improvements through bac… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2402.06982  [pdf, other

    cs.CV cs.AI physics.med-ph

    Treatment-wise Glioblastoma Survival Inference with Multi-parametric Preoperative MRI

    Authors: Xiaofeng Liu, Nadya Shusharina, Helen A Shih, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: In this work, we aim to predict the survival time (ST) of glioblastoma (GBM) patients undergoing different treatments based on preoperative magnetic resonance (MR) scans. The personalized and precise treatment planning can be achieved by comparing the ST of different treatments. It is well established that both the current status of the patient (as represented by the MR scans) and the choice of tr… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Computer-Aided Diagnosis

  3. arXiv:2401.07475  [pdf, other

    cs.CL

    GWPT: A Green Word-Embedding-based POS Tagger

    Authors: Chengwei Wei, Runqi Pang, C. -C. Jay Kuo

    Abstract: As a fundamental tool for natural language processing (NLP), the part-of-speech (POS) tagger assigns the POS label to each word in a sentence. A novel lightweight POS tagger based on word embeddings is proposed and named GWPT (green word-embedding-based POS tagger) in this work. Following the green learning (GL) methodology, GWPT contains three modules in cascade: 1) representation learning, 2) fe… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  4. arXiv:2312.14968  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Edge Intelligence with Highly Discriminant LNT Features

    Authors: Xinyu Wang, Vinod K. Mishra, C. -C. Jay Kuo

    Abstract: AI algorithms at the edge demand smaller model sizes and lower computational complexity. To achieve these objectives, we adopt a green learning (GL) paradigm rather than the deep learning paradigm. GL has three modules: 1) unsupervised representation learning, 2) supervised feature learning, and 3) supervised decision learning. We focus on the second module in this work. In particular, we derive n… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Conference on Big Data, AI and Adaptive Computing for Edge Sensing and Processing Workshop

  5. arXiv:2310.04995  [pdf, other

    cs.CV

    SemST: Semantically Consistent Multi-Scale Image Translation via Structure-Texture Alignment

    Authors: Ganning Zhao, Wenhui Cui, Suya You, C. -C. Jay Kuo

    Abstract: Unsupervised image-to-image (I2I) translation learns cross-domain image mapping that transfers input from the source domain to output in the target domain while preserving its semantics. One challenge is that different semantic statistics in source and target domains result in content discrepancy known as semantic distortion. To address this problem, a novel I2I method that maintains semantic cons… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  6. arXiv:2309.12501  [pdf, other

    cs.AI cs.CL cs.LG

    Knowledge Graph Embedding: An Overview

    Authors: Xiou Ge, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

    Abstract: Many mathematical models have been leveraged to design embeddings for representing Knowledge Graph (KG) entities and relations for link prediction and many downstream tasks. These mathematically-inspired models are not only highly scalable for inference in large KGs, but also have many explainable advantages in modeling different relation patterns that can be validated through both formal proofs a… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  7. arXiv:2309.09078  [pdf, other

    cs.CV

    Unsupervised Green Object Tracker (GOT) without Offline Pre-training

    Authors: Zhiruo Zhou, Suya You, C. -C. Jay Kuo

    Abstract: Supervised trackers trained on labeled data dominate the single object tracking field for superior tracking accuracy. The labeling cost and the huge computational complexity hinder their applications on edge devices. Unsupervised learning methods have also been investigated to reduce the labeling cost but their complexity remains high. Aiming at lightweight high-performance tracking, feasibility w… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  8. arXiv:2309.08836  [pdf, other

    cs.CL cs.AI cs.CY

    Bias and Fairness in Chatbots: An Overview

    Authors: Jintang Xue, Yun-Cheng Wang, Chengwei Wei, Xiaofeng Liu, Jonghye Woo, C. -C. Jay Kuo

    Abstract: Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in mode… ▽ More

    Submitted 10 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  9. arXiv:2308.16055  [pdf, other

    cs.CL cs.AI

    AsyncET: Asynchronous Learning for Knowledge Graph Entity Typing with Auxiliary Relations

    Authors: Yun-Cheng Wang, Xiou Ge, Bin Wang, C. -C. Jay Kuo

    Abstract: Knowledge graph entity typing (KGET) is a task to predict the missing entity types in knowledge graphs (KG). Previously, KG embedding (KGE) methods tried to solve the KGET task by introducing an auxiliary relation, 'hasType', to model the relationship between entities and their types. However, a single auxiliary relation has limited expressiveness for diverse entity-type patterns. We improve the e… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  10. arXiv:2306.17170  [pdf, other

    cs.DC cs.AI eess.SY

    An Overview on Generative AI at Scale with Edge-Cloud Computing

    Authors: Yun-Cheng Wang, Jintang Xue, Chengwei Wei, C. -C. Jay Kuo

    Abstract: As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing fram… ▽ More

    Submitted 9 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  11. arXiv:2306.04008  [pdf

    eess.IV cs.CR cs.LG

    Green Steganalyzer: A Green Learning Approach to Image Steganalysis

    Authors: Yao Zhu, Xinyu Wang, Hong-Shuo Chen, Ronald Salloum, C. -C. Jay Kuo

    Abstract: A novel learning solution to image steganalysis based on the green learning paradigm, called Green Steganalyzer (GS), is proposed in this work. GS consists of three modules: 1) pixel-based anomaly prediction, 2) embedding location detection, and 3) decision fusion for image-level detection. In the first module, GS decomposes an image into patches, adopts Saab transforms for feature extraction, and… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  12. arXiv:2304.12591  [pdf, other

    cs.CV cs.AI eess.IV

    Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints

    Authors: Ganning Zhao, Tingwei Shen, Suya You, C. -C. Jay Kuo

    Abstract: Ensuring the realism of computer-generated synthetic images is crucial to deep neural network (DNN) training. Due to different semantic distributions between synthetic and real-world captured datasets, there exists semantic mismatch between synthetic and refined images, which in turn results in the semantic distortion. Recently, contrastive learning (CL) has been successfully used to pull correlat… ▽ More

    Submitted 26 April, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  13. arXiv:2304.00378  [pdf, other

    cs.AI cs.LG

    Knowledge Graph Embedding with 3D Compound Geometric Transformations

    Authors: Xiou Ge, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

    Abstract: The cascade of 2D geometric transformations were exploited to model relations between entities in a knowledge graph (KG), leading to an effective KG embedding (KGE) model, CompoundE. Furthermore, the rotation in the 3D space was proposed as a new KGE model, Rotate3D, by leveraging its non-commutative property. Inspired by CompoundE and Rotate3D, we leverage 3D compound geometric transformations, i… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  14. arXiv:2303.10898  [pdf, other

    cs.CV cs.LG

    A Tiny Machine Learning Model for Point Cloud Object Classification

    Authors: Min Zhang, Jintang Xue, Pranav Kadam, Hardik Prajapati, Shan Liu, C. -C. Jay Kuo

    Abstract: The design of a tiny machine learning model, which can be deployed in mobile and edge devices, for point cloud object classification is investigated in this work. To achieve this objective, we replace the multi-scale representation of a point cloud object with a single-scale representation for complexity reduction, and exploit rich 3D geometric information of a point cloud object for performance i… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 13 pages, 4 figures

  15. arXiv:2303.05759  [pdf, other

    cs.CL

    An Overview on Language Models: Recent Developments and Outlook

    Authors: Chengwei Wei, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

    Abstract: Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner, while pre-trained language models (PLMs) c… ▽ More

    Submitted 3 July, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Report number: APSIPA Transactions on Signal and Information Processing: Vol. 13: No. 2, e101

  16. arXiv:2302.14193  [pdf, other

    cs.CV

    PointFlowHop: Green and Interpretable Scene Flow Estimation from Consecutive Point Clouds

    Authors: Pranav Kadam, Jiahao Gu, Shan Liu, C. -C. Jay Kuo

    Abstract: An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work. PointFlowHop takes two consecutive point clouds and determines the 3D flow vectors for every point in the first point cloud. PointFlowHop decomposes the scene flow estimation task into a set of subtasks, including ego-motion compensation, object association and object-wise motion estimation. It follows the g… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 13 pages, 5 figures

  17. arXiv:2302.13596  [pdf, other

    eess.IV cs.CV

    LSR: A Light-Weight Super-Resolution Method

    Authors: Wei Wang, Xuejing Lei, Yueru Chen, Ming-Sui Lee, C. -C. Jay Kuo

    Abstract: A light-weight super-resolution (LSR) method from a single image targeting mobile applications is proposed in this work. LSR predicts the residual image between the interpolated low-resolution (ILR) and high-resolution (HR) images using a self-supervised framework. To lower the computational complexity, LSR does not adopt the end-to-end optimization deep networks. It consists of three modules: 1)… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 8 pages, 3 figures, 10 tables

    ACM Class: I.4.3

  18. arXiv:2302.11506  [pdf, other

    cs.CV

    S3I-PointHop: SO(3)-Invariant PointHop for 3D Point Cloud Classification

    Authors: Pranav Kadam, Hardik Prajapati, Min Zhang, Jintang Xue, Shan Liu, C. -C. Jay Kuo

    Abstract: Many point cloud classification methods are developed under the assumption that all point clouds in the dataset are well aligned with the canonical axes so that the 3D Cartesian point coordinates can be employed to learn features. When input point clouds are not aligned, the classification performance drops significantly. In this work, we focus on a mathematically transparent point cloud classific… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 5 pages, 3 figures

  19. arXiv:2301.08959  [pdf, other

    eess.IV cs.CV

    Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Hanna K. Gaggin, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenotyping tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for w… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: ISBI 2023

  20. arXiv:2212.11484  [pdf, other

    cs.CV eess.IV

    SALVE: Self-supervised Adaptive Low-light Video Enhancement

    Authors: Zohreh Azizi, C. -C. Jay Kuo

    Abstract: A self-supervised adaptive low-light video enhancement method, called SALVE, is proposed in this work. SALVE first enhances a few key frames of an input low-light video using a retinex-based low-light image enhancement technique. For each keyframe, it learns a mapping from low-light image patches to enhanced ones via ridge regression. These mappings are then used to enhance the remaining frames in… ▽ More

    Submitted 21 February, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 12 pages, 7 figures, 4 tables

  21. Recovering Sign Bits of DCT Coefficients in Digital Images as an Optimization Problem

    Authors: Ruiyuan Lin, Sheng Liu, Jun Jiang, Shujun Li, Chengqing Li, C. -C. Jay Kuo

    Abstract: Recovering unknown, missing, damaged, distorted, or lost information in DCT coefficients is a common task in multiple applications of digital image processing, including image compression, selective image encryption, and image communication. This paper investigates the recovery of sign bits in DCT coefficients of digital images, by proposing two different approximation methods to solve a mixed int… ▽ More

    Submitted 8 January, 2024; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 22 pages, 8 figures

    MSC Class: 68P30

    Journal ref: Journal of Visual Communication and Image Representation, vol. 98, art. no. 104045, 2024

  22. arXiv:2210.03689  [pdf, ps, other

    eess.IV cs.CV

    GENHOP: An Image Generation Method Based on Successive Subspace Learning

    Authors: Xuejing Lei, Wei Wang, C. -C. Jay Kuo

    Abstract: Being different from deep-learning-based (DL-based) image generation methods, a new image generative model built upon successive subspace learning principle is proposed and named GenHop (an acronym of Generative PixelHop) in this work. GenHop consists of three modules: 1) high-to-low dimension reduction, 2) seed image generation, and 3) low-to-high dimension expansion. In the first module, it buil… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 10 pages, 5 figures, accepted by ISCAS 2022

  23. arXiv:2210.00965  [pdf, other

    cs.LG

    Green Learning: Introduction, Examples and Outlook

    Authors: C. -C. Jay Kuo, Azad M. Madni

    Abstract: Rapid advances in artificial intelligence (AI) in the last decade have largely been built upon the wide applications of deep learning (DL). However, the high carbon footprint yielded by larger and larger DL networks becomes a concern for sustainability. Furthermore, DL decision mechanism is somewhat obsecure and can only be verified by test data. Green learning (GL) has been proposed as an alterna… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Journal ref: Journal of Visual Communication and Image Representation 2022

  24. arXiv:2209.12139  [pdf, other

    cs.CV

    Lightweight Image Codec via Multi-Grid Multi-Block-Size Vector Quantization (MGBVQ)

    Authors: Yifan Wang, Zhanxuan Mei, Ioannis Katsavounidis, C. -C. Jay Kuo

    Abstract: A multi-grid multi-block-size vector quantization (MGBVQ) method is proposed for image coding in this work. The fundamental idea of image coding is to remove correlations among pixels before quantization and entropy coding, e.g., the discrete cosine transform (DCT) and intra predictions, adopted by modern image coding standards. We present a new method to remove pixel correlations. First, by decom… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: GIC-python-v2

  25. arXiv:2209.11549  [pdf, other

    cs.CV cs.AI cs.LG

    MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

    Authors: Mozhdeh Rouhsedaghat, Masoud Monajatipoor, C. -C. Jay Kuo, Iacopo Masi

    Abstract: We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guarant… ▽ More

    Submitted 30 June, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Accepted to the Thirty-Seventh Conference on Artificial Intelligence (AAAI) 2023 - 12 pages, 9 figures

  26. arXiv:2208.09137  [pdf, other

    cs.AI

    GreenKGC: A Lightweight Knowledge Graph Completion Method

    Authors: Yun-Cheng Wang, Xiou Ge, Bin Wang, C. -C. Jay Kuo

    Abstract: Knowledge graph completion (KGC) aims to discover missing relationships between entities in knowledge graphs (KGs). Most prior KGC work focuses on learning embeddings for entities and relations through a simple scoring function. Yet, a higher-dimensional embedding space is usually required for a better reasoning capability, which leads to a larger model size and hinders applicability to real-world… ▽ More

    Submitted 9 July, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to ACL2023

  27. arXiv:2208.07769  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Unsupervised Domain Adaptation for Segmentation with Black-box Source Model

    Authors: Xiaofeng Liu, Chaehwa Yoo, Fangxu Xing, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been widely used to transfer knowledge from a labeled source domain to an unlabeled target domain to counter the difficulty of labeling in a new domain. The training of conventional solutions usually relies on the existence of both source and target domain data. However, privacy of the large-scale and well-labeled data in the source domain and trained model… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: SPIE Medical Imaging 2022: Image Processing

  28. arXiv:2208.07754  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Subtype-Aware Dynamic Unsupervised Domain Adaptation

    Authors: Xiaofeng Liu, Fangxu Xing, Jia You, Jun Lu, C. -C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    Abstract: Unsupervised domain adaptation (UDA) has been successfully applied to transfer knowledge from a labeled source domain to target domains without their labels. Recently introduced transferable prototypical networks (TPN) further addresses class-wise conditional alignment. In TPN, while the closeness of class centers between source and target domains is explicitly enforced in a latent space, the unde… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  29. arXiv:2208.07023  [pdf, ps, other

    cs.LG

    Acceleration of Subspace Learning Machine via Particle Swarm Optimization and Parallel Processing

    Authors: Hongyu Fu, Yijing Yang, Yuhuai Liu, Joseph Lin, Ethan Harrison, Vinod K. Mishra, C. -C. Jay Kuo

    Abstract: Built upon the decision tree (DT) classification and regression idea, the subspace learning machine (SLM) has been recently proposed to offer higher performance in general classification and regression tasks. Its performance improvement is reached at the expense of higher computational complexity. In this work, we investigate two ways to accelerate SLM. First, we adopt the particle swarm optimizat… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  30. arXiv:2208.02932  [pdf, other

    cs.AI cs.HC cs.LG

    Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment

    Authors: Yilei Zeng, Jiali Duan, Yang Li, Emilio Ferrara, Lerrel Pinto, C. -C. Jay Kuo, Stefanos Nikolaidis

    Abstract: Human-centered AI considers human experiences with AI performance. While abundant research has been helping AI achieve superhuman performance either by fully automatic or weak supervision learning, fewer endeavors are experimenting with how AI can tailor to humans' preferred skill level given fine-grained input. In this work, we guide the curriculum reinforcement learning results towards a preferr… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 6 pages, 7 figures

    ACM Class: I.2.6

  31. arXiv:2208.01823  [pdf, other

    cs.CV

    Statistical Attention Localization (SAL): Methodology and Application to Object Classification

    Authors: Yijing Yang, Vasileios Magoulianitis, Xinyu Wang, C. -C. Jay Kuo

    Abstract: A statistical attention localization (SAL) method is proposed to facilitate the object classification task in this work. SAL consists of three steps: 1) preliminary attention window selection via decision statistics, 2) attention map refinement, and 3) rectangular attention region finalization. SAL computes soft-decision scores of local squared windows and uses them to identify salient regions in… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 11 pages, 9 figures

  32. arXiv:2208.00475  [pdf, other

    cs.CV

    Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics

    Authors: Xiaoyuan Guo, Jiali Duan, C. -C. Jay Kuo, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Language modality within the vision language pretraining framework is innately discretized, endowing each word in the language vocabulary a semantic meaning. In contrast, visual modality is inherently continuous and high-dimensional, which potentially prohibits the alignment as well as fusion between vision and language modalities. We therefore propose to "discretize" the visual representation by… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: 7 pages, 4 figures, ICPR2022. arXiv admin note: text overlap with arXiv:2203.00048

  33. Enhancing Image Rescaling using Dual Latent Variables in Invertible Neural Network

    Authors: Min Zhang, Zhihong Pan, Xin Zhou, C. -C. Jay Kuo

    Abstract: Normalizing flow models have been used successfully for generative image super-resolution (SR) by approximating complex distribution of natural images to simple tractable distribution in latent space through Invertible Neural Networks (INN). These models can generate multiple realistic SR images from one low-resolution (LR) input using randomly sampled points in the latent space, simulating the il… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted by ACM Multimedia 2022

    ACM Class: I.4.5

  34. arXiv:2207.07629  [pdf, other

    cs.CV

    GUSOT: Green and Unsupervised Single Object Tracking for Long Video Sequences

    Authors: Zhiruo Zhou, Hongyu Fu, Suya You, C. -C. Jay Kuo

    Abstract: Supervised and unsupervised deep trackers that rely on deep learning technologies are popular in recent years. Yet, they demand high computational complexity and a high memory cost. A green unsupervised single-object tracker, called GUSOT, that aims at object tracking for long videos under a resource-constrained environment is proposed in this work. Built upon a baseline tracker, UHP-SOT++, which… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  35. arXiv:2207.05324  [pdf, other

    cs.AI cs.CL cs.LG

    CompoundE: Knowledge Graph Embedding with Translation, Rotation and Scaling Compound Operations

    Authors: Xiou Ge, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

    Abstract: Translation, rotation, and scaling are three commonly used geometric manipulation operations in image processing. Besides, some of them are successfully used in developing effective knowledge graph embedding (KGE) models such as TransE and RotatE. Inspired by the synergy, we propose a new KGE model by leveraging all three operations in this work. Since translation, rotation, and scaling operations… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: 16 pages

  36. arXiv:2206.10029  [pdf, other

    cs.CL

    SynWMD: Syntax-aware Word Mover's Distance for Sentence Similarity Evaluation

    Authors: Chengwei Wei, Bin Wang, C. -C. Jay Kuo

    Abstract: Word Mover's Distance (WMD) computes the distance between words and models text similarity with the moving cost between words in two text sequences. Yet, it does not offer good performance in sentence similarity evaluation since it does not incorporate word importance and fails to take inherent contextual and structural information in a sentence into account. An improved WMD method using the synta… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  37. arXiv:2206.09061  [pdf, other

    cs.CV

    Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking

    Authors: Yijing Yang, Hongyu Fu, C. -C. Jay Kuo

    Abstract: The design of robust learning systems that offer stable performance under a wide range of supervision degrees is investigated in this work. We choose the image classification problem as an illustrative example and focus on the design of modularized systems that consist of three learning modules: representation learning, feature learning and decision learning. We discuss ways to adjust each module… ▽ More

    Submitted 16 August, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: 16 pages, 12 figures, 4 tables, under consideration at Pattern Recognition

  38. arXiv:2206.00162  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    PAGER: Progressive Attribute-Guided Extendable Robust Image Generation

    Authors: Zohreh Azizi, C. -C. Jay Kuo

    Abstract: This work presents a generative modeling approach based on successive subspace learning (SSL). Unlike most generative models in the literature, our method does not utilize neural networks to analyze the underlying source distribution and synthesize images. The resulting method, called the progressive attribute-guided extendable robust image generative (PAGER) model, has advantages in mathematical… ▽ More

    Submitted 22 August, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: 19 pages, 12 figures, 2 tables

  39. arXiv:2205.05296  [pdf, other

    cs.LG

    Subspace Learning Machine (SLM): Methodology and Performance

    Authors: Hongyu Fu, Yijing Yang, Vinod K. Mishra, C. -C. Jay Kuo

    Abstract: Inspired by the feedforward multilayer perceptron (FF-MLP), decision tree (DT) and extreme learning machine (ELM), a new classification model, called the subspace learning machine (SLM), is proposed in this work. SLM first identifies a discriminant subspace, $S^0$, by examining the discriminant power of each input feature. Then, it uses probabilistic projections of features in $S^0$ to yield 1D su… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  40. arXiv:2205.00211  [pdf, other

    cs.CV

    DefakeHop++: An Enhanced Lightweight Deepfake Detector

    Authors: Hong-Shuo Chen, Shuowen Hu, Suya You, C. -C. Jay Kuo

    Abstract: On the basis of DefakeHop, an enhanced lightweight Deepfake detector called DefakeHop++ is proposed in this work. The improvements lie in two areas. First, DefakeHop examines three facial regions (i.e., two eyes and mouth) while DefakeHop++ includes eight more landmarks for broader coverage. Second, for discriminant features selection, DefakeHop uses an unsupervised approach while DefakeHop++ adop… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

  41. arXiv:2204.08646  [pdf, other

    cs.LG cs.AI

    Label Efficient Regularization and Propagation for Graph Node Classification

    Authors: Tian Xie, Rajgopal Kannan, C. -C. Jay Kuo

    Abstract: An enhanced label propagation (LP) method called GraphHop was proposed recently. It outperforms graph convolutional networks (GCNs) in the semi-supervised node classification task on various networks. Although the performance of GraphHop was explained intuitively with joint node attribute and label signal smoothening, its rigorous mathematical treatment is lacking. In this paper, we propose a labe… ▽ More

    Submitted 30 October, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

  42. arXiv:2203.14887  [pdf, other

    eess.IV cs.CV

    HUNIS: High-Performance Unsupervised Nuclei Instance Segmentation

    Authors: Vasileios Magoulianitis, Yijing Yang, C. -C. Jay Kuo

    Abstract: A high-performance unsupervised nuclei instance segmentation (HUNIS) method is proposed in this work. HUNIS consists of two-stage block-wise operations. The first stage includes: 1) adaptive thresholding of pixel intensities, 2) incorporation of nuclei size/shape priors and 3) removal of false positive nuclei instances. Then, HUNIS conducts the second stage segmentation by receiving guidance from… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 8 pages, 3 figures, 3 tables

  43. arXiv:2203.11924  [pdf, other

    cs.LG

    On Supervised Feature Selection from High Dimensional Feature Spaces

    Authors: Yijing Yang, Wei Wang, Hongyu Fu, C. -C. Jay Kuo

    Abstract: The application of machine learning to image and video data often yields a high dimensional feature space. Effective feature selection techniques identify a discriminant feature subspace that lowers computational and modeling costs with little performance degradation. A novel supervised feature selection methodology is proposed for machine learning decisions in this work. The resulting tests are c… ▽ More

    Submitted 19 June, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 14 pages, 9 figures, 9 tables, under consideration at APSIPA Transactions on Signal and Information Processing

  44. arXiv:2203.02679  [pdf, other

    cs.CL cs.AI

    Just Rank: Rethinking Evaluation with Word and Sentence Similarities

    Authors: Bin Wang, C. -C. Jay Kuo, Haizhou Li

    Abstract: Word and sentence embeddings are useful feature representations in natural language processing. However, intrinsic evaluation for embeddings lags far behind, and there has been no significant update since the past decade. Word and sentence similarity tasks have become the de facto evaluation method. It leads models to overfit to such evaluations, negatively impacting embedding models' development.… ▽ More

    Submitted 21 March, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

    Comments: Accepted as Main Conference for ACL 2022. Code: https://github.com/BinWang28/EvalRank-Embedding-Evaluation

  45. arXiv:2202.07843  [pdf, other

    cs.CV

    PCRP: Unsupervised Point Cloud Object Retrieval and Pose Estimation

    Authors: Pranav Kadam, Qingyang Zhou, Shan Liu, C. -C. Jay Kuo

    Abstract: An unsupervised point cloud object retrieval and pose estimation method, called PCRP, is proposed in this work. It is assumed that there exists a gallery point cloud set that contains point cloud objects with given pose orientation information. PCRP attempts to register the unknown point cloud object with those in the gallery set so as to achieve content-based object retrieval and pose estimation… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 8 pages, 3 figures

  46. arXiv:2112.12284  [pdf, other

    cs.MM eess.IV

    A Survey on Perceptually Optimized Video Coding

    Authors: Yun Zhang, Linwei Zhu, Gangyi Jiang, Sam Kwong, C. -C. Jay Kuo

    Abstract: To provide users with more realistic visual experiences, videos are developing in the trends of Ultra High Definition (UHD), High Frame Rate (HFR), High Dynamic Range (HDR), Wide Color Gammut (WCG) and high clarity. However, the data amount of videos increases exponentially, which requires high efficiency video compression for storage and network transmission. Perceptually optimized video coding a… ▽ More

    Submitted 15 November, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: 36 pages, 12 figures, 6 tables, accepted by ACM Computing Surveys

  47. CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding

    Authors: Xiou Ge, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

    Abstract: Entity type prediction is an important problem in knowledge graph (KG) research. A new KG entity type prediction method, named CORE (COmplex space Regression and Embedding), is proposed in this work. The proposed CORE method leverages the expressive power of two complex space embedding models; namely, RotatE and ComplEx models. It embeds entities and types in two different complex spaces using eit… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Journal ref: Pattern Recognit. Lett. 157 (2022) 97-103

  48. KGBoost: A Classification-based Knowledge Base Completion Method with Negative Sampling

    Authors: Yun-Cheng Wang, Xiou Ge, Bin Wang, C. -C. Jay Kuo

    Abstract: Knowledge base completion is formulated as a binary classification problem in this work, where an XGBoost binary classifier is trained for each relation using relevant links in knowledge graphs (KGs). The new method, named KGBoost, adopts a modularized design and attempts to find hard negative samples so as to train a powerful classifier for missing link prediction. We conduct experiments on multi… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Journal ref: Pattern Recognition Letters, 2022

  49. arXiv:2112.04054  [pdf, other

    cs.CV

    GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method

    Authors: Pranav Kadam, Min Zhang, Jiahao Gu, Shan Liu, C. -C. Jay Kuo

    Abstract: Visual odometry aims to track the incremental motion of an object using the information captured by visual sensors. In this work, we study the point cloud odometry problem, where only the point cloud scans obtained by the LiDAR (Light Detection And Ranging) are used to estimate object's motion trajectory. A lightweight point cloud odometry solution is proposed and named the green point cloud odome… ▽ More

    Submitted 17 July, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: 10 pages, 5 figures

  50. arXiv:2111.07548  [pdf, other

    cs.CV

    Unsupervised Lightweight Single Object Tracking with UHP-SOT++

    Authors: Zhiruo Zhou, Hongyu Fu, Suya You, C. -C. Jay Kuo

    Abstract: An unsupervised, lightweight and high-performance single object tracker, called UHP-SOT, was proposed by Zhou et al. recently. As an extension, we present an enhanced version and name it UHP-SOT++ in this work. Built upon the foundation of the discriminative-correlation-filters-based (DCF-based) tracker, two new ingredients are introduced in UHP-SOT and UHP-SOT++: 1) background motion modeling and… ▽ More

    Submitted 6 April, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: updated content: comparison with state-of-the-art deep unsupervised methods