subscribe to arXiv mailings

Texture Edge detection by Patch consensus (TEP)

Abstract: We propose Texture Edge detection using Patch consensus (TEP) which is a training-free method to detect the boundary of texture. We propose a new simple way to identify the texture edge location, using the consensus of segmented local patch information. While on the boundary, even using local patch information, the distinction between textures are typically not clear, but using neighbor consensus… ▽ More We propose Texture Edge detection using Patch consensus (TEP) which is a training-free method to detect the boundary of texture. We propose a new simple way to identify the texture edge location, using the consensus of segmented local patch information. While on the boundary, even using local patch information, the distinction between textures are typically not clear, but using neighbor consensus give a clear idea of the boundary. We utilize local patch, and its response against neighboring regions, to emphasize the similarities and the differences across different textures. The step of segmentation of response further emphasizes the edge location, and the neighborhood voting gives consensus and stabilize the edge detection. We analyze texture as a stationary process to give insight into the patch width parameter verses the quality of edge detection. We derive the necessary condition for textures to be distinguished, and analyze the patch width with respect to the scale of textures. Various experiments are presented to validate the proposed model. △ Less

Submitted 16 March, 2024; originally announced March 2024.

arXiv:2308.00307 [pdf, other]

Domain Adaptation based on Human Feedback for Enhancing Generative Model Denoising Abilities

Authors: Hyun-Cheol Park, Sung Ho Kang

Abstract: How can we apply human feedback into generative model? As answer of this question, in this paper, we show the method applied on denoising problem and domain adaptation using human feedback. Deep generative models have demonstrated impressive results in image denoising. However, current image denoising models often produce inappropriate results when applied to domains different from the ones they w… ▽ More How can we apply human feedback into generative model? As answer of this question, in this paper, we show the method applied on denoising problem and domain adaptation using human feedback. Deep generative models have demonstrated impressive results in image denoising. However, current image denoising models often produce inappropriate results when applied to domains different from the ones they were trained on. If there are `Good' and `Bad' result for unseen data, how to raise up quality of `Bad' result. Most methods use an approach based on generalization of model. However, these methods require target image for training or adapting unseen domain. In this paper, to adapting domain, we deal with non-target image for unseen domain, and improve specific failed image. To address this, we propose a method for fine-tuning inappropriate results generated in a different domain by utilizing human feedback. First, we train a generator to denoise images using only the noisy MNIST digit '0' images. The denoising generator trained on the source domain leads to unintended results when applied to target domain images. To achieve domain adaptation, we construct a noise-image denoising generated image data set and train a reward model predict human feedback. Finally, we fine-tune the generator on the different domain using the reward model with auxiliary loss function, aiming to transfer denoising capabilities to target domain. Our approach demonstrates the potential to efficiently fine-tune a generator trained on one domain using human feedback from another domain, thereby enhancing denoising abilities in different domains. △ Less

Submitted 1 August, 2023; originally announced August 2023.

arXiv:2211.03134 [pdf, other]

doi 10.1016/j.jcp.2023.112069

WeakIdent: Weak formulation for Identifying Differential Equations using Narrow-fit and Trimming

Authors: Mengyi Tang, Wenjing Liao, Rachel Kuske, Sung Ha Kang

Abstract: Data-driven identification of differential equations is an interesting but challenging problem, especially when the given data are corrupted by noise. When the governing differential equation is a linear combination of various differential terms, the identification problem can be formulated as solving a linear system, with the feature matrix consisting of linear and nonlinear terms multiplied by a… ▽ More Data-driven identification of differential equations is an interesting but challenging problem, especially when the given data are corrupted by noise. When the governing differential equation is a linear combination of various differential terms, the identification problem can be formulated as solving a linear system, with the feature matrix consisting of linear and nonlinear terms multiplied by a coefficient vector. This product is equal to the time derivative term, and thus generates dynamical behaviors. The goal is to identify the correct terms that form the equation to capture the dynamics of the given data. We propose a general and robust framework to recover differential equations using a weak formulation, for both ordinary and partial differential equations (ODEs and PDEs). The weak formulation facilitates an efficient and robust way to handle noise. For a robust recovery against noise and the choice of hyper-parameters, we introduce two new mechanisms, narrow-fit and trimming, for the coefficient support and value recovery, respectively. For each sparsity level, Subspace Pursuit is utilized to find an initial set of support from the large dictionary. Then, we focus on highly dynamic regions (rows of the feature matrix), and error normalize the feature matrix in the narrow-fit step. The support is further updated via trimming of the terms that contribute the least. Finally, the support set of features with the smallest Cross-Validation error is chosen as the result. A comprehensive set of numerical experiments are presented for both systems of ODEs and PDEs with various noise levels. The proposed method gives a robust recovery of the coefficients, and a significant denoising effect which can handle up to $100\%$ noise-to-signal ratio for some equations. We compare the proposed method with several state-of-the-art algorithms for the recovery of differential equations. △ Less

Submitted 6 November, 2022; originally announced November 2022.

MSC Class: 65N99(Primary) 37M99; 65Z05(Secondary) ACM Class: G.1.8; G.1.10

arXiv:2201.05724 [pdf, ps, other]

StemP: A fast and deterministic Stem-graph approach for RNA and protein folding prediction

Authors: Mengyi Tang, Kumbit Hwang, Sung Ha Kang

Abstract: We propose a new deterministic methodology to predict RNA sequence and protein folding. Is stem enough for structure prediction? The main idea is to consider all possible stem formation in the given sequence. With the stem loop energy and the strength of stem, we explore how to deterministically utilize stem information for RNA sequence and protein folding structure prediction. We use graph notati… ▽ More We propose a new deterministic methodology to predict RNA sequence and protein folding. Is stem enough for structure prediction? The main idea is to consider all possible stem formation in the given sequence. With the stem loop energy and the strength of stem, we explore how to deterministically utilize stem information for RNA sequence and protein folding structure prediction. We use graph notation, where all possible stems are represented as vertices, and co-existence as edges. This full Stem-graph presents all possible folding structure, and we pick sub-graph(s) which give the best matching energy for folding structure prediction. We introduce a Stem-Loop score to add structure information and to speed up the computation. The proposed method can handle secondary structure prediction as well as protein folding with pseudo knots. Numerical experiments are done using a laptop and results take only a few minutes or seconds. One of the strengths of this approach is in the simplicity and flexibility of the algorithm, and it gives deterministic answer. We explore protein sequences from Protein Data Bank, rRNA 5S sequences, and tRNA sequences from the Gutell Lab. Various experiments and comparisons are included to validate the propose method. △ Less

Submitted 14 January, 2022; originally announced January 2022.

MSC Class: 92-10 (Primary) 68R99 (Secondary) ACM Class: G.2.3

arXiv:2110.08365 [pdf, other]

doi 10.1016/j.jvcir.2022.103527

Counting Objects by Diffused Index: geometry-free and training-free approach

Authors: Mengyi Tang, Maryam Yashtini, Sung Ha Kang

Abstract: Counting objects is a fundamental but challenging problem. In this paper, we propose diffusion-based, geometry-free, and learning-free methodologies to count the number of objects in images. The main idea is to represent each object by a unique index value regardless of its intensity or size, and to simply count the number of index values. First, we place different vectors, refer to as seed vector… ▽ More Counting objects is a fundamental but challenging problem. In this paper, we propose diffusion-based, geometry-free, and learning-free methodologies to count the number of objects in images. The main idea is to represent each object by a unique index value regardless of its intensity or size, and to simply count the number of index values. First, we place different vectors, refer to as seed vectors, uniformly throughout the mask image. The mask image has boundary information of the objects to be counted. Secondly, the seeds are diffused using an edge-weighted harmonic variational optimization model within each object. We propose an efficient algorithm based on an operator splitting approach and alternating direction minimization method, and theoretical analysis of this algorithm is given. An optimal solution of the model is obtained when the distributed seeds are completely diffused such that there is a unique intensity within each object, which we refer to as an index. For computational efficiency, we stop the diffusion process before a full convergence, and propose to cluster these diffused index values. We refer to this approach as Counting Objects by Diffused Index (CODI). We explore scalar and multi-dimensional seed vectors. For Scalar seeds, we use Gaussian fitting in histogram to count, while for vector seeds, we exploit a high-dimensional clustering method for the final step of counting via clustering. The proposed method is flexible even if the boundary of the object is not clear nor fully enclosed. We present counting results in various applications such as biological cells, agriculture, concert crowd, and transportation. Some comparisons with existing methods are presented. △ Less

Submitted 15 October, 2021; originally announced October 2021.

MSC Class: 65Z05(Primary); 65S05(Secondary) ACM Class: I.4.9; I.4.6; I.5.5

arXiv:2104.02680 [pdf, ps, other]

A New Parallel Adaptive Clustering and its Application to Streaming Data

Authors: Benjamin McLaughlin, Sung Ha Kang

Abstract: This paper presents a parallel adaptive clustering (PAC) algorithm to automatically classify data while simultaneously choosing a suitable number of classes. Clustering is an important tool for data analysis and understanding in a broad set of areas including data reduction, pattern analysis, and classification. However, the requirement to specify the number of clusters in advance and the computat… ▽ More This paper presents a parallel adaptive clustering (PAC) algorithm to automatically classify data while simultaneously choosing a suitable number of classes. Clustering is an important tool for data analysis and understanding in a broad set of areas including data reduction, pattern analysis, and classification. However, the requirement to specify the number of clusters in advance and the computational burden associated with clustering large sets of data persist as challenges in clustering. We propose a new parallel adaptive clustering (PAC) algorithm that addresses these challenges by adaptively computing the number of clusters and leveraging the power of parallel computing. The algorithm clusters disjoint subsets of the data on parallel computation threads. We develop regularized set \mi{k}-means to efficiently cluster the results from the parallel threads. A refinement step further improves the clusters. The PAC algorithm offers the capability to adaptively cluster data sets which change over time by reusing the information from previous time steps to decrease computation. We provide theoretical analysis and numerical experiments to characterize the performance of the method, validate its properties, and demonstrate the computational efficiency of the method. △ Less

Submitted 6 April, 2021; originally announced April 2021.

Comments: This work was funded by NAVSEA. Distribution Statement A: Approved for Public Release, Distribution is Unlimited

arXiv:2007.12117 [pdf, other]

Silhouette Vectorization by Affine Scale-space

Authors: Yuchen He, Sung Ha Kang, Jean-Michel Morel

Abstract: Silhouettes or 2D planar shapes are extremely important in human communication, which involves many logos, graphics symbols and fonts in vector form. Many more shapes can be extracted from image by binarization or segmentation, thus in raster form that requires a vectorization. There is a need for disposing of a mathematically well defined and justified shape vectorization process, which in additi… ▽ More Silhouettes or 2D planar shapes are extremely important in human communication, which involves many logos, graphics symbols and fonts in vector form. Many more shapes can be extracted from image by binarization or segmentation, thus in raster form that requires a vectorization. There is a need for disposing of a mathematically well defined and justified shape vectorization process, which in addition provides a minimal set of control points with geometric meaning. In this paper we propose a silhouette vectorization method which extracts the outline of a 2D shape from a raster binary image, and converts it to a combination of cubic Bézier polygons and perfect circles. Starting from the boundary curvature extrema computed at sub-pixel level, we identify a set of control points based on the affine scale-space induced by the outline. These control points capture similarity invariant geometric features of the given silhouette and give precise locations of the shape's corners.of the given silhouette. Then, piecewise Bézier cubics are computed by least-square fitting combined with an adaptive splitting to guarantee a predefined accuracy. When there are no curvature extrema identified, either the outline is recognized as a circle using the isoperimetric inequality, or a pair of the most distant outline points are chosen to initiate the fitting. Given their construction, most of our control points are geometrically stable under affine transformations. By comparing with other feature detectors, we show that our method can be used as a reliable feature point detector for silhouettes. Compared to state-of-the-art image vectorization software, our algorithm demonstrates superior reduction on the number of control points, while maintaining high accuracy. △ Less

Submitted 23 July, 2020; originally announced July 2020.

arXiv:2001.07884 [pdf, other]

Curvature Regularized Surface Reconstruction from Point Cloud

Authors: Yuchen He, Sung Ha Kang, Hao Liu

Abstract: We propose a variational functional and fast algorithms to reconstruct implicit surface from point cloud data with a curvature constraint. The minimizing functional balances the distance function from the point cloud and the mean curvature term. Only the point location is used, without any local normal or curvature estimation at each point. With the added curvature constraint, the computation beco… ▽ More We propose a variational functional and fast algorithms to reconstruct implicit surface from point cloud data with a curvature constraint. The minimizing functional balances the distance function from the point cloud and the mean curvature term. Only the point location is used, without any local normal or curvature estimation at each point. With the added curvature constraint, the computation becomes particularly challenging. To enhance the computational efficiency, we solve the problem by a novel operator splitting scheme. It replaces the original high-order PDEs by a decoupled PDE system, which is solved by a semi-implicit method. We also discuss approach using an augmented Lagrangian method. The proposed method shows robustness against noise, and recovers concave features and sharp corners better compared to models without curvature constraint. Numerical experiments in two and three dimensional data sets, noisy and sparse data are presented to validate the model. △ Less

Submitted 9 September, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

Comments: 22 pages, 15 figures

MSC Class: 65D18

arXiv:1901.02520 [pdf, other]

Lattice Identification and Separation: Theory and Algorithm

Authors: Yuchen He, Sung Ha Kang

Abstract: Motivated by lattice mixture identification and grain boundary detection, we present a framework for lattice pattern representation and comparison, and propose an efficient algorithm for lattice separation. We define new scale and shape descriptors, which helps to considerably reduce the size of equivalence classes of lattice bases. These finitely many equivalence relations are fully characterized… ▽ More Motivated by lattice mixture identification and grain boundary detection, we present a framework for lattice pattern representation and comparison, and propose an efficient algorithm for lattice separation. We define new scale and shape descriptors, which helps to considerably reduce the size of equivalence classes of lattice bases. These finitely many equivalence relations are fully characterized by modular group theory. We construct the lattice space $\mathscr{L}$ based on the equivalent descriptors and define a metric $d_{\mathscr{L}}$ to accurately quantify the visual similarities and differences between lattices. Furthermore, we introduce the Lattice Identification and Separation Algorithm (LISA), which identifies each lattice patterns from superposed lattices. LISA finds lattice candidates from the high responses in the image spectrum, then sequentially extracts different layers of lattice patterns one by one. Analyzing the frequency components, we reveal the intricate dependency of LISA's performances on particle radius, lattice density, and relative translations. Various numerical experiments are designed to show LISA's robustness against a large number of lattice layers, moiré patterns and missing particles. △ Less

Submitted 19 December, 2018; originally announced January 2019.

Comments: 30 Pages plus 4 pages of Appendix. 4 Pages of References. 24 Figures

arXiv:1811.07889 [pdf]

doi 10.1080/21681163.2019.1674696

Automatic Three-Dimensional Cephalometric Annotation System Using Three-Dimensional Convolutional Neural Networks

Authors: Sung Ho Kang, Kiwan Jeon, Hak-Jin Kim, Jin Keun Seo, Sang-Hwy Lee

Abstract: Background: Three-dimensional (3D) cephalometric analysis using computerized tomography data has been rapidly adopted for dysmorphosis and anthropometry. Several different approaches to automatic 3D annotation have been proposed to overcome the limitations of traditional cephalometry. The purpose of this study was to evaluate the accuracy of our newly-developed system using a deep learning algorit… ▽ More Background: Three-dimensional (3D) cephalometric analysis using computerized tomography data has been rapidly adopted for dysmorphosis and anthropometry. Several different approaches to automatic 3D annotation have been proposed to overcome the limitations of traditional cephalometry. The purpose of this study was to evaluate the accuracy of our newly-developed system using a deep learning algorithm for automatic 3D cephalometric annotation. Methods: To overcome current technical limitations, some measures were developed to directly annotate 3D human skull data. Our deep learning-based model system mainly consisted of a 3D convolutional neural network and image data resampling. Results: The discrepancies between the referenced and predicted coordinate values in three axes and in 3D distance were calculated to evaluate system accuracy. Our new model system yielded prediction errors of 3.26, 3.18, and 4.81 mm (for three axes) and 7.61 mm (for 3D). Moreover, there was no difference among the landmarks of the three groups, including the midsagittal plane, horizontal plane, and mandible (p>0.05). Conclusion: A new 3D convolutional neural network-based automatic annotation system for 3D cephalometry was developed. The strategies used to implement the system were detailed and measurement results were evaluated for accuracy. Further development of this system is planned for full clinical application of automatic 3D cephalometric annotation. △ Less

Submitted 19 November, 2018; originally announced November 2018.

Showing 1–10 of 10 results for author: Kang, S H