-
Systematic literature review of the trust reinforcement mechanisms exist in package ecosystems
Authors:
Angel Temelko,
Fang Hou,
Siamak Farshidi,
Slinger Jansen
Abstract:
We conducted a thorough SLR to better grasp the challenges and possible solutions associated with existing npm security tools. Our goal was to delve into documented experiences and findings. Specifically, we were keen to learn about the motivations behind choosing third-party packages, software engineers' responses to warning messages, and their overall understanding of security issues. The main a…
▽ More
We conducted a thorough SLR to better grasp the challenges and possible solutions associated with existing npm security tools. Our goal was to delve into documented experiences and findings. Specifically, we were keen to learn about the motivations behind choosing third-party packages, software engineers' responses to warning messages, and their overall understanding of security issues. The main aim of this review was to pinpoint prevailing trends, methods, and concerns in trust tools for the present npm environment. Furthermore, we sought to understand the complexities of integrating SECO into platforms such as npm. By analyzing earlier studies, our intention was to spot any overlooked areas and steer our research to address them.
△ Less
Submitted 26 June, 2024;
originally announced July 2024.
-
Learning Unsigned Distance Fields from Local Shape Functions for 3D Surface Reconstruction
Authors:
Jiangbei Hu,
Yanggeng Li,
Fei Hou,
Junhui Hou,
Zhebin Zhang,
Shengfa Wang,
Na Lei,
Ying He
Abstract:
Unsigned distance fields (UDFs) provide a versatile framework for representing a diverse array of 3D shapes, encompassing both watertight and non-watertight geometries. Traditional UDF learning methods typically require extensive training on large datasets of 3D shapes, which is costly and often necessitates hyperparameter adjustments for new datasets. This paper presents a novel neural framework,…
▽ More
Unsigned distance fields (UDFs) provide a versatile framework for representing a diverse array of 3D shapes, encompassing both watertight and non-watertight geometries. Traditional UDF learning methods typically require extensive training on large datasets of 3D shapes, which is costly and often necessitates hyperparameter adjustments for new datasets. This paper presents a novel neural framework, LoSF-UDF, for reconstructing surfaces from 3D point clouds by leveraging local shape functions to learn UDFs. We observe that 3D shapes manifest simple patterns within localized areas, prompting us to create a training dataset of point cloud patches characterized by mathematical functions that represent a continuum from smooth surfaces to sharp edges and corners. Our approach learns features within a specific radius around each query point and utilizes an attention mechanism to focus on the crucial features for UDF estimation. This method enables efficient and robust surface reconstruction from point clouds without the need for shape-specific training. Additionally, our method exhibits enhanced resilience to noise and outliers in point clouds compared to existing methods. We present comprehensive experiments and comparisons across various datasets, including synthetic and real-scanned point clouds, to validate our method's efficacy.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Multi-Functional Beamforming Design for Integrated Sensing, Communication, and Computation
Authors:
Yapeng Zhao,
Qingqing Wu,
Wen Chen,
Yong Zeng,
Ruiqi Liu,
Weidong Mei,
Fen Hou,
Shaodan Ma
Abstract:
Integrated sensing and communication (ISAC) systems may face a heavy computation burden since the sensory data needs to be further processed. This paper studies a novel system that integrates sensing, communication, and computation, aiming to provide services for different objectives efficiently. This system consists of a multi-antenna multi-functional base station (BS), an edge server, a target,…
▽ More
Integrated sensing and communication (ISAC) systems may face a heavy computation burden since the sensory data needs to be further processed. This paper studies a novel system that integrates sensing, communication, and computation, aiming to provide services for different objectives efficiently. This system consists of a multi-antenna multi-functional base station (BS), an edge server, a target, and multiple singleantenna communication users. The BS needs to allocate the available resources to efficiently provide sensing, communication, and computation services. Due to the heavy service burden and limited power budget, the BS can partially offload the tasks to the nearby edge server instead of computing them locally. We consider the estimation of the target response matrix, a general problem in radar sensing, and utilize Cramer-Rao bound (CRB) as the corresponding performance metric. To tackle the non-convex optimization problem, we propose both semidefinite relaxation (SDR)-based alternating optimization and SDR-based successive convex approximation (SCA) algorithms to minimize the CRB of radar sensing while meeting the requirement of communication users and the need for task computing. Furthermore, we demonstrate that the optimal rankone solutions of both the alternating and SCA algorithms can be directly obtained via the solver or further constructed even when dealing with multiple functionalities. Simulation results show that the proposed algorithms can provide higher target estimation performance than state-of-the-art benchmarks while satisfying the communication and computation constraints.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting
Authors:
Jiaze Li,
Zhengyu Wen,
Luo Zhang,
Jiangbei Hu,
Fei Hou,
Zhebin Zhang,
Ying He
Abstract:
The 3D Gaussian Splatting technique has significantly advanced the construction of radiance fields from multi-view images, enabling real-time rendering. While point-based rasterization effectively reduces computational demands for rendering, it often struggles to accurately reconstruct the geometry of the target object, especially under strong lighting. To address this challenge, we introduce a no…
▽ More
The 3D Gaussian Splatting technique has significantly advanced the construction of radiance fields from multi-view images, enabling real-time rendering. While point-based rasterization effectively reduces computational demands for rendering, it often struggles to accurately reconstruct the geometry of the target object, especially under strong lighting. To address this challenge, we introduce a novel approach that combines octree-based implicit surface representations with Gaussian splatting. Our method consists of four stages. Initially, it reconstructs a signed distance field (SDF) and a radiance field through volume rendering, encoding them in a low-resolution octree. The initial SDF represents the coarse geometry of the target object. Subsequently, it introduces 3D Gaussians as additional degrees of freedom, which are guided by the SDF. In the third stage, the optimized Gaussians further improve the accuracy of the SDF, allowing it to recover finer geometric details compared to the initial SDF obtained in the first stage. Finally, it adopts the refined SDF to further optimize the 3D Gaussians via splatting, eliminating those that contribute little to visual appearance. Experimental results show that our method, which leverages the distribution of 3D Gaussians with SDFs, reconstructs more accurate geometry, particularly in images with specular highlights caused by strong lighting.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction
Authors:
Cheng Xu,
Fei Hou,
Wencheng Wang,
Hong Qin,
Zhebin Zhang,
Ying He
Abstract:
While Signed Distance Fields (SDF) are well-established for modeling watertight surfaces, Unsigned Distance Fields (UDF) broaden the scope to include open surfaces and models with complex inner structures. Despite their flexibility, UDFs encounter significant challenges in high-fidelity 3D reconstruction, such as non-differentiability at the zero level set, difficulty in achieving the exact zero v…
▽ More
While Signed Distance Fields (SDF) are well-established for modeling watertight surfaces, Unsigned Distance Fields (UDF) broaden the scope to include open surfaces and models with complex inner structures. Despite their flexibility, UDFs encounter significant challenges in high-fidelity 3D reconstruction, such as non-differentiability at the zero level set, difficulty in achieving the exact zero value, numerous local minima, vanishing gradients, and oscillating gradient directions near the zero level set. To address these challenges, we propose Details Enhanced UDF (DEUDF) learning that integrates normal alignment and the SIREN network for capturing fine geometric details, adaptively weighted Eikonal constraints to address vanishing gradients near the target surface, unconditioned MLP-based UDF representation to relax non-negativity constraints, and a UDF-tailored method for extracting iso-surface with non-constant iso-values. These strategies collectively stabilize the learning process from unoriented point clouds and enhance the accuracy of UDFs. Our computational results demonstrate that DEUDF outperforms existing UDF learning methods in both accuracy and the quality of reconstructed surfaces. We will make the source code publicly available.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Diffusing Winding Gradients (DWG): A Parallel and Scalable Method for 3D Reconstruction from Unoriented Point Clouds
Authors:
Weizhou Liu,
Jiaze Li,
Xuhui Chen,
Fei Hou,
Shiqing Xin,
Xingce Wang,
Zhongke Wu,
Chen Qian,
Ying He
Abstract:
This paper presents a method for reconstructing watertight 3D surfaces from unoriented point clouds. Starting with randomly initialized normals, the method iteratively refines each normal by diffusing the gradient of the generalized winding number (GWN) field. Upon convergence, the target surface is extracted using the standard Marching Cubes algorithm. Our method is conceptually simple, easy to i…
▽ More
This paper presents a method for reconstructing watertight 3D surfaces from unoriented point clouds. Starting with randomly initialized normals, the method iteratively refines each normal by diffusing the gradient of the generalized winding number (GWN) field. Upon convergence, the target surface is extracted using the standard Marching Cubes algorithm. Our method is conceptually simple, easy to implement, and does not require numerical solvers, which distinguishes it from existing approaches. Designed for parallelization and scalability, it efficiently handles large-scale models on both CPUs and GPUs. Experimental results demonstrate that our method outperforms all existing methods in reconstructing from unoriented point clouds, particularly in terms of runtime performance. On large-scale models with 10 to 20 million points, our CUDA implementation on an NVIDIA GTX 4090 GPU is typically 30-100x faster than iPSR, the leading sequential method tested on a high-end PC with an Intel i9 CPU. Furthermore, our approach exhibits superior robustness against noise and effectively handles models with thin structures, surpassing existing methods. We will make the source code publicly available to encourage further research and applications.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Global existence and scattering of small data smooth solutions to a class of quasilinear wave systems on $\mathbb{R}^2\times\mathbb{T}$
Authors:
Fei Hou,
Fei Tao,
Huicheng Yin
Abstract:
In this paper, we are concerned with the global existence and scattering of small data smooth solutions to a class of quasilinear wave systems on the product space $\mathbb{R}^2\times\mathbb{T}$. These quasilinear wave systems include 3D irrotational potential flow equation of Chaplygin gases, 3D relativistic membrane equation, some 3D quasilinear wave equations which come from the corresponding L…
▽ More
In this paper, we are concerned with the global existence and scattering of small data smooth solutions to a class of quasilinear wave systems on the product space $\mathbb{R}^2\times\mathbb{T}$. These quasilinear wave systems include 3D irrotational potential flow equation of Chaplygin gases, 3D relativistic membrane equation, some 3D quasilinear wave equations which come from the corresponding Lagrangian functionals as perturbations of the Lagrangian densities of linear waves, and nonlinear wave maps system. Through looking for some suitable transformations of unknown functions, the nonlinear wave system can be reduced into a more tractable form. Subsequently, by applying the vector-field method together with the ghost weight technique as well as deriving some kinds of weighted $L^\infty-L^\infty$ and $L^\infty-L^2$ estimates of solution $w$ to the 2D linear wave equation $\Box w=f(t,x)$, the global existence and scattering of small data solutions are established.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
A new social welfare function with a number of desirable properties
Authors:
Fujun Hou
Abstract:
By relaxing the dominating set in three ways (e.g., from "each member beats every non-member" to "each member beats or ties every non-member, with an additional requirement that at least one member beat every non-member"), we propose a new social welfare function, which satisfies a number of desirable properties including Condorcet winner principle, Condorcet loser principle, strong Gehrlein-stabi…
▽ More
By relaxing the dominating set in three ways (e.g., from "each member beats every non-member" to "each member beats or ties every non-member, with an additional requirement that at least one member beat every non-member"), we propose a new social welfare function, which satisfies a number of desirable properties including Condorcet winner principle, Condorcet loser principle, strong Gehrlein-stability (hence Smith set principle), anonymity, neutrality, weak Pareto, strong Pareto, non-dictatorship, and [independence of irrelevant alternatives (IIA) when the pairwise majority relation is an ordering on the alternative set]. If the pairwise majority relation is complete and transitive, the proposed method yields a collective preference relation that coincides with the input majority relation. It thus shares the same collective preference function on the dichotomous domain with the approval voting and the majority voting. It runs in polynomial time and thus possesses a competitive advantage over a number of computationally intractable voting rules such as the Dodgson's rule, the Kemeny's rule, the Slater's rule, the Banks rule, and the Schwartz's tournament equilibrium set (TEQ) rule. When it is used in tournaments, its winner belongs to the uncovered set, the top cycle set, the Smith set, and the Schwartz set. In addition, in a tournament where the number of alternatives is not more than 4, its winner set is a subset, sometimes proper, of the Copeland winner set. Whether this attractive argument is still valid in four-more-alternative tournaments remains an open question.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Freshness-aware Resource Allocation for Non-orthogonal Wireless-powered IoT Networks
Authors:
Yunfeng Chen,
Yong Liu,
Jinhao Xiao,
Qunying Wu,
Han Zhang,
Fen Hou
Abstract:
This paper investigates a wireless-powered Internet of Things (IoT) network comprising a hybrid access point (HAP) and two devices. The HAP facilitates downlink wireless energy transfer (WET) for device charging and uplink wireless information transfer (WIT) to collect status updates from the devices. To keep the information fresh, concurrent WET and WIT are allowed, and orthogonal multiple access…
▽ More
This paper investigates a wireless-powered Internet of Things (IoT) network comprising a hybrid access point (HAP) and two devices. The HAP facilitates downlink wireless energy transfer (WET) for device charging and uplink wireless information transfer (WIT) to collect status updates from the devices. To keep the information fresh, concurrent WET and WIT are allowed, and orthogonal multiple access (OMA) and non-orthogonal multiple access (NOMA) are adaptively scheduled for WIT. Consequently, we formulate an expected weighted sum age of information (EWSAoI) minimization problem to adaptively schedule the transmission scheme, choosing from WET, OMA, NOMA, and WET+OMA, and to allocate transmit power. To address this, we reformulate the problem as a Markov decision process (MDP) and develop an optimal policy based on instantaneous AoI and remaining battery power to determine scheme selection and transmit power allocation. Extensive results demonstrate the effectiveness of the proposed policy, and the optimal policy has a distinct decision boundary-switching property, providing valuable insights for practical system design.
△ Less
Submitted 27 February, 2024;
originally announced March 2024.
-
DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization
Authors:
Feng Hou,
Jin Yuan,
Ying Yang,
Yang Liu,
Yang Zhang,
Cheng Zhong,
Zhongchao Shi,
Jianping Fan,
Yong Rui,
Zhiqiang He
Abstract:
Traditional cross-domain tasks, including domain adaptation and domain generalization, rely heavily on training model by source domain data. With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Ada…
▽ More
Traditional cross-domain tasks, including domain adaptation and domain generalization, rely heavily on training model by source domain data. With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Adaptive Domain Generalization (ADG). However, current cross-domain datasets have many limitations, such as unrealistic domains, unclear domain definitions, and the inability to fine-grained domain decomposition, which drives us to establish a novel dataset DomainVerse for ADG. Benefiting from the introduced hierarchical definition of domain shifts, DomainVerse consists of about 0.5 million images from 390 fine-grained realistic domains. With the help of the constructed DomainVerse and VLMs, we propose two methods called Domain CLIP and Domain++ CLIP for tuning-free adaptive domain generalization. Extensive and comprehensive experiments demonstrate the significance of the dataset and the effectiveness of the proposed methods.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Topology-Aware Latent Diffusion for 3D Shape Generation
Authors:
Jiangbei Hu,
Ben Fei,
Baixin Xu,
Fei Hou,
Weidong Yang,
Shengfa Wang,
Na Lei,
Chen Qian,
Ying He
Abstract:
We introduce a new generative model that combines latent diffusion with persistent homology to create 3D shapes with high diversity, with a special emphasis on their topological characteristics. Our method involves representing 3D shapes as implicit fields, then employing persistent homology to extract topological features, including Betti numbers and persistence diagrams. The shape generation pro…
▽ More
We introduce a new generative model that combines latent diffusion with persistent homology to create 3D shapes with high diversity, with a special emphasis on their topological characteristics. Our method involves representing 3D shapes as implicit fields, then employing persistent homology to extract topological features, including Betti numbers and persistence diagrams. The shape generation process consists of two steps. Initially, we employ a transformer-based autoencoding module to embed the implicit representation of each 3D shape into a set of latent vectors. Subsequently, we navigate through the learned latent space via a diffusion model. By strategically incorporating topological features into the diffusion process, our generative module is able to produce a richer variety of 3D shapes with different topological structures. Furthermore, our framework is flexible, supporting generation tasks constrained by a variety of inputs, including sparse and partial point clouds, as well as sketches. By modifying the persistence diagrams, we can alter the topology of the shapes generated from these input modalities.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition
Authors:
Chengxi Lei,
Satwinder Singh,
Feng Hou,
Xiaoyun Jia,
Ruili Wang
Abstract:
Most of the current speech data augmentation methods operate on either the raw waveform or the amplitude spectrum of speech. In this paper, we propose a novel speech data augmentation method called PhasePerturbation that operates dynamically on the phase spectrum of speech. Instead of statically rotating a phase by a constant degree, PhasePerturbation utilizes three dynamic phase spectrum operatio…
▽ More
Most of the current speech data augmentation methods operate on either the raw waveform or the amplitude spectrum of speech. In this paper, we propose a novel speech data augmentation method called PhasePerturbation that operates dynamically on the phase spectrum of speech. Instead of statically rotating a phase by a constant degree, PhasePerturbation utilizes three dynamic phase spectrum operations, i.e., a randomization operation, a frequency masking operation, and a temporal masking operation, to enhance the diversity of speech data. We conduct experiments on wav2vec2.0 pre-trained ASR models by fine-tuning them with the PhasePerturbation augmented TIMIT corpus. The experimental results demonstrate 10.9\% relative reduction in the word error rate (WER) compared with the baseline model fine-tuned without any augmentation operation. Furthermore, the proposed method achieves additional improvements (12.9\% and 15.9\%) in WER by complementing the Vocal Tract Length Perturbation (VTLP) and the SpecAug, which are both amplitude spectrum-based augmentation methods. The results highlight the capability of PhasePerturbation to improve the current amplitude spectrum-based augmentation methods.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Parameterization-driven Neural Surface Reconstruction for Object-oriented Editing in Neural Rendering
Authors:
Baixin Xu,
Jiangbei Hu,
Fei Hou,
Kwan-Yee Lin,
Wayne Wu,
Chen Qian,
Ying He
Abstract:
The advancements in neural rendering have increased the need for techniques that enable intuitive editing of 3D objects represented as neural implicit surfaces. This paper introduces a novel neural algorithm for parameterizing neural implicit surfaces to simple parametric domains like spheres and polycubes. Our method allows users to specify the number of cubes in the parametric domain, learning a…
▽ More
The advancements in neural rendering have increased the need for techniques that enable intuitive editing of 3D objects represented as neural implicit surfaces. This paper introduces a novel neural algorithm for parameterizing neural implicit surfaces to simple parametric domains like spheres and polycubes. Our method allows users to specify the number of cubes in the parametric domain, learning a configuration that closely resembles the target 3D object's geometry. It computes bi-directional deformation between the object and the domain using a forward mapping from the object's zero level set and an inverse deformation for backward mapping. We ensure nearly bijective mapping with a cycle loss and optimize deformation smoothness. The parameterization quality, assessed by angle and area distortions, is guaranteed using a Laplacian regularizer and an optimized learned parametric domain. Our framework integrates with existing neural rendering pipelines, using multi-view images of a single object or multiple objects of similar geometries to reconstruct 3D geometry and compute texture maps automatically, eliminating the need for any prior information. We demonstrate the method's effectiveness on images of human heads and man-made objects.
△ Less
Submitted 13 July, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
Robust Zero Level-Set Extraction from Unsigned Distance Fields Based on Double Covering
Authors:
Fei Hou,
Xuhui Chen,
Wencheng Wang,
Hong Qin,
Ying He
Abstract:
In this paper, we propose a new method, called DoubleCoverUDF, for extracting the zero level-set from unsigned distance fields (UDFs). DoubleCoverUDF takes a learned UDF and a user-specified parameter $r$ (a small positive real number) as input and extracts an iso-surface with an iso-value $r$ using the conventional marching cubes algorithm. We show that the computed iso-surface is the boundary of…
▽ More
In this paper, we propose a new method, called DoubleCoverUDF, for extracting the zero level-set from unsigned distance fields (UDFs). DoubleCoverUDF takes a learned UDF and a user-specified parameter $r$ (a small positive real number) as input and extracts an iso-surface with an iso-value $r$ using the conventional marching cubes algorithm. We show that the computed iso-surface is the boundary of the $r$-offset volume of the target zero level-set $S$, which is an orientable manifold, regardless of the topology of $S$. Next, the algorithm computes a covering map to project the boundary mesh onto $S$, preserving the mesh's topology and avoiding folding. If $S$ is an orientable manifold surface, our algorithm separates the double-layered mesh into a single layer using a robust minimum-cut post-processing step. Otherwise, it keeps the double-layered mesh as the output. We validate our algorithm by reconstructing 3D surfaces of open models and demonstrate its efficacy and effectiveness on synthetic models and benchmark datasets. Our experimental results confirm that our method is robust and produces meshes with better quality in terms of both visual evaluation and quantitative measures than existing UDF-based methods. The source code is available at https://github.com/jjjkkyz/DCUDF.
△ Less
Submitted 10 January, 2024; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Photoconductive Effects in Single Crystals of BaZrS$_3$
Authors:
Boyang Zhao,
Huandong Chen,
Ragib Ahsan,
Fei Hou,
Eric R Hoglund,
Shantanu Singh,
Huan Zhao,
Han Htoon,
Andrey Krayev,
Maruda Shanmugasundaram,
Patrick E Hopkins,
Jan Seidel,
Rehan Kapadia,
Jayakanth Ravichandran
Abstract:
Chalcogenide perovskites, such as BaZrS$_3$, are emerging semiconductors with potential for high photovoltaic power conversion efficiency. The role of defects in the efficiency of the generation and collection of photo-excited carriers has not been experimentally investigated extensively. We study the effect of processing-induced defects on the photoconductive properties of single crystals of BaZr…
▽ More
Chalcogenide perovskites, such as BaZrS$_3$, are emerging semiconductors with potential for high photovoltaic power conversion efficiency. The role of defects in the efficiency of the generation and collection of photo-excited carriers has not been experimentally investigated extensively. We study the effect of processing-induced defects on the photoconductive properties of single crystals of BaZrS$_3$. We achieved ohmic contacts to single crystals of BaZrS$_3$ and observed positive surface photovoltage, which is typically observed in p-type semiconductors. However, mechanical polishing of BaZrS$_3$ to remove the surface oxide leads to dense deformation grain boundaries and leads to trap-dominated photoconductive response. In comparison, ohmic contacts achieved in cleaved crystals leave fewer deformation defects and greatly improve optoelectronic properties. Defect-controlled crystal growth and contact fabrication are potentially limiting factors for achieving high photon-to-excited electron conversion efficiency in BaZrS$_3$.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Almost global solutions of 1D nonlinear Klein-Gordon equations with small weakly decaying initial data
Authors:
Fei Hou,
Fei Tao,
Huicheng Yin
Abstract:
It has been known that if the initial data decay sufficiently fast at space infinity, then 1D Klein-Gordon equations with quadratic nonlinearity admit classical solutions up to time $e^{C/ε^2}$ while $e^{C/ε^2}$ is also the upper bound of the lifespan, where $C>0$ is some suitable constant and $ε>0$ is the size of the initial data. In this paper, we will focus on the 1D nonlinear Klein-Gordon equa…
▽ More
It has been known that if the initial data decay sufficiently fast at space infinity, then 1D Klein-Gordon equations with quadratic nonlinearity admit classical solutions up to time $e^{C/ε^2}$ while $e^{C/ε^2}$ is also the upper bound of the lifespan, where $C>0$ is some suitable constant and $ε>0$ is the size of the initial data. In this paper, we will focus on the 1D nonlinear Klein-Gordon equations with weakly decaying initial data. It is shown that if the $H^s$-Sobolev norm with $(1+|x|)^{1/2+}$ weight of the initial data is small, then the almost global solutions exist; if the initial $H^s$-Sobolev norm with $(1+|x|)^{1/2}$ weight is small, then for any $M>0$, the solutions exist on $[0,ε^{-M}]$. Our proof is based on the dispersive estimate with a suitable $Z$-norm and a delicate analysis on the phase function.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
A Novel Self-training Approach for Low-resource Speech Recognition
Authors:
Satwinder Singh,
Feng Hou,
Ruili Wang
Abstract:
In this paper, we propose a self-training approach for automatic speech recognition (ASR) for low-resource settings. While self-training approaches have been extensively developed and evaluated for high-resource languages such as English, their applications to low-resource languages like Punjabi have been limited, despite the language being spoken by millions globally. The scarcity of annotated da…
▽ More
In this paper, we propose a self-training approach for automatic speech recognition (ASR) for low-resource settings. While self-training approaches have been extensively developed and evaluated for high-resource languages such as English, their applications to low-resource languages like Punjabi have been limited, despite the language being spoken by millions globally. The scarcity of annotated data has hindered the development of accurate ASR systems, especially for low-resource languages (e.g., Punjabi and Māori languages). To address this issue, we propose an effective self-training approach that generates highly accurate pseudo-labels for unlabeled low-resource speech. Our experimental analysis demonstrates that our approach significantly improves word error rate, achieving a relative improvement of 14.94% compared to a baseline model across four real speech datasets. Further, our proposed approach reports the best results on the Common Voice Punjabi dataset.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Improved Real-time Image Smoothing with Weak Structures Preserved and High-contrast Details Removed
Authors:
Shengchun Wang,
Wencheng Wang,
Fei Hou
Abstract:
Image smoothing is by reducing pixel-wise gradients to smooth out details. As existing methods always rely on gradients to determine smoothing manners, it is difficult to distinguish structures and details to handle distinctively due to the overlapped ranges of gradients for structures and details. Thus, it is still challenging to achieve high-quality results, especially on preserving weak structu…
▽ More
Image smoothing is by reducing pixel-wise gradients to smooth out details. As existing methods always rely on gradients to determine smoothing manners, it is difficult to distinguish structures and details to handle distinctively due to the overlapped ranges of gradients for structures and details. Thus, it is still challenging to achieve high-quality results, especially on preserving weak structures and removing high-contrast details. In this paper, we address this challenge by improving the real-time optimization-based method via iterative least squares (called ILS). We observe that 1) ILS uses gradients as the independent variable in its penalty function for determining smoothing manners, and 2) the framework of ILS can still work for image smoothing when we use some values instead of gradients in the penalty function. Thus, corresponding to the properties of pixels on structures or not, we compute some values to use in the penalty function to determine smoothing manners, and so we can handle structures and details distinctively, no matter whether their gradients are high or low. As a result, we can conveniently remove high-contrast details while preserving weak structures. Moreover, such values can be adjusted to accelerate optimization computation, so that we can use fewer iterations than the original ILS method for efficiency. This also reduces the changes onto structures to help structure preservation. Experimental results show our advantages over existing methods on efficiency and quality.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
Intelligent Reflecting Surface Empowered Self-Interference Cancellation in Full-Duplex Systems
Authors:
Chi Qiu,
Meng Hua,
Qingqing Wu,
Wen Chen,
Shaodan Ma,
Fen Hou,
Derrick Wing Kwan Ng,
A. Lee Swindlehurst
Abstract:
Compared with traditional half-duplex wireless systems, the application of emerging full-duplex (FD) technology can potentially double the system capacity theoretically. However, conventional techniques for suppressing self-interference (SI) adopted in FD systems require exceedingly high power consumption and expensive hardware. In this paper, we consider employing an intelligent reflecting surfac…
▽ More
Compared with traditional half-duplex wireless systems, the application of emerging full-duplex (FD) technology can potentially double the system capacity theoretically. However, conventional techniques for suppressing self-interference (SI) adopted in FD systems require exceedingly high power consumption and expensive hardware. In this paper, we consider employing an intelligent reflecting surface (IRS) in the proximity of an FD base station (BS) to mitigate SI for simultaneously receiving data from uplink users and transmitting information to downlink users. The objective considered is to maximize the weighted sum-rate of the system by jointly optimizing the IRS phase shifts, the BS transmit beamformers, and the transmit power of the uplink users. To visualize the role of the IRS in SI cancellation by isolating other interference, we first study a simple scenario with one downlink user and one uplink user. To address the formulated non-convex problem, a low-complexity algorithm based on successive convex approximation is proposed. For the more general case considering multiple downlink and uplink users, an efficient alternating optimization algorithm based on element-wise optimization is proposed. Numerical results demonstrate that the FD system with the proposed schemes can achieve a larger gain over the half-duplex system, and the IRS is able to achieve a balance between suppressing SI and providing beamforming gain.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Ferromagnetic Superconductivity in Two-dimensional Niobium Diselenide
Authors:
Tingyu Qu,
Shangjian Jin,
Fuchen Hou,
Deyi Fu,
Junye Huang,
Darryl Foo Chuan Wei,
Xiao Chang,
Kenji Watanabe,
Takashi Taniguchi,
Junhao Lin,
Shaffique Adam,
Barbaros Özyilmaz
Abstract:
The co-existence of ferromagnetism and superconductivity becomes possible through unconventional pairing in the superconducting state. Such materials are exceedingly rare in solid-state systems but are promising platforms to explore topological phases, such as Majorana bound states. Theoretical investigations date back to the late 1950s, but only a few systems have so far been experimentally ident…
▽ More
The co-existence of ferromagnetism and superconductivity becomes possible through unconventional pairing in the superconducting state. Such materials are exceedingly rare in solid-state systems but are promising platforms to explore topological phases, such as Majorana bound states. Theoretical investigations date back to the late 1950s, but only a few systems have so far been experimentally identified as potential hosts. Here, we show that atomically-thin niobium diselenide (NbSe$_2$) intercalated with dilute cobalt atoms spontaneously displays ferromagnetism below the superconducting transition temperature ($T_c$). We elucidate the origin of this phase by constructing a magnetic tunnel junction that consists of cobalt and cobalt-doped niobium diselenide (Co-NbSe$_2$) as the two ferromagnetic electrodes, with an ultra-thin boron nitride as the tunnelling barrier. At a temperature well below $T_c$, the tunnelling magnetoresistance shows a bistable state, suggesting a ferromagnetic order in Co-NbSe$_2$. We propose a RKKY exchange coupling mechanism based on the spin-triplet superconducting order parameter to mediate such ferromagnetism. We further perform non-local lateral spin valve measurements to confirm the origin of the ferromagnetism. The observation of Hanle precession signals show spin diffusion length up to micrometres below Tc, demonstrating an intrinsic spin-triplet nature in superconducting NbSe$_2$. Our discovery of superconductivity-mediated ferromagnetism opens the door to an alternative design of ferromagnetic superconductors
△ Less
Submitted 11 June, 2023;
originally announced June 2023.
-
How to Design Translation Prompts for ChatGPT: An Empirical Study
Authors:
Yuan Gao,
Ruili Wang,
Feng Hou
Abstract:
The recently released ChatGPT has demonstrated surprising abilities in natural language understanding and natural language generation. Machine translation relies heavily on the abilities of language understanding and generation. Thus, in this paper, we explore how to assist machine translation with ChatGPT. We adopt several translation prompts on a wide range of translations. Our experimental resu…
▽ More
The recently released ChatGPT has demonstrated surprising abilities in natural language understanding and natural language generation. Machine translation relies heavily on the abilities of language understanding and generation. Thus, in this paper, we explore how to assist machine translation with ChatGPT. We adopt several translation prompts on a wide range of translations. Our experimental results show that ChatGPT with designed translation prompts can achieve comparable or better performance over commercial translation systems for high-resource language translations. We further evaluate the translation quality using multiple references, and ChatGPT achieves superior performance compared to commercial systems. We also conduct experiments on domain-specific translations, the final results show that ChatGPT is able to comprehend the provided domain keyword and adjust accordingly to output proper translations. At last, we perform few-shot prompts that show consistent improvement across different base prompts. Our work provides empirical evidence that ChatGPT still has great potential in translations.
△ Less
Submitted 21 April, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
2S-UDF: A Novel Two-stage UDF Learning Method for Robust Non-watertight Model Reconstruction from Multi-view Images
Authors:
Junkai Deng,
Fei Hou,
Xuhui Chen,
Wencheng Wang,
Ying He
Abstract:
Recently, building on the foundation of neural radiance field, various techniques have emerged to learn unsigned distance fields (UDF) to reconstruct 3D non-watertight models from multi-view images. Yet, a central challenge in UDF-based volume rendering is formulating a proper way to convert unsigned distance values into volume density, ensuring that the resulting weight function remains unbiased…
▽ More
Recently, building on the foundation of neural radiance field, various techniques have emerged to learn unsigned distance fields (UDF) to reconstruct 3D non-watertight models from multi-view images. Yet, a central challenge in UDF-based volume rendering is formulating a proper way to convert unsigned distance values into volume density, ensuring that the resulting weight function remains unbiased and sensitive to occlusions. Falling short on these requirements often results in incorrect topology or large reconstruction errors in resulting models. This paper addresses this challenge by presenting a novel two-stage algorithm, 2S-UDF, for learning a high-quality UDF from multi-view images. Initially, the method applies an easily trainable density function that, while slightly biased and transparent, aids in coarse reconstruction. The subsequent stage then refines the geometry and appearance of the object to achieve a high-quality reconstruction by directly adjusting the weight function used in volume rendering to ensure that it is unbiased and occlusion-aware. Decoupling density and weight in two stages makes our training stable and robust, distinguishing our technique from existing UDF learning approaches. Evaluations on the DeepFashion3D, DTU, and BlendedMVS datasets validate the robustness and effectiveness of our proposed approach. In both quantitative metrics and visual quality, the results indicate our superior performance over other UDF learning techniques in reconstructing 3D non-watertight models from multi-view images. Our code is available at https://bitbucket.org/jkdeng/2sudf/.
△ Less
Submitted 16 April, 2024; v1 submitted 27 March, 2023;
originally announced March 2023.
-
The partial null conditions and global smooth solutions of the nonlinear wave equations on $\mathbb{R}^d\times\mathbb{T}$ with $d=2,3$
Authors:
Fei Hou,
Fei Tao,
Huicheng Yin
Abstract:
In this paper, we investigate the fully nonlinear wave equations on the product space $\mathbb{R}^3\times\mathbb{T}$ with quadratic nonlinearities and on $\mathbb{R}^2\times\mathbb{T}$ with cubic nonlinearities, respectively. It is shown that for the small initial data satisfying some space-decay rates at infinity, these nonlinear equations admit global smooth solutions when the corresponding part…
▽ More
In this paper, we investigate the fully nonlinear wave equations on the product space $\mathbb{R}^3\times\mathbb{T}$ with quadratic nonlinearities and on $\mathbb{R}^2\times\mathbb{T}$ with cubic nonlinearities, respectively. It is shown that for the small initial data satisfying some space-decay rates at infinity, these nonlinear equations admit global smooth solutions when the corresponding partial null conditions hold and while have almost global smooth solutions when the partial null conditions are violated. Our proof relies on the Fourier mode decomposition of the solutions with respect to the periodic direction, the efficient combinations of time-decay estimates for the solutions to the linear wave equations and the linear Klein-Gordon equations, and the global weighted energy estimates. In addition, an interesting auxiliary energy is introduced. As a byproduct, our results can be applied to the 4D irrotational compressible Euler equations of polytropic gases or Chaplygin gases on $\mathbb{R}^3\times\mathbb{T}$, the 3D relativistic membrane equation and the 3D nonlinear membrane equation on $\mathbb{R}^2\times\mathbb{T}$.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Long time solutions of quasilinear Klein-Gordon equations with small weakly decaying initial data
Authors:
Fei Hou,
Huicheng Yin
Abstract:
It is well known that for the quasilinear Klein-Gordon equation with quadratic nonlinearity and sufficiently decaying small initial data, there exists a global smooth solution if the space dimensions $d\geq2$. When the initial data are of size $\varepsilon>0$ in the Sobolev space, for the semilinear Klein-Gordon equation satisfying the null condition, the authors in the article (J.-M. Delort, Daoy…
▽ More
It is well known that for the quasilinear Klein-Gordon equation with quadratic nonlinearity and sufficiently decaying small initial data, there exists a global smooth solution if the space dimensions $d\geq2$. When the initial data are of size $\varepsilon>0$ in the Sobolev space, for the semilinear Klein-Gordon equation satisfying the null condition, the authors in the article (J.-M. Delort, Daoyuan Fang, Almost global existence for solutions of semilinear Klein-Gordon equations with small weakly decaying Cauchy data, Comm. Partial Differential Equations 25 (2000), no. 11-12, 2119--2169) prove that the solution exists in time $[0,T_\varepsilon)$ with $T_\varepsilon\ge Ce^{C\varepsilon^{-μ}}$ ($μ=1$ if $d\ge3$, $μ=2/3$ if $d=2$). In the present paper, we will focus on the general quasilinear Klein-Gordon equation without the null condition and further show that the existence time of the solution can be improved to $T_\varepsilon=+\infty$ if $d\geq3$ and $T_\varepsilon\ge e^{C\varepsilon^{-2}}$ if $d=2$. In addition, for $d=2$ and any fixed number $α>0$, if the weighted $L^2$ norm of the initial data with the weight $(1+|x|)^α$ is small, then the solution exists globally and scatters to a free solution. The arguments are based on the introduction of a good unknown, the Strichartz estimate, the weighted $L^2$-norm estimate and the resonance analysis.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Learning to Learn Domain-invariant Parameters for Domain Generalization
Authors:
Feng Hou,
Yao Zhang,
Yang Liu,
Jin Yuan,
Cheng Zhong,
Yang Zhang,
Zhongchao Shi,
Jianping Fan,
Zhiqiang He
Abstract:
Due to domain shift, deep neural networks (DNNs) usually fail to generalize well on unknown test data in practice. Domain generalization (DG) aims to overcome this issue by capturing domain-invariant representations from source domains. Motivated by the insight that only partial parameters of DNNs are optimized to extract domain-invariant representations, we expect a general model that is capable…
▽ More
Due to domain shift, deep neural networks (DNNs) usually fail to generalize well on unknown test data in practice. Domain generalization (DG) aims to overcome this issue by capturing domain-invariant representations from source domains. Motivated by the insight that only partial parameters of DNNs are optimized to extract domain-invariant representations, we expect a general model that is capable of well perceiving and emphatically updating such domain-invariant parameters. In this paper, we propose two modules of Domain Decoupling and Combination (DDC) and Domain-invariance-guided Backpropagation (DIGB), which can encourage such general model to focus on the parameters that have a unified optimization direction between pairs of contrastive samples. Our extensive experiments on two benchmarks have demonstrated that our proposed method has achieved state-of-the-art performance with strong generalization capability.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
Optimal Power Allocation for HARQ Schemes over Time-Correlated Nakagami-m Fading Channels
Authors:
Zheng Shi,
Shaodan Ma,
Fen Hou,
Kam-Weng Tam,
Yik-Chung Wu
Abstract:
This paper investigates the problem of power allocation for hybrid automatic repeat request (HARQ) schemes over time-correlated Nakagami-m fading channels under outage constraint. The presence of time correlation complicates the power allocation problem due to the involvement of multiple correlated fading channels. Under a general time-correlated Nakagami-m fading channel with exponential correlat…
▽ More
This paper investigates the problem of power allocation for hybrid automatic repeat request (HARQ) schemes over time-correlated Nakagami-m fading channels under outage constraint. The presence of time correlation complicates the power allocation problem due to the involvement of multiple correlated fading channels. Under a general time-correlated Nakagami-m fading channel with exponential correlation, outage probabilities for three widely adopted HARQ schemes, including Type I HARQ, HARQ with chase combining (HARQ-CC) and HARQ with incremental redundancy (HARQ-IR), are first derived. With these results, power allocation schemes are proposed to minimize the average total transmission power with guaranteed outage performance. Simulation results demonstrate the accuracy of our outage analysis and the effectiveness of our proposed power allocation schemes. It is shown that our proposed power allocation schemes can achieve significant power savings when compared with fixed power allocation. Moreover, under practical low outage constraint, the power efficiency is further improved when the time correlation is reduced and/or the fading order is increased.
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Zero-Forcing Based Downlink Virtual MIMO-NOMA Communications in IoT Networks
Authors:
Zheng Shi,
Hong Wang,
Yaru Fu,
Guanghua Yang,
Shaodan Ma,
Fen Hou,
Theodoros A. Tsiftsis
Abstract:
To support massive connectivity and boost spectral efficiency for internet of things (IoT), a downlink scheme combining virtual multiple-input multiple-output (MIMO) and nonorthogonal multiple access (NOMA) is proposed. All the single-antenna IoT devices in each cluster cooperate with each other to establish a virtual MIMO entity, and multiple independent data streams are requested by each cluster…
▽ More
To support massive connectivity and boost spectral efficiency for internet of things (IoT), a downlink scheme combining virtual multiple-input multiple-output (MIMO) and nonorthogonal multiple access (NOMA) is proposed. All the single-antenna IoT devices in each cluster cooperate with each other to establish a virtual MIMO entity, and multiple independent data streams are requested by each cluster. NOMA is employed to superimpose all the requested data streams, and each cluster leverages zero-forcing detection to de-multiplex the input data streams. Only statistical channel state information (CSI) is available at base station to avoid the waste of the energy and bandwidth on frequent CSI estimations. The outage probability and goodput of the virtual MIMO-NOMA system are thoroughly investigated by considering Kronecker model, which embraces both the transmit and receive correlations. Furthermore, the asymptotic results facilitate not only the exploration of physical insights but also the goodput maximization. In particular, the asymptotic outage expressions provide quantitative impacts of various system parameters and enable the investigation of diversity-multiplexing tradeoff (DMT). Moreover, power allocation coefficients and/or transmission rates can be properly chosen to achieve the maximal goodput. By favor of Karush-Kuhn-Tucker conditions, the goodput maximization problems can be solved in closed-form, with which the joint power and rate selection is realized by using alternately iterating optimization.Besides, the optimization algorithms tend to allocate more power to clusters under unfavorable channel conditions and support clusters with higher transmission rate under benign channel conditions.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
Iterative Poisson Surface Reconstruction (iPSR) for Unoriented Points
Authors:
Fei Hou,
Chiyu Wang,
Wencheng Wang,
Hong Qin,
Chen Qian,
Ying He
Abstract:
Poisson surface reconstruction (PSR) remains a popular technique for reconstructing watertight surfaces from 3D point samples thanks to its efficiency, simplicity, and robustness. Yet, the existing PSR method and subsequent variants work only for oriented points. This paper intends to validate that an improved PSR, called iPSR, can completely eliminate the requirement of point normals and proceed…
▽ More
Poisson surface reconstruction (PSR) remains a popular technique for reconstructing watertight surfaces from 3D point samples thanks to its efficiency, simplicity, and robustness. Yet, the existing PSR method and subsequent variants work only for oriented points. This paper intends to validate that an improved PSR, called iPSR, can completely eliminate the requirement of point normals and proceed in an iterative manner. In each iteration, iPSR takes as input point samples with normals directly computed from the surface obtained in the preceding iteration, and then generates a new surface with better quality. Extensive quantitative evaluation confirms that the new iPSR algorithm converges in 5-30 iterations even with randomly initialized normals. If initialized with a simple visibility based heuristic, iPSR can further reduce the number of iterations. We conduct comprehensive comparisons with PSR and other powerful implicit-function based methods. Finally, we confirm iPSR's effectiveness and scalability on the AIM@SHAPE dataset and challenging (indoor and outdoor) scenes. Code and data for this paper are at https://github.com/houfei0801/ipsr.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
Conditions for none to be whipped by `Rank and Yank' under the majority rule
Authors:
Fujun Hou
Abstract:
`Rank and Yank' is practiced in many organizations. This paper is concerned with the condtions for none to be whipped by `Rank and Yank' when the evaluation data under each criterion are assumed to be ordinal rankings and the majority rule is used. Two sufficient conditions are set forth of which the first one formulates the alternatives indifference definition in terms of the election matrix, whi…
▽ More
`Rank and Yank' is practiced in many organizations. This paper is concerned with the condtions for none to be whipped by `Rank and Yank' when the evaluation data under each criterion are assumed to be ordinal rankings and the majority rule is used. Two sufficient conditions are set forth of which the first one formulates the alternatives indifference definition in terms of the election matrix, while the second one specifies a certain balance in the probabilities of alternatives being ranked at positions. In a sense, `none to be whipped' means that the organization is of stability. Thus the second sufficient condition indicates an intrinsic relation of balance and organization stability. In addition, directions for future research are put forward.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
Hierarchical Vectorization for Portrait Images
Authors:
Qian Fu,
Linlin Liu,
Fei Hou,
Ying He
Abstract:
Aiming at developing intuitive and easy-to-use portrait editing tools, we propose a novel vectorization method that can automatically convert raster images into a 3-tier hierarchical representation. The base layer consists of a set of sparse diffusion curves (DC) which characterize salient geometric features and low-frequency colors and provide means for semantic color transfer and facial expressi…
▽ More
Aiming at developing intuitive and easy-to-use portrait editing tools, we propose a novel vectorization method that can automatically convert raster images into a 3-tier hierarchical representation. The base layer consists of a set of sparse diffusion curves (DC) which characterize salient geometric features and low-frequency colors and provide means for semantic color transfer and facial expression editing. The middle level encodes specular highlights and shadows to large and editable Poisson regions (PR) and allows the user to directly adjust illumination via tuning the strength and/or changing shape of PR. The top level contains two types of pixel-sized PRs for high-frequency residuals and fine details such as pimples and pigmentation. We also train a deep generative model that can produce high-frequency residuals automatically. Thanks to the meaningful organization of vector primitives, editing portraits becomes easy and intuitive. In particular, our method supports color transfer, facial expression editing, highlight and shadow editing and automatic retouching. Thanks to the linearity of the Laplace operator, we introduce alpha blending, linear dodge and linear burn to vector editing and show that they are effective in editing highlights and shadows. To quantitatively evaluate the results, we extend the commonly used FLIP metric (which measures differences between two images) by considering illumination. The new metric, called illumination-sensitive FLIP or IS-FLIP, can effectively capture the salient changes in color transfer results, and is more consistent with human perception than FLIP and other quality measures on portrait images. We evaluate our method on the FFHQR dataset and show that our method is effective for common portrait editing tasks, such as retouching, light editing, color transfer and expression editing. We will make the code and trained models publicly available.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Conditions for Social Preference Transitivity When Cycle Involved and A $\hat{O}\mbox{-}\hat{I}$ Framework
Authors:
Fujun Hou
Abstract:
We present some conditions for social preference transitivity under the majority rule when the individual preferences include cycles. First, our concern is with the restriction on the preference orderings of individuals except those (called cycle members) whose preferences constitute the cycles, but the considered transitivity is, of course, of the society as a whole. In our discussion, the indivi…
▽ More
We present some conditions for social preference transitivity under the majority rule when the individual preferences include cycles. First, our concern is with the restriction on the preference orderings of individuals except those (called cycle members) whose preferences constitute the cycles, but the considered transitivity is, of course, of the society as a whole. In our discussion, the individual preferences are assumed concerned and the cycle members' preferences are assumed as strict orderings. Particularly, for an alternative triple when one cycle is involved and the society is sufficient large (at least 5 individuals in the society), we present a sufficient condition for social transitivity; when two antagonistic cycles are involved and the society has at least 9 individuals, necessary and sufficient conditions are presented which are merely restricted on the preferences of those individuals except the cycle members. Based on the work due to Slutsky (1977) and Gaertner \& Heinecke (1978), we then outline a conceptual $\hat{O}\mbox{-}\hat{I}$ framework of social transitivity in an axiomatic manner. Connections between some already identified conditions and the $\hat{O}\mbox{-}\hat{I}$ framework is examined.
△ Less
Submitted 31 May, 2022; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Reformulating the Value Restriction and the Not-Strict Value Restriction in Terms of Possibility Preference Map
Authors:
Fujun Hou
Abstract:
In social choice theory, Sen's value restriction and Pattanaik's not-strict value restriction are both attractive conditions for testing social preference transitivity and/or non-empty social choice set existence. This article introduces a novel mathematical representation tool, called possibility preference map (PPM), for weak orderings, and then reformulates the value restriction and the not-str…
▽ More
In social choice theory, Sen's value restriction and Pattanaik's not-strict value restriction are both attractive conditions for testing social preference transitivity and/or non-empty social choice set existence. This article introduces a novel mathematical representation tool, called possibility preference map (PPM), for weak orderings, and then reformulates the value restriction and the not-strict value restriction in terms of PPM. The reformulations all appear elegant since they take the form of minmax.
△ Less
Submitted 15 May, 2022;
originally announced May 2022.
-
Improved Meta Learning for Low Resource Speech Recognition
Authors:
Satwinder Singh,
Ruili Wang,
Feng Hou
Abstract:
We propose a new meta learning based framework for low resource speech recognition that improves the previous model agnostic meta learning (MAML) approach. The MAML is a simple yet powerful meta learning approach. However, the MAML presents some core deficiencies such as training instabilities and slower convergence speed. To address these issues, we adopt multi-step loss (MSL). The MSL aims to ca…
▽ More
We propose a new meta learning based framework for low resource speech recognition that improves the previous model agnostic meta learning (MAML) approach. The MAML is a simple yet powerful meta learning approach. However, the MAML presents some core deficiencies such as training instabilities and slower convergence speed. To address these issues, we adopt multi-step loss (MSL). The MSL aims to calculate losses at every step of the inner loop of MAML and then combines them with a weighted importance vector. The importance vector ensures that the loss at the last step has more importance than the previous steps. Our empirical evaluation shows that MSL significantly improves the stability of the training procedure and it thus also improves the accuracy of the overall system. Our proposed system outperforms MAML based low resource ASR system on various languages in terms of character error rates and stable training behavior.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
A counter example to the theorems of social preference transitivity and social choice set existence under the majority rule
Authors:
Fujun Hou
Abstract:
I present an example in which the individuals' preferences are strict orderings, and under the majority rule, a transitive social ordering can be obtained and thus a non-empty choice set can also be obtained. However, the individuals' preferences in that example do not satisfy any conditions (restrictions) of which at least one is required by Inada (1969) for social preference transitivity under t…
▽ More
I present an example in which the individuals' preferences are strict orderings, and under the majority rule, a transitive social ordering can be obtained and thus a non-empty choice set can also be obtained. However, the individuals' preferences in that example do not satisfy any conditions (restrictions) of which at least one is required by Inada (1969) for social preference transitivity under the majority rule. Moreover, the considered individuals' preferences satisfy none of the conditions of value restriction (VR), extremal restriction (ER) or limited agreement (LA), some of which is required by Sen and Pattanaik (1969) for the existence of a non-empty social choice set. Therefore, the example is an exception to a number of theorems of social preference transitivity and social choice set existence under the majority rule. This observation indicates that the collection of the conditions listed by Inada (1969) is not as complete as might be supposed. This is also the case for the collection of conditions VR, ER and LA considered by Sen and Pattanaik (1969). This observation is a challenge to some necessary conditions in the current social choice theory. In addition to seeking new conditions, one possible way to deal with this challenge may be, from a theoretical prospective, to represent the identified conditions (such as the VR, ER and LA) in terms of a common mathematical tool, and then, people may find more.
△ Less
Submitted 5 May, 2022; v1 submitted 4 May, 2022;
originally announced May 2022.
-
Describing Sen's Transitivity Condition in Inequalities and Equations
Authors:
Fujun Hou
Abstract:
In social choice theory, Sen's value restriction condition is a sufficiency condition restricted to individuals' ordinal preferences so as to obtain a transitive social preference under the majority decision rule. In this article, Sen's transitivity condition is described by use of inequality and equation. First, for a triple of alternatives, an individual's preference is represented by a preferen…
▽ More
In social choice theory, Sen's value restriction condition is a sufficiency condition restricted to individuals' ordinal preferences so as to obtain a transitive social preference under the majority decision rule. In this article, Sen's transitivity condition is described by use of inequality and equation. First, for a triple of alternatives, an individual's preference is represented by a preference map, whose entries are sets containing the ranking position or positions derived from the individual's preference over that triple of those alternatives. Second, by using the union operation of sets and the cardinality concept, Sen's transitivity condition is described by inequalities. Finally, by using the membership function of sets, Sen's transitivity condition is further described by equations.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation
Authors:
Jin Yuan,
Feng Hou,
Yangzhou Du,
Zhongchao Shi,
Xin Geng,
Jianping Fan,
Yong Rui
Abstract:
Domain adaptation (DA) tries to tackle the scenarios when the test data does not fully follow the same distribution of the training data, and multi-source domain adaptation (MSDA) is very attractive for real world applications. By learning from large-scale unlabeled samples, self-supervised learning has now become a new trend in deep learning. It is worth noting that both self-supervised learning…
▽ More
Domain adaptation (DA) tries to tackle the scenarios when the test data does not fully follow the same distribution of the training data, and multi-source domain adaptation (MSDA) is very attractive for real world applications. By learning from large-scale unlabeled samples, self-supervised learning has now become a new trend in deep learning. It is worth noting that both self-supervised learning and multi-source domain adaptation share a similar goal: they both aim to leverage unlabeled data to learn more expressive representations. Unfortunately, traditional multi-task self-supervised learning faces two challenges: (1) the pretext task may not strongly relate to the downstream task, thus it could be difficult to learn useful knowledge being shared from the pretext task to the target task; (2) when the same feature extractor is shared between the pretext task and the downstream one and only different prediction heads are used, it is ineffective to enable inter-task information exchange and knowledge sharing. To address these issues, we propose a novel \textbf{S}elf-\textbf{S}upervised \textbf{G}raph Neural Network (SSG), where a graph neural network is used as the bridge to enable more effective inter-task information exchange and knowledge sharing. More expressive representation is learned by adopting a mask token strategy to mask some domain information. Our extensive experiments have demonstrated that our proposed SSG method has achieved state-of-the-art results over four multi-source domain adaptation datasets, which have shown the effectiveness of our proposed SSG method from different aspects.
△ Less
Submitted 15 January, 2024; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Flexible Portrait Image Editing with Fine-Grained Control
Authors:
Linlin Liu,
Qian Fu,
Fei Hou,
Ying He
Abstract:
We develop a new method for portrait image editing, which supports fine-grained editing of geometries, colors, lights and shadows using a single neural network model. We adopt a novel asymmetric conditional GAN architecture: the generators take the transformed conditional inputs, such as edge maps, color palette, sliders and masks, that can be directly edited by the user; the discriminators take t…
▽ More
We develop a new method for portrait image editing, which supports fine-grained editing of geometries, colors, lights and shadows using a single neural network model. We adopt a novel asymmetric conditional GAN architecture: the generators take the transformed conditional inputs, such as edge maps, color palette, sliders and masks, that can be directly edited by the user; the discriminators take the conditional inputs in the way that can guide controllable image generation more effectively. Taking color editing as an example, we feed color palettes (which can be edited easily) into the generator, and color maps (which contain positional information of colors) into the discriminator. We also design a region-weighted discriminator so that higher weights are assigned to more important regions, like eyes and skin. Using a color palette, the user can directly specify the desired colors of hair, skin, eyes, lip and background. Color sliders allow the user to blend colors in an intuitive manner. The user can also edit lights and shadows by modifying the corresponding masks. We demonstrate the effectiveness of our method by evaluating it on the CelebAMask-HQ dataset with a wide range of tasks, including geometry/color/shadow/light editing, hand-drawn sketch to image translation, and color transfer. We also present ablation studies to justify our design.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
A Systematic Literature Review on Trust in the Software Ecosystem
Authors:
Fang Hou,
Slinger Jansen
Abstract:
We conduct a systematic literature review on the concept of trust in the worldwide software ecosystem. We acknowledge that trust is something between two actors in the software ecosystem, and we examine what role trust plays in the relationships between end-users and (1) software products, (2) package managers, (3) software producing organizations, and (4) software engineers. Two major findings em…
▽ More
We conduct a systematic literature review on the concept of trust in the worldwide software ecosystem. We acknowledge that trust is something between two actors in the software ecosystem, and we examine what role trust plays in the relationships between end-users and (1) software products, (2) package managers, (3) software producing organizations, and (4) software engineers. Two major findings emerged from the systematic literature review. To begin, we provide a definition of trust in the software ecosystem, including a theoretical framework that decomposes and signifies a theoretical understanding of trust. Second, we provide a list of trust factors that can be used to assemble an overview of software trust.
△ Less
Submitted 13 February, 2022;
originally announced March 2022.
-
A Standardized Pipeline for Colon Nuclei Identification and Counting Challenge
Authors:
Jijun Cheng,
Xipeng Pan,
Feihu Hou,
Bingchao Zhao,
Jiatai Lin,
Zhenbing Liu,
Zaiyi Liu,
Chu Han
Abstract:
Nuclear segmentation and classification is an essential step for computational pathology. TIA lab from Warwick University organized a nuclear segmentation and classification challenge (CoNIC) for H&E stained histopathology images in colorectal cancer with two highly correlated tasks, nuclei segmentation and classification task and cellular composition task. There are a few obstacles we have to add…
▽ More
Nuclear segmentation and classification is an essential step for computational pathology. TIA lab from Warwick University organized a nuclear segmentation and classification challenge (CoNIC) for H&E stained histopathology images in colorectal cancer with two highly correlated tasks, nuclei segmentation and classification task and cellular composition task. There are a few obstacles we have to address in this challenge, 1) limited training samples, 2) color variation, 3) imbalanced annotations, 4) similar morphological appearance among classes. To deal with these challenges, we proposed a standardized pipeline for nuclear segmentation and classification by integrating several pluggable components. First, we built a GAN-based model to automatically generate pseudo images for data augmentation. Then we trained a self-supervised stain normalization model to solve the color variation problem. Next we constructed a baseline model HoVer-Net with cost-sensitive loss to encourage the model pay more attention on the minority classes. According to the results of the leaderboard, our proposed pipeline achieves 0.40665 mPQ+ (Rank 49th) and 0.62199 r2 (Rank 10th) in the preliminary test phase.
△ Less
Submitted 20 March, 2022; v1 submitted 28 February, 2022;
originally announced March 2022.
-
Quantum microscopy with van der Waals heterostructures
Authors:
A. J. Healey,
S. C. Scholten,
T. Yang,
J. A. Scott,
G. J. Abrahams,
I. O. Robertson,
X. F. Hou,
Y. F. Guo,
S. Rahman,
Y. Lu,
M. Kianinia,
I. Aharonovich,
J. -P. Tetienne
Abstract:
Quantum microscopes based on solid-state spin quantum sensors have recently emerged as powerful tools for probing material properties and physical processes in regimes not accessible to classical sensors, especially on the nanoscale. Such microscopes have already found utility in a variety of problems, from imaging magnetism and charge transport in nanoscale devices, to mapping remanent magnetic f…
▽ More
Quantum microscopes based on solid-state spin quantum sensors have recently emerged as powerful tools for probing material properties and physical processes in regimes not accessible to classical sensors, especially on the nanoscale. Such microscopes have already found utility in a variety of problems, from imaging magnetism and charge transport in nanoscale devices, to mapping remanent magnetic fields from ancient rocks and biological organisms. However, applications of quantum microscopes have so far relied on sensors hosted in a rigid, three-dimensional crystal, typically diamond, which limits their ability to closely interact with the sample under study. Here we demonstrate a versatile and robust quantum microscope using quantum sensors embedded within a thin layer of a van der Waals (vdW) material, hexagonal boron nitride (hBN). To showcase the capabilities of this platform, we assemble several active vdW heterostructures, with an hBN layer acting as the quantum sensor. We demonstrate time-resolved, simultaneous temperature and magnetic imaging near the Curie temperature of a vdW ferromagnet as well as apply this unique microscope to map out charge currents and Joule heating in graphene. By enabling intimate proximity between sensor and sample, potentially down to a single atomic layer, the hBN quantum sensor represents a paradigm shift for nanoscale quantum sensing and microscopy. Moreover, given the ubiquitous use of hBN in modern materials and condensed matter physics research, we expect our technique to find rapid and broad adoption in these fields, further motivated by the prospect of performing in-situ chemical analysis and noise spectroscopy using advanced quantum sensing protocols.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
-
A Survey of Visual Transformers
Authors:
Yang Liu,
Yao Zhang,
Yixin Wang,
Feng Hou,
Jin Yuan,
Jiang Tian,
Yang Zhang,
Zhongchao Shi,
Jianping Fan,
Zhiqiang He
Abstract:
Transformer, an attention-based encoder-decoder model, has already revolutionized the field of natural language processing (NLP). Inspired by such significant achievements, some pioneering works have recently been done on employing Transformer-liked architectures in the computer vision (CV) field, which have demonstrated their effectiveness on three fundamental CV tasks (classification, detection,…
▽ More
Transformer, an attention-based encoder-decoder model, has already revolutionized the field of natural language processing (NLP). Inspired by such significant achievements, some pioneering works have recently been done on employing Transformer-liked architectures in the computer vision (CV) field, which have demonstrated their effectiveness on three fundamental CV tasks (classification, detection, and segmentation) as well as multiple sensory data stream (images, point clouds, and vision-language data). Because of their competitive modeling capabilities, the visual Transformers have achieved impressive performance improvements over multiple benchmarks as compared with modern Convolution Neural Networks (CNNs). In this survey, we have reviewed over one hundred of different visual Transformers comprehensively according to three fundamental CV tasks and different data stream types, where a taxonomy is proposed to organize the representative methods according to their motivations, structures, and application scenarios. Because of their differences on training settings and dedicated vision tasks, we have also evaluated and compared all these existing visual Transformers under different configurations. Furthermore, we have revealed a series of essential but unexploited aspects that may empower such visual Transformers to stand out from numerous architectures, e.g., slack high-level semantic embeddings to bridge the gap between the visual Transformers and the sequential ones. Finally, three promising research directions are suggested for future investment. We will continue to update the latest articles and their released source codes at https://github.com/liuyang-ict/awesome-visual-transformers.
△ Less
Submitted 6 December, 2022; v1 submitted 11 November, 2021;
originally announced November 2021.
-
Organization and Understanding of a Tactile Information Dataset TacAct During Physical Human-Robot Interactions
Authors:
Peng Wang,
Jixiao Liu,
Funing Hou,
Dicai Chen,
Zihou Xia,
Shijie Guo
Abstract:
Advanced service robots require superior tactile intelligence to guarantee human-contact safety and to provide essential supplements to visual and auditory information for human-robot interaction, especially when a robot is in physical contact with a human. Tactile intelligence is an essential capability of perception and recognition from tactile information, based on the learning from a large amo…
▽ More
Advanced service robots require superior tactile intelligence to guarantee human-contact safety and to provide essential supplements to visual and auditory information for human-robot interaction, especially when a robot is in physical contact with a human. Tactile intelligence is an essential capability of perception and recognition from tactile information, based on the learning from a large amount of tactile data and the understanding of the physical meaning behind the data. This report introduces a recently collected and organized dataset "TacAct" that encloses real-time pressure distribution when a human subject touches the arms of a nursing-care robot. The dataset consists of information from 50 subjects who performed a total of 24,000 touch actions. Furthermore, the details of the dataset are described, the data are preliminarily analyzed, and the validity of the collected information is tested through a convolutional neural network LeNet-5 classifying different types of touch actions. We believe that the TacAct dataset would be more than beneficial for the community of human interactive robots to understand the tactile profile under various circumstances.
△ Less
Submitted 11 August, 2021; v1 submitted 8 August, 2021;
originally announced August 2021.
-
Trinity: A No-Code AI platform for complex spatial datasets
Authors:
C. V. Krishnakumar Iyer,
Feili Hou,
Henry Wang,
Yonghong Wang,
Kay Oh,
Swetava Ganguli,
Vipul Pandey
Abstract:
We present a no-code Artificial Intelligence (AI) platform called Trinity with the main design goal of enabling both machine learning researchers and non-technical geospatial domain experts to experiment with domain-specific signals and datasets for solving a variety of complex problems on their own. This versatility to solve diverse problems is achieved by transforming complex Spatio-temporal dat…
▽ More
We present a no-code Artificial Intelligence (AI) platform called Trinity with the main design goal of enabling both machine learning researchers and non-technical geospatial domain experts to experiment with domain-specific signals and datasets for solving a variety of complex problems on their own. This versatility to solve diverse problems is achieved by transforming complex Spatio-temporal datasets to make them consumable by standard deep learning models, in this case, Convolutional Neural Networks (CNNs), and giving the ability to formulate disparate problems in a standard way, eg. semantic segmentation. With an intuitive user interface, a feature store that hosts derivatives of complex feature engineering, a deep learning kernel, and a scalable data processing mechanism, Trinity provides a powerful platform for domain experts to share the stage with scientists and engineers in solving business-critical problems. It enables quick prototyping, rapid experimentation and reduces the time to production by standardizing model building and deployment. In this paper, we present our motivation behind Trinity and its design along with showcasing sample applications to motivate the idea of lowering the bar to using AI.
△ Less
Submitted 1 July, 2021; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Improving Entity Linking through Semantic Reinforced Entity Embeddings
Authors:
Feng Hou,
Ruili Wang,
Jun He,
Yi Zhou
Abstract:
Entity embeddings, which represent different aspects of each entity with a single vector like word embeddings, are a key component of neural entity linking models. Existing entity embeddings are learned from canonical Wikipedia articles and local contexts surrounding target entities. Such entity embeddings are effective, but too distinctive for linking models to learn contextual commonality. We pr…
▽ More
Entity embeddings, which represent different aspects of each entity with a single vector like word embeddings, are a key component of neural entity linking models. Existing entity embeddings are learned from canonical Wikipedia articles and local contexts surrounding target entities. Such entity embeddings are effective, but too distinctive for linking models to learn contextual commonality. We propose a simple yet effective method, FGS2EE, to inject fine-grained semantic information into entity embeddings to reduce the distinctiveness and facilitate the learning of contextual commonality. FGS2EE first uses the embeddings of semantic type words to generate semantic embeddings, and then combines them with existing entity embeddings through linear aggregation. Extensive experiments show the effectiveness of such embeddings. Based on our entity embeddings, we achieved new sate-of-the-art performance on entity linking.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Delayed singularity formation for the three dimensional compressible Euler equations with non-zero vorticity
Authors:
Fei Hou,
Huicheng Yin
Abstract:
For the 3D compressible isentropic Euler equations with an initial perturbation of size $\ve$ of a rest state, if the initial vorticity is of size $\dl$ with $0<\dl\le \ve$ and $\ve$ is small, we establish that the lifespan of the smooth solutions is $T_{\dl}=O(\min\{e^\frac{1}{\ve},\frac{1}δ\})$ for the polytropic gases, and $T_{\dl}=O(\frac{1}δ)$ for the Chaplygin gases. For example, when…
▽ More
For the 3D compressible isentropic Euler equations with an initial perturbation of size $\ve$ of a rest state, if the initial vorticity is of size $\dl$ with $0<\dl\le \ve$ and $\ve$ is small, we establish that the lifespan of the smooth solutions is $T_{\dl}=O(\min\{e^\frac{1}{\ve},\frac{1}δ\})$ for the polytropic gases, and $T_{\dl}=O(\frac{1}δ)$ for the Chaplygin gases. For example, when $\dl=e^{-\f{1}{\ve^2}}$ is chosen, then $T_{\dl}=O(e^{\f{1}{\ve}})$ for the polytropic gases and $T_{\dl}=O(e^{\f{1}{\ve^2}})$ for the Chaplygin gases although the perturbations of the initial density and the divergence of the initial velocity are only of order $O(\ve)$. Our result illustrates that the time of existence of smooth solutions depends crucially on the size of the vorticity of the initial data, as long as the initial data is sufficiently close to a constant. The main ingredients in the paper are: introducing some suitably weighted energies, deriving the pointwise space-time decay estimates of solutions, looking for the good unknown instead of the velocity, and establishing the required weighted estimates on the vorticty.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
Long time existence of smooth solutions to 2D compressible Euler equations of Chaplygin gases with non-zero vorticity
Authors:
Fei Hou,
Huicheng Yin
Abstract:
For the 2D compressible isentropic Euler equations of polytropic gases with an initial perturbation of size $\ve$ of a rest state, it has been known that if the initial data are rotationnally invariant or irrotational, then the lifespan $T_{\ve}$ of the classical solutions is of order $O(\f{1}{\ve^2})$; if the initial vorticity is of size $\ve^{1+\al}$ ($0\le\al\le 1$), then $T_{\ve}$ is of…
▽ More
For the 2D compressible isentropic Euler equations of polytropic gases with an initial perturbation of size $\ve$ of a rest state, it has been known that if the initial data are rotationnally invariant or irrotational, then the lifespan $T_{\ve}$ of the classical solutions is of order $O(\f{1}{\ve^2})$; if the initial vorticity is of size $\ve^{1+\al}$ ($0\le\al\le 1$), then $T_{\ve}$ is of $O(\f{1}{\ve^{1+\al}})$. In the present paper, for the 2D compressible isentropic Euler equations of Chaplygin gases, if the initial data are a perturbation of size $\ve$, and the initial vorticity is of any size $\dl$ with $0<\dl\le \ve$, we will establish the lifespan $T_{\dl}=O(\f{1}{\dl})$. For examples, if $\dl=e^{-\f{1}{\ve^2}}$ or $\dl=e^{-e^{\f{1}{\ve^2}}}$ are chosen, then $T_{\dl}=O(e^{\f{1}{\ve^2}})$ or $T_{\dl}=O(e^{e^{\f{1}{\ve^2}}})$ although the perturbations of the initial density and the divergence of the initial velocity are only of order $O(\ve)$. Our main ingredients are: finding the null condition structures in 2D compressible Euler equations of Chaplygin gases and looking for the good unknown; establishing a new class of weighted space-time $L^\infty$-$L^\infty$ estimates for the solution itself and its gradients of 2D linear wave equations; introducing some suitably weighted energies and taking the $L^p$ $(1<p<\infty)$ estimates on the vorticity.
△ Less
Submitted 23 February, 2021;
originally announced February 2021.
-
TrustSECO: An Interview Survey into Software Trust
Authors:
Floris Jansen,
Slinger Jansen,
Fang Hou
Abstract:
The software ecosystem is a trust-rich part of the world. Collaboratively, software engineers trust major hubs in the ecosystem, such as package managers, repository services, and programming language ecosystems. This trust, however, is often broken by vulnerabilities, ransomware, and abuse from malignant actors.
But what is trust? In this paper we explore, through twelve in-depth interviews wit…
▽ More
The software ecosystem is a trust-rich part of the world. Collaboratively, software engineers trust major hubs in the ecosystem, such as package managers, repository services, and programming language ecosystems. This trust, however, is often broken by vulnerabilities, ransomware, and abuse from malignant actors.
But what is trust? In this paper we explore, through twelve in-depth interviews with software engineers, how they perceive trust in their daily work. From the interviews we conclude three things. First, software engineers make a distinction between an adoption factor and a trust factor when selecting a package. Secondly, while in literature mostly technical factors are considered as the main trust factors, the software engineers in this study conclude that organizational factors are more important. Finally, we find that different kinds of software engineers require different views on trust, and that it is impossible to create one unified perception of trust.
Keywords: software ecosystem trust, empirical software engineering, TrustSECO, external software adoption, cross-sectional exploratory interview analysis, trust perception.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
Semi-supervised Cardiac Image Segmentation via Label Propagation and Style Transfer
Authors:
Yao Zhang,
Jiawei Yang,
Feng Hou,
Yang Liu,
Yixin Wang,
Jiang Tian,
Cheng Zhong,
Yang Zhang,
Zhiqiang He
Abstract:
Accurate segmentation of cardiac structures can assist doctors to diagnose diseases, and to improve treatment planning, which is highly demanded in the clinical practice. However, the shortage of annotation and the variance of the data among different vendors and medical centers restrict the performance of advanced deep learning methods. In this work, we present a fully automatic method to segment…
▽ More
Accurate segmentation of cardiac structures can assist doctors to diagnose diseases, and to improve treatment planning, which is highly demanded in the clinical practice. However, the shortage of annotation and the variance of the data among different vendors and medical centers restrict the performance of advanced deep learning methods. In this work, we present a fully automatic method to segment cardiac structures including the left (LV) and right ventricle (RV) blood pools, as well as for the left ventricular myocardium (MYO) in MRI volumes. Specifically, we design a semi-supervised learning method to leverage unlabelled MRI sequence timeframes by label propagation. Then we exploit style transfer to reduce the variance among different centers and vendors for more robust cardiac image segmentation. We evaluate our method in the M&Ms challenge 7 , ranking 2nd place among 14 competitive teams.
△ Less
Submitted 4 August, 2022; v1 submitted 29 December, 2020;
originally announced December 2020.
-
Modality-Pairing Learning for Brain Tumor Segmentation
Authors:
Yixin Wang,
Yao Zhang,
Feng Hou,
Yang Liu,
Jiang Tian,
Cheng Zhong,
Yang Zhang,
Zhiqiang He
Abstract:
Automatic brain tumor segmentation from multi-modality Magnetic Resonance Images (MRI) using deep learning methods plays an important role in assisting the diagnosis and treatment of brain tumor. However, previous methods mostly ignore the latent relationship among different modalities. In this work, we propose a novel end-to-end Modality-Pairing learning method for brain tumor segmentation. Paral…
▽ More
Automatic brain tumor segmentation from multi-modality Magnetic Resonance Images (MRI) using deep learning methods plays an important role in assisting the diagnosis and treatment of brain tumor. However, previous methods mostly ignore the latent relationship among different modalities. In this work, we propose a novel end-to-end Modality-Pairing learning method for brain tumor segmentation. Paralleled branches are designed to exploit different modality features and a series of layer connections are utilized to capture complex relationships and abundant information among modalities. We also use a consistency loss to minimize the prediction variance between two branches. Besides, learning rate warmup strategy is adopted to solve the problem of the training instability and early over-fitting. Lastly, we use average ensemble of multiple models and some post-processing techniques to get final results. Our method is tested on the BraTS 2020 online testing dataset, obtaining promising segmentation performance, with average dice scores of 0.891, 0.842, 0.816 for the whole tumor, tumor core and enhancing tumor, respectively. We won the second place of the BraTS 2020 Challenge for the tumor segmentation task.
△ Less
Submitted 28 December, 2020; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Calibration Venus: An Interactive Camera Calibration Method Based on Search Algorithm and Pose Decomposition
Authors:
Wentai Lei,
Mengdi Xu,
Feifei Hou,
Wensi Jiang
Abstract:
In many scenarios where cameras are applied, such as robot positioning and unmanned driving, camera calibration is one of the most important pre-work. The interactive calibration method based on the plane board is becoming popular in camera calibration field due to its repeatability and operation advantages. However, the existing methods select suggestions from a fixed dataset of pre-defined poses…
▽ More
In many scenarios where cameras are applied, such as robot positioning and unmanned driving, camera calibration is one of the most important pre-work. The interactive calibration method based on the plane board is becoming popular in camera calibration field due to its repeatability and operation advantages. However, the existing methods select suggestions from a fixed dataset of pre-defined poses based on subjective experience, which leads to a certain degree of one-sidedness. Moreover, they does not give users clear instructions on how to place the board in the specified pose.
△ Less
Submitted 13 September, 2020;
originally announced September 2020.