Skip to main content

Showing 1–39 of 39 results for author: Yi, D

  1. arXiv:2406.06110  [pdf, other

    cs.CL cs.AI

    Recurrent Context Compression: Efficiently Expanding the Context Window of LLM

    Authors: Chensen Huang, Guibo Zhu, Xuepeng Wang, Yifei Luo, Guojing Ge, Haoran Chen, Dong Yi, Jinqiao Wang

    Abstract: To extend the context length of Transformer-based large language models (LLMs) and improve comprehension capabilities, we often face limitations due to computational resources and bounded memory storage capacity. This work introduces a method called Recurrent Context Compression (RCC), designed to efficiently expand the context window length of LLMs within constrained storage space. We also invest… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2402.03047  [pdf, other

    cs.CV cs.LG

    PFDM: Parser-Free Virtual Try-on via Diffusion Model

    Authors: Yunfang Niu, Dong Yi, Lingxiang Wu, Zhiwei Liu, Pengxiang Cai, Jinqiao Wang

    Abstract: Virtual try-on can significantly improve the garment shopping experiences in both online and in-store scenarios, attracting broad interest in computer vision. However, to achieve high-fidelity try-on performance, most state-of-the-art methods still rely on accurate segmentation masks, which are often produced by near-perfect parsers or manual labeling. To overcome the bottleneck, we propose a pars… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE ICASSP 2024

  3. arXiv:2312.10920  [pdf, other

    cs.LG stat.ME

    Domain adaption and physical constrains transfer learning for shale gas production

    Authors: Zhaozhong Yang, Liangjie Gou, Chao Min, Duo Yi, Xiaogang Li, Guoquan Wen

    Abstract: Effective prediction of shale gas production is crucial for strategic reservoir development. However, in new shale gas blocks, two main challenges are encountered: (1) the occurrence of negative transfer due to insufficient data, and (2) the limited interpretability of deep learning (DL) models. To tackle these problems, we propose a novel transfer learning methodology that utilizes domain adaptat… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  4. arXiv:2311.01149  [pdf, other

    cs.CL

    ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

    Authors: Jianghao Chen, Pu Jian, Tengxiao Xi, Dongyi Yi, Qianlong Du, Chenglin Ding, Guibo Zhu, Chengqing Zong, Jinqiao Wang, Jiajun Zhang

    Abstract: During the development of large language models (LLMs), the scale and quality of the pre-training data play a crucial role in shaping LLMs' capabilities. To accelerate the research of LLMs, several large-scale datasets, such as C4 [1], Pile [2], RefinedWeb [3] and WanJuan [4], have been released to the public. However, most of the released corpus focus mainly on English, and there is still lack of… ▽ More

    Submitted 10 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

  5. arXiv:2306.12108  [pdf

    cs.CY cs.RO

    Complex accident, clear responsibility

    Authors: Dexin Yi

    Abstract: The problem of allocating accident responsibility for autonomous driving is a difficult issue in the field of autonomous driving. Due to the complexity of autonomous driving technology, most of the research on the responsibility of autonomous driving accidents has remained at the theoretical level. When encountering actual autonomous driving accidents, a proven and fair solution is needed. To addr… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 7 pages, 7 figures

  6. arXiv:2211.09027  [pdf, other

    cs.LG cs.CV

    LLEDA -- Lifelong Self-Supervised Domain Adaptation

    Authors: Mamatha Thota, Dewei Yi, Georgios Leontidis

    Abstract: Humans and animals have the ability to continuously learn new information over their lifetime without losing previously acquired knowledge. However, artificial neural networks struggle with this due to new information conflicting with old knowledge, resulting in catastrophic forgetting. The complementary learning systems (CLS) theory suggests that the interplay between hippocampus and neocortex sy… ▽ More

    Submitted 7 August, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: 19 pages, 6 figures, 6 tables; V2 added more experiments on more domains and fixed typos

  7. arXiv:2209.04811  [pdf, other

    cs.CL

    Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models

    Authors: David K. Yi, James V. Bruno, Jiayu Han, Peter Zukerman, Shane Steinert-Threlkeld

    Abstract: We investigate the extent to which verb alternation classes, as described by Levin (1993), are encoded in the embeddings of Large Pre-trained Language Models (PLMs) such as BERT, RoBERTa, ELECTRA, and DeBERTa using selectively constructed diagnostic classifiers for word and sentence-level prediction tasks. We follow and expand upon the experiments of Kann et al. (2019), which aim to probe whether… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 8 pages, 6 figures

  8. arXiv:2111.00042  [pdf, other

    cs.CV

    CvS: Classification via Segmentation For Small Datasets

    Authors: Nooshin Mojab, Philip S. Yu, Joelle A. Hallak, Darvin Yi

    Abstract: Deep learning models have shown promising results in a wide range of computer vision applications across various domains. The success of deep learning methods relies heavily on the availability of a large amount of data. Deep neural networks are prone to overfitting when data is scarce. This problem becomes even more severe for neural network with classification head with access to only a few data… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  9. Desk Organization: Effect of Multimodal Inputs on Spatial Relational Learning

    Authors: Ryan Rowe, Shivam Singhal, Daqing Yi, Tapomayukh Bhattacharjee, Siddhartha S. Srinivasa

    Abstract: For robots to operate in a three dimensional world and interact with humans, learning spatial relationships among objects in the surrounding is necessary. Reasoning about the state of the world requires inputs from many different sensory modalities including vision ($V$) and haptics ($H$). We examine the problem of desk organization: learning how humans spatially position different objects on a pl… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 8 pages, 7 figures

    ACM Class: I.2.9

    Journal ref: 2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) (pp. 1-8). IEEE

  10. arXiv:2107.14654  [pdf, other

    cs.AI

    Brain-Inspired Deep Imitation Learning for Autonomous Driving Systems

    Authors: Hasan Bayarov Ahmedov, Dewei Yi, Jie Sui

    Abstract: Autonomous driving has attracted great attention from both academics and industries. To realise autonomous driving, Deep Imitation Learning (DIL) is treated as one of the most promising solutions, because it improves autonomous driving systems by automatically learning a complex mapping from human driving data, compared to manually designing the driving policy. However, existing DIL methods cannot… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

  11. arXiv:2106.03905  [pdf, other

    eess.IV cs.CV cs.LG

    AutoPtosis

    Authors: Abdullah Aleem, Manoj Prabhakar Nallabothula, Pete Setabutr, Joelle A. Hallak, Darvin Yi

    Abstract: Blepharoptosis, or ptosis as it is more commonly referred to, is a condition of the eyelid where the upper eyelid droops. The current diagnosis for ptosis involves cumbersome manual measurements that are time-consuming and prone to human error. In this paper, we present AutoPtosis, an artificial intelligence based system with interpretable results for rapid diagnosis of ptosis. We utilize a divers… ▽ More

    Submitted 9 June, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  12. arXiv:2104.02609  [pdf, other

    eess.IV cs.CV

    I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications

    Authors: Nooshin Mojab, Vahid Noroozi, Abdullah Aleem, Manoj P. Nallabothula, Joseph Baker, Dimitri T. Azar, Mark Rosenblatt, RV Paul Chan, Darvin Yi, Philip S. Yu, Joelle A. Hallak

    Abstract: Data from clinical real-world settings is characterized by variability in quality, machine-type, setting, and source. One of the primary goals of medical computer vision is to develop and validate artificial intelligence (AI) based algorithms on real-world data enabling clinical translations. However, despite the exponential growth in AI based applications in healthcare, specifically in ophthalmol… ▽ More

    Submitted 29 March, 2021; originally announced April 2021.

  13. arXiv:2007.12672  [pdf, other

    cs.CV

    Real-World Multi-Domain Data Applications for Generalizations to Clinical Settings

    Authors: Nooshin Mojab, Vahid Noroozi, Darvin Yi, Manoj Prabhakar Nallabothula, Abdullah Aleem, Phillip S. Yu, Joelle A. Hallak

    Abstract: With promising results of machine learning based models in computer vision, applications on medical imaging data have been increasing exponentially. However, generalizations to complex real-world clinical data is a persistent problem. Deep learning models perform well when trained on standardized datasets from artificial settings, such as clinical trials. However, real-world data is different and… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  14. arXiv:2002.09809  [pdf, other

    cs.CV

    Random Bundle: Brain Metastases Segmentation Ensembling through Annotation Randomization

    Authors: Darvin Yi, Endre Grøvik, Michael Iv, Elizabeth Tong, Greg Zaharchuk, Daniel Rubin

    Abstract: We introduce a novel ensembling method, Random Bundle (RB), that improves performance for brain metastases segmentation. We create our ensemble by training each network on our dataset with 50% of our annotated lesions censored out. We also apply a lopsided bootstrap loss to recover performance after inducing an in silico 50% false negative rate and make our networks more sensitive. We improve our… ▽ More

    Submitted 28 April, 2020; v1 submitted 22 February, 2020; originally announced February 2020.

  15. arXiv:2001.09501  [pdf, other

    cs.CV eess.IV

    Brain Metastasis Segmentation Network Trained with Robustness to Annotations with Multiple False Negatives

    Authors: Darvin Yi, Endre Grøvik, Michael Iv, Elizabeth Tong, Greg Zaharchuk, Daniel Rubin

    Abstract: Deep learning has proven to be an essential tool for medical image analysis. However, the need for accurately labeled input data, often requiring time- and labor-intensive annotation by experts, is a major limitation to the use of deep learning. One solution to this challenge is to allow for use of coarse or noisy labels, which could permit more efficient and scalable labeling of images. In this w… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

  16. arXiv:1912.11966  [pdf

    eess.IV cs.CV

    Handling Missing MRI Input Data in Deep Learning Segmentation of Brain Metastases: A Multi-Center Study

    Authors: Endre Grøvik, Darvin Yi, Michael Iv, Elizabeth Tong, Line Brennhaug Nilsen, Anna Latysheva, Cathrine Saxhaug, Kari Dolven Jacobsen, Åslaug Helland, Kyrre Eeg Emblem, Daniel Rubin, Greg Zaharchuk

    Abstract: The purpose was to assess the clinical value of a novel DropOut model for detecting and segmenting brain metastases, in which a neural network is trained on four distinct MRI sequences using an input dropout layer, thus simulating the scenario of missing MRI data by training on the full set and all possible subsets of the input data. This retrospective, multi-center study, evaluated 165 patients w… ▽ More

    Submitted 26 December, 2019; originally announced December 2019.

  17. arXiv:1912.08775  [pdf, other

    eess.IV cs.CV

    MRI Pulse Sequence Integration for Deep-Learning Based Brain Metastasis Segmentation

    Authors: Darvin Yi, Endre Grøvik, Michael Iv, Elizabeth Tong, Kyrre Eeg Emblem, Line Brennhaug Nilsen, Cathrine Saxhaug, Anna Latysheva, Kari Dolven Jacobsen, Åslaug Helland, Greg Zaharchuk, Daniel Rubin

    Abstract: Magnetic resonance (MR) imaging is an essential diagnostic tool in clinical medicine. Recently, a variety of deep learning methods have been applied to segmentation tasks in medical images, with promising results for computer-aided diagnosis. For MR images, effectively integrating different pulse sequences is important to optimize performance. However, the best way to integrate different pulse seq… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: In the IEEE transactions format for submission to IEEE-TMI

  18. Latent Complete Row Space Recovery for Multi-view Subspace Clustering

    Authors: Hong Tao, Chenping Hou, Yuhua Qian, Jubo Zhu, Dongyun Yi

    Abstract: Multi-view subspace clustering has been applied to applications such as image processing and video surveillance, and has attracted increasing attention. Most existing methods learn view-specific self-representation matrices, and construct a combined affinity matrix from multiple views. The affinity construction process is time-consuming, and the combined affinity matrix is not guaranteed to reflec… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

  19. arXiv:1905.04457  [pdf, ps, other

    cs.CV cs.LG

    Triplet Distillation for Deep Face Recognition

    Authors: Yushu Feng, Huan Wang, Daniel T. Yi, Roland Hu

    Abstract: Convolutional neural networks (CNNs) have achieved a great success in face recognition, which unfortunately comes at the cost of massive computation and storage consumption. Many compact face recognition networks are thus proposed to resolve this problem. Triplet loss is effective to further improve the performance of those compact models. However, it normally employs a fixed margin to all the sam… ▽ More

    Submitted 19 May, 2019; v1 submitted 11 May, 2019; originally announced May 2019.

    Comments: 5 pages, 2 tables, accpeted by ICML 2019 ODML-CDNNR Workshop

  20. arXiv:1904.11595  [pdf, other

    cs.CV

    DeepPerimeter: Indoor Boundary Estimation from Posed Monocular Sequences

    Authors: Ameya Phalak, Zhao Chen, Darvin Yi, Khushi Gupta, Vijay Badrinarayanan, Andrew Rabinovich

    Abstract: We present DeepPerimeter, a deep learning based pipeline for inferring a full indoor perimeter (i.e. exterior boundary map) from a sequence of posed RGB images. Our method relies on robust deep methods for depth estimation and wall segmentation to generate an exterior boundary point cloud, and then uses deep unsupervised clustering to fit wall planes to obtain a final boundary map of the room. We… ▽ More

    Submitted 1 July, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

  21. arXiv:1903.07988  [pdf

    eess.IV cs.LG stat.ML

    Deep Learning Enables Automatic Detection and Segmentation of Brain Metastases on Multi-Sequence MRI

    Authors: Endre Grøvik, Darvin Yi, Michael Iv, Elisabeth Tong, Daniel L. Rubin, Greg Zaharchuk

    Abstract: Detecting and segmenting brain metastases is a tedious and time-consuming task for many radiologists, particularly with the growing use of multi-sequence 3D imaging. This study demonstrates automated detection and segmentation of brain metastases on multi-sequence MRI using a deep learning approach based on a fully convolution neural network (CNN). In this retrospective study, a total of 156 patie… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

  22. Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

    Authors: Hong Tao, Chenping Hou, Dongyun Yi, Jubo Zhu, Dewen Hu

    Abstract: In real-world applications, not all instances in multi-view data are fully represented. To deal with incomplete data, Incomplete Multi-view Learning (IML) rises. In this paper, we propose the Joint Embedding Learning and Low-Rank Approximation (JELLA) framework for IML. The JELLA framework approximates the incomplete data by a set of low-rank matrices and learns a full and common embedding by line… ▽ More

    Submitted 16 December, 2019; v1 submitted 24 December, 2018; originally announced December 2018.

  23. arXiv:1811.11226  [pdf, ps, other

    cs.NE cs.CV

    CT organ segmentation using GPU data augmentation, unsupervised labels and IOU loss

    Authors: Blaine Rister, Darvin Yi, Kaushik Shivakumar, Tomomi Nobashi, Daniel L. Rubin

    Abstract: Fully-convolutional neural networks have achieved superior performance in a variety of image segmentation tasks. However, their training requires laborious manual annotation of large datasets, as well as acceleration by parallel processors with high-bandwidth memory, such as GPUs. We show that simple models can achieve competitive accuracy for organ segmentation on CT images when trained with exte… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: Journal submission pre-print

  24. arXiv:1806.03018  [pdf, other

    cs.CV

    Large-scale Bisample Learning on ID Versus Spot Face Recognition

    Authors: Xiangyu Zhu, Hao Liu, Zhen Lei, Hailin Shi, Fan Yang, Dong Yi, Guojun Qi, Stan Z. Li

    Abstract: In real-world face recognition applications, there is a tremendous amount of data with two images for each person. One is an ID photo for face enrollment, and the other is a probe photo captured on spot. Most existing methods are designed for training data with limited breadth (a relatively small number of classes) and sufficient depth (many samples for each class). They would meet great challenge… ▽ More

    Submitted 13 February, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: Accepted by special issue on Deep Learning for Face Analysis. International Journal of Computer Vision (IJCV), 2019

  25. arXiv:1805.07719  [pdf, other

    cs.RO cs.CL

    Balancing Shared Autonomy with Human-Robot Communication

    Authors: Rosario Scalise, Yonatan Bisk, Maxwell Forbes, Daqing Yi, Yejin Choi, Siddhartha Srinivasa

    Abstract: Robotic agents that share autonomy with a human should leverage human domain knowledge and account for their preferences when completing a task. This extra knowledge can dramatically improve plan efficiency and user-satisfaction, but these gains are lost if communicating with a robot is taxing and unnatural. In this paper, we show how viewing humanrobot language through the lens of shared autonomy… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

  26. arXiv:1710.06092  [pdf, other

    cs.RO

    Generalizing Informed Sampling for Asymptotically Optimal Sampling-based Kinodynamic Planning via Markov Chain Monte Carlo

    Authors: Daqing Yi, Rohan Thakker, Cole Gulino, Oren Salzman, Siddhartha Srinivasa

    Abstract: Asymptotically-optimal motion planners such as RRT* have been shown to incrementally approximate the shortest path between start and goal states. Once an initial solution is found, their performance can be dramatically improved by restricting subsequent samples to regions of the state space that can potentially improve the current solution. When the motion planning problem lies in a Euclidean spac… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

  27. arXiv:1709.05929  [pdf

    cs.CV cs.LG physics.med-ph

    Institutionally Distributed Deep Learning Networks

    Authors: Ken Chang, Niranjan Balachandar, Carson K Lam, Darvin Yi, James M Brown, Andrew Beers, Bruce R Rosen, Daniel L Rubin, Jayashree Kalpathy-Cramer

    Abstract: Deep learning has become a promising approach for automated medical diagnoses. When medical data samples are limited, collaboration among multiple institutions is necessary to achieve high algorithm performance. However, sharing patient data often has limitations due to technical, legal, or ethical concerns. In such cases, sharing a deep learning model is a more attractive alternative. The best me… ▽ More

    Submitted 10 September, 2017; originally announced September 2017.

  28. arXiv:1705.06362  [pdf, other

    cs.CV

    Optimizing and Visualizing Deep Learning for Benign/Malignant Classification in Breast Tumors

    Authors: Darvin Yi, Rebecca Lynn Sawyer, David Cohn III, Jared Dunnmon, Carson Lam, Xuerong Xiao, Daniel Rubin

    Abstract: Breast cancer has the highest incidence and second highest mortality rate for women in the US. Our study aims to utilize deep learning for benign/malignant classification of mammogram tumors using a subset of cases from the Digital Database of Screening Mammography (DDSM). Though it was a small dataset from the view of Deep Learning (about 1000 patients), we show that currently state of the art ar… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

  29. arXiv:1702.05663  [pdf, other

    cs.CV

    The Game Imitation: Deep Supervised Convolutional Networks for Quick Video Game AI

    Authors: Zhao Chen, Darvin Yi

    Abstract: We present a vision-only model for gaming AI which uses a late integration deep convolutional network architecture trained in a purely supervised imitation learning context. Although state-of-the-art deep learning models for video game tasks generally rely on more complex methods such as deep-Q learning, we show that a supervised model which requires substantially fewer resources and training time… ▽ More

    Submitted 18 February, 2017; originally announced February 2017.

    Comments: 11 pages, 12 figures

  30. arXiv:1611.04534  [pdf, other

    cs.CV

    3-D Convolutional Neural Networks for Glioblastoma Segmentation

    Authors: Darvin Yi, Mu Zhou, Zhao Chen, Olivier Gevaert

    Abstract: Convolutional Neural Networks (CNN) have emerged as powerful tools for learning discriminative image features. In this paper, we propose a framework of 3-D fully CNN models for Glioblastoma segmentation from multi-modality MRI data. By generalizing CNN models to true 3-D convolutions in learning 3-D tumor MRI data, the proposed approach utilizes a unique network architecture to decouple image pixe… ▽ More

    Submitted 14 November, 2016; originally announced November 2016.

  31. arXiv:1607.04884  [pdf, other

    physics.soc-ph cs.DL

    Modeling the coevolution between citations and coauthorships in scientific papers

    Authors: Zheng Xie, Zonglin Xie, Miao Li, Jianping Li, Dongyun Yi

    Abstract: Collaborations and citations within scientific research grow simultaneously and interact dynamically. Modelling the coevolution between them helps to study many phenomena that can be approached only through combining citation and coauthorship data. A geometric graph for the coevolution is proposed, the mechanism of which synthetically expresses the interactive impacts of authors and papers in a ge… ▽ More

    Submitted 28 September, 2017; v1 submitted 17 July, 2016; originally announced July 2016.

    Journal ref: Scientometrics (2017) 112: 483-507

  32. arXiv:1604.08891  [pdf, other

    physics.soc-ph cs.SI

    Modelling transition phenomena of scientific coauthorship networks

    Authors: Zheng Xie, Enming Dong, Dongyun Yi, Ouyang Zhenzheng, Jianping Li

    Abstract: In a range of scientific coauthorship networks, transitions emerge in degree distributions, correlations between degrees and local clustering coefficients, etc. The existence of those transitions could be regarded as a result of the diversity in collaboration behaviours of scientific fields. A growing geometric hypergraph built on a cluster of concentric circles is proposed to model two specific c… ▽ More

    Submitted 15 June, 2018; v1 submitted 29 April, 2016; originally announced April 2016.

    Comments: 19 Pages, 8 figures

    Journal ref: Journal of the Association for Information Science & Technology, 69(2):305-317 (2016)

  33. arXiv:1504.05408  [pdf, ps, other

    cs.LG

    Effective Discriminative Feature Selection with Non-trivial Solutions

    Authors: Hong Tao, Chenping Hou, Feiping Nie, Yuanyuan Jiao, Dongyun Yi

    Abstract: Feature selection and feature transformation, the two main ways to reduce dimensionality, are often presented separately. In this paper, a feature selection method is proposed by combining the popular transformation based dimensionality reduction method Linear Discriminant Analysis (LDA) and sparsity regularization. We impose row sparsity on the transformation matrix of LDA through ${\ell}_{2,1}$-… ▽ More

    Submitted 21 April, 2015; originally announced April 2015.

  34. arXiv:1504.02351  [pdf, other

    cs.CV cs.LG cs.NE

    When Face Recognition Meets with Deep Learning: an Evaluation of Convolutional Neural Networks for Face Recognition

    Authors: Guosheng Hu, Yongxin Yang, Dong Yi, Josef Kittler, William Christmas, Stan Z. Li, Timothy Hospedales

    Abstract: Deep learning, in particular Convolutional Neural Network (CNN), has achieved promising results in face recognition recently. However, it remains an open question: why CNNs work well and how to design a 'good' architecture. The existing works tend to focus on reporting CNN architectures that work well for face recognition rather than investigate the reason. In this work, we conduct an extensive ev… ▽ More

    Submitted 9 April, 2015; originally announced April 2015.

    Comments: 7 pages, 4 figures, 7 tables

  35. arXiv:1411.7923  [pdf, other

    cs.CV

    Learning Face Representation from Scratch

    Authors: Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li

    Abstract: Pushing by big data and deep convolutional neural network (CNN), the performance of face recognition is becoming comparable to human. Using private large scale training datasets, several groups achieve very high performance on LFW, i.e., 97% to 99%. While there are many open source implementations of CNN, none of large scale face dataset is publicly available. The current situation in the field of… ▽ More

    Submitted 28 November, 2014; originally announced November 2014.

  36. arXiv:1407.4979  [pdf, other

    cs.CV cs.LG cs.NE

    Deep Metric Learning for Practical Person Re-Identification

    Authors: Dong Yi, Zhen Lei, Stan Z. Li

    Abstract: Various hand-crafted features and metric learning methods prevail in the field of person re-identification. Compared to these methods, this paper proposes a more general way that can learn a similarity metric from image pixels directly. By using a "siamese" deep neural network, the proposed method can jointly learn the color feature, texture feature and metric in a unified framework. The network h… ▽ More

    Submitted 18 July, 2014; originally announced July 2014.

  37. arXiv:1406.1247  [pdf, other

    cs.CV

    Shared Representation Learning for Heterogeneous Face Recognition

    Authors: Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li

    Abstract: After intensive research, heterogenous face recognition is still a challenging problem. The main difficulties are owing to the complex relationship between heterogenous face image spaces. The heterogeneity is always tightly coupled with other variations, which makes the relationship of heterogenous face images highly nonlinear. Many excellent methods have been proposed to model the nonlinear relat… ▽ More

    Submitted 4 June, 2014; originally announced June 2014.

  38. arXiv:1307.1253  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Network robustness of multiplex networks with interlayer degree correlations

    Authors: Byungjoon Min, Su Do Yi, Kyu-Min Lee, K. -I. Goh

    Abstract: We study the robustness properties of multiplex networks consisting of multiple layers of distinct types of links, focusing on the role of correlations between degrees of a node in different layers. We use generating function formalism to address various notions of the network robustness relevant to multiplex networks such as the resilience of ordinary- and mutual connectivity under random or targ… ▽ More

    Submitted 8 April, 2014; v1 submitted 4 July, 2013; originally announced July 2013.

    Comments: 9 pages, 9 figures, accepted for publication in Phys. Rev. E

    Journal ref: Phys. Rev. E 89, 042811 (2014)

  39. arXiv:1302.7180  [pdf, other

    cs.CV

    Fast Matching by 2 Lines of Code for Large Scale Face Recognition Systems

    Authors: Dong Yi, Zhen Lei, Yang Hu, Stan Z. Li

    Abstract: In this paper, we propose a method to apply the popular cascade classifier into face recognition to improve the computational efficiency while keeping high recognition rate. In large scale face recognition systems, because the probability of feature templates coming from different subjects is very high, most of the matching pairs will be rejected by the early stages of the cascade. Therefore, the… ▽ More

    Submitted 28 February, 2013; originally announced February 2013.