Skip to main content

Showing 251–300 of 338 results for author: Cai, D

  1. arXiv:1904.12987  [pdf, ps, other

    astro-ph.IM cs.LG

    Optical Transient Object Classification in Wide Field Small Aperture Telescopes with Neural Networks

    Authors: Peng Jia, Yifei Zhao, Gang Xue, Dongmei Cai

    Abstract: Wide field small aperture telescopes are working horses for fast sky surveying. Transient discovery is one of their main tasks. Classification of candidate transient images between real sources and artifacts with high accuracy is an important step for transient discovery. In this paper, we propose two transient classification methods based on neural networks. The first method uses the convolutiona… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

    Comments: 13 pages, 10 figures. Accepted by AJ and all the code can be downloaded from aojp.lamost.org. Comments welcome

  2. arXiv:1902.07374  [pdf, other

    eess.AS cs.LG cs.SD

    Utterance-level end-to-end language identification using attention-based CNN-BLSTM

    Authors: Weicheng Cai, Danwei Cai, Shen Huang, Ming Li

    Abstract: In this paper, we present an end-to-end language identification framework, the attention-based Convolutional Neural Network-Bidirectional Long-short Term Memory (CNN-BLSTM). The model is performed on the utterance level, which means the utterance-level decision can be directly obtained from the output of the neural network. To handle speech utterances with entire arbitrary and potentially long dur… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: Accepted for ICASSP 2019

  3. arXiv:1901.06079  [pdf

    cs.CL

    Chinese Word Segmentation: Another Decade Review (2007-2017)

    Authors: Hai Zhao, Deng Cai, Changning Huang, Chunyu Kit

    Abstract: This paper reviews the development of Chinese word segmentation (CWS) in the most recent decade, 2007-2017. Special attention was paid to the deep learning technologies that has already permeated into most areas of natural language processing (NLP). The basic view we have arrived at is that compared to traditional supervised learning methods, neural network based methods have not shown any superio… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

    Comments: in Chinese

  4. arXiv:1812.03808  [pdf

    physics.app-ph physics.optics

    Ultrasensitive hybrid optical skin

    Authors: Lei Zhang, Jing Pan, Zhang Zhang, Hao Wu, Ni Yao, Dawei Cai, Yingxin Xu, Jin Zhang, Guofei Sun, Liqiang Wang, Weidong Geng, Wenguang Jin, Wei Fang, Dawei Di, Limin Tong

    Abstract: Electronic skin, a class of wearable electronic sensors that mimic the functionalities of human skin, has made remarkable success in applications including health monitoring, human-machine interaction and electronic-biological interfaces. While electronic skin continues to achieve higher sensitivity and faster response, its ultimate performance is fundamentally limited by the nature of low-frequen… ▽ More

    Submitted 25 October, 2018; originally announced December 2018.

  5. arXiv:1811.12314  [pdf, ps, other

    q-bio.NC stat.AP

    Swift Two-sample Test on High-dimensional Neural Spiking Data

    Authors: Zhi-Qin John Xu, Douglas Zhou, David Cai

    Abstract: To understand how neural networks process information, it is important to investigate how neural network dynamics varies with respect to different stimuli. One challenging task is to design efficient statistical approaches to analyze multiple spike train data obtained from a short recording time. Based on the development of high-dimensional statistical methods, it is able to deal with data whose d… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: 10 pages, 6 figures

    MSC Class: 62H15; 62H30; 92B15 ACM Class: G.3

  6. arXiv:1811.05632  [pdf, other

    cs.CL

    Translating a Math Word Problem to an Expression Tree

    Authors: Lei Wang, Yan Wang, Deng Cai, Dongxiang Zhang, Xiaojiang Liu

    Abstract: Sequence-to-sequence (SEQ2SEQ) models have been successfully applied to automatic math word problem solving. Despite its simplicity, a drawback still remains: a math word problem can be correctly solved by more than one equations. This non-deterministic transduction harms the performance of maximum likelihood estimation. In this paper, by considering the uniqueness of expression tree, we propose a… ▽ More

    Submitted 14 November, 2018; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: 6 pages, 2 figures

  7. arXiv:1811.00185  [pdf, ps, other

    cs.CL

    Dial2Desc: End-to-end Dialogue Description Generation

    Authors: Haojie Pan, Junpei Zhou, Zhou Zhao, Yan Liu, Deng Cai, Min Yang

    Abstract: We first propose a new task named Dialogue Description (Dial2Desc). Unlike other existing dialogue summarization tasks such as meeting summarization, we do not maintain the natural flow of a conversation but describe an object or an action of what people are talking about. The Dial2Desc system takes a dialogue text as input, then outputs a concise description of the object or the action involved i… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

  8. arXiv:1810.10226  [pdf, other

    cs.IR cs.LG cs.MM

    Textually Guided Ranking Network for Attentional Image Retweet Modeling

    Authors: Zhou Zhao, Hanbing Zhan, Lingtao Meng, Jun Xiao, Jun Yu, Min Yang, Fei Wu, Deng Cai

    Abstract: Retweet prediction is a challenging problem in social media sites (SMS). In this paper, we study the problem of image retweet prediction in social media, which predicts the image sharing behavior that the user reposts the image tweets from their followees. Unlike previous studies, we learn user preference ranking model from their past retweeted image tweets in SMS. We first propose heterogeneous i… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: 12 pages, 9 figures

  9. arXiv:1809.05296  [pdf, other

    cs.CL

    Skeleton-to-Response: Dialogue Generation Guided by Retrieval Memory

    Authors: Deng Cai, Yan Wang, Victoria Bi, Zhaopeng Tu, Xiaojiang Liu, Wai Lam, Shuming Shi

    Abstract: For dialogue response generation, traditional generative models generate responses solely from input queries. Such models rely on insufficient information for generating a specific response since a certain query could be answered in multiple ways. Consequentially, those models tend to output generic and dull responses, impeding the generation of informative utterances. Recently, researchers have a… ▽ More

    Submitted 28 February, 2020; v1 submitted 14 September, 2018; originally announced September 2018.

    Comments: accepted to NAACL2019

  10. arXiv:1809.02906  [pdf, other

    eess.AS cs.AI cs.SD eess.SP

    End-to-end Language Identification using NetFV and NetVLAD

    Authors: Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li

    Abstract: In this paper, we apply the NetFV and NetVLAD layers for the end-to-end language identification task. NetFV and NetVLAD layers are the differentiable implementations of the standard Fisher Vector and Vector of Locally Aggregated Descriptors (VLAD) methods, respectively. Both of them can encode a sequence of feature vectors into a fixed dimensional vector which is very important to process those va… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

    Comments: Accepted for ISCSLP 2018

  11. arXiv:1808.10506  [pdf, ps, other

    physics.bio-ph cs.IT physics.data-an stat.ME

    Maximum Entropy Principle Analysis in Network Systems with Short-time Recordings

    Authors: Zhi-Qin John Xu, Jennifer Crodelle, Douglas Zhou, David Cai

    Abstract: In many realistic systems, maximum entropy principle (MEP) analysis provides an effective characterization of the probability distribution of network states. However, to implement the MEP analysis, a sufficiently long-time data recording in general is often required, e.g., hours of spiking recordings of neurons in neuronal networks. The issue of whether the MEP analysis can be successfully applied… ▽ More

    Submitted 30 August, 2018; originally announced August 2018.

    Comments: 10 pages, 5 figures

    MSC Class: 92B15; 92B20

    Journal ref: Phys. Rev. E 99, 022409 (2019)

  12. arXiv:1808.04499  [pdf, other

    physics.bio-ph q-bio.NC

    Dynamical and Coupling Structure of Pulse-Coupled Networks in Maximum Entropy Analysis

    Authors: Zhi-Qin John Xu, Douglas Zhou, David Cai

    Abstract: Maximum entropy principle (MEP) analysis with few non-zero effective interactions successfully characterizes the distribution of dynamical states of pulse-coupled networks in many experiments, e.g., in neuroscience. To better understand the underlying mechanism, we found a relation between the dynamical structure, i.e., effective interactions in MEP analysis, and the coupling structure of pulse-co… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: 4 pages, 3 figures

    MSC Class: 92B15; 92B20

  13. arXiv:1808.04071  [pdf, other

    cs.CL

    Language Style Transfer from Sentences with Arbitrary Unknown Styles

    Authors: Yanpeng Zhao, Wei Bi, Deng Cai, Xiaojiang Liu, Kewei Tu, Shuming Shi

    Abstract: Language style transfer is the problem of migrating the content of a source sentence to a target style. In many of its applications, parallel training data are not available and source sentences to be transferred may have arbitrary and unknown styles. First, each sentence is encoded into its content and style latent representations. Then, by recombining the content with the target style, we decode… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

  14. arXiv:1807.03745  [pdf, other

    cs.IT

    On the Performance of NOMA with Hybrid ARQ

    Authors: Donghong Cai, Zhiguo Ding, Pingzhi Fan, Zheng Yang

    Abstract: In this paper, we investigate the outage performance of hybrid automatic repeat request with chase combining (HARQ-CC) assisted downlink non-orthogonal multiple access (NOMA) systems. A closed-form expression of the individual outage probability and the diversity gain are obtained firstly. Based on the developed analytical outage probability, a tradeoff between the minimum number of retransmission… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

  15. arXiv:1806.04860  [pdf, other

    cs.CV cs.CL

    Learning Visual Knowledge Memory Networks for Visual Question Answering

    Authors: Zhou Su, Chen Zhu, Yinpeng Dong, Dongqi Cai, Yurong Chen, Jianguo Li

    Abstract: Visual question answering (VQA) requires joint comprehension of images and natural language questions, where many questions can't be directly or clearly answered from visual content but require reasoning from structured human knowledge with confirmation from visual content. This paper proposes visual knowledge memory network (VKMN) to address this issue, which seamlessly incorporates structured hu… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: Supplementary to CVPR 2018 version

  16. arXiv:1805.09023  [pdf, other

    cs.IR cs.LG stat.ML

    Addressing the Item Cold-start Problem by Attribute-driven Active Learning

    Authors: Yu Zhu, Jinhao Lin, Shibi He, Beidou Wang, Ziyu Guan, Haifeng Liu, Deng Cai

    Abstract: In recommender systems, cold-start issues are situations where no previous events, e.g. ratings, are known for certain users or items. In this paper, we focus on the item cold-start problem. Both content information (e.g. item attributes) and initial user ratings are valuable for seizing users' preferences on a new item. However, previous methods for the item cold-start problem either 1) incorpora… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: 14 pages, 7 figures, 9 tables. Submitted to TKDE

    ACM Class: H.3.3

  17. arXiv:1805.08958  [pdf, other

    cs.IR cs.LG

    A Brand-level Ranking System with the Customized Attention-GRU Model

    Authors: Yu Zhu, Junxiong Zhu, Jie Hou, Yongliang Li, Beidou Wang, Ziyu Guan, Deng Cai

    Abstract: In e-commerce websites like Taobao, brand is playing a more important role in influencing users' decision of click/purchase, partly because users are now attaching more importance to the quality of products and brand is an indicator of quality. However, existing ranking systems are not specifically designed to satisfy this kind of demand. Some design tricks may partially alleviate this problem, bu… ▽ More

    Submitted 11 August, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: 7 pages, 6 figures, 3 tables. Published in IJCAI 2018. Make some figures and tables more clear

    ACM Class: H.3.3

    Journal ref: International Joint Conferences on Artificial Intelligence, 2018: 3947-3953

  18. arXiv:1801.01315  [pdf, other

    cs.CV

    PixelLink: Detecting Scene Text via Instance Segmentation

    Authors: Dan Deng, Haifeng Liu, Xuelong Li, Deng Cai

    Abstract: Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a ki… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Comments: AAAI-2018

  19. arXiv:1712.07329  [pdf, other

    cs.CV

    On the Diversity of Realistic Image Synthesis

    Authors: Zichen Yang, Haifeng Liu, Deng Cai

    Abstract: Many image processing tasks can be formulated as translating images between two image domains, such as colorization, super resolution and conditional image synthesis. In most of these tasks, an input image may correspond to multiple outputs. However, current existing approaches only show very minor diversity of the outputs. In this paper, we present a novel approach to synthesize diverse realistic… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: 10 pages, 11 figures

  20. arXiv:1711.06016  [pdf, other

    cs.CV

    A Revisit on Deep Hashings for Large-scale Content Based Image Retrieval

    Authors: Deng Cai, Xiuye Gu, Chaoqi Wang

    Abstract: There is a growing trend in studying deep hashing methods for content-based image retrieval (CBIR), where hash functions and binary codes are learnt using deep convolutional neural networks and then the binary codes can be used to do approximate nearest neighbor (ANN) search. All the existing deep hashing papers report their methods' superior performance over the traditional hashing methods accord… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

  21. arXiv:1711.05568  [pdf, other

    cs.CL

    Dialogue Act Recognition via CRF-Attentive Structured Network

    Authors: Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, Xiaofei He

    Abstract: Dialogue Act Recognition (DAR) is a challenging problem in dialogue interpretation, which aims to attach semantic labels to utterances and characterize the speaker's intention. Currently, many existing approaches formulate the DAR problem ranging from multi-classification to structured prediction, which suffer from handcrafted feature extensions and attentive contextual structural dependencies. In… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

    Comments: 10 pages, 4figures

  22. arXiv:1711.00179  [pdf, other

    cs.CL

    Keyword-based Query Comprehending via Multiple Optimized-Demand Augmentation

    Authors: Boyuan Pan, Hao Li, Zhou Zhao, Deng Cai, Xiaofei He

    Abstract: In this paper, we consider the problem of machine reading task when the questions are in the form of keywords, rather than natural language. In recent years, researchers have achieved significant success on machine reading comprehension tasks, such as SQuAD and TriviaQA. These datasets provide a natural language question sentence and a pre-selected passage, and the goal is to answer the question a… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.

  23. arXiv:1710.05201  [pdf, other

    q-bio.NC

    Emergence of a Balanced Core through Dynamical Competition in Heterogeneous Neuronal Networks

    Authors: Qing-long L. Gu, Songting Li, Wei P. Dai, Douglas Zhou, David Cai

    Abstract: The balance between excitation and inhibition is crucial for neuronal computation. It is observed that the balanced state of neuronal networks exists in many experiments, yet its underlying mechanism remains to be fully clarified. Theoretical studies of the balanced state mainly focus on the analysis of the homogeneous Erd$\ddot{\text{o}}$s-Rényi network. However, neuronal networks have been found… ▽ More

    Submitted 14 October, 2017; originally announced October 2017.

  24. arXiv:1710.05113  [pdf, other

    q-bio.NC q-bio.QM

    A New Framework for Determination of Excitatory and Inhibitory Conductances Using Somatic Clamp

    Authors: Songting Li, Xiaohui Zhang, Douglas Zhou, David Cai

    Abstract: The interaction between excitation and inhibition is crucial for brain computation. To understand synaptic mechanisms underlying brain function, it is important to separate excitatory and inhibitory inputs to a target neuron. In the traditional method, after applying somatic current or voltage clamp, the excitatory and inhibitory conductances are determined from the synaptic current-voltage (I-V)… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  25. arXiv:1710.05098  [pdf, ps, other

    q-bio.NC q-bio.QM

    Determination of Effective Synaptic Conductances Using Somatic Voltage Clamp

    Authors: Songting Li, Nan Liu, Xiaohui Zhang, Douglas Zhou, David Cai

    Abstract: The interplay between excitatory and inhibitory neurons imparts rich functions of the brain. To understand the underlying synaptic mechanisms, a fundamental approach is to study the dynamics of excitatory and inhibitory conductances of each neuron. The traditional method of determining conductance employs the synaptic current-voltage (I-V) relation obtained via voltage clamp. Using theoretical ana… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  26. arXiv:1710.02772  [pdf, other

    cs.CL cs.IR

    Smarnet: Teaching Machines to Read and Comprehend Like Human

    Authors: Zheqian Chen, Rongqin Yang, Bin Cao, Zhou Zhao, Deng Cai, Xiaofei He

    Abstract: Machine Comprehension (MC) is a challenging task in Natural Language Processing field, which aims to guide the machine to comprehend a passage and answer the given question. Many existing approaches on MC task are suffering the inefficiency in some bottlenecks, such as insufficient lexical understanding, complex question-passage interaction, incorrect answer extraction and so on. In this paper, we… ▽ More

    Submitted 7 October, 2017; originally announced October 2017.

    Comments: 8 pages, paper for SQuAD machine comprehension

  27. arXiv:1709.10002  [pdf, ps, other

    math.DG

    Some Inequalities Related to Ricci Curvatures for Lagrangian Submanifolds of Kahler QCH-manifolds

    Authors: Liang Zhang, Xudong Liu, Dandan Cai

    Abstract: By establishing two general quadratic inequalities, we obtain some inequalities related to Ricci curvatures for Lagrangian submanifolds of K$\ddot{\mathrm{a}}$hler QCH-manifolds, which generalize some results for Lagrangian submanifolds of complex space forms.

    Submitted 28 September, 2017; originally announced September 2017.

  28. arXiv:1709.03741  [pdf, other

    cs.LG stat.ML

    Learning Graph-Level Representation for Drug Discovery

    Authors: Junying Li, Deng Cai, Xiaofei He

    Abstract: Predicating macroscopic influences of drugs on human body, like efficacy and toxicity, is a central problem of small-molecule based drug discovery. Molecules can be represented as an undirected graph, and we can utilize graph convolution networks to predication molecular properties. However, graph convolutional networks and other graph neural networks all focus on learning node-level representatio… ▽ More

    Submitted 15 September, 2017; v1 submitted 12 September, 2017; originally announced September 2017.

    Comments: arXiv admin note: text overlap with arXiv:1703.00564, arXiv:1611.03199 by other authors

  29. arXiv:1707.09098  [pdf, other

    cs.AI cs.CL

    MEMEN: Multi-layer Embedding with Memory Networks for Machine Comprehension

    Authors: Boyuan Pan, Hao Li, Zhou Zhao, Bin Cao, Deng Cai, Xiaofei He

    Abstract: Machine comprehension(MC) style question answering is a representative problem in natural language processing. Previous methods rarely spend time on the improvement of encoding layer, especially the embedding of syntactic information and name entity of the words, which are very crucial to the quality of encoding. Moreover, existing attention methods represent each query word as a vector or use a s… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

  30. Nonlinear dance motion analysis and motion editing using Hilbert-Huang transform

    Authors: Ran Dong, Dongsheng Cai, Nobuyoshi Asai

    Abstract: Human motions (especially dance motions) are very noisy, and it is hard to analyze and edit the motions. To resolve this problem, we propose a new method to decompose and modify the motions using the Hilbert-Huang transform (HHT). First, HHT decomposes a chromatic signal into "monochromatic" signals that are the so-called Intrinsic Mode Functions (IMFs) using an Empirical Mode Decomposition (EMD)… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

    Comments: 6 pages, 10 figures, Computer Graphics International 2017, Conference short paper

    Report number: Article No. 35

  31. arXiv:1707.00143  [pdf, other

    cs.LG

    Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph

    Authors: Cong Fu, Chao Xiang, Changxu Wang, Deng Cai

    Abstract: Approximate nearest neighbor search (ANNS) is a fundamental problem in databases and data mining. A scalable ANNS algorithm should be both memory-efficient and fast. Some early graph-based approaches have shown attractive theoretical guarantees on search time complexity, but they all suffer from the problem of high indexing time complexity. Recently, some graph-based methods have been proposed to… ▽ More

    Submitted 11 December, 2018; v1 submitted 1 July, 2017; originally announced July 2017.

  32. "Synchronize" to VR Body: Full Body Illusion in VR Space

    Authors: Peikun Xiong, Chen Sun, Dongsheng Cai

    Abstract: Virtual Reality (VR) becomes accessible to mimic a "real-like" world now. People who have a VR experience usually can be impressed by the immersive feeling, they might consider themselves are actually existed in the VR space. Self-consciousness is important for people to identify their own characters in VR space, and illusory ownership can help people to "build" their "bodies". The rubber hand ill… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

    Comments: 4 pages, 4 figures, Eurographics 2017,Conference short paper

    Report number: 009-012 ACM Class: H.5.1

    Journal ref: Eurographics 2017, 4pp,(2017)

  33. arXiv:1706.00722  [pdf, other

    eess.SY

    Understanding the Inefficiency of Security-Constrained Economic Dispatch

    Authors: Mohammad H. Hajiesmaili, Desmond Cai, Enrique Mallada

    Abstract: The security-constrained economic dispatch (SCED) problem tries to maintain the reliability of a power network by ensuring that a single failure does not lead to a global outage. The previous research has mainly investigated SCED by formulating the problem in different modalities, e.g. preventive or corrective, and devising efficient solutions for SCED. In this paper, we tackle a novel and importa… ▽ More

    Submitted 2 June, 2017; originally announced June 2017.

  34. arXiv:1705.08623  [pdf, other

    cs.CV

    Deep Rotation Equivariant Network

    Authors: Junying Li, Zichen Yang, Haifeng Liu, Deng Cai

    Abstract: Recently, learning equivariant representations has attracted considerable research attention. Dieleman et al. introduce four operations which can be inserted into convolutional neural network to learn deep representations equivariant to rotation. However, feature maps should be copied and rotated four times in each layer in their approach, which causes much running time and memory overhead. In ord… ▽ More

    Submitted 28 February, 2018; v1 submitted 24 May, 2017; originally announced May 2017.

  35. arXiv:1705.05443  [pdf, other

    math.NA

    SMASH: Structured matrix approximation by separation and hierarchy

    Authors: Difeng Cai, Edmond Chow, Yousef Saad, Yuanzhe Xi

    Abstract: This paper presents an efficient method to perform Structured Matrix Approximation by Separation and Hierarchy (SMASH), when the original dense matrix is associated with a kernel function. Given points in a domain, a tree structure is first constructed based on an adaptive partitioning of the computational domain to facilitate subsequent approximation procedures. In contrast to existing schemes ba… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.

  36. arXiv:1705.01253  [pdf, other

    cs.CV cs.CL

    The Forgettable-Watcher Model for Video Question Answering

    Authors: Hongyang Xue, Zhou Zhao, Deng Cai

    Abstract: A number of visual question answering approaches have been proposed recently, aiming at understanding the visual scenes by answering the natural language questions. While the image question answering has drawn significant attention, video question answering is largely unexplored. Video-QA is different from Image-QA since the information and the events are scattered among multiple frames. In orde… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

  37. arXiv:1704.07047  [pdf, other

    cs.CL

    Fast and Accurate Neural Word Segmentation for Chinese

    Authors: Deng Cai, Hai Zhao, Zhisong Zhang, Yuan Xin, Yongjian Wu, Feiyue Huang

    Abstract: Neural models with minimal feature engineering have achieved competitive performance against traditional methods for the task of Chinese word segmentation. However, both training and working procedures of the current neural models are computationally inefficient. This paper presents a greedy neural word segmenter with balanced word and character embedding inputs to alleviate the existing drawbacks… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: To appear in ACL2017

  38. arXiv:1703.05393  [pdf, other

    cs.CV

    Convolutional Low-Resolution Fine-Grained Classification

    Authors: Dingding Cai, Ke Chen, Yanlin Qian, Joni-Kristian Kämäräinen

    Abstract: Successful fine-grained image classification methods learn subtle details between visually similar (sub-)classes, but the problem becomes significantly more challenging if the details are missing due to low resolution. Encouraged by the recent success of Convolutional Neural Network (CNN) architectures in image classification, we propose a novel resolution-aware deep model which combines convoluti… ▽ More

    Submitted 16 October, 2017; v1 submitted 15 March, 2017; originally announced March 2017.

  39. arXiv:1701.08896  [pdf, other

    cs.GT

    On the Role of a Market Maker in Networked Cournot Competition

    Authors: Desmond Cai, Subhonmesh Bose, Adam Wierman

    Abstract: We study Cournot competition among firms in a networked marketplace that is centrally managed by a market maker. In particular, we study a situation in which a market maker facilitates trade between geographically separate markets via a constrained transport network. Our focus is on understanding the consequences of the design of the market maker and on providing tools for optimal design. To that… ▽ More

    Submitted 19 April, 2019; v1 submitted 30 January, 2017; originally announced January 2017.

  40. arXiv:1612.07545  [pdf, other

    cs.CV

    A Revisit of Hashing Algorithms for Approximate Nearest Neighbor Search

    Authors: Deng Cai

    Abstract: Approximate Nearest Neighbor Search (ANNS) is a fundamental problem in many areas of machine learning and data mining. During the past decade, numerous hashing algorithms are proposed to solve this problem. Every proposed algorithm claims outperform other state-of-the-art hashing methods. However, the evaluation of these hashing papers was not thorough enough, and those claims should be re-examine… ▽ More

    Submitted 18 June, 2019; v1 submitted 22 December, 2016; originally announced December 2016.

  41. arXiv:1612.05519  [pdf, other

    stat.ML

    Edge-exchangeable graphs and sparsity (NIPS 2016)

    Authors: Diana Cai, Trevor Campbell, Tamara Broderick

    Abstract: Many popular network models rely on the assumption of (vertex) exchangeability, in which the distribution of the graph is invariant to relabelings of the vertices. However, the Aldous-Hoover theorem guarantees that these graphs are dense or empty with probability one, whereas many real-world graphs are sparse. We present an alternative notion of exchangeability for random graphs, which we call edg… ▽ More

    Submitted 3 February, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

    Comments: In the proceedings of the Advances in Neural Information Processing Systems 29 (NIPS), 2016. Preliminary work appeared in the 2015 NIPS workshops on Networks in the Social and Information Sciences (http://stanford.edu/~jugander/NetworksNIPS2015/) and Bayesian Nonparametrics: The Next Generation (https://sites.google.com/site/nipsbnp2015/). 26 pages, 4 figures

  42. arXiv:1611.08135  [pdf, other

    cs.IR cs.CL

    Question Retrieval for Community-based Question Answering via Heterogeneous Network Integration Learning

    Authors: Zheqian Chen, Chi Zhang, Zhou Zhao, Deng Cai

    Abstract: Community based question answering platforms have attracted substantial users to share knowledge and learn from each other. As the rapid enlargement of CQA platforms, quantities of overlapped questions emerge, which makes users confounded to select a proper reference. It is urgent for us to take effective automated algorithms to reuse historical questions with corresponding answers. In this paper… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

  43. arXiv:1611.08096  [pdf, other

    cs.IR cs.CL

    User Personalized Satisfaction Prediction via Multiple Instance Deep Learning

    Authors: Zheqian Chen, Ben Gao, Huimin Zhang, Zhou Zhao, Deng Cai

    Abstract: Community based question answering services have arisen as a popular knowledge sharing pattern for netizens. With abundant interactions among users, individuals are capable of obtaining satisfactory information. However, it is not effective for users to attain answers within minutes. Users have to check the progress over time until the satisfying answers submitted. We address this problem as a use… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

    Comments: draft for www

  44. Relational Multi-Manifold Co-Clustering

    Authors: Ping Li, Jiajun Bu, Chun Chen, Zhanying He, Deng Cai

    Abstract: Co-clustering targets on grouping the samples (e.g., documents, users) and the features (e.g., words, ratings) simultaneously. It employs the dual relation and the bilateral information between the samples and features. In many realworld applications, data usually reside on a submanifold of the ambient Euclidean space, but it is nontrivial to estimate the intrinsic manifold of the data space in a… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: 11 pages, 4 figures, published in IEEE Transactions on Cybernetics (TCYB)

    Journal ref: IEEE Transactions on Cybernetics, 43(6): 1871-1881, 2013

  45. Constrained Low-Rank Learning Using Least Squares-Based Regularization

    Authors: Ping Li, Jun Yu, Meng Wang, Luming Zhang, Deng Cai, Xuelong Li

    Abstract: Low-rank learning has attracted much attention recently due to its efficacy in a rich variety of real-world tasks, e.g., subspace segmentation and image categorization. Most low-rank methods are incapable of capturing low-dimensional subspace for supervised learning tasks, e.g., classification and regression. This paper aims to learn both the discriminant low-rank representation (LRR) and the robu… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

    Comments: 14 pages, 7 figures, accepted to appear in IEEE Transactions on Cybernetics

    Journal ref: IEEE Transactions on Cybernetics, 2016

  46. arXiv:1611.00388  [pdf, other

    q-bio.NC q-bio.SC

    Automated scalable segmentation of neurons from multispectral images

    Authors: Uygar Sümbül, Douglas Roussien Jr., Fei Chen, Nicholas Barry, Edward S. Boyden, Dawen Cai, John P. Cunningham, Liam Paninski

    Abstract: Reconstruction of neuroanatomy is a fundamental problem in neuroscience. Stochastic expression of colors in individual cells is a promising tool, although its use in the nervous system has been limited due to various sources of variability in expression. Moreover, the intermingled anatomy of neuronal trees is challenging for existing segmentation algorithms. Here, we propose a method to automate t… ▽ More

    Submitted 21 January, 2017; v1 submitted 1 November, 2016; originally announced November 2016.

    Comments: main text: 9 pages and 5 figures, supplementary text: 11 pages and 8 figures (NIPS 2016)

  47. arXiv:1609.09147  [pdf, other

    math.ST

    Exchangeable Trait Allocations

    Authors: Trevor Campbell, Diana Cai, Tamara Broderick

    Abstract: Trait allocations are a class of combinatorial structures in which data may belong to multiple groups and may have different levels of belonging in each group. Often the data are also exchangeable, i.e., their joint distribution is invariant to reordering. In clustering---a special case of trait allocation---exchangeability implies the existence of both a de Finetti representation and an exchangea… ▽ More

    Submitted 5 July, 2018; v1 submitted 28 September, 2016; originally announced September 2016.

    Comments: 30 pages, 2 figures

    MSC Class: 60G09; 60C05; 62H30

    Journal ref: Electronic Journal of Statistics 12(2), 2018, 2290-2322

  48. arXiv:1609.07228  [pdf, other

    cs.CV

    EFANNA : An Extremely Fast Approximate Nearest Neighbor Search Algorithm Based on kNN Graph

    Authors: Cong Fu, Deng Cai

    Abstract: Approximate nearest neighbor (ANN) search is a fundamental problem in many areas of data mining, machine learning and computer vision. The performance of traditional hierarchical structure (tree) based methods decreases as the dimensionality of data grows, while hashing based methods usually lack efficiency in practice. Recently, the graph based methods have drawn considerable attention. The main… ▽ More

    Submitted 3 December, 2016; v1 submitted 23 September, 2016; originally announced September 2016.

  49. arXiv:1607.06996  [pdf, other

    stat.ML cs.LG

    Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

    Authors: Weizhong Zhang, Bin Hong, Wei Liu, Jieping Ye, Deng Cai, Xiaofei He, Jie Wang

    Abstract: Sparse support vector machine (SVM) is a popular classification technique that can simultaneously learn a small set of the most interpretable features and identify the support vectors. It has achieved great successes in many real-world applications. However, for large-scale problems involving a huge number of samples and ultra-high dimensional features, solving sparse SVMs remains challenging. By… ▽ More

    Submitted 18 July, 2019; v1 submitted 24 July, 2016; originally announced July 2016.

    Comments: accepted by JMLR

  50. arXiv:1606.08604  [pdf, other

    math.OC

    On the Inefficiency of Forward Markets in Leader-Follower Competition

    Authors: Desmond Cai, Anish Agarwal, Adam Wierman

    Abstract: Motivated by electricity markets, this paper studies the impact of forward contracting in situations where firms have capacity constraints and heterogeneous production lead times. We consider a model with two types of firms - leaders and followers - that choose production at two different times. Followers choose productions in the second stage but can sell forward contracts in the first stage. Our… ▽ More

    Submitted 28 June, 2016; originally announced June 2016.