Skip to main content

Showing 101–150 of 344 results for author: Choi, D

  1. arXiv:2203.07814  [pdf, other

    cs.PL cs.AI cs.LG

    Competition-Level Code Generation with AlphaCode

    Authors: Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien de Masson d'Autume, Igor Babuschkin, Xinyun Chen, Po-Sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando de Freitas, Koray Kavukcuoglu , et al. (1 additional authors not shown)

    Abstract: Programming is a powerful and ubiquitous problem-solving tool. Developing systems that can assist programmers or even generate programs independently could make programming more productive and accessible, yet so far incorporating innovations in AI has proven challenging. Recent large-scale language models have demonstrated an impressive ability to generate code, and are now able to complete simple… ▽ More

    Submitted 8 February, 2022; originally announced March 2022.

    Comments: 74 pages

  2. arXiv:2203.05934  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall

    Moiré dispersion of edge states in spin chains on superconductors

    Authors: Cristina Mier, Deung-Jang Choi, Nicolás Lorente

    Abstract: Our calculations of ferromagnetic spin chains on s-wave superconductors show that the energy oscillations of edge states with the chain's length are due to a moiré pattern emerging from Friedel-like oscillations and the discreteness of the spin-chain lattice. By modifying the spin lattice, the moiré dispersion of edge states can be controlled. In particular, we can engineer non-dispersive edge sta… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Journal ref: Phys. Rev. Research 4, L032010 (2022)

  3. arXiv:2203.01426  [pdf, other

    cs.NE cs.AI cs.ET

    SPICEprop: Backpropagating Errors Through Memristive Spiking Neural Networks

    Authors: Peng Zhou, Jason K. Eshraghian, Dong-Uk Choi, Sung-Mo Kang

    Abstract: We present a fully memristive spiking neural network (MSNN) consisting of novel memristive neurons trained using the backpropagation through time (BPTT) learning rule. Gradient descent is applied directly to the memristive integrated-and-fire (MIF) neuron designed using analog SPICE circuit models, which generates distinct depolarization, hyperpolarization, and repolarization voltage waveforms. Sy… ▽ More

    Submitted 9 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  4. arXiv:2203.01416  [pdf, other

    cs.NE cs.AI cs.ET

    A Fully Memristive Spiking Neural Network with Unsupervised Learning

    Authors: Peng Zhou, Dong-Uk Choi, Jason K. Eshraghian, Sung-Mo Kang

    Abstract: We present a fully memristive spiking neural network (MSNN) consisting of physically-realizable memristive neurons and memristive synapses to implement an unsupervised Spiking Time Dependent Plasticity (STDP) learning rule. The system is fully memristive in that both neuronal and synaptic dynamics can be realized by using memristors. The neuron is implemented using the SPICE-level memristive integ… ▽ More

    Submitted 9 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  5. arXiv:2201.07459  [pdf, other

    cs.CV

    PT4AL: Using Self-Supervised Pretext Tasks for Active Learning

    Authors: John Seon Keun Yi, Minseok Seo, Jongchan Park, Dong-Geol Choi

    Abstract: Labeling a large set of data is expensive. Active learning aims to tackle this problem by asking to annotate only the most informative data from the unlabeled set. We propose a novel active learning approach that utilizes self-supervised pretext tasks and a unique data sampler to select data that are both difficult and representative. We discover that the loss of a simple self-supervised pretext t… ▽ More

    Submitted 26 July, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Code is available at https://github.com/johnsk95/PT4AL Updated for ECCV 2022 submission

  6. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  7. arXiv:2112.02026  [pdf, other

    astro-ph.GA astro-ph.IM

    The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar and APOGEE-2 Data

    Authors: Abdurro'uf, Katherine Accetta, Conny Aerts, Victor Silva Aguirre, Romina Ahumada, Nikhil Ajgaonkar, N. Filiz Ak, Shadab Alam, Carlos Allende Prieto, Andres Almeida, Friedrich Anders, Scott F. Anderson, Brett H. Andrews, Borja Anguiano, Erik Aquino-Ortiz, Alfonso Aragon-Salamanca, Maria Argudo-Fernandez, Metin Ata, Marie Aubert, Vladimir Avila-Reese, Carles Badenes, Rodolfo H. Barba, Kat Barger, Jorge K. Barrera-Ballesteros, Rachael L. Beaton , et al. (316 additional authors not shown)

    Abstract: This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies… ▽ More

    Submitted 13 January, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 40 pages, 8 figures, 6 tables. In press at ApJSS (arxiv v2 corrects some minor typos and updates references)

  8. arXiv:2112.00503  [pdf, other

    cs.CL cs.LG

    Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph

    Authors: Liyan Xu, Xuchao Zhang, Bo Zong, Yanchi Liu, Wei Cheng, Jingchao Ni, Haifeng Chen, Liang Zhao, Jinho D. Choi

    Abstract: We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided MRC models, we propose to adopt the inter-sentence syntactic relations, in addition to… ▽ More

    Submitted 15 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI 2022

  9. arXiv:2111.00572  [pdf, other

    cs.CL cs.AI

    What Went Wrong? Explaining Overall Dialogue Quality through Utterance-Level Impacts

    Authors: James D. Finch, Sarah E. Finch, Jinho D. Choi

    Abstract: Improving user experience of a dialogue system often requires intensive developer effort to read conversation logs, run statistical analyses, and intuit the relative importance of system shortcomings. This paper presents a novel approach to automated analysis of conversation logs that learns the relationship between user-system interactions and overall dialogue quality. Unlike prior work on uttera… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: Accepted at the 3rd Workshop on NLP for ConvAI

  10. arXiv:2111.00570  [pdf, other

    cs.CL cs.AI

    An Approach to Inference-Driven Dialogue Management within a Social Chatbot

    Authors: Sarah E. Finch, James D. Finch, Daniil Huryn, William Hutsell, Xiaoyuan Huang, Han He, Jinho D. Choi

    Abstract: We present a chatbot implementing a novel dialogue management approach based on logical inference. Instead of framing conversation a sequence of response generation tasks, we model conversation as a collaborative inference process in which speakers share information to synthesize new knowledge in real time. Our chatbot pipeline accomplishes this modelling in three broad stages. The first stage tra… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: Published in 4th Proceedings of Alexa Prize (Alexa Prize 2020)

  11. arXiv:2110.00238  [pdf, other

    cs.RO cs.CV

    Improving Object Permanence using Agent Actions and Reasoning

    Authors: Ying Siu Liang, Chen Zhang, Dongkyu Choi, Kenneth Kwok

    Abstract: Object permanence in psychology means knowing that objects still exist even if they are no longer visible. It is a crucial concept for robots to operate autonomously in uncontrolled environments. Existing approaches learn object permanence from low-level perception, but perform poorly on more complex scenarios, like when objects are contained and carried by others. Knowledge about manipulation act… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021)

  12. arXiv:2109.09858  [pdf, other

    cs.CL

    Intensionalizing Abstract Meaning Representations: Non-Veridicality and Scope

    Authors: Gregor Williamson, Patrick Elliott, Yuxin Ji, Jinho D. Choi

    Abstract: Abstract Meaning Representation (AMR) is a graphical meaning representation language designed to represent propositional information about argument structure. However, at present it is unable to satisfyingly represent non-veridical intensional contexts, often licensing inappropriate inferences. In this paper, we show how to resolve the problem of non-veridicality without appealing to layered graph… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: LAW-DMR'21, 8 pages (excl. refs)

  13. arXiv:2109.09853  [pdf, other

    cs.CL

    StreamSide: A Fully-Customizable Open-Source Toolkit for Efficient Annotation of Meaning Representations

    Authors: Jinho D. Choi, Gregor Williamson

    Abstract: This demonstration paper presents StreamSide, an open-source toolkit for annotating multiple kinds of meaning representations. StreamSide supports frame-based annotation schemes e.g., Abstract Meaning Representation (AMR) and frameless annotation schemes e.g., Widely Interpretable Semantic Representation (WISeR). Moreover, it supports both sentence-level and document-level annotation by allowing a… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: demo paper, 6 pages (excl. refs), 6 figures

  14. arXiv:2109.06939  [pdf, other

    cs.CL cs.AI

    The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders

    Authors: Han He, Jinho D. Choi

    Abstract: Multi-task learning with transformer encoders (MTL) has emerged as a powerful technique to improve performance on closely-related tasks for both accuracy and efficiency while a question still remains whether or not it would perform as well on tasks that are distinct in nature. We first present MTL results on five NLP tasks, POS, NER, DEP, CON, and SRL, and depict its deficiency over single-task le… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021: The 2021 Conference on Empirical Methods in Natural Language Processing

  15. arXiv:2109.03903  [pdf, other

    cs.CL

    ELIT: Emory Language and Information Toolkit

    Authors: Han He, Liyan Xu, Jinho D. Choi

    Abstract: We introduce ELIT, the Emory Language and Information Toolkit, which is a comprehensive NLP framework providing transformer-based end-to-end models for core tasks with a special focus on memory efficiency while maintaining state-of-the-art accuracy and speed. Compared to existing toolkits, ELIT features an efficient Multi-Task Learning (MTL) model with many downstream tasks that include lemmatizat… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  16. arXiv:2109.00194  [pdf, other

    cs.CL cs.LG

    Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation

    Authors: Liyan Xu, Xuchao Zhang, Xujiang Zhao, Haifeng Chen, Feng Chen, Jinho D. Choi

    Abstract: Recent multilingual pre-trained language models have achieved remarkable zero-shot performance, where the model is only finetuned on one source language and directly evaluated on target languages. In this work, we propose a self-learning framework that further utilizes unlabeled data of target languages, combined with uncertainty estimation in the process to select high-quality silver labels. Thre… ▽ More

    Submitted 23 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021

  17. arXiv:2109.00185  [pdf, other

    cs.CL cs.LG

    Adapted End-to-End Coreference Resolution System for Anaphoric Identities in Dialogues

    Authors: Liyan Xu, Jinho D. Choi

    Abstract: We present an effective system adapted from the end-to-end neural coreference resolution model, targeting on the task of anaphora resolution in dialogues. Three aspects are specifically addressed in our approach, including the support of singletons, encoding speakers and turns throughout dialogue interactions, and knowledge transfer utilizing existing resources. Despite the simplicity of our adapt… ▽ More

    Submitted 23 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Accepted to CODI-CRAC 2021

  18. arXiv:2108.11146  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall

    Calculations of in-gap states of ferromagnetic spin chains on \textit{s}-wave wide-band superconductors

    Authors: Cristina Mier, deung-Jang Choi, Nicolás Lorente

    Abstract: Magnetic impurities create in-gap states on superconductors. Recent experiments explore the topological properties of one-dimensional arrays of magnetic impurities on superconductors, because in certain regimes p-wave pairing can be locally induced leading to new topological phases. A by-product of the new accessible phases is the appearance of zero-energy edge states that have non-Abelian exchang… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  19. arXiv:2108.09880  [pdf

    cond-mat.mes-hall quant-ph

    An electron-spin qubit platform assembled atom-by-atom on a surface

    Authors: Yu Wang, Yi Chen, Hong T. Bui, Christoph Wolf, Masahiro Haze, Cristina Mier, Jinkyung Kim, Deung-jang Choi, Christopher P. Lutz, Yujeong Bae, Soo-Hyon Phark, Andreas J. Heinrich

    Abstract: Creating a quantum-coherent architecture at the atomic scale has long been an ambition in quantum science and nanotechnology. This ultimate length scale requires the use of fundamental quantum properties of atoms, such as the spin of electrons, which naturally occurs in many solid-state environments and allows high-fidelity operations and readout by electromagnetic means. Despite decades of effort… ▽ More

    Submitted 5 August, 2022; v1 submitted 22 August, 2021; originally announced August 2021.

  20. Exploiting Features with Split-and-Share Module

    Authors: Jaemin Lee, Minseok Seo, Jongchan Park, Dong-Geol Choi

    Abstract: Deep convolutional neural networks (CNNs) have shown state-of-the-art performances in various computer vision tasks. Advances on CNN architectures have focused mainly on designing convolutional blocks of the feature extractors, but less on the classifiers that exploit extracted features. In this work, we propose Split-and-Share Module (SSM),a classifier that splits a given feature into parts, whic… ▽ More

    Submitted 10 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Journal ref: Electronics 2022

  21. arXiv:2108.02400  [pdf, other

    cs.CV

    Security and Privacy Enhanced Gait Authentication with Random Representation Learning and Digital Lockers

    Authors: Lam Tran, Thuc Nguyen, Hyunil Kim, Deokjai Choi

    Abstract: Gait data captured by inertial sensors have demonstrated promising results on user authentication. However, most existing approaches stored the enrolled gait pattern insecurely for matching with the validating pattern, thus, posed critical security and privacy issues. In this study, we present a gait cryptosystem that generates from gait data the random key for user authentication, meanwhile, secu… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  22. arXiv:2107.13227  [pdf

    physics.optics cond-mat.mes-hall

    Nonlinear imaging of nanoscale topological corner states

    Authors: Sergey S. Kruk, Wenlong Gao, Duk-Yong Choi, Thomas Zentgraf, Shuang Zhang, Yuri Kivshar

    Abstract: Topological states of light represent counterintuitive optical modes localized at boundaries of finite-size optical structures that originate from the properties of the bulk. Being defined by bulk properties, such boundary states are insensitive to certain types of perturbations, thus naturally enhancing robustness of photonic circuitries. Conventionally, the N-dimensional bulk modes correspond to… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2011.10164

  23. arXiv:2107.10916  [pdf, other

    cond-mat.mes-hall quant-ph

    A flexible design platform for Si/SiGe exchange-only qubits with low disorder

    Authors: Wonill Ha, Sieu D. Ha, Maxwell D. Choi, Yan Tang, Adele E. Schmitz, Mark P. Levendorf, Kangmu Lee, James M. Chappell, Tower S. Adams, Daniel R. Hulbert, Edwin Acuna, Ramsey S. Noah, Justine W. Matten, Michael P. Jura, Jeffrey A. Wright, Matthew T. Rakher, Matthew G. Borselli

    Abstract: Spin-based silicon quantum dots are an attractive qubit technology for quantum information processing with respect to coherence time, control, and engineering. Here we present an exchange-only Si qubit device platform that combines the throughput of CMOS-like wafer processing with the versatility of direct-write lithography. The technology, which we coin "SLEDGE," features dot-shaped gates that ar… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

  24. arXiv:2107.04152  [pdf, other

    cs.CL cs.AI

    Levi Graph AMR Parser using Heterogeneous Attention

    Authors: Han He, Jinho D. Choi

    Abstract: Coupled with biaffine decoders, transformers have been effectively adapted to text-to-graph transduction and achieved state-of-the-art performance on AMR parsing. Many prior works, however, rely on the biaffine decoder for either or both arc and label predictions although most features used by the decoder may be learned by the transformer already. This paper presents a novel approach to AMR parsin… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: Accepted in IWPT 2021: The 17th International Conference on Parsing Technologies

  25. arXiv:2107.03038  [pdf, other

    cs.RO cs.CV

    Maintaining a Reliable World Model using Action-aware Perceptual Anchoring

    Authors: Ying Siu Liang, Dongkyu Choi, Kenneth Kwok

    Abstract: Reliable perception is essential for robots that interact with the world. But sensors alone are often insufficient to provide this capability, and they are prone to errors due to various conditions in the environment. Furthermore, there is a need for robots to maintain a model of its surroundings even when objects go out of view and are no longer visible. This requires anchoring perceptual informa… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 7 pages, 3 figures

    Journal ref: 2021 International Conference on Robotics and Automation (ICRA 2021)

  26. arXiv:2107.01354  [pdf, other

    cs.DB cs.AI cs.LG

    Pool of Experts: Realtime Querying Specialized Knowledge in Massive Neural Networks

    Authors: Hakbin Kim, Dong-Wan Choi

    Abstract: In spite of the great success of deep learning technologies, training and delivery of a practically serviceable model is still a highly time-consuming process. Furthermore, a resulting model is usually too generic and heavyweight, and hence essentially goes through another expensive model compression phase to fit in a resource-limited device like embedded systems. Inspired by the fact that a machi… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

    Comments: In SIGMOD/PODS 2021

    Journal ref: SIGMOD Conference 2021: 2244-2252

  27. arXiv:2107.01349  [pdf, other

    cs.LG cs.AI cs.CV

    Split-and-Bridge: Adaptable Class Incremental Learning within a Single Neural Network

    Authors: Jong-Yeong Kim, Dong-Wan Choi

    Abstract: Continual learning has been a major problem in the deep learning community, where the main challenge is how to effectively learn a series of newly arriving tasks without forgetting the knowledge of previous tasks. Initiated by Learning without Forgetting (LwF), many of the existing works report that knowledge distillation is effective to preserve the previous knowledge, and hence they commonly use… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

    Comments: In AAAI-2021

    Journal ref: In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 35, No. 9, pp. 8137-8145) 2021

  28. arXiv:2107.00248  [pdf, ps, other

    stat.ME

    New Estimands for Experiments with Strong Interference

    Authors: David Choi

    Abstract: In experiments that study social phenomena, such as peer influence or herd immunity, the treatment of one unit may influence the outcomes of others. Such "interference between units" violates traditional approaches for causal inference, so that additional assumptions are often imposed to model or limit the underlying social mechanism. For binary outcomes, we propose new estimands that can be estim… ▽ More

    Submitted 29 August, 2023; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: new title, expanded discussion of interpretation and limitations, consolidation of central limit theorem results

  29. arXiv:2106.12767  [pdf, other

    cs.CL cs.DB cs.HC cs.LG

    TagRuler: Interactive Tool for Span-Level Data Programming by Demonstration

    Authors: Dongjin Choi, Sara Evensen, Çağatay Demiralp, Estevam Hruschka

    Abstract: Despite rapid developments in the field of machine learning research, collecting high-quality labels for supervised learning remains a bottleneck for many applications. This difficulty is exacerbated by the fact that state-of-the-art models for NLP tasks are becoming deeper and more complex, often increasing the amount of training data required even for fine-tuning. Weak supervision methods, inclu… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: WWW'21 Demo

  30. Non-archimedean Sendov's Conjecture

    Authors: Daebeom Choi, Seewoo Lee

    Abstract: We prove non-archimedean analogue of Sendov's conjecure. We also provide complete list of polynomials over an algebraically closed non-archimedean field $K$ that satisfy the optimal bound in the Sendov's conjecture.

    Submitted 4 July, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 4 pages, criterion for polynomials of degree $n$ with $v(n) = 0$ is added

    MSC Class: 11C08; 11S05

    Journal ref: P-Adic Num Ultrametr Anal Appl 14, 77-80 (2022)

  31. arXiv:2106.07610  [pdf, other

    q-bio.QM physics.bio-ph

    Measuring the repertoire of age-related behavioral changes in Drosophila melanogaster

    Authors: Katherine E. Overman, Daniel M. Choi, Kawai Leung, Joshua W. Shaevitz, Gordon J. Berman

    Abstract: Aging affects almost all aspects of an organism -- its morphology, its physiology, its behavior. Isolating which biological mechanisms are regulating these changes, however, has proven difficult, potentially due to our inability to characterize the full repertoire of an animal's behavior across the lifespan. Using data from fruit flies (D. melanogaster) we measure the full repertoire of behaviors… ▽ More

    Submitted 15 June, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

  32. arXiv:2106.05478  [pdf, other

    cs.CR cs.LG

    Semantic-aware Binary Code Representation with BERT

    Authors: Hyungjoon Koo, Soyeon Park, Daejin Choi, Taesoo Kim

    Abstract: A wide range of binary analysis applications, such as bug discovery, malware analysis and code clone detection, require recovery of contextual meanings on a binary code. Recently, binary analysis techniques based on machine learning have been proposed to automatically reconstruct the code representation of a binary instead of manually crafting specifics of the analysis algorithm. However, the exis… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: 16 pages

  33. arXiv:2105.11354  [pdf, other

    cs.CL cs.LG

    View Distillation with Unlabeled Data for Extracting Adverse Drug Effects from User-Generated Data

    Authors: Payam Karisani, Jinho D. Choi, Li Xiong

    Abstract: We present an algorithm based on multi-layer transformers for identifying Adverse Drug Reactions (ADR) in social media data. Our model relies on the properties of the problem and the characteristics of contextual word embeddings to extract two views from documents. Then a classifier is trained on each view to label a set of unlabeled documents to be used as an initializer for a new classifier in t… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: NAACL 2021 (workshops)

  34. arXiv:2105.05601  [pdf, other

    cs.CL cs.LG

    OutFlip: Generating Out-of-Domain Samples for Unknown Intent Detection with Natural Language Attack

    Authors: DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: Out-of-domain (OOD) input detection is vital in a task-oriented dialogue system since the acceptance of unsupported inputs could lead to an incorrect response of the system. This paper proposes OutFlip, a method to generate out-of-domain samples using only in-domain training dataset automatically. A white-box natural language attack method HotFlip is revised to generate out-of-domain samples inste… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 9 pages, 3 figures; to be appear in ACL Findings of ACL-IJCNLP 2021

  35. arXiv:2105.02381  [pdf, other

    stat.AP

    Balancing weights for region-level analysis: the effect of Medicaid Expansion on the uninsurance rate among states that did not expand Medicaid

    Authors: Max Rubinstein, Amelia Haviland, David Choi

    Abstract: We predict the average effect of Medicaid expansion on the non-elderly adult uninsurance rate among states that did not expand Medicaid in 2014 as if they had expanded their Medicaid eligibility requirements. Using American Community Survey data aggregated to the region level, we estimate this effect by finding weights that approximately reweights the expansion regions to match the covariate distr… ▽ More

    Submitted 23 May, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

  36. arXiv:2104.10117  [pdf, other

    cs.CL

    Enhancing Cognitive Models of Emotions with Representation Learning

    Authors: Yuting Guo, Jinho Choi

    Abstract: We present a novel deep learning-based framework to generate embedding representations of fine-grained emotions that can be used to computationally describe psychological models of emotions. Our framework integrates a contextualized embedding encoder with a multi-head probing model that enables to interpret dynamically learned representations optimized for an emotion classification task. Our model… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted by the NAACL Workshop on Cognitive Modeling and Computational Linguistics 2021

  37. arXiv:2104.06924  [pdf, other

    cs.CL

    Evaluation of Unsupervised Entity and Event Salience Estimation

    Authors: Jiaying Lu, Jinho D. Choi

    Abstract: Salience Estimation aims to predict term importance in documents. Due to few existing human-annotated datasets and the subjective notion of salience, previous studies typically generate pseudo-ground truth for evaluation. However, our investigation reveals that the evaluation protocol proposed by prior work is difficult to replicate, thus leading to few follow-up studies existing. Moreover, the ev… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Journal ref: Proceedings of the 34rd International Florida Artificial Intelligence Research Society Conference, 2021

  38. Atomic Manipulation of In-gap States on the $β$-Bi$_2$Pd Superconductor

    Authors: Cristina Mier, Jiyoon Hwang, Jinkyung Kim, Yujeong Bae, Fuyuki Nabeshima, Yoshinori Imai, Atsutaka Maeda, Nicolás Lorente, Andreas Heinrich, Deung-Jang Choi

    Abstract: Electronic states in the gap of a superconductor inherit intriguing many-body properties from the superconductor. Here, we create these in-gap states by manipulating Cr atomic chains on the $β$-Bi$_2$Pd superconductor. We find that the topological properties of the in-gap states can greatly vary depending on the crafted spin chain. These systems make an ideal platform for non-trivial topological p… ▽ More

    Submitted 6 May, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Journal ref: Phys. Rev. B 104, 045406 (2021)

  39. arXiv:2104.00924  [pdf, other

    cs.CV

    Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning

    Authors: Sangmin Lee, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim, Yong Man Ro

    Abstract: Our work addresses long-term motion context issues for predicting future frames. To predict the future precisely, it is required to capture which long-term motion context (e.g., walking or running) the input motion (e.g., leg movement) belongs to. The bottlenecks arising when dealing with the long-term motion context are: (i) how to predict the long-term motion context naturally matching input seq… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 (Oral)

  40. arXiv:2103.12533   

    math.NT

    Multiplicity one bound for cohomological automorphic representations with a fixed level

    Authors: Dohoon Choi

    Abstract: Let $F$ be a totally real field, and $\mathbb{A}_F$ be the adele ring of $F$. Let us fix $N$ to be a positive integer. Let $π_1=\otimesπ_{1,v}$ and $π_2=\otimesπ_{2,v}$ be distinct cohomological cuspidal automorphic representations of $\mathrm{GL}_n(\mathbb{A}_{F})$ with levels less than or equal to $N$. Let $\mathcal{N}(π_1,π_2)$ be the minimum of the absolute norm of $v \nmid \infty$ such that… ▽ More

    Submitted 12 March, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: A new paper preparing with other collaborators will include the results of this paper

  41. Spin Resonance Amplitude and Frequency of a Single Atom on a Surface in a Vector Magnetic Field

    Authors: Jinkyung Kim, Won-jun Jang, Thi Hong Bui, Deung-jang Choi, Christoph Wolf, Fernando Delgado, Denis Krylov, Soonhyeong Lee, Sangwon Yoon, Christopher P. Lutz, Andreas J. Heinrich, Yujeong Bae

    Abstract: We used electron spin resonance (ESR) combined with scanning tunneling microscopy (STM) to measure hydrogenated Ti (spin-1/2) atoms at low-symmetry binding sites on MgO in vector magnetic fields. We found strongly anisotropic g-values in all three spatial directions. Interestingly, the amplitude and lineshape of the ESR signals are also strongly dependent on the angle of the field. We conclude tha… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

  42. arXiv:2103.04044  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Putting Humans in the Natural Language Processing Loop: A Survey

    Authors: Zijie J. Wang, Dongjin Choi, Shenyu Xu, Diyi Yang

    Abstract: How can we design Natural Language Processing (NLP) systems that learn from human feedback? There is a growing research body of Human-in-the-loop (HITL) NLP frameworks that continuously integrate human feedback to improve the model itself. HITL NLP research is nascent but multifarious -- solving various NLP problems, collecting diverse feedback from different people, and applying different methods… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

    Comments: The paper is accepted to the HCI+NLP workshop at EACL 2021

  43. arXiv:2103.01655  [pdf, other

    cs.RO

    Run Your Visual-Inertial Odometry on NVIDIA Jetson: Benchmark Tests on a Micro Aerial Vehicle

    Authors: Jinwoo Jeon, Sungwook Jung, Eungchang Lee, Duckyu Choi, Hyun Myung

    Abstract: This paper presents benchmark tests of various visual(-inertial) odometry algorithms on NVIDIA Jetson platforms. The compared algorithms include mono and stereo, covering Visual Odometry (VO) and Visual-Inertial Odometry (VIO): VINS-Mono, VINS-Fusion, Kimera, ALVIO, Stereo-MSCKF, ORB-SLAM2 stereo, and ROVIO. As these methods are mainly used for unmanned aerial vehicles (UAVs), they must perform we… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 8 pages, 5 figures

  44. Peacock Exploration: A Lightweight Exploration for UAV using Control-Efficient Trajectory

    Authors: EungChang Mason Lee, Duckyu Choi, Hyun Myung

    Abstract: Unmanned Aerial Vehicles have received much attention in recent years due to its wide range of applications, such as exploration of an unknown environment to acquire a 3D map without prior knowledge of it. Existing exploration methods have been largely challenged by computationally heavy probabilistic path planning. Similarly, kinodynamic constraints or proper sensors considering the payload for U… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Comments: 10 pages

  45. Creating a Physicist: The Impact of Informal Programs on University Student Development

    Authors: Callie Rethman, Jonathan Perry, Jonan Donaldson, Daniel Choi, Tatiana Erukhimova

    Abstract: Physics outreach programs provide a critical context for informal experiences that promote the transition from new student to contributing physicist. Prior studies have suggested a positive link between participation in informal physics outreach programs and the development of a student's physics identity. In this study, we adopt a student-focused investigation to explore the effects of informal p… ▽ More

    Submitted 29 May, 2021; v1 submitted 27 December, 2020; originally announced December 2020.

    Comments: 15 pages, 6 figures

    Journal ref: Phys. Rev. Phys. Educ. Res. 17, 020110 (2021)

  46. arXiv:2011.10897   

    cs.AI eess.SY

    Reinforcement learning with distance-based incentive/penalty (DIP) updates for highly constrained industrial control systems

    Authors: Hyungjun Park, Daiki Min, Jong-hyun Ryu, Dong Gu Choi

    Abstract: Typical reinforcement learning (RL) methods show limited applicability for real-world industrial control problems because industrial systems involve various constraints and simultaneously require continuous and discrete control. To overcome these challenges, we devise a novel RL algorithm that enables an agent to handle a highly constrained action space. This algorithm has two main features. First… ▽ More

    Submitted 19 May, 2021; v1 submitted 21 November, 2020; originally announced November 2020.

    Comments: We request withdrawal of this article due to a definition error on methodology and problem definition (Section 3-4; pages 2-5)

  47. Nonlinear imaging of nanoscale topological corner states

    Authors: Sergey Kruk, Wenlong Gao, Duk Yong Choi, Thomas Zentgraf, Shuang Zhang, Yuri Kivshar

    Abstract: Topological states of light represent counterintuitive optical modes localized at boundaries of finite-size optical structures that originate from the properties of the bulk. Being defined by bulk properties, such boundary states are insensitive to certain types of perturbations, thus naturally enhancing robustness of photonic circuitries. Conventionally, the N-dimensional bulk modes correspond to… ▽ More

    Submitted 1 September, 2022; v1 submitted 19 November, 2020; originally announced November 2020.

    Journal ref: Nano Lett. 2021, 21, 11, 4592-4597

  48. arXiv:2011.04803  [pdf, other

    cs.LG

    Self-Tuning Stochastic Optimization with Curvature-Aware Gradient Filtering

    Authors: Ricky T. Q. Chen, Dami Choi, Lukas Balles, David Duvenaud, Philipp Hennig

    Abstract: Standard first-order stochastic optimization algorithms base their updates solely on the average mini-batch gradient, and it has been shown that tracking additional quantities such as the curvature can help de-sensitize common hyperparameters. Based on this intuition, we explore the use of exact per-sample Hessian-vector products and gradients to construct optimizers that are self-tuning and hyper… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  49. arXiv:2011.02998  [pdf, other

    cs.CL

    Competence-Level Prediction and Resume & Job Description Matching Using Context-Aware Transformer Models

    Authors: Changmao Li, Elaine Fisher, Rebecca Thomas, Steve Pittard, Vicki Hertzberg, Jinho D. Choi

    Abstract: This paper presents a comprehensive study on resume classification to reduce the time and labor needed to screen an overwhelming number of applications significantly, while improving the selection of suitable candidates. A total of 6,492 resumes are extracted from 24,933 job applications for 252 positions designated into four levels of experience for Clinical Research Coordinators (CRC). Each resu… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted by the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

    ACM Class: I.2.7

  50. arXiv:2011.02207  [pdf, other

    cs.CL

    Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

    Authors: Dongha Choi, Hyunju Lee

    Abstract: The extraction of interactions between chemicals and proteins from several biomedical articles is important in many fields of biomedical research such as drug development and prediction of drug side effects. Several natural language processing methods, including deep neural network (DNN) models, have been applied to address this problem. However, these methods were trained with hard-labeled data,… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: 10 pages, 4 figures, accepted for the Findings of EMNLP