-
Transferable Adversarial Examples for Anchor Free Object Detection
Authors:
Quanyu Liao,
Xin Wang,
Bin Kong,
Siwei Lyu,
Bin Zhu,
Youbing Yin,
Qi Song,
Xi Wu
Abstract:
Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbation can completely change prediction result. The vulnerability has led to a surge of research in this direction, including adversarial attacks on object detection networks. However, previous studies are dedicated to attacking anchor-based object detectors. In this paper, we present the first advers…
▽ More
Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbation can completely change prediction result. The vulnerability has led to a surge of research in this direction, including adversarial attacks on object detection networks. However, previous studies are dedicated to attacking anchor-based object detectors. In this paper, we present the first adversarial attack on anchor-free object detectors. It conducts category-wise, instead of previously instance-wise, attacks on object detectors, and leverages high-level semantic information to efficiently generate transferable adversarial examples, which can also be transferred to attack other object detectors, even anchor-based detectors such as Faster R-CNN. Experimental results on two benchmark datasets demonstrate that our proposed method achieves state-of-the-art performance and transferability.
△ Less
Submitted 3 June, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Imperceptible Adversarial Examples for Fake Image Detection
Authors:
Quanyu Liao,
Yuezun Li,
Xin Wang,
Bin Kong,
Bin Zhu,
Siwei Lyu,
Youbing Yin,
Qi Song,
Xi Wu
Abstract:
Fooling people with highly realistic fake images generated with Deepfake or GANs brings a great social disturbance to our society. Many methods have been proposed to detect fake images, but they are vulnerable to adversarial perturbations -- intentionally designed noises that can lead to the wrong prediction. Existing methods of attacking fake image detectors usually generate adversarial perturbat…
▽ More
Fooling people with highly realistic fake images generated with Deepfake or GANs brings a great social disturbance to our society. Many methods have been proposed to detect fake images, but they are vulnerable to adversarial perturbations -- intentionally designed noises that can lead to the wrong prediction. Existing methods of attacking fake image detectors usually generate adversarial perturbations to perturb almost the entire image. This is redundant and increases the perceptibility of perturbations. In this paper, we propose a novel method to disrupt the fake image detection by determining key pixels to a fake image detector and attacking only the key pixels, which results in the $L_0$ and the $L_2$ norms of adversarial perturbations much less than those of existing works. Experiments on two public datasets with three fake image detectors indicate that our proposed method achieves state-of-the-art performance in both white-box and black-box attacks.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Uncertainty Quantification 360: A Holistic Toolkit for Quantifying and Communicating the Uncertainty of AI
Authors:
Soumya Ghosh,
Q. Vera Liao,
Karthikeyan Natesan Ramamurthy,
Jiri Navratil,
Prasanna Sattigeri,
Kush R. Varshney,
Yunfeng Zhang
Abstract:
In this paper, we describe an open source Python toolkit named Uncertainty Quantification 360 (UQ360) for the uncertainty quantification of AI models. The goal of this toolkit is twofold: first, to provide a broad range of capabilities to streamline as well as foster the common practices of quantifying, evaluating, improving, and communicating uncertainty in the AI application development lifecycl…
▽ More
In this paper, we describe an open source Python toolkit named Uncertainty Quantification 360 (UQ360) for the uncertainty quantification of AI models. The goal of this toolkit is twofold: first, to provide a broad range of capabilities to streamline as well as foster the common practices of quantifying, evaluating, improving, and communicating uncertainty in the AI application development lifecycle; second, to encourage further exploration of UQ's connections to other pillars of trustworthy AI such as fairness and transparency through the dissemination of latest research and education materials. Beyond the Python package (\url{https://github.com/IBM/UQ360}), we have developed an interactive experience (\url{http://uq360.mybluemix.net}) and guidance materials as educational tools to aid researchers and developers in producing and communicating high-quality uncertainties in an effective manner.
△ Less
Submitted 3 June, 2021; v1 submitted 2 June, 2021;
originally announced June 2021.
-
A recursion formula for the generalized Euler function $ \varphi_e(n) $
Authors:
Canze Zhu,
Qunying Liao
Abstract:
In this paper, basing on the linear algebra methods and elementary techniques, for any positive integers $ e $ and $ n $, we obtain a recursion formula for the generalized Euler function $ \varphi_e(n) $, which is determined by some matrices related to a congruence equation modulo $ e $. Furthermore, through the recursion formula, we get the explicit formula for $ \varphi_5(n) $. Our results gener…
▽ More
In this paper, basing on the linear algebra methods and elementary techniques, for any positive integers $ e $ and $ n $, we obtain a recursion formula for the generalized Euler function $ \varphi_e(n) $, which is determined by some matrices related to a congruence equation modulo $ e $. Furthermore, through the recursion formula, we get the explicit formula for $ \varphi_5(n) $. Our results generalize the corresponding results in \cite{A4,A8,A10,A11}.
△ Less
Submitted 24 August, 2022; v1 submitted 23 May, 2021;
originally announced May 2021.
-
ER-IQA: Boosting Perceptual Quality Assessment Using External Reference Images
Authors:
Jingyu Guo,
Wei Wang,
Wenming Yang,
Qingmin Liao,
Jie Zhou
Abstract:
Recently, image quality assessment (IQA) has achieved remarkable progress with the success of deep learning. However, the strict pre-condition of full-reference (FR) methods has limited its application in real scenarios. And the no-reference (NR) scheme is also inconvenient due to its unsatisfying performance as a result of ignoring the essence of image quality. In this paper, we introduce a brand…
▽ More
Recently, image quality assessment (IQA) has achieved remarkable progress with the success of deep learning. However, the strict pre-condition of full-reference (FR) methods has limited its application in real scenarios. And the no-reference (NR) scheme is also inconvenient due to its unsatisfying performance as a result of ignoring the essence of image quality. In this paper, we introduce a brand new scheme, namely external-reference image quality assessment (ER-IQA), by introducing external reference images to bridge the gap between FR and NR-IQA. As the first implementation and a new baseline of ER-IQA, we propose a new Unpaired-IQA network to process images in a content-unpaired manner. A Mutual Attention-based Feature Enhancement (MAFE) module is well-designed for the unpaired features in ER-IQA. The MAFE module allows the network to extract quality-discriminative features from distorted images and content variability-robust features from external reference ones. Extensive experiments demonstrate that the proposed model outperforms the state-of-the-art NR-IQA methods, verifying the effectiveness of ER-IQA and the possibility of narrowing the gap of the two existing categories.
△ Less
Submitted 16 September, 2021; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Leveraging Machine Learning for Industrial Wireless Communications
Authors:
Ilaria Malanchini,
Patrick Agostini,
Khurshid Alam,
Michael Baumgart,
Martin Kasparick,
Qi Liao,
Fabian Lipp,
Nikolaj Marchenko,
Nicola Michailow,
Rastin Pries,
Hans Schotten,
Slawomir Stanczak,
Stanislaw Strzyz
Abstract:
Two main trends characterize today's communication landscape and are finding their way into industrial facilities: the rollout of 5G with its distinct support for vertical industries and the increasing success of machine learning (ML). The combination of those two technologies open the doors to many exciting industrial applications and its impact is expected to rapidly increase in the coming years…
▽ More
Two main trends characterize today's communication landscape and are finding their way into industrial facilities: the rollout of 5G with its distinct support for vertical industries and the increasing success of machine learning (ML). The combination of those two technologies open the doors to many exciting industrial applications and its impact is expected to rapidly increase in the coming years, given the abundant data growth and the availability of powerful edge computers in production facilities. Unlike most previous work that has considered the application of 5G and ML in industrial environment separately, this paper highlights the potential and synergies that result from combining them. The overall vision presented here generates from the KICK project, a collaboration of several partners from the manufacturing and communication industry as well as research institutes. This unprecedented blend of 5G and ML expertise creates a unique perspective on ML-supported industrial communications and their role in facilitating industrial automation. The paper identifies key open industrial challenges that are grouped into four use cases: wireless connectivity and edge-cloud integration, flexibility in network reconfiguration, dynamicity of heterogeneous network services, and mobility of robots and vehicles. Moreover, the paper provides insights into the advantages of ML-based industrial communications and discusses current challenges of data acquisition in real systems.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
SCNet: Enhancing Few-Shot Semantic Segmentation by Self-Contrastive Background Prototypes
Authors:
Jiacheng Chen,
Bin-Bin Gao,
Zongqing Lu,
Jing-Hao Xue,
Chengjie Wang,
Qingmin Liao
Abstract:
Few-shot semantic segmentation aims to segment novel-class objects in a query image with only a few annotated examples in support images. Most of advanced solutions exploit a metric learning framework that performs segmentation through matching each pixel to a learned foreground prototype. However, this framework suffers from biased classification due to incomplete construction of sample pairs wit…
▽ More
Few-shot semantic segmentation aims to segment novel-class objects in a query image with only a few annotated examples in support images. Most of advanced solutions exploit a metric learning framework that performs segmentation through matching each pixel to a learned foreground prototype. However, this framework suffers from biased classification due to incomplete construction of sample pairs with the foreground prototype only. To address this issue, in this paper, we introduce a complementary self-contrastive task into few-shot semantic segmentation. Our new model is able to associate the pixels in a region with the prototype of this region, no matter they are in the foreground or background. To this end, we generate self-contrastive background prototypes directly from the query image, with which we enable the construction of complete sample pairs and thus a complementary and auxiliary segmentation task to achieve the training of a better segmentation model. Extensive experiments on PASCAL-5$^i$ and COCO-20$^i$ demonstrate clearly the superiority of our proposal. At no expense of inference efficiency, our model achieves state-of-the results in both 1-shot and 5-shot settings for few-shot semantic segmentation.
△ Less
Submitted 28 April, 2021; v1 submitted 19 April, 2021;
originally announced April 2021.
-
Model LineUpper: Supporting Interactive Model Comparison at Multiple Levels for AutoML
Authors:
Shweta Narkar,
Yunfeng Zhang,
Q. Vera Liao,
Dakuo Wang,
Justin D Weisz
Abstract:
Automated Machine Learning (AutoML) is a rapidly growing set of technologies that automate the model development pipeline by searching model space and generating candidate models. A critical, final step of AutoML is human selection of a final model from dozens of candidates. In current AutoML systems, selection is supported only by performance metrics. Prior work has shown that in practice, people…
▽ More
Automated Machine Learning (AutoML) is a rapidly growing set of technologies that automate the model development pipeline by searching model space and generating candidate models. A critical, final step of AutoML is human selection of a final model from dozens of candidates. In current AutoML systems, selection is supported only by performance metrics. Prior work has shown that in practice, people evaluate ML models based on additional criteria, such as the way a model makes predictions. Comparison may happen at multiple levels, from types of errors, to feature importance, to how the model makes predictions of specific instances. We developed \tool{} to support interactive model comparison for AutoML by integrating multiple Explainable AI (XAI) and visualization techniques. We conducted a user study in which we both evaluated the system and used it as a technology probe to understand how users perform model comparison in an AutoML system. We discuss design implications for utilizing XAI techniques for model comparison and supporting the unique needs of data scientists in comparing AutoML models.
△ Less
Submitted 9 April, 2021;
originally announced April 2021.
-
Question-Driven Design Process for Explainable AI User Experiences
Authors:
Q. Vera Liao,
Milena Pribić,
Jaesik Han,
Sarah Miller,
Daby Sow
Abstract:
A pervasive design issue of AI systems is their explainability--how to provide appropriate information to help users understand the AI. The technical field of explainable AI (XAI) has produced a rich toolbox of techniques. Designers are now tasked with the challenges of how to select the most suitable XAI techniques and translate them into UX solutions. Informed by our previous work studying desig…
▽ More
A pervasive design issue of AI systems is their explainability--how to provide appropriate information to help users understand the AI. The technical field of explainable AI (XAI) has produced a rich toolbox of techniques. Designers are now tasked with the challenges of how to select the most suitable XAI techniques and translate them into UX solutions. Informed by our previous work studying design challenges around XAI UX, this work proposes a design process to tackle these challenges. We review our and related prior work to identify requirements that the process should fulfill, and accordingly, propose a Question-Driven Design Process that grounds the user needs, choices of XAI techniques, design, and evaluation of XAI UX all in the user questions. We provide a mapping guide between prototypical user questions and exemplars of XAI techniques to reframe the technical space of XAI, also serving as boundary objects to support collaboration between designers and AI engineers. We demonstrate it with a use case of designing XAI for healthcare adverse events prediction, and discuss lessons learned for tackling design challenges of AI systems.
△ Less
Submitted 3 September, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
A Multistakeholder Approach Towards Evaluating AI Transparency Mechanisms
Authors:
Ana Lucic,
Madhulika Srikumar,
Umang Bhatt,
Alice Xiang,
Ankur Taly,
Q. Vera Liao,
Maarten de Rijke
Abstract:
Given that there are a variety of stakeholders involved in, and affected by, decisions from machine learning (ML) models, it is important to consider that different stakeholders have different transparency needs. Previous work found that the majority of deployed transparency mechanisms primarily serve technical stakeholders. In our work, we want to investigate how well transparency mechanisms migh…
▽ More
Given that there are a variety of stakeholders involved in, and affected by, decisions from machine learning (ML) models, it is important to consider that different stakeholders have different transparency needs. Previous work found that the majority of deployed transparency mechanisms primarily serve technical stakeholders. In our work, we want to investigate how well transparency mechanisms might work in practice for a more diverse set of stakeholders by conducting a large-scale, mixed-methods user study across a range of organizations, within a particular industry such as health care, criminal justice, or content moderation. In this paper, we outline the setup for our study.
△ Less
Submitted 1 June, 2021; v1 submitted 27 March, 2021;
originally announced March 2021.
-
Adaptive deep density approximation for Fokker-Planck equations
Authors:
Kejun Tang,
Xiaoliang Wan,
Qifeng Liao
Abstract:
In this paper we present an adaptive deep density approximation strategy based on KRnet (ADDA-KR) for solving the steady-state Fokker-Planck (F-P) equations. F-P equations are usually high-dimensional and defined on an unbounded domain, which limits the application of traditional grid based numerical methods. With the Knothe-Rosenblatt rearrangement, our newly proposed flow-based generative model,…
▽ More
In this paper we present an adaptive deep density approximation strategy based on KRnet (ADDA-KR) for solving the steady-state Fokker-Planck (F-P) equations. F-P equations are usually high-dimensional and defined on an unbounded domain, which limits the application of traditional grid based numerical methods. With the Knothe-Rosenblatt rearrangement, our newly proposed flow-based generative model, called KRnet, provides a family of probability density functions to serve as effective solution candidates for the Fokker-Planck equations, which has a weaker dependence on dimensionality than traditional computational approaches and can efficiently estimate general high-dimensional density functions. To obtain effective stochastic collocation points for the approximation of the F-P equation, we develop an adaptive sampling procedure, where samples are generated iteratively using the approximate density function at each iteration. We present a general framework of ADDA-KR, validate its accuracy and demonstrate its efficiency with numerical experiments.
△ Less
Submitted 15 December, 2021; v1 submitted 20 March, 2021;
originally announced March 2021.
-
Double Asymptotic Structures of Topologically Interlocked Molecules
Authors:
Jiang-Tao Li,
Fang Gu,
Ning Yao,
Hai-Jun Wang,
Qi Liao
Abstract:
The mean square size of topologically interlocked molecules (TIMs) is presented as a linear combination of contributions from the backbone and subcomponents. Using scaling analyses and extensive molecular dynamics simulations of polycatenanes, as a typical example of TIMs, we show that the effective exponent $ν(m)$ for the size dependence of the backbone on the monomer number of subcomponent $m$ i…
▽ More
The mean square size of topologically interlocked molecules (TIMs) is presented as a linear combination of contributions from the backbone and subcomponents. Using scaling analyses and extensive molecular dynamics simulations of polycatenanes, as a typical example of TIMs, we show that the effective exponent $ν(m)$ for the size dependence of the backbone on the monomer number of subcomponent $m$ is asymptotic to a value $ν$ (approximately 0.588 in good solvents) with a correction of $m^{-0.47}$, which is the same as for the covalently linked polymer. However, the effective exponent for the size dependence of subcomponents on $m$ is asymptotic to the same value $ν$ but with a new correction of $m^{-1.0}$. The different corrections to the scaling on the backbone and subcomponent structure induce a surprising double asymptotic behavior for the architecture of the TIMs. The scaling model that takes into account the double asymptotic behavior is in good quantitative agreement with the simulation result that the effective exponent for the size dependence of TIMs on $m$ increases with the subcomponent number $n$. The full scaling functional form of the size dependence on $m$ and $n$ for polycatenanes in a good solvent is well described by a simple sum of two limiting behaviors with different corrections.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
A region-based descriptor network for uniformly sampled keypoints
Authors:
Kai Lv,
Zongqing Lu,
Qingmin Liao
Abstract:
Matching keypoint pairs of different images is a basic task of computer vision. Most methods require customized extremum point schemes to obtain the coordinates of feature points with high confidence, which often need complex algorithmic design or a network with higher training difficulty and also ignore the possibility that flat regions can be used as candidate regions of matching points. In this…
▽ More
Matching keypoint pairs of different images is a basic task of computer vision. Most methods require customized extremum point schemes to obtain the coordinates of feature points with high confidence, which often need complex algorithmic design or a network with higher training difficulty and also ignore the possibility that flat regions can be used as candidate regions of matching points. In this paper, we design a region-based descriptor by combining the context features of a deep network. The new descriptor can give a robust representation of a point even in flat regions. By the new descriptor, we can obtain more high confidence matching points without extremum operation. The experimental results show that our proposed method achieves a performance comparable to state-of-the-art.
△ Less
Submitted 26 January, 2021;
originally announced March 2021.
-
Realization of exciton-mediated optical spin-orbit interaction in organic microcrystalline resonators
Authors:
Jiahuan Ren,
Qing Liao,
Xuekai Ma,
Stefan Schumacher,
Jiannian Yao,
Hongbing Fu
Abstract:
The ability to control the spin-orbit interaction of light in optical microresonators is of fundamental importance for future photonics. Organic microcrystals, due to their giant optical anisotropy, play a crucial role in spin-optics and topological photonics. Here we realize controllable and wavelength-dependent Rashba-Dresselhaus spin-orbit interaction, attributed to the anisotropic excitonic re…
▽ More
The ability to control the spin-orbit interaction of light in optical microresonators is of fundamental importance for future photonics. Organic microcrystals, due to their giant optical anisotropy, play a crucial role in spin-optics and topological photonics. Here we realize controllable and wavelength-dependent Rashba-Dresselhaus spin-orbit interaction, attributed to the anisotropic excitonic response in an optical microcavity filled with an organic microcrystalline. We also investigate the transition of the spin-orbit interaction from dominant photonic type caused by the splitting of the transverse-electric and transverse-magnetic modes to spin-orbit interaction of the Rashba-Dresselhaus type. The interplay of the two allows us to engineer the spin-orbit interaction of light in organic microcavities, which besides its fundamental interest promises applications in spin-controlled on-chip integrated nanophotonic elements, towards exploiting non-magnetic and low-cost spin-photonic devices.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
Task-oriented Communication Design in Cyber-Physical Systems: A Survey on Theory and Applications
Authors:
Arsham Mostaani,
Thang X. Vu,
Shree Krishna Sharma,
Van-Dinh Nguyen,
Qi Liao,
Symeon Chatzinotas
Abstract:
Communications system design has been traditionally guided by task-agnostic principles, which aim at efficiently transmitting as many correct bits as possible through a given channel. However, in the era of cyber-physical systems, the effectiveness of communications is not dictated simply by the bit rate, but most importantly by the efficient completion of the task in hand, e.g., controlling remot…
▽ More
Communications system design has been traditionally guided by task-agnostic principles, which aim at efficiently transmitting as many correct bits as possible through a given channel. However, in the era of cyber-physical systems, the effectiveness of communications is not dictated simply by the bit rate, but most importantly by the efficient completion of the task in hand, e.g., controlling remotely a robot, automating a production line or collaboratively sensing through a drone swarm. In parallel, it is projected that by 2023, half of the worldwide network connections will be among machines rather than humans. In this context, it is crucial to establish a new paradigm for designing communications strategies for multi-agent cyber-physical systems. This is a daunting task, since it requires a combination of principles from information, communication, control theories and computer science in order to formalize a general framework for task-oriented communication design. In this direction, this paper reviews and structures the relevant theoretical work across a wide range of scientific communities. Subsequently, it proposes a general conceptual framework for task-oriented communication design, along with its specializations according to the targeted use case. Furthermore, it provides a survey of relevant contributions in dominant applications, such as industrial internet of things, multi-UAV systems, tactile internet, autonomous vehicles, distributed learning systems, smart manufacturing plants and 5G and beyond self-organizing networks. Finally, it highlights the most important open research topics from both the theoretical framework and application points of view.
△ Less
Submitted 25 May, 2023; v1 submitted 14 February, 2021;
originally announced February 2021.
-
Facilitating Knowledge Sharing from Domain Experts to Data Scientists for Building NLP Models
Authors:
Soya Park,
April Wang,
Ban Kawas,
Q. Vera Liao,
David Piorkowski,
Marina Danilevsky
Abstract:
Data scientists face a steep learning curve in understanding a new domain for which they want to build machine learning (ML) models. While input from domain experts could offer valuable help, such input is often limited, expensive, and generally not in a form readily consumable by a model development pipeline. In this paper, we propose Ziva, a framework to guide domain experts in sharing essential…
▽ More
Data scientists face a steep learning curve in understanding a new domain for which they want to build machine learning (ML) models. While input from domain experts could offer valuable help, such input is often limited, expensive, and generally not in a form readily consumable by a model development pipeline. In this paper, we propose Ziva, a framework to guide domain experts in sharing essential domain knowledge to data scientists for building NLP models. With Ziva, experts are able to distill and share their domain knowledge using domain concept extractors and five types of label justification over a representative data sample. The design of Ziva is informed by preliminary interviews with data scientists, in order to understand current practices of domain knowledge acquisition process for ML development projects. To assess our design, we run a mix-method case-study to evaluate how Ziva can facilitate interaction of domain experts and data scientists. Our results highlight that (1) domain experts are able to use Ziva to provide rich domain knowledge, while maintaining low mental load and stress levels; and (2) data scientists find Ziva's output helpful for learning essential information about the domain, offering scalability of information, and lowering the burden on domain experts to share knowledge. We conclude this work by experimenting with building NLP models using the Ziva output by our case study.
△ Less
Submitted 29 January, 2021;
originally announced February 2021.
-
Rheological similarities between dense self-propelled and sheared particulate systems
Authors:
Ruoyang Mo,
Qinyi Liao,
Ning Xu
Abstract:
Different from previous modelings of self-propelled particles, we develop a method to propel the particles with a constant average velocity instead of a constant force. This constant propulsion velocity (CPV) approach is validated by its agreement with the conventional constant propulsion force (CPF) approach in the flowing regime. However, the CPV approach shows its advantage of accessing quasist…
▽ More
Different from previous modelings of self-propelled particles, we develop a method to propel the particles with a constant average velocity instead of a constant force. This constant propulsion velocity (CPV) approach is validated by its agreement with the conventional constant propulsion force (CPF) approach in the flowing regime. However, the CPV approach shows its advantage of accessing quasistatic flows of yield stress fluids with a vanishing propulsion velocity, while the CPF approach is usually unable to because of finite system size. Taking this advantage, we realize the cyclic self-propulsion and study the evolution of the propulsion force with propelled particle displacement, both in the quasistatic flow regime. By mapping shear stress and shear rate to propulsion force and propulsion velocity, we find similar rheological behaviors of self-propelled systems to sheared systems, including the yield force gap between the CPF and CPV approaches, propulsion force overshoot, reversible-irreversible transition under cyclic propulsion, and propulsion bands in plastic flows. These similarities suggest the underlying connections between self-propulsion and shear, although they act on systems in different ways.
△ Less
Submitted 25 January, 2021;
originally announced January 2021.
-
Strong coupling between excitons and magnetic dipole quasi-bound states in the continuum in WS$_2$-TiO$_2$ hybrid metasurfaces
Authors:
Meibao Qin,
Shuyuan Xiao,
Wenxing Liu,
Mingyu Ouyang,
Tianbao Yu,
Tongbiao Wang,
Qinghua Liao
Abstract:
Enhancing the light-matter interactions in two-dimensional materials via optical metasurfaces has attracted much attention due to its potential to enable breakthrough in advanced compact photonic and quantum information devices. Here, we theoretically investigate a strong coupling between excitons in monolayer WS$_2$ and quasi-bound states in the continuum (quasi-BIC). In the hybrid structure comp…
▽ More
Enhancing the light-matter interactions in two-dimensional materials via optical metasurfaces has attracted much attention due to its potential to enable breakthrough in advanced compact photonic and quantum information devices. Here, we theoretically investigate a strong coupling between excitons in monolayer WS$_2$ and quasi-bound states in the continuum (quasi-BIC). In the hybrid structure composed of WS$_2$ coupled with asymmetric titanium dioxide nanobars, a remarkable spectral splitting and typical anticrossing behavior of the Rabi splitting can be observed, and such strong coupling effect can be modulated by shaping the thickness and asymmetry parameter of the proposed metasurfaces. It is found that the balance of line width of the quasi-BIC mode and local electric field enhancement should be considered since both of them affect the strong coupling, which is crucial to the design and optimization of metasurface devices. This work provides a promising way for controlling the light-matter interactions in strong coupling regime and opens the door for the future novel quantum, low-energy, distinctive nanodevices by advanced meta-optical engineering.
△ Less
Submitted 16 January, 2021;
originally announced January 2021.
-
Expanding Explainability: Towards Social Transparency in AI systems
Authors:
Upol Ehsan,
Q. Vera Liao,
Michael Muller,
Mark O. Riedl,
Justin D. Weisz
Abstract:
As AI-powered systems increasingly mediate consequential decision-making, their explainability is critical for end-users to take informed and accountable actions. Explanations in human-human interactions are socially-situated. AI systems are often socio-organizationally embedded. However, Explainable AI (XAI) approaches have been predominantly algorithm-centered. We take a developmental step towar…
▽ More
As AI-powered systems increasingly mediate consequential decision-making, their explainability is critical for end-users to take informed and accountable actions. Explanations in human-human interactions are socially-situated. AI systems are often socio-organizationally embedded. However, Explainable AI (XAI) approaches have been predominantly algorithm-centered. We take a developmental step towards socially-situated XAI by introducing and exploring Social Transparency (ST), a sociotechnically informed perspective that incorporates the socio-organizational context into explaining AI-mediated decision-making. To explore ST conceptually, we conducted interviews with 29 AI users and practitioners grounded in a speculative design scenario. We suggested constitutive design elements of ST and developed a conceptual framework to unpack ST's effect and implications at the technical, decision-making, and organizational level. The framework showcases how ST can potentially calibrate trust in AI, improve decision-making, facilitate organizational collective actions, and cultivate holistic explainability. Our work contributes to the discourse of Human-Centered XAI by expanding the design space of XAI.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
How Much Automation Does a Data Scientist Want?
Authors:
Dakuo Wang,
Q. Vera Liao,
Yunfeng Zhang,
Udayan Khurana,
Horst Samulowitz,
Soya Park,
Michael Muller,
Lisa Amini
Abstract:
Data science and machine learning (DS/ML) are at the heart of the recent advancements of many Artificial Intelligence (AI) applications. There is an active research thread in AI, \autoai, that aims to develop systems for automating end-to-end the DS/ML Lifecycle. However, do DS and ML workers really want to automate their DS/ML workflow? To answer this question, we first synthesize a human-centere…
▽ More
Data science and machine learning (DS/ML) are at the heart of the recent advancements of many Artificial Intelligence (AI) applications. There is an active research thread in AI, \autoai, that aims to develop systems for automating end-to-end the DS/ML Lifecycle. However, do DS and ML workers really want to automate their DS/ML workflow? To answer this question, we first synthesize a human-centered AutoML framework with 6 User Role/Personas, 10 Stages and 43 Sub-Tasks, 5 Levels of Automation, and 5 Types of Explanation, through reviewing research literature and marketing reports. Secondly, we use the framework to guide the design of an online survey study with 217 DS/ML workers who had varying degrees of experience, and different user roles "matching" to our 6 roles/personas. We found that different user personas participated in distinct stages of the lifecycle -- but not all stages. Their desired levels of automation and types of explanation for AutoML also varied significantly depending on the DS/ML stage and the user persona. Based on the survey results, we argue there is no rationale from user needs for complete automation of the end-to-end DS/ML lifecycle. We propose new next steps for user-controlled DS/ML automation.
△ Less
Submitted 6 January, 2021;
originally announced January 2021.
-
Explicit regularization and implicit bias in deep network classifiers trained with the square loss
Authors:
Tomaso Poggio,
Qianli Liao
Abstract:
Deep ReLU networks trained with the square loss have been observed to perform well in classification tasks. We provide here a theoretical justification based on analysis of the associated gradient flow. We show that convergence to a solution with the absolute minimum norm is expected when normalization techniques such as Batch Normalization (BN) or Weight Normalization (WN) are used together with…
▽ More
Deep ReLU networks trained with the square loss have been observed to perform well in classification tasks. We provide here a theoretical justification based on analysis of the associated gradient flow. We show that convergence to a solution with the absolute minimum norm is expected when normalization techniques such as Batch Normalization (BN) or Weight Normalization (WN) are used together with Weight Decay (WD). The main property of the minimizers that bounds their expected error is the norm: we prove that among all the close-to-interpolating solutions, the ones associated with smaller Frobenius norms of the unnormalized weight matrices have better margin and better bounds on the expected classification error. With BN but in the absence of WD, the dynamical system is singular. Implicit dynamical regularization -- that is zero-initial conditions biasing the dynamics towards high margin solutions -- is also possible in the no-BN and no-WD case. The theory yields several predictions, including the role of BN and weight decay, aspects of Papyan, Han and Donoho's Neural Collapse and the constraints induced by BN on the network weights.
△ Less
Submitted 31 December, 2020;
originally announced January 2021.
-
Exploiting Shared Knowledge from Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation
Authors:
Yichi Zhang,
Qingcheng Liao,
Lin Yuan,
He Zhu,
Jiezhen Xing,
Jicong Zhang
Abstract:
The novel Coronavirus disease (COVID-19) is a highly contagious virus and has spread all over the world, posing an extremely serious threat to all countries. Automatic lung infection segmentation from computed tomography (CT) plays an important role in the quantitative analysis of COVID-19. However, the major challenge lies in the inadequacy of annotated COVID-19 datasets. Currently, there are sev…
▽ More
The novel Coronavirus disease (COVID-19) is a highly contagious virus and has spread all over the world, posing an extremely serious threat to all countries. Automatic lung infection segmentation from computed tomography (CT) plays an important role in the quantitative analysis of COVID-19. However, the major challenge lies in the inadequacy of annotated COVID-19 datasets. Currently, there are several public non-COVID lung lesion segmentation datasets, providing the potential for generalizing useful information to the related COVID-19 segmentation task. In this paper, we propose a novel relation-driven collaborative learning model to exploit shared knowledge from non-COVID lesions for annotation-efficient COVID-19 CT lung infection segmentation. The model consists of a general encoder to capture general lung lesion features based on multiple non-COVID lesions, and a target encoder to focus on task-specific features based on COVID-19 infections. Features extracted from the two parallel encoders are concatenated for the subsequent decoder part. We develop a collaborative learning scheme to regularize feature-level relation consistency of given input and encourage the model to learn more general and discriminative representation of COVID-19 infections. Extensive experiments demonstrate that trained with limited COVID-19 data, exploiting shared knowledge from non-COVID lesions can further improve state-of-the-art performance with up to 3.0% in dice similarity coefficient and 4.2% in normalized surface dice. Our proposed method promotes new insights into annotation-efficient deep learning for COVID-19 infection segmentation and illustrates strong potential for real-world applications in the global fight against COVID-19 in the absence of sufficient high-quality annotations.
△ Less
Submitted 27 July, 2021; v1 submitted 31 December, 2020;
originally announced December 2020.
-
Experimental measurement of the divergent quantum metric of an exceptional point
Authors:
Qing Liao,
Charly Leblanc,
Jiahuan Ren,
Feng Li,
Yiming Li,
Dmitry Solnyshkov,
Guillaume Malpuech,
Jiannian Yao,
Hongbing Fu
Abstract:
The geometry of Hamiltonian's eigenstates is encoded in the quantum geometric tensor (QGT). It contains both the Berry curvature, central to the description of topological matter and the quantum metric. So far the full QGT has been measured only in Hermitian systems, where the role of the quantum metric is mostly shown to determine corrections to physical effects. On the contrary, in non-Hermitian…
▽ More
The geometry of Hamiltonian's eigenstates is encoded in the quantum geometric tensor (QGT). It contains both the Berry curvature, central to the description of topological matter and the quantum metric. So far the full QGT has been measured only in Hermitian systems, where the role of the quantum metric is mostly shown to determine corrections to physical effects. On the contrary, in non-Hermitian systems, and in particular near exceptional points, the quantum metric is expected to diverge and to often play a dominant role, for example on the enhanced sensing and on wave packet dynamics. In this work, we report the first experimental measurement of the quantum metric in a non-Hermitian system. The specific platform under study is an organic microcavity with exciton-polariton eigenstates, which demonstrate exceptional points. We measure the quantum metric's divergence and we determine the scaling exponent $n=-1.01\pm0.08$, which is in agreement with theoretical predictions for the second-order exceptional points.
△ Less
Submitted 24 November, 2020;
originally announced November 2020.
-
Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty
Authors:
Umang Bhatt,
Javier Antorán,
Yunfeng Zhang,
Q. Vera Liao,
Prasanna Sattigeri,
Riccardo Fogliato,
Gabrielle Gauthier Melançon,
Ranganath Krishnan,
Jason Stanley,
Omesh Tickoo,
Lama Nachman,
Rumi Chunara,
Madhulika Srikumar,
Adrian Weller,
Alice Xiang
Abstract:
Algorithmic transparency entails exposing system properties to various stakeholders for purposes that include understanding, improving, and contesting predictions. Until now, most research into algorithmic transparency has predominantly focused on explainability. Explainability attempts to provide reasons for a machine learning model's behavior to stakeholders. However, understanding a model's spe…
▽ More
Algorithmic transparency entails exposing system properties to various stakeholders for purposes that include understanding, improving, and contesting predictions. Until now, most research into algorithmic transparency has predominantly focused on explainability. Explainability attempts to provide reasons for a machine learning model's behavior to stakeholders. However, understanding a model's specific behavior alone might not be enough for stakeholders to gauge whether the model is wrong or lacks sufficient knowledge to solve the task at hand. In this paper, we argue for considering a complementary form of transparency by estimating and communicating the uncertainty associated with model predictions. First, we discuss methods for assessing uncertainty. Then, we characterize how uncertainty can be used to mitigate model unfairness, augment decision-making, and build trustworthy systems. Finally, we outline methods for displaying uncertainty to stakeholders and recommend how to collect information required for incorporating uncertainty into existing ML pipelines. This work constitutes an interdisciplinary review drawn from literature spanning machine learning, visualization/HCI, design, decision-making, and fairness. We aim to encourage researchers and practitioners to measure, communicate, and use uncertainty as a form of transparency.
△ Less
Submitted 4 May, 2021; v1 submitted 15 November, 2020;
originally announced November 2020.
-
Fast Local Attack: Generating Local Adversarial Examples for Object Detectors
Authors:
Quanyu Liao,
Xin Wang,
Bin Kong,
Siwei Lyu,
Youbing Yin,
Qi Song,
Xi Wu
Abstract:
The deep neural network is vulnerable to adversarial examples. Adding imperceptible adversarial perturbations to images is enough to make them fail. Most existing research focuses on attacking image classifiers or anchor-based object detectors, but they generate globally perturbation on the whole image, which is unnecessary. In our work, we leverage higher-level semantic information to generate hi…
▽ More
The deep neural network is vulnerable to adversarial examples. Adding imperceptible adversarial perturbations to images is enough to make them fail. Most existing research focuses on attacking image classifiers or anchor-based object detectors, but they generate globally perturbation on the whole image, which is unnecessary. In our work, we leverage higher-level semantic information to generate high aggressive local perturbations for anchor-free object detectors. As a result, it is less computationally intensive and achieves a higher black-box attack as well as transferring attack performance. The adversarial examples generated by our method are not only capable of attacking anchor-free object detectors, but also able to be transferred to attack anchor-based object detector.
△ Less
Submitted 27 October, 2020;
originally announced October 2020.
-
Tensor Train Random Projection
Authors:
Yani Feng,
Kejun Tang,
Lianxing He,
Pingqiang Zhou,
Qifeng Liao
Abstract:
This work proposes a novel tensor train random projection (TTRP) method for dimension reduction, where pairwise distances can be approximately preserved. Our TTRP is systematically constructed through a tensor train (TT) representation with TT-ranks equal to one. Based on the tensor train format, this new random projection method can speed up the dimension reduction procedure for high-dimensional…
▽ More
This work proposes a novel tensor train random projection (TTRP) method for dimension reduction, where pairwise distances can be approximately preserved. Our TTRP is systematically constructed through a tensor train (TT) representation with TT-ranks equal to one. Based on the tensor train format, this new random projection method can speed up the dimension reduction procedure for high-dimensional datasets and requires less storage costs with little loss in accuracy, compared with existing methods. We provide a theoretical analysis of the bias and the variance of TTRP, which shows that this approach is an expected isometric projection with bounded variance, and we show that the Rademacher distribution is an optimal choice for generating the corresponding TT-cores. Detailed numerical experiments with synthetic datasets and the MNIST dataset are conducted to demonstrate the efficiency of TTRP.
△ Less
Submitted 20 October, 2021; v1 submitted 21 October, 2020;
originally announced October 2020.
-
Bridging 2D and 3D Segmentation Networks for Computation Efficient Volumetric Medical Image Segmentation: An Empirical Study of 2.5D Solutions
Authors:
Yichi Zhang,
Qingcheng Liao,
Le Ding,
Jicong Zhang
Abstract:
Recently, deep convolutional neural networks have achieved great success for medical image segmentation. However, unlike segmentation of natural images, most medical images such as MRI and CT are volumetric data. In order to make full use of volumetric information, 3D CNNs are widely used. However, 3D CNNs suffer from higher inference time and computation cost, which hinders their further clinical…
▽ More
Recently, deep convolutional neural networks have achieved great success for medical image segmentation. However, unlike segmentation of natural images, most medical images such as MRI and CT are volumetric data. In order to make full use of volumetric information, 3D CNNs are widely used. However, 3D CNNs suffer from higher inference time and computation cost, which hinders their further clinical applications. Additionally, with the increased number of parameters, the risk of overfitting is higher, especially for medical images where data and annotations are expensive to acquire. To issue this problem, many 2.5D segmentation methods have been proposed to make use of volumetric spatial information with less computation cost. Despite these works lead to improvements on a variety of segmentation tasks, to the best of our knowledge, there has not previously been a large-scale empirical comparison of these methods. In this paper, we aim to present a review of the latest developments of 2.5D methods for volumetric medical image segmentation. Additionally, to compare the performance and effectiveness of these methods, we provide an empirical study of these methods on three representative segmentation tasks involving different modalities and targets. Our experimental results highlight that 3D CNNs may not always be the best choice. Despite all these 2.5D methods can bring performance gains to 2D baseline, not all the methods hold the benefits on different datasets. We hope the results and conclusions of our study will prove useful for the community on exploring and developing efficient volumetric medical image segmentation methods.
△ Less
Submitted 7 February, 2022; v1 submitted 13 October, 2020;
originally announced October 2020.
-
Quantum metric and wavepackets at exceptional points in non-Hermitian systems
Authors:
D. D. Solnyshkov,
C. Leblanc,
L. Bessonart,
A. Nalitov,
J. Ren,
Q. Liao,
F. Li,
G. Malpuech
Abstract:
The usual concepts of topological physics, such as the Berry curvature, cannot be applied directly to non-Hermitian systems. We show that another object, the quantum metric, which often plays a secondary role in Hermitian systems, becomes a crucial quantity near exceptional points in non-Hermitian systems, where it diverges in a way that fully controls the description of wavepacket trajectories. T…
▽ More
The usual concepts of topological physics, such as the Berry curvature, cannot be applied directly to non-Hermitian systems. We show that another object, the quantum metric, which often plays a secondary role in Hermitian systems, becomes a crucial quantity near exceptional points in non-Hermitian systems, where it diverges in a way that fully controls the description of wavepacket trajectories. The quantum metric behaviour is responsible for a constant acceleration with a fixed direction, and for a non-vanishing constant velocity with a controllable direction. Both contributions are independent of the wavepacket size.
△ Less
Submitted 15 September, 2020;
originally announced September 2020.
-
Attention Cube Network for Image Restoration
Authors:
Yucheng Hang,
Qingmin Liao,
Wenming Yang,
Yupeng Chen,
Jie Zhou
Abstract:
Recently, deep convolutional neural network (CNN) have been widely used in image restoration and obtained great success. However, most of existing methods are limited to local receptive field and equal treatment of different types of information. Besides, existing methods always use a multi-supervised method to aggregate different feature maps, which can not effectively aggregate hierarchical feat…
▽ More
Recently, deep convolutional neural network (CNN) have been widely used in image restoration and obtained great success. However, most of existing methods are limited to local receptive field and equal treatment of different types of information. Besides, existing methods always use a multi-supervised method to aggregate different feature maps, which can not effectively aggregate hierarchical feature information. To address these issues, we propose an attention cube network (A-CubeNet) for image restoration for more powerful feature expression and feature correlation learning. Specifically, we design a novel attention mechanism from three dimensions, namely spatial dimension, channel-wise dimension and hierarchical dimension. The adaptive spatial attention branch (ASAB) and the adaptive channel attention branch (ACAB) constitute the adaptive dual attention module (ADAM), which can capture the long-range spatial and channel-wise contextual information to expand the receptive field and distinguish different types of information for more effective feature representations. Furthermore, the adaptive hierarchical attention module (AHAM) can capture the long-range hierarchical contextual information to flexibly aggregate different feature maps by weights depending on the global context. The ADAM and AHAM cooperate to form an "attention in attention" structure, which means AHAM's inputs are enhanced by ASAB and ACAB. Experiments demonstrate the superiority of our method over state-of-the-art image restoration methods in both quantitative comparison and visual analysis. Code is available at https://github.com/YCHang686/A-CubeNet.
△ Less
Submitted 24 January, 2021; v1 submitted 12 September, 2020;
originally announced September 2020.
-
Active Learning++: Incorporating Annotator's Rationale using Local Model Explanation
Authors:
Bhavya Ghai,
Q. Vera Liao,
Yunfeng Zhang,
Klaus Mueller
Abstract:
We propose a new active learning (AL) framework, Active Learning++, which can utilize an annotator's labels as well as its rationale. Annotators can provide their rationale for choosing a label by ranking input features based on their importance for a given query. To incorporate this additional input, we modified the disagreement measure for a bagging-based Query by Committee (QBC) sampling strate…
▽ More
We propose a new active learning (AL) framework, Active Learning++, which can utilize an annotator's labels as well as its rationale. Annotators can provide their rationale for choosing a label by ranking input features based on their importance for a given query. To incorporate this additional input, we modified the disagreement measure for a bagging-based Query by Committee (QBC) sampling strategy. Instead of weighing all committee models equally to select the next instance, we assign higher weight to the committee model with higher agreement with the annotator's ranking. Specifically, we generated a feature importance-based local explanation for each committee model. The similarity score between feature rankings provided by the annotator and the local model explanation is used to assign a weight to each corresponding committee model. This approach is applicable to any kind of ML model using model-agnostic techniques to generate local explanation such as LIME. With a simulation study, we show that our framework significantly outperforms a QBC based vanilla AL framework.
△ Less
Submitted 6 September, 2020;
originally announced September 2020.
-
Circumventing spin glass traps by microcanonical spontaneous symmetry breaking
Authors:
Hai-Jun Zhou,
Qinyi Liao
Abstract:
The planted p-spin interaction model is a paradigm of random-graph systems possessing both a ferromagnetic phase and a disordered phase with the latter splitting into many spin glass states at low temperatures. Conventional simulated annealing dynamics is easily blocked by these low-energy spin glass states. Here we demonstrate that, actually this planted system is exponentially dominated by a mic…
▽ More
The planted p-spin interaction model is a paradigm of random-graph systems possessing both a ferromagnetic phase and a disordered phase with the latter splitting into many spin glass states at low temperatures. Conventional simulated annealing dynamics is easily blocked by these low-energy spin glass states. Here we demonstrate that, actually this planted system is exponentially dominated by a microcanonical polarized phase at intermediate energy densities. There is a discontinuous microcanonical spontaneous symmetry breaking transition from the paramagnetic phase to the microcanonical polarized phase. This transition can serve as a mechanism to avoid all the spin glass traps, and it is accelerated by the restart strategy of microcanonical random walk. We also propose an unsupervised learning problem on microcanonically sampled configurations for inferring the planted ground state.
△ Less
Submitted 6 April, 2021; v1 submitted 1 July, 2020;
originally announced July 2020.
-
Hierarchically Compositional Tasks and Deep Convolutional Networks
Authors:
Arturo Deza,
Qianli Liao,
Andrzej Banburski,
Tomaso Poggio
Abstract:
The main success stories of deep learning, starting with ImageNet, depend on deep convolutional networks, which on certain tasks perform significantly better than traditional shallow classifiers, such as support vector machines, and also better than deep fully connected networks; but what is so special about deep convolutional networks? Recent results in approximation theory proved an exponential…
▽ More
The main success stories of deep learning, starting with ImageNet, depend on deep convolutional networks, which on certain tasks perform significantly better than traditional shallow classifiers, such as support vector machines, and also better than deep fully connected networks; but what is so special about deep convolutional networks? Recent results in approximation theory proved an exponential advantage of deep convolutional networks with or without shared weights in approximating functions with hierarchical locality in their compositional structure. More recently, the hierarchical structure was proved to be hard to learn from data, suggesting that it is a powerful prior embedded in the architecture of the network. These mathematical results, however, do not say which real-life tasks correspond to input-output functions with hierarchical locality. To evaluate this, we consider a set of visual tasks where we disrupt the local organization of images via "deterministic scrambling" to later perform a visual task on these images structurally-altered in the same way for training and testing. For object recognition we find, as expected, that scrambling does not affect the performance of shallow or deep fully connected networks contrary to the out-performance of convolutional networks. Not all tasks involving images are however affected. Texture perception and global color estimation are much less sensitive to deterministic scrambling showing that the underlying functions corresponding to these tasks are not hierarchically local; and also counter-intuitively showing that these tasks are better approximated by networks that are not deep (texture) nor convolutional (color). Altogether, these results shed light into the importance of matching a network architecture with its embedded prior of the task to be learned.
△ Less
Submitted 25 March, 2021; v1 submitted 24 June, 2020;
originally announced June 2020.
-
Review of Quadruped Robots for Dynamic Locomotion
Authors:
Qiayuan Liao
Abstract:
This review introduces quadruped robots: MITCheetah, HyQ, ANYmal, BigDog, and their mechanical structure, actuation, and control.
This review introduces quadruped robots: MITCheetah, HyQ, ANYmal, BigDog, and their mechanical structure, actuation, and control.
△ Less
Submitted 25 September, 2022; v1 submitted 5 May, 2020;
originally announced May 2020.
-
Noise-Sampling Cross Entropy Loss: Improving Disparity Regression Via Cost Volume Aware Regularizer
Authors:
Yang Chen,
Zongqing Lu,
Xuechen Zhang,
Lei Chen,
Qingmin Liao
Abstract:
Recent end-to-end deep neural networks for disparity regression have achieved the state-of-the-art performance. However, many well-acknowledged specific properties of disparity estimation are omitted in these deep learning algorithms. Especially, matching cost volume, one of the most important procedure, is treated as a normal intermediate feature for the following softargmin regression, lacking e…
▽ More
Recent end-to-end deep neural networks for disparity regression have achieved the state-of-the-art performance. However, many well-acknowledged specific properties of disparity estimation are omitted in these deep learning algorithms. Especially, matching cost volume, one of the most important procedure, is treated as a normal intermediate feature for the following softargmin regression, lacking explicit constraints compared with those traditional algorithms. In this paper, inspired by previous canonical definition of cost volume, we propose the noise-sampling cross entropy loss function to regularize the cost volume produced by deep neural networks to be unimodal and coherent. Extensive experiments validate that the proposed noise-sampling cross entropy loss can not only help neural networks learn more informative cost volume, but also lead to better stereo matching performance compared with several representative algorithms.
△ Less
Submitted 28 May, 2020; v1 submitted 18 May, 2020;
originally announced May 2020.
-
The Role of the Hercules Autonomous Vehicle During the COVID-19 Pandemic: An Autonomous Logistic Vehicle for Contactless Goods Transportation
Authors:
Tianyu Liu,
Qinghai Liao,
Lu Gan,
Fulong Ma,
Jie Cheng,
Xupeng Xie,
Zhe Wang,
Yingbing Chen,
Yilong Zhu,
Shuyang Zhang,
Zhengyong Chen,
Yang Liu,
Meng Xie,
Yang Yu,
Zitong Guo,
Guang Li,
Peidong Yuan,
Dong Han,
Yuying Chen,
Haoyang Ye,
Jianhao Jiao,
Peng Yun,
Zhenhua Xu,
Hengli Wang,
Huaiyang Huang
, et al. (6 additional authors not shown)
Abstract:
Since early 2020, the coronavirus disease 2019 (COVID-19) has spread rapidly across the world. As at the date of writing this article, the disease has been globally reported in 223 countries and regions, infected over 108 million people and caused over 2.4 million deaths (https://covid19.who.int/, accessed on Feb. 17, 2021). Avoiding person-to-person transmission is an effective approach to contro…
▽ More
Since early 2020, the coronavirus disease 2019 (COVID-19) has spread rapidly across the world. As at the date of writing this article, the disease has been globally reported in 223 countries and regions, infected over 108 million people and caused over 2.4 million deaths (https://covid19.who.int/, accessed on Feb. 17, 2021). Avoiding person-to-person transmission is an effective approach to control and prevent the pandemic. However, many daily activities, such as transporting goods in our daily life, inevitably involve person-to-person contact. Using an autonomous logistic vehicle to achieve contact-less goods transportation could alleviate this issue. For example, it can reduce the risk of virus transmission between the driver and customers. Moreover, many countries have imposed tough lockdown measures to reduce the virus transmission (e.g., retail, catering) during the pandemic, which causes inconveniences for human daily life. Autonomous vehicle can deliver the goods bought by humans, so that humans can get the goods without going out. These demands motivate us to develop an autonomous vehicle, named as Hercules, for contact-less goods transportation during the COVID-19 pandemic. The vehicle is evaluated through real-world delivering tasks under various traffic conditions.
△ Less
Submitted 16 February, 2021; v1 submitted 16 April, 2020;
originally announced April 2020.
-
Measuring Social Biases of Crowd Workers using Counterfactual Queries
Authors:
Bhavya Ghai,
Q. Vera Liao,
Yunfeng Zhang,
Klaus Mueller
Abstract:
Social biases based on gender, race, etc. have been shown to pollute machine learning (ML) pipeline predominantly via biased training datasets. Crowdsourcing, a popular cost-effective measure to gather labeled training datasets, is not immune to the inherent social biases of crowd workers. To ensure such social biases aren't passed onto the curated datasets, it's important to know how biased each…
▽ More
Social biases based on gender, race, etc. have been shown to pollute machine learning (ML) pipeline predominantly via biased training datasets. Crowdsourcing, a popular cost-effective measure to gather labeled training datasets, is not immune to the inherent social biases of crowd workers. To ensure such social biases aren't passed onto the curated datasets, it's important to know how biased each crowd worker is. In this work, we propose a new method based on counterfactual fairness to quantify the degree of inherent social bias in each crowd worker. This extra information can be leveraged together with individual worker responses to curate a less biased dataset.
△ Less
Submitted 4 April, 2020;
originally announced April 2020.
-
Real-MFF: A Large Realistic Multi-focus Image Dataset with Ground Truth
Authors:
Juncheng Zhang,
Qingmin Liao,
Shaojun Liu,
Haoyu Ma,
Wenming Yang,
Jing-Hao Xue
Abstract:
Multi-focus image fusion, a technique to generate an all-in-focus image from two or more partially-focused source images, can benefit many computer vision tasks. However, currently there is no large and realistic dataset to perform convincing evaluation and comparison of algorithms in multi-focus image fusion. Moreover, it is difficult to train a deep neural network for multi-focus image fusion wi…
▽ More
Multi-focus image fusion, a technique to generate an all-in-focus image from two or more partially-focused source images, can benefit many computer vision tasks. However, currently there is no large and realistic dataset to perform convincing evaluation and comparison of algorithms in multi-focus image fusion. Moreover, it is difficult to train a deep neural network for multi-focus image fusion without a suitable dataset. In this letter, we introduce a large and realistic multi-focus dataset called Real-MFF, which contains 710 pairs of source images with corresponding ground truth images. The dataset is generated by light field images, and both the source images and the ground truth images are realistic. To serve as both a well-established benchmark for existing multi-focus image fusion algorithms and an appropriate training dataset for future development of deep-learning-based methods, the dataset contains a variety of scenes, including buildings, plants, humans, shopping malls, squares and so on. We also evaluate 10 typical multi-focus algorithms on this dataset for the purpose of illustration.
△ Less
Submitted 28 August, 2020; v1 submitted 28 March, 2020;
originally announced March 2020.
-
Category-wise Attack: Transferable Adversarial Examples for Anchor Free Object Detection
Authors:
Quanyu Liao,
Xin Wang,
Bin Kong,
Siwei Lyu,
Youbing Yin,
Qi Song,
Xi Wu
Abstract:
Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbations can completely change the classification results. Their vulnerability has led to a surge of research in this direction. However, most works dedicated to attacking anchor-based object detection models. In this work, we aim to present an effective and efficient algorithm to generate adversarial…
▽ More
Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbations can completely change the classification results. Their vulnerability has led to a surge of research in this direction. However, most works dedicated to attacking anchor-based object detection models. In this work, we aim to present an effective and efficient algorithm to generate adversarial examples to attack anchor-free object models based on two approaches. First, we conduct category-wise instead of instance-wise attacks on the object detectors. Second, we leverage the high-level semantic information to generate the adversarial examples. Surprisingly, the generated adversarial examples it not only able to effectively attack the targeted anchor-free object detector but also to be transferred to attack other object detectors, even anchor-based detectors such as Faster R-CNN.
△ Less
Submitted 22 June, 2020; v1 submitted 9 February, 2020;
originally announced March 2020.
-
CUBE -- Towards an Optimal Scaling of Cosmological N-body Simulations
Authors:
Shenggan Cheng,
Hao-Ran Yu,
Derek Inman,
Qiucheng Liao,
Qiaoya Wu,
James Lin
Abstract:
N-body simulations are essential tools in physical cosmology to understand the large-scale structure (LSS) formation of the Universe. Large-scale simulations with high resolution are important for exploring the substructure of universe and for determining fundamental physical parameters like neutrino mass. However, traditional particle-mesh (PM) based algorithms use considerable amounts of memory,…
▽ More
N-body simulations are essential tools in physical cosmology to understand the large-scale structure (LSS) formation of the Universe. Large-scale simulations with high resolution are important for exploring the substructure of universe and for determining fundamental physical parameters like neutrino mass. However, traditional particle-mesh (PM) based algorithms use considerable amounts of memory, which limits the scalability of simulations. Therefore, we designed a two-level PM algorithm CUBE towards optimal performance in memory consumption reduction. By using the fixed-point compression technique, CUBE reduces the memory consumption per N-body particle toward 6 bytes, an order of magnitude lower than the traditional PM-based algorithms. We scaled CUBE to 512 nodes (20,480 cores) on an Intel Cascade Lake based supercomputer with $\simeq$95\% weak-scaling efficiency. This scaling test was performed in "Cosmo-$π$" -- a cosmological LSS simulation using $\simeq$4.4 trillion particles, tracing the evolution of the universe over $\simeq$13.7 billion years. To our best knowledge, Cosmo-$π$ is the largest completed cosmological N-body simulation. We believe CUBE has a huge potential to scale on exascale supercomputers for larger simulations.
△ Less
Submitted 9 March, 2020;
originally announced March 2020.
-
XSepConv: Extremely Separated Convolution
Authors:
Jiarong Chen,
Zongqing Lu,
Jing-Hao Xue,
Qingmin Liao
Abstract:
Depthwise convolution has gradually become an indispensable operation for modern efficient neural networks and larger kernel sizes ($\ge5$) have been applied to it recently. In this paper, we propose a novel extremely separated convolutional block (XSepConv), which fuses spatially separable convolutions into depthwise convolution to further reduce both the computational cost and parameter size of…
▽ More
Depthwise convolution has gradually become an indispensable operation for modern efficient neural networks and larger kernel sizes ($\ge5$) have been applied to it recently. In this paper, we propose a novel extremely separated convolutional block (XSepConv), which fuses spatially separable convolutions into depthwise convolution to further reduce both the computational cost and parameter size of large kernels. Furthermore, an extra $2\times2$ depthwise convolution coupled with improved symmetric padding strategy is employed to compensate for the side effect brought by spatially separable convolutions. XSepConv is designed to be an efficient alternative to vanilla depthwise convolution with large kernel sizes. To verify this, we use XSepConv for the state-of-the-art architecture MobileNetV3-Small and carry out extensive experiments on four highly competitive benchmark datasets (CIFAR-10, CIFAR-100, SVHN and Tiny-ImageNet) to demonstrate that XSepConv can indeed strike a better trade-off between accuracy and efficiency.
△ Less
Submitted 27 February, 2020;
originally announced February 2020.
-
Explainable Active Learning (XAL): An Empirical Study of How Local Explanations Impact Annotator Experience
Authors:
Bhavya Ghai,
Q. Vera Liao,
Yunfeng Zhang,
Rachel Bellamy,
Klaus Mueller
Abstract:
The wide adoption of Machine Learning technologies has created a rapidly growing demand for people who can train ML models. Some advocated the term "machine teacher" to refer to the role of people who inject domain knowledge into ML models. One promising learning paradigm is Active Learning (AL), by which the model intelligently selects instances to query the machine teacher for labels. However, i…
▽ More
The wide adoption of Machine Learning technologies has created a rapidly growing demand for people who can train ML models. Some advocated the term "machine teacher" to refer to the role of people who inject domain knowledge into ML models. One promising learning paradigm is Active Learning (AL), by which the model intelligently selects instances to query the machine teacher for labels. However, in current AL settings, the human-AI interface remains minimal and opaque. We begin considering AI explanations as a core element of the human-AI interface for teaching machines. When a human student learns, it is a common pattern to present one's own reasoning and solicit feedback from the teacher. When a ML model learns and still makes mistakes, the human teacher should be able to understand the reasoning underlying the mistakes. When the model matures, the machine teacher should be able to recognize its progress in order to trust and feel confident about their teaching outcome. Toward this vision, we propose a novel paradigm of explainable active learning (XAL), by introducing techniques from the recently surging field of explainable AI (XAI) into an AL setting. We conducted an empirical study comparing the model learning outcomes, feedback content and experience with XAL, to that of traditional AL and coactive learning (providing the model's prediction without the explanation). Our study shows benefits of AI explanation as interfaces for machine teaching--supporting trust calibration and enabling rich forms of teaching feedback, and potential drawbacks--anchoring effect with the model judgment and cognitive workload. Our study also reveals important individual factors that mediate a machine teacher's reception to AI explanations, including task knowledge, AI experience and need for cognition. By reflecting on the results, we suggest future directions and design implications for XAL.
△ Less
Submitted 30 September, 2020; v1 submitted 24 January, 2020;
originally announced January 2020.
-
Questioning the AI: Informing Design Practices for Explainable AI User Experiences
Authors:
Q. Vera Liao,
Daniel Gruen,
Sarah Miller
Abstract:
A surge of interest in explainable AI (XAI) has led to a vast collection of algorithmic work on the topic. While many recognize the necessity to incorporate explainability features in AI systems, how to address real-world user needs for understanding AI remains an open question. By interviewing 20 UX and design practitioners working on various AI products, we seek to identify gaps between the curr…
▽ More
A surge of interest in explainable AI (XAI) has led to a vast collection of algorithmic work on the topic. While many recognize the necessity to incorporate explainability features in AI systems, how to address real-world user needs for understanding AI remains an open question. By interviewing 20 UX and design practitioners working on various AI products, we seek to identify gaps between the current XAI algorithmic work and practices to create explainable AI products. To do so, we develop an algorithm-informed XAI question bank in which user needs for explainability are represented as prototypical questions users might ask about the AI, and use it as a study probe. Our work contributes insights into the design space of XAI, informs efforts to support design practices in this space, and identifies opportunities for future XAI work. We also provide an extended XAI question bank and discuss how it can be used for creating user-centered XAI.
△ Less
Submitted 3 September, 2021; v1 submitted 8 January, 2020;
originally announced January 2020.
-
Effect of Confidence and Explanation on Accuracy and Trust Calibration in AI-Assisted Decision Making
Authors:
Yunfeng Zhang,
Q. Vera Liao,
Rachel K. E. Bellamy
Abstract:
Today, AI is being increasingly used to help human experts make decisions in high-stakes scenarios. In these scenarios, full automation is often undesirable, not only due to the significance of the outcome, but also because human experts can draw on their domain knowledge complementary to the model's to ensure task success. We refer to these scenarios as AI-assisted decision making, where the indi…
▽ More
Today, AI is being increasingly used to help human experts make decisions in high-stakes scenarios. In these scenarios, full automation is often undesirable, not only due to the significance of the outcome, but also because human experts can draw on their domain knowledge complementary to the model's to ensure task success. We refer to these scenarios as AI-assisted decision making, where the individual strengths of the human and the AI come together to optimize the joint decision outcome. A key to their success is to appropriately \textit{calibrate} human trust in the AI on a case-by-case basis; knowing when to trust or distrust the AI allows the human expert to appropriately apply their knowledge, improving decision outcomes in cases where the model is likely to perform poorly. This research conducts a case study of AI-assisted decision making in which humans and AI have comparable performance alone, and explores whether features that reveal case-specific model information can calibrate trust and improve the joint performance of the human and AI. Specifically, we study the effect of showing confidence score and local explanation for a particular prediction. Through two human experiments, we show that confidence score can help calibrate people's trust in an AI model, but trust calibration alone is not sufficient to improve AI-assisted decision making, which may also depend on whether the human can bring in enough unique knowledge to complement the AI's errors. We also highlight the problems in using local explanation for AI-assisted decision making scenarios and invite the research community to explore new approaches to explainability for calibrating human trust in AI.
△ Less
Submitted 7 January, 2020;
originally announced January 2020.
-
Two_Generalizations_for_Quadratic_Residue_Codes_over_Finite_Fields
Authors:
Qunying Liao,
Yuanbo Liu
Abstract:
It's well known that the quadratic residue code over finite fields is an interesting class of cyclic codes for its higher minimum distance. Let $g$ be a positive integer and $p,p_{1},\ldots, p_{g}$ be distinct odd primes, the present paper generalizes the constructions for the quadratic residue code with length $p$ to be the length $n=p_{1}\cdots p_{g}$, and to be the case $m$-th residue codes wit…
▽ More
It's well known that the quadratic residue code over finite fields is an interesting class of cyclic codes for its higher minimum distance. Let $g$ be a positive integer and $p,p_{1},\ldots, p_{g}$ be distinct odd primes, the present paper generalizes the constructions for the quadratic residue code with length $p$ to be the length $n=p_{1}\cdots p_{g}$, and to be the case $m$-th residue codes with length $p$ over finite fields, where $m\geq 2$ is a positive integer. Furthermore, a criterion for that these codes are self-orthogonal or complementary dual is obtained, and then the corresponding counting formula are given. In particular, the minimum distance of all 24 quaternary quadratic residue codes $[15,8]$ are determined.
△ Less
Submitted 7 January, 2020;
originally announced January 2020.
-
Tunable optical second-order sideband effects in a parity-time symmetric optomechanical system
Authors:
Xing Xiao,
Qinghong Liao,
Nanrun Zhou,
Wenjie Nie,
Yongchun Liu
Abstract:
We theoretically investigate the optical second-order sideband generation (OSSG) in an optical parity-time (PT) symmetric system, which consists of a passive cavity trapping the atomic ensemble and an active cavity. It is found that near the exceptional point (EP), the efficiency of the OSSG increases sharply not only for the blue probe-pump detuning resonant case but also for the red one. Using e…
▽ More
We theoretically investigate the optical second-order sideband generation (OSSG) in an optical parity-time (PT) symmetric system, which consists of a passive cavity trapping the atomic ensemble and an active cavity. It is found that near the exceptional point (EP), the efficiency of the OSSG increases sharply not only for the blue probe-pump detuning resonant case but also for the red one. Using experimentally achievable parameters, we study the effect of the atomic ensemble on the efficiency of the OSSG. The numerical results show that the efficiency of the OSSG is 30% higher than that of the first-order sideband, which is realized easily by simultaneously modulating the atom-cavity coupling strength and detuning. Moreover, the efficiency of the OSSG can also be tuned effectively by the pump power, and the efficiency is robust when the pump power is strong enough. This study may have some guidance for modulating the nonlinear optical properties and controlling light propagation, which may stimulate further applications in optical communications.
△ Less
Submitted 8 January, 2020; v1 submitted 19 December, 2019;
originally announced December 2019.
-
Enabling Value Sensitive AI Systems through Participatory Design Fictions
Authors:
Q. Vera Liao,
Michael Muller
Abstract:
Two general routes have been followed to develop artificial agents that are sensitive to human values---a top-down approach to encode values into the agents, and a bottom-up approach to learn from human actions, whether from real-world interactions or stories. Although both approaches have made exciting scientific progress, they may face challenges when applied to the current development practices…
▽ More
Two general routes have been followed to develop artificial agents that are sensitive to human values---a top-down approach to encode values into the agents, and a bottom-up approach to learn from human actions, whether from real-world interactions or stories. Although both approaches have made exciting scientific progress, they may face challenges when applied to the current development practices of AI systems, which require the under-standing of the specific domains and specific stakeholders involved. In this work, we bring together perspectives from the human-computer interaction (HCI) community, where designing technologies sensitive to user values has been a longstanding focus. We highlight several well-established areas focusing on developing empirical methods for inquiring user values. Based on these methods, we propose participatory design fictions to study user values involved in AI systems and present preliminary results from a case study. With this paper, we invite the consideration of user-centered value inquiry and value learning.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Nontrivial band geometry in an optically active system
Authors:
Jiahuan Ren,
Qing Liao,
Feng Li,
Yiming Li,
Olivier Bleu,
Guillaume Malpuech,
Jiannian Yao,
Hongbing Fu,
Dmitry Solnyshkov
Abstract:
Optical activity (OA), also called circular birefringence, is known for two hundred years, but its applications for topological photonics remain unexplored. Unlike the Faraday effect, OA provokes rotation of the linear polarization of light without magnetic effects, thus preserving the time-reversal symmetry. Here, we report a direct measurement of the Berry curvature and quantum metric of the pho…
▽ More
Optical activity (OA), also called circular birefringence, is known for two hundred years, but its applications for topological photonics remain unexplored. Unlike the Faraday effect, OA provokes rotation of the linear polarization of light without magnetic effects, thus preserving the time-reversal symmetry. Here, we report a direct measurement of the Berry curvature and quantum metric of the photonic modes of a planar cavity containing an optically active organic microcrystal (perylene). Photonic spin-orbit-coupling induced by the cavity results in the action of a non-Abelian gauge field on photons. The addition of high OA makes emerge geometrically non-trivial bands containing two gapped Dirac cones with opposite topological charges. This experiment performed at room temperature and at visible wavelength establishes the potential of optically active organic materials for implementing non-magnetic and low-cost topological photonic devices.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Full Characterization of Minimal Linear Codes as Cutting Blocking Sets
Authors:
Chunming Tang,
Yan Qiu,
Qunying Liao,
Zhengchun Zhou
Abstract:
In this paper, we first study in detail the relationship between minimal linear codes and cutting blocking sets, which were recently introduced by Bonini and Borello, and then completely characterize minimal linear codes as cutting blocking sets. As a direct result, minimal projective codes of dimension $3$ and $t$-fold blocking sets with $t\ge 2$ in projective planes are identical objects. Some b…
▽ More
In this paper, we first study in detail the relationship between minimal linear codes and cutting blocking sets, which were recently introduced by Bonini and Borello, and then completely characterize minimal linear codes as cutting blocking sets. As a direct result, minimal projective codes of dimension $3$ and $t$-fold blocking sets with $t\ge 2$ in projective planes are identical objects. Some bounds on the parameters of minimal codes are derived from this characterization. This confirms a recent conjecture by Alfarano, Borello and Neri in [a geometric characterization of minimal codes and their asymptotic performance, arXiv:1911.11738, 2019] about a lower bound of the minimum distance of a minimal code. Using this new link between minimal codes and blocking sets, we also present new general primary and secondary constructions of minimal linear codes. As a result, infinite families of minimal linear codes not satisfying the Aschikhmin-Barg's condition are obtained. In addition to this, the weight distributions of two subfamilies of the proposed minimal linear codes are established. Open problems are also presented.
△ Less
Submitted 25 April, 2020; v1 submitted 22 November, 2019;
originally announced November 2019.
-
ANOVA Gaussian process modeling for high-dimensional stochastic computational models
Authors:
Chen Chen,
Qifeng Liao
Abstract:
In this paper we present a novel analysis of variance Gaussian process (ANOVA-GP) emulator for models governed by partial differential equations (PDEs) with high-dimensional random inputs. Gaussian process (GP) is a widely used surrogate modeling strategy, but it can become invalid when the inputs are high-dimensional. In this new ANOVA-GP strategy, high-dimensional inputs are decomposed into unio…
▽ More
In this paper we present a novel analysis of variance Gaussian process (ANOVA-GP) emulator for models governed by partial differential equations (PDEs) with high-dimensional random inputs. Gaussian process (GP) is a widely used surrogate modeling strategy, but it can become invalid when the inputs are high-dimensional. In this new ANOVA-GP strategy, high-dimensional inputs are decomposed into unions of local low-dimensional inputs, and principal component analysis (PCA) is applied to provide dimension reduction for each ANOVA term. We then systematically build local GP models for PCA coefficients based on ANOVA decomposition to provide an emulator for the overall high-dimensional problem. We present a general mathematical framework of ANOVA-GP, validate its accuracy and demonstrate its efficiency with numerical experiments.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
An α-Matte Boundary Defocus Model Based Cascaded Network for Multi-focus Image Fusion
Authors:
Haoyu Ma,
Qingmin Liao,
Juncheng Zhang,
Shaojun Liu,
Jing-Hao Xue
Abstract:
Capturing an all-in-focus image with a single camera is difficult since the depth of field of the camera is usually limited. An alternative method to obtain the all-in-focus image is to fuse several images focusing at different depths. However, existing multi-focus image fusion methods cannot obtain clear results for areas near the focused/defocused boundary (FDB). In this paper, a novel α-matte b…
▽ More
Capturing an all-in-focus image with a single camera is difficult since the depth of field of the camera is usually limited. An alternative method to obtain the all-in-focus image is to fuse several images focusing at different depths. However, existing multi-focus image fusion methods cannot obtain clear results for areas near the focused/defocused boundary (FDB). In this paper, a novel α-matte boundary defocus model is proposed to generate realistic training data with the defocus spread effect precisely modeled, especially for areas near the FDB. Based on this α-matte defocus model and the generated data, a cascaded boundary aware convolutional network termed MMF-Net is proposed and trained, aiming to achieve clearer fusion results around the FDB. More specifically, the MMF-Net consists of two cascaded sub-nets for initial fusion and boundary fusion, respectively; these two sub-nets are designed to first obtain a guidance map of FDB and then refine the fusion near the FDB. Experiments demonstrate that with the help of the new α-matte boundary defocus model, the proposed MMF-Net outperforms the state-of-the-art methods both qualitatively and quantitatively.
△ Less
Submitted 29 October, 2019; v1 submitted 29 October, 2019;
originally announced October 2019.