Skip to main content

Showing 151–200 of 615 results for author: Yu, Q

  1. arXiv:2304.10333  [pdf, other

    cs.CV

    Noisy Universal Domain Adaptation via Divergence Optimization for Visual Recognition

    Authors: Qing Yu, Atsushi Hashimoto, Yoshitaka Ushiku

    Abstract: To transfer the knowledge learned from a labeled source domain to an unlabeled target domain, many studies have worked on universal domain adaptation (UniDA), where there is no constraint on the label sets of the source domain and target domain. However, the existing UniDA methods rely on source samples with correct annotations. Due to the limited resources in the real world, it is difficult to ob… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  2. arXiv:2304.04694  [pdf, other

    cs.CV

    Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation

    Authors: Inkyu Shin, Dahun Kim, Qihang Yu, Jun Xie, Hong-Seok Kim, Bradley Green, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen

    Abstract: Video Panoptic Segmentation (VPS) aims to achieve comprehensive pixel-level scene understanding by segmenting all pixels and associating objects in a video. Current solutions can be categorized into online and near-online approaches. Evolving over the time, each category has its own specialized designs, making it nontrivial to adapt models between different categories. To alleviate the discrepancy… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  3. arXiv:2304.04521  [pdf, other

    cs.CV

    Zero-Shot In-Distribution Detection in Multi-Object Settings Using Vision-Language Foundation Models

    Authors: Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa

    Abstract: Extracting in-distribution (ID) images from noisy images scraped from the Internet is an important preprocessing for constructing datasets, which has traditionally been done manually. Automating this preprocessing with deep learning techniques presents two key challenges. First, images should be collected using only the name of the ID class without training on the ID data. Second, as we can see wh… ▽ More

    Submitted 23 August, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: v3: I fixed some typos from v2

  4. arXiv:2304.04052  [pdf, other

    cs.CL cs.AI cs.LG

    Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

    Authors: Zihao Fu, Wai Lam, Qian Yu, Anthony Man-Cho So, Shengding Hu, Zhiyuan Liu, Nigel Collier

    Abstract: The sequence-to-sequence (seq2seq) task aims at generating the target sequence based on the given input source sequence. Traditionally, most of the seq2seq task is resolved by the Encoder-Decoder framework which requires an encoder to encode the source sequence and a decoder to generate the target text. Recently, a bunch of new approaches have emerged that apply decoder-only language models direct… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  5. Partial measurements of the total field gradient and the field gradient tensor using an atomic magnetic gradiometer

    Authors: Qianqian Yu, Siqi Liu, Xueke Wang, Dong Sheng

    Abstract: Magnetic gradiometers have wide practical and academic applications, and two important types of field gradient observables are the total field gradient and field gradient tensor. However, measurements of the field gradient tensor have not been the focus of previous researches on atomic magnetic gradiometers. In this work, we develop an atomic magnetic gradiometer based on two separately optically… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted by Physical Review A

  6. arXiv:2304.00479  [pdf, other

    math.OC

    Mixed-Integer Programming Approaches to Generalized Submodular Optimization and its Applications

    Authors: Simge Küçükyavuz, Qimeng Yu

    Abstract: Submodularity is an important concept in integer and combinatorial optimization. A classical submodular set function models the utility of selecting homogenous items from a single ground set, and such selections can be represented by binary variables. In practice, many problem contexts involve choosing heterogenous items from more than one ground set or selecting multiple copies of homogenous item… ▽ More

    Submitted 4 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

  7. arXiv:2303.17376  [pdf, other

    cs.CV cs.AI cs.LG

    A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision

    Authors: Lucas Beyer, Bo Wan, Gagan Madan, Filip Pavetic, Andreas Steiner, Alexander Kolesnikov, André Susano Pinto, Emanuele Bugliarello, Xiao Wang, Qihang Yu, Liang-Chieh Chen, Xiaohua Zhai

    Abstract: There has been a recent explosion of computer vision models which perform many tasks and are composed of an image encoder (usually a ViT) and an autoregressive decoder (usually a Transformer). However, most of this work simply presents one system and its results, leaving many questions regarding design decisions and trade-offs of such systems unanswered. In this work, we aim to provide such answer… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  8. arXiv:2303.15790  [pdf, other

    hep-ex hep-ph physics.ins-det

    STCF Conceptual Design Report: Volume 1 -- Physics & Detector

    Authors: M. Achasov, X. C. Ai, R. Aliberti, L. P. An, Q. An, X. Z. Bai, Y. Bai, O. Bakina, A. Barnyakov, V. Blinov, V. Bobrovnikov, D. Bodrov, A. Bogomyagkov, A. Bondar, I. Boyko, Z. H. Bu, F. M. Cai, H. Cai, J. J. Cao, Q. H. Cao, Z. Cao, Q. Chang, K. T. Chao, D. Y. Chen, H. Chen , et al. (413 additional authors not shown)

    Abstract: The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,… ▽ More

    Submitted 5 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Front. Phys. 19(1), 14701 (2024)

  9. arXiv:2303.14086  [pdf, other

    cs.IT

    Finite Field Multiple Access

    Authors: Qi-yue Yu, Jiang-xuan Li, Shu Lin

    Abstract: In the past several decades, various techniques have been developed and used for multiple-access (MA) communications. With the new applications for 6G, it is desirable to find new resources, physical or virtual, to confront the fast development of MA communication systems. For binary source transmission, this paper proposes an element-pair (EP) coding scheme for supporting massive users with short… ▽ More

    Submitted 26 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: 38 pages, 11 figures

  10. arXiv:2303.13233  [pdf, other

    cs.CV

    Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World

    Authors: Qifan Yu, Juncheng Li, Yu Wu, Siliang Tang, Wei Ji, Yueting Zhuang

    Abstract: Scene Graph Generation (SGG) aims to extract <subject, predicate, object> relationships in images for vision understanding. Although recent works have made steady progress on SGG, they still suffer long-tail distribution issues that tail-predicates are more costly to train and hard to distinguish due to a small amount of annotated data compared to frequent predicates. Existing re-balancing strateg… ▽ More

    Submitted 19 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted by ICCV 2023

  11. arXiv:2303.13090  [pdf, other

    cs.CV

    Orthogonal Annotation Benefits Barely-supervised Medical Image Segmentation

    Authors: Heng Cai, Shumeng Li, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao

    Abstract: Recent trends in semi-supervised learning have significantly boosted the performance of 3D semi-supervised medical image segmentation. Compared with 2D images, 3D medical volumes involve information from different directions, e.g., transverse, sagittal, and coronal planes, so as to naturally provide complementary views. These complementary views and the intrinsic similarity among adjacent 3D slice… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  12. arXiv:2303.09273  [pdf, other

    cs.LG

    Adaptive Modeling of Uncertainties for Traffic Forecasting

    Authors: Ying Wu, Yongchao Ye, Adnan Zeb, James J. Q. Yu, Zheng Wang

    Abstract: Deep neural networks (DNNs) have emerged as a dominant approach for developing traffic forecasting models. These models are typically trained to minimize error on averaged test cases and produce a single-point prediction, such as a scalar value for traffic speed or travel time. However, single-point predictions fail to account for prediction uncertainty that is critical for many transportation man… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 14 pages, 5 figures

  13. arXiv:2303.07184  [pdf, other

    cs.LG cs.AI

    Traffic Prediction with Transfer Learning: A Mutual Information-based Approach

    Authors: Yunjie Huang, Xiaozhuang Song, Yuanshao Zhu, Shiyao Zhang, James J. Q. Yu

    Abstract: In modern traffic management, one of the most essential yet challenging tasks is accurately and timely predicting traffic. It has been well investigated and examined that deep learning-based Spatio-temporal models have an edge when exploiting Spatio-temporal relationships in traffic data. Typically, data-driven models require vast volumes of data, but gathering data in small cities can be difficul… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: submited to T-ITS, 16 pages, 13 figures in color

  14. arXiv:2303.06095  [pdf, other

    cs.IR cs.AI

    HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

    Authors: Jie Zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu

    Abstract: Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture. However, the MoE-based method, which aims to project all information in the same feature space, cannot effectively deal with the… ▽ More

    Submitted 13 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  15. arXiv:2303.05475  [pdf, other

    cs.CV

    Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking

    Authors: Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu

    Abstract: Masked Autoencoders (MAE) have been popular paradigms for large-scale vision representation pre-training. However, MAE solely reconstructs the low-level RGB signals after the decoder and lacks supervision upon high-level semantics for the encoder, thus suffering from sub-optimal learned representations and long pre-training epochs. To alleviate this, previous methods simply replace the pixel recon… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 12 pages, 3 figures

  16. arXiv:2303.04354  [pdf, other

    physics.atom-ph physics.ins-det

    A sensitive and stable atomic vector magnetometer for weak field detections using double orthogonal multipass cavities

    Authors: Siqi Liu, Qianqian Yu, Hao Zhou, Dong Sheng

    Abstract: This paper presents a compact low-temperature atomic vector magnetometer for weak field measurements, using an atomic cell containing two orthogonal multipass cavities. At the working temperature of 75 $^\circ$C, the magnetic field sensitivities at all three axes are better than 45 fT/Hz$^{1/2}$ at 10~Hz limited by photon noise, and 85 fT/Hz$^{1/2}$ at 0.1~Hz. This sensor also shows measurement st… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: Submitted to Physical Review Applied

  17. arXiv:2303.04351  [pdf, other

    cs.RO cs.CV

    ElC-OIS: Ellipsoidal Clustering for Open-World Instance Segmentation on LiDAR Data

    Authors: Wenbang Deng, Kaihong Huang, Qinghua Yu, Huimin Lu, Zhiqiang Zheng, Xieyuanli Chen

    Abstract: Open-world Instance Segmentation (OIS) is a challenging task that aims to accurately segment every object instance appearing in the current observation, regardless of whether these instances have been labeled in the training set. This is important for safety-critical applications such as robust autonomous navigation. In this paper, we present a flexible and effective OIS framework for LiDAR point… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  18. Region and Spatial Aware Anomaly Detection for Fundus Images

    Authors: Jingqi Niu, Shiwen Dong, Qinji Yu, Kang Dang, Xiaowei Ding

    Abstract: Recently anomaly detection has drawn much attention in diagnosing ocular diseases. Most existing anomaly detection research in fundus images has relatively large anomaly scores in the salient retinal structures, such as blood vessels, optical cups and discs. In this paper, we propose a Region and Spatial Aware Anomaly Detection (ReSAD) method for fundus images, which obtains local region and long-… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Report number: 2303.03817

    Journal ref: 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia, 2023, pp. 1-5

  19. Qubit Energy Tuner Based on Single Flux Quantum Circuits

    Authors: Xiao Geng, Rutian Huang, Yongcheng He, Kaiyong He, Genting Dai, Liangliang Yang, Xinyu Wu, Qing Yu, Mingjun Cheng, Guodong Chen, Jianshe Liu, Wei Chen

    Abstract: A device called qubit energy tuner (QET) based on single flux quantum (SFQ) circuits is proposed for Z control of superconducting qubits. Created from the improvement of flux digital-to-analog converters (flux DACs), a QET is able to set the energy levels or the frequencies of qubits, especially flux-tunable transmons, and perform gate operations requiring Z control. The circuit structure of QET i… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  20. arXiv:2302.14671  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Semiconducting nonperovskite ferroelectric oxynitride designed ab initio

    Authors: Qisheng Yu, Jiawei Huang, Changming Ke, Zhuang Qian, Liyang Ma, Shi Liu

    Abstract: Recent discovery of HfO2-based and nitride-based ferroelectrics that are compatible to the semiconductor manufacturing process have revitalized the field of ferroelectric-based nanoelectronics. Guided by a simple design principle of charge compensation and density functional theory calculations, we discover HfO2-like mixed-anion materials, TaON and NbON, can crystallize in the polar Pca21 phase wi… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  21. arXiv:2302.11770  [pdf, other

    physics.bio-ph cond-mat.stat-mech q-bio.MN

    Resolving the binding-kinase discrepancy in bacterial chemotaxis: A nonequilibrium allosteric model and the role of energy dissipation

    Authors: David Hathcock, Qiwei Yu, Bernardo A. Mello, Divya N. Amin, Gerald L. Hazelbauer, Yuhai Tu

    Abstract: The Escherichia coli chemotaxis signaling pathway has served as a model system for studying the adaptive sensing of environmental signals by large protein complexes. The chemoreceptors control the kinase activity of CheA in response to the extracellular ligand concentration and adapt across a wide concentration range by undergoing methylation and demethylation. Methylation shifts the kinase respon… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 12 (main text) + 4 (supplemental information) pages, 6+4 figures

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 120, e2303115120 (2023)

  22. arXiv:2302.10473  [pdf, other

    cs.CV

    Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey

    Authors: Kun Wang, Zi Wang, Zhang Li, Ang Su, Xichao Teng, Minhao Liu, Qifeng Yu

    Abstract: Oriented object detection is one of the most fundamental and challenging tasks in remote sensing, aiming to locate and classify objects with arbitrary orientations. Recent years have witnessed remarkable progress in oriented object detection using deep learning techniques. Given the rapid development of this field, this paper aims to provide a comprehensive survey of recent advances in oriented ob… ▽ More

    Submitted 9 April, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

  23. Thermodynamics and Microstructures of Euler-Heisenberg Black Hole in a Cavity

    Authors: Qin Yu, Qi Xu, Jun Tao

    Abstract: The Euler-Heisenberg black holes with quantum electrodynamics (QED) correction are embraced by a cavity in this paper, which serves as a boundary of the black hole spacetime and contributes to the equilibrium of the system. We explore the thermodynamic properties of the black hole, including the phase transitions and phase structures. The small/large black hole phase transition occurs for a negati… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 26 pages, 10 figures

    Journal ref: 2023 Commun. Theor. Phys. 75 095402

  24. arXiv:2302.08888  [pdf, other

    cs.LG cs.AI

    Multimodal Federated Learning via Contrastive Representation Ensemble

    Authors: Qiying Yu, Yang Liu, Yimu Wang, Ke Xu, Jingjing Liu

    Abstract: With the increasing amount of multimedia data on modern mobile systems and IoT infrastructures, harnessing these rich multimodal data without breaching user privacy becomes a critical issue. Federated learning (FL) serves as a privacy-conscious alternative to centralized machine learning. However, existing FL methods extended to multimodal data all rely on model aggregation on single modality leve… ▽ More

    Submitted 5 May, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: ICLR 2023, update

  25. arXiv:2302.08092  [pdf, other

    cs.CL cs.IR

    Product Question Answering in E-Commerce: A Survey

    Authors: Yang Deng, Wenxuan Zhang, Qian Yu, Wai Lam

    Abstract: Product question answering (PQA), aiming to automatically provide instant responses to customer's questions in E-Commerce platforms, has drawn increasing attention in recent years. Compared with typical QA problems, PQA exhibits unique challenges such as the subjectivity and reliability of user-generated contents in E-commerce platforms. Therefore, various problem settings and novel methods have b… ▽ More

    Submitted 3 May, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Accepted by ACL 2023 main conference

  26. arXiv:2302.06077  [pdf, ps, other

    math.PR

    Derivative of self-intersection local time for multidimensional fractional Brownian motion

    Authors: Qian Yu, Xianye Yu

    Abstract: The existence condition $H<1/d$ for first-order derivative of self-intersection local time for $d\geq3$ dimensional fractional Brownian motion can be obtained in Yu (2021). In this paper, we show a limit theorem under the non-existence critical condition $H=1/d$.

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 15 pages. arXiv admin note: substantial text overlap with arXiv:2008.05633

  27. arXiv:2302.05031  [pdf, other

    cs.IR cs.AI

    Feature Decomposition for Reducing Negative Transfer: A Novel Multi-task Learning Method for Recommender System

    Authors: Jie Zhou, Qian Yu, Chuan Luo, Jing Zhang

    Abstract: In recent years, thanks to the rapid development of deep learning (DL), DL-based multi-task learning (MTL) has made significant progress, and it has been successfully applied to recommendation systems (RS). However, in a recommender system, the correlations among the involved tasks are complex. Therefore, the existing MTL models designed for RS suffer from negative transfer to different degrees, w… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: This paper has been accepted by AAAI-23

  28. arXiv:2302.02371  [pdf, other

    eess.SY quant-ph

    Model-free Quantum Gate Design and Calibration using Deep Reinforcement Learning

    Authors: Omar Shindi, Qi Yu, Parth Girdhar, Daoyi Dong

    Abstract: High-fidelity quantum gate design is important for various quantum technologies, such as quantum computation and quantum communication. Numerous control policies for quantum gate design have been proposed given a dynamical model of the quantum system of interest. However, a quantum system is often highly sensitive to noise, and obtaining its accurate modeling can be difficult for many practical ap… ▽ More

    Submitted 7 February, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: 12 pages, 17 figures, accepted for publication in the IEEE Transactions on Artificial Intelligence, in press

  29. arXiv:2302.01478  [pdf, other

    cs.AI cs.LG

    Clustered Embedding Learning for Recommender Systems

    Authors: Yizhou Chen, Guangda Huzhang, Anxiang Zeng, Qingtao Yu, Hui Sun, Heng-yi Li, Jingyi Li, Yabo Ni, Han Yu, Zhiming Zhou

    Abstract: In recent years, recommender systems have advanced rapidly, where embedding learning for users and items plays a critical role. A standard method learns a unique embedding vector for each user and item. However, such a method has two important limitations in real-world applications: 1) it is hard to learn embeddings that generalize well for users and items with rare interactions on their own; and… ▽ More

    Submitted 10 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  30. arXiv:2301.12291  [pdf, other

    eess.IV cs.CV

    CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans

    Authors: Jieneng Chen, Yingda Xia, Jiawen Yao, Ke Yan, Jianpeng Zhang, Le Lu, Fakai Wang, Bo Zhou, Mingyan Qiu, Qihang Yu, Mingze Yuan, Wei Fang, Yuxing Tang, Minfeng Xu, Jian Zhou, Yuqian Zhao, Qifeng Wang, Xianghua Ye, Xiaoli Yin, Yu Shi, Xin Chen, Jingren Zhou, Alan Yuille, Zaiyi Liu, Ling Zhang

    Abstract: Human readers or radiologists routinely perform full-body multi-organ multi-disease detection and diagnosis in clinical practice, while most medical AI systems are built to focus on single organs with a narrow list of a few diseases. This might severely limit AI's clinical adoption. A certain number of AI models need to be assembled non-trivially to match the diagnostic process of a human reading… ▽ More

    Submitted 6 October, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: ICCV 2023 Camera Ready Version

  31. arXiv:2301.07085  [pdf, other

    cs.CL cs.AI

    Are Language Models Worse than Humans at Following Prompts? It's Complicated

    Authors: Albert Webson, Alyssa Marie Loo, Qinan Yu, Ellie Pavlick

    Abstract: Prompts have been the center of progress in advancing language models' zero-shot and few-shot performance. However, recent work finds that models can perform surprisingly well when given intentionally irrelevant or misleading prompts. Such results may be interpreted as evidence that model behavior is not "human like". In this study, we challenge a central assumption in such work: that humans would… ▽ More

    Submitted 11 November, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: EMNLP 2023

  32. arXiv:2301.05931  [pdf, other

    cs.LG q-bio.QM

    Drug Synergistic Combinations Predictions via Large-Scale Pre-Training and Graph Structure Learning

    Authors: Zhihang Hu, Qinze Yu, Yucheng Guo, Taifeng Wang, Irwin King, Xin Gao, Le Song, Yu Li

    Abstract: Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation. However, identifying novel drug combinations through wet-lab experiments is resource intensive due to the vast combinatorial search space. Recently, computational approaches, specifically deep learning models have emerged as an efficient way to discover synergistic c… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

  33. Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments

    Authors: Mayank Mittal, Calvin Yu, Qinxi Yu, Jingzhou Liu, Nikita Rudin, David Hoeller, Jia Lin Yuan, Ritvik Singh, Yunrong Guo, Hammad Mazhar, Ajay Mandlekar, Buck Babich, Gavriel State, Marco Hutter, Animesh Garg

    Abstract: We present Orbit, a unified and modular framework for robot learning powered by NVIDIA Isaac Sim. It offers a modular design to easily and efficiently create robotic environments with photo-realistic scenes and high-fidelity rigid and deformable body simulation. With Orbit, we provide a suite of benchmark tasks of varying difficulty -- from single-stage cabinet opening and cloth folding to multi-s… ▽ More

    Submitted 16 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Project website: https://isaac-orbit.github.io/

    Journal ref: IEEE Robotics and Automation Letters (Volume: 8, Issue: 6, June 2023)

  34. arXiv:2212.12103  [pdf, other

    cs.CV

    Bridging the Domain Gap in Satellite Pose Estimation: a Self-Training Approach based on Geometrical Constraints

    Authors: Zi Wang, Minglin Chen, Yulan Guo, Zhang Li, Qifeng Yu

    Abstract: Recently, unsupervised domain adaptation in satellite pose estimation has gained increasing attention, aiming at alleviating the annotation cost for training deep models. To this end, we propose a self-training framework based on the domain-agnostic geometrical constraints. Specifically, we train a neural network to predict the 2D keypoints of a satellite and then use PnP to estimate the pose. The… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 11 pages, 5 figures. Submitted to IEEE TAES, major revision

  35. arXiv:2212.10537  [pdf, other

    cs.CV cs.AI cs.CL

    Does CLIP Bind Concepts? Probing Compositionality in Large Image Models

    Authors: Martha Lewis, Nihal V. Nayak, Peilin Yu, Qinan Yu, Jack Merullo, Stephen H. Bach, Ellie Pavlick

    Abstract: Large-scale neural network models combining text and images have made incredible progress in recent years. However, it remains an open question to what extent such models encode compositional representations of the concepts over which they operate, such as correctly identifying ''red cube'' by reasoning over the constituents ''red'' and ''cube''. In this work, we focus on the ability of a large pr… ▽ More

    Submitted 29 March, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  36. arXiv:2212.10441  [pdf, other

    cs.DC

    First CE Matters: On the Importance of Long Term Properties on Memory Failure Prediction

    Authors: Jasmin Bogatinovski, Qiao Yu, Jorge Cardoso, Odej Kao

    Abstract: Dynamic random access memory failures are a threat to the reliability of data centres as they lead to data loss and system crashes. Timely predictions of memory failures allow for taking preventive measures such as server migration and memory replacement. Thereby, memory failure prediction prevents failures from externalizing, and it is a vital task to improve system reliability. In this paper, we… ▽ More

    Submitted 21 November, 2022; originally announced December 2022.

    Comments: This paper is accepted to appear in the proceedings of IEEE Big Data 2022. All publishing licenses belong to IEEE

  37. arXiv:2212.09613  [pdf, other

    cs.RO

    Model Predictive Spherical Image-Based Visual Servoing On $SO(3)$ for Aggressive Aerial Tracking

    Authors: Chao Qin, Qiuyu Yu, Hugh H. T. Liu

    Abstract: This paper presents an image-based visual servo control (IBVS) method for a first-person-view (FPV) quadrotor to conduct aggressive aerial tracking. There are three major challenges to maneuvering an underactuated vehicle using IBVS: (i) finding a visual feature representation that is robust to large rotations and is suited to be an optimization variable; (ii) keeping the target visible without sa… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  38. arXiv:2212.00131  [pdf, other

    cs.LG cs.AI

    Evidential Conditional Neural Processes

    Authors: Deep Shankar Pandey, Qi Yu

    Abstract: The Conditional Neural Process (CNP) family of models offer a promising direction to tackle few-shot problems by achieving better scalability and competitive predictive performance. However, the current CNP models only capture the overall uncertainty for the prediction made on a target data point. They lack a systematic fine-grained quantification on the distinct sources of uncertainty that are es… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: To appear in AAAI2023 Conference

  39. arXiv:2211.15425  [pdf

    cs.CV cs.AI

    FAF: A novel multimodal emotion recognition approach integrating face, body and text

    Authors: Zhongyu Fang, Aoyun He, Qihui Yu, Baopeng Gao, Weiping Ding, Tong Zhang, Lei Ma

    Abstract: Multimodal emotion analysis performed better in emotion recognition depending on more comprehensive emotional clues and multimodal emotion dataset. In this paper, we developed a large multimodal emotion dataset, named "HED" dataset, to facilitate the emotion recognition task, and accordingly propose a multimodal emotion recognition method. To promote recognition accuracy, "Feature After Feature" f… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  40. arXiv:2211.15242  [pdf, other

    cs.IT math.PR math.ST

    Ising Model on Locally Tree-like Graphs: Uniqueness of Solutions to Cavity Equations

    Authors: Qian Yu, Yury Polyanskiy

    Abstract: In the study of Ising models on large locally tree-like graphs, in both rigorous and non-rigorous methods one is often led to understanding the so-called belief propagation distributional recursions and its fixed points. We prove that there is at most one non-trivial fixed point for Ising models with zero or certain random external fields. Previously this was only known for sufficiently ``low-temp… ▽ More

    Submitted 31 July, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  41. arXiv:2211.13702  [pdf, other

    cs.CV

    CasFusionNet: A Cascaded Network for Point Cloud Semantic Scene Completion by Dense Feature Fusion

    Authors: Jinfeng Xu, Xianzhi Li, Yuan Tang, Qiao Yu, Yixue Hao, Long Hu, Min Chen

    Abstract: Semantic scene completion (SSC) aims to complete a partial 3D scene and predict its semantics simultaneously. Most existing works adopt the voxel representations, thus suffering from the growth of memory and computation cost as the voxel resolution increases. Though a few works attempt to solve SSC from the perspective of 3D point clouds, they have not fully exploited the correlation and complemen… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

  42. Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization

    Authors: Weiqi Sun, Rui Su, Qian Yu, Dong Xu

    Abstract: Weakly supervised temporal action localization (WTAL) aims to localize actions in untrimmed videos with only weak supervision information (e.g. video-level labels). Most existing models handle all input videos with a fixed temporal scale. However, such models are not sensitive to actions whose pace of the movements is different from the ``normal" speed, especially slow-motion action instances, whi… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology, 2022

  43. arXiv:2211.07726  [pdf, other

    math.OC

    On Constrained Mixed-Integer DR-Submodular Minimization

    Authors: Qimeng Yu, Simge Küçükyavuz

    Abstract: DR-submodular functions encompass a broad class of functions which are generally non-convex and non-concave. We study the problem of minimizing any DR-submodular function, with continuous and general integer variables, under box constraints and possibly additional monotonicity constraints. We propose valid linear inequalities for the epigraph of any DR-submodular function under the constraints. We… ▽ More

    Submitted 5 September, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  44. arXiv:2210.16152  [pdf, ps, other

    math.PR

    Limit laws for functionals of self-intersection symmetric alpha-stable processes

    Authors: Minhao Hong, Qian Yu

    Abstract: In this paper, we prove two limit laws for functionals of self-intersection symmetric alpha-stable processes with alpha\in(1,2). The results are obtained based on the method of moments, the sample configuration and the chaining argument introduced in (Nualart and Xu 2013) are employed.

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 18 pages

  45. arXiv:2210.12681  [pdf, other

    cs.CV

    Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation

    Authors: Atsuyuki Miyai, Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa

    Abstract: Rotation is frequently listed as a candidate for data augmentation in contrastive learning but seldom provides satisfactory improvements. We argue that this is because the rotated image is always treated as either positive or negative. The semantics of an image can be rotation-invariant or rotation-variant, so whether the rotated image is treated as positive or negative should be determined based… ▽ More

    Submitted 24 November, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  46. arXiv:2210.04379  [pdf, other

    cs.CV

    Unsupervised Domain Adaptive Fundus Image Segmentation with Few Labeled Source Data

    Authors: Qianbi Yu, Dongnan Liu, Chaoyi Zhang, Xinwen Zhang, Weidong Cai

    Abstract: Deep learning-based segmentation methods have been widely employed for automatic glaucoma diagnosis and prognosis. In practice, fundus images obtained by different fundus cameras vary significantly in terms of illumination and intensity. Although recent unsupervised domain adaptation (UDA) methods enhance the models' generalization ability on the unlabeled target fundus datasets, they always requi… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: Accepted by The 33rd British Machine Vision Conference (BMVC) 2022

  47. arXiv:2210.01820  [pdf, other

    cs.CV

    MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models

    Authors: Chenglin Yang, Siyuan Qiao, Qihang Yu, Xiaoding Yuan, Yukun Zhu, Alan Yuille, Hartwig Adam, Liang-Chieh Chen

    Abstract: This paper presents MOAT, a family of neural networks that build on top of MObile convolution (i.e., inverted residual blocks) and ATtention. Unlike the current works that stack separate mobile convolution and transformer blocks, we effectively merge them into a MOAT block. Starting with a standard Transformer block, we replace its multi-layer perceptron with a mobile convolution block, and furthe… ▽ More

    Submitted 30 January, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: ICLR 2023. arXiv v2: add ImageNet-1K-V2, tiny-MOAT on COCO detection and ADE20K segmentation

  48. arXiv:2209.15296  [pdf, other

    cs.SD eess.AS

    Wake Word Detection Based on Res2Net

    Authors: Qiuchen Yu, Ruohua Zhou

    Abstract: This letter proposes a new wake word detection system based on Res2Net. As a variant of ResNet, Res2Net was first applied to objection detection. Res2Net realizes multiple feature scales by increasing possible receptive fields. This multiple scaling mechanism significantly improves the detection ability of wake words with different durations. Compared with the ResNet-based model, Res2Net also sign… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  49. arXiv:2209.13947  [pdf, ps, other

    nucl-ex physics.plasm-ph

    $^{197}$Au($γ,\,xn;\,x\,=\,1\thicksim9$) Reaction Cross Section Measurements using Laser-Driven Ultra-Intense $γ$-Ray Source

    Authors: D. Wu, H. Y. Lan, J. Y. Zhang, J. X. Liu, H. G. Lu, J. F. Lv, X. Z. Wu, H. Zhang, J. Cai, Q. Y. Ma, Y. H. Xia, Z. N. Wang, M. Z. Wang, Z. Y. Yang, X. L. Xu, Y. X. Geng, Y. Y. Zhao, C. Lin, W. J. Ma, J. Q. Yu, H. R. Wang, F. L. Liu, C. Y. He, B. Guo, P. Zhu , et al. (4 additional authors not shown)

    Abstract: We present a new method for the measurements of photonuclear reaction flux-weighted average cross sections and isomeric ratios using a laser-driven bremsstrahlung $γ$-ray source. An ultra-bright ultra-fast 60$\,\thicksim\,$250 MeV bremsstrahlung $γ$-ray source was established using the 200 TW laser facility in the Compact Laser Plasma Accelerator Laboratory, Peking University, which could cover th… ▽ More

    Submitted 23 November, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  50. arXiv:2209.12141  [pdf, other

    astro-ph.SR astro-ph.HE

    A dynamically discovered and characterized non-accreting neutron star -- M dwarf binary candidate

    Authors: Tuan Yi, Wei-Min Gu, Zhi-Xiang Zhang, Ling-Lin Zheng, Mouyuan Sun, Junfeng Wang, Zhongrui Bai, Pei Wang, Jianfeng Wu, Yu Bai, Song Wang, Haotong Zhang, Yize Dong, Yong Shao, Xiang-Dong Li, Jia Zhang, Yang Huang, Fan Yang, Qingzheng Yu, Hui-Jun Mu, Jin-Bo Fu, Senyu Qi, Jing Guo, Xuan Fang, Chuanjie Zheng , et al. (4 additional authors not shown)

    Abstract: Optical time-domain surveys can unveil and characterize exciting but less-explored non-accreting and/or non-beaming neutron stars (NS) in binaries. Here we report the discovery of such a NS candidate using the LAMOST spectroscopic survey. The candidate, designated LAMOST J112306.9+400736 (hereafter J1123), is in a single-lined spectroscopic binary containing an optically visible M star. The star's… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: 53 pages, 15 figures, publication in Nature Astronomy