Skip to main content

Showing 1–27 of 27 results for author: Mi, L

  1. arXiv:2405.03373  [pdf, other

    cs.CV

    Knowledge-aware Text-Image Retrieval for Remote Sensing Images

    Authors: Li Mi, Xianjie Dai, Javiera Castillo-Navarro, Devis Tuia

    Abstract: Image-based retrieval in large Earth observation archives is challenging because one needs to navigate across thousands of candidate matches only with the query image as a guide. By using text as information supporting the visual query, the retrieval system gains in usability, but at the same time faces difficulties due to the diversity of visual signals that cannot be summarized by a short captio… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Under review

  2. arXiv:2403.13965  [pdf, other

    cs.CV

    ConGeo: Robust Cross-view Geo-localization across Ground View Variations

    Authors: Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Wen Yang, Antoine Bosselut, Devis Tuia

    Abstract: Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view. In real-world scenarios, the task requires accommodating diverse ground images captured by users with varying orientations and reduced field of views (FoVs). However, existing learning pipelines are orientation-specific or FoV-specific, demanding separate model… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Project page at https://chasel-tsui.github.io/ConGeo/

  3. arXiv:2402.12846  [pdf, other

    cs.CV cs.AI

    ConVQG: Contrastive Visual Question Generation with Multimodal Guidance

    Authors: Li Mi, Syrielle Montariol, Javiera Castillo-Navarro, Xianjie Dai, Antoine Bosselut, Devis Tuia

    Abstract: Asking questions about visual environments is a crucial way for intelligent agents to understand rich multi-faceted scenes, raising the importance of Visual Question Generation (VQG) systems. Apart from being grounded to the image, existing VQG systems can use textual constraints, such as expected answers or knowledge triplets, to generate focused questions. These constraints allow VQG systems to… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: AAAI 2024. Project page at https://limirs.github.io/ConVQG

  4. arXiv:2401.14142  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

    Authors: Xinyue Xu, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Li

    Abstract: Existing methods, such as concept bottleneck models (CBMs), have been successful in providing concept-based interpretations for black-box deep learning models. They typically work by predicting concepts given the input and then predicting the final class label given the predicted concepts. However, (1) they often fail to capture the high-order, nonlinear interaction between concepts, e.g., correct… ▽ More

    Submitted 26 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR 2024

  5. arXiv:2312.15740  [pdf, other

    cs.NI cs.CV cs.LG

    BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

    Authors: Lin Sun, Weijun Wang, Tingting Yuan, Liang Mi, Haipeng Dai, Yunxin Liu, Xiaoming Fu

    Abstract: High-definition (HD) cameras for surveillance and road traffic have experienced tremendous growth, demanding intensive computation resources for real-time analytics. Recently, offloading frames from the front-end device to the back-end edge server has shown great promise. In multi-stream competitive environments, efficient bandwidth management and proper scheduling are crucial to ensure both high… ▽ More

    Submitted 4 February, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by 2024 IEEE INFOCOM

  6. arXiv:2311.06928  [pdf, other

    cs.LG stat.ME

    Attention for Causal Relationship Discovery from Biological Neural Dynamics

    Authors: Ziyu Lu, Anika Tabassum, Shruti Kulkarni, Lu Mi, J. Nathan Kutz, Eric Shea-Brown, Seung-Hwan Lim

    Abstract: This paper explores the potential of the transformer models for learning Granger causality in networks with complex nonlinear dynamics at every node, as in neurobiological and biophysical networks. Our study primarily focuses on a proof-of-concept investigation based on simulated neural dynamics, for which the ground-truth causality is known through the underlying connectivity matrix. For transfor… ▽ More

    Submitted 23 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Causal Representation Learning

  7. arXiv:2311.02258  [pdf, other

    q-bio.NC cs.LG

    Learning Time-Invariant Representations for Individual Neurons from Population Dynamics

    Authors: Lu Mi, Trung Le, Tianxing He, Eli Shlizerman, Uygar Sümbül

    Abstract: Neurons can display highly variable dynamics. While such variability presumably supports the wide range of behaviors generated by the organism, their gene expressions are relatively stable in the adult brain. This suggests that neuronal activity is a combination of its time-invariant identity and the inputs the neuron receives from the rest of the circuit. Here, we propose a self-supervised learni… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023

  8. arXiv:2309.17157  [pdf, other

    cs.CL

    LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud

    Authors: Mengke Zhang, Tianxing He, Tianle Wang, Lu Mi, Fatemehsadat Mireshghallah, Binyi Chen, Hao Wang, Yulia Tsvetkov

    Abstract: In the current user-server interaction paradigm of prompted generation with large language models (LLM) on cloud, the server fully controls the generation process, which leaves zero options for users who want to keep the generated text to themselves. We propose LatticeGen, a cooperative framework in which the server still handles most of the computation while the user controls the sampling operati… ▽ More

    Submitted 5 April, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  9. arXiv:2303.00882  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    X-Ray2EM: Uncertainty-Aware Cross-Modality Image Reconstruction from X-Ray to Electron Microscopy in Connectomics

    Authors: Yicong Li, Yaron Meirovitch, Aaron T. Kuan, Jasper S. Phelps, Alexandra Pacureanu, Wei-Chung Allen Lee, Nir Shavit, Lu Mi

    Abstract: Comprehensive, synapse-resolution imaging of the brain will be crucial for understanding neuronal computations and function. In connectomics, this has been the sole purview of volume electron microscopy (EM), which entails an excruciatingly difficult process because it requires cutting tissue into many thin, fragile slices that then need to be imaged, aligned, and reconstructed. Unlike EM, hard X-… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by ISBI 2023 conference. Supplementary material is available in this arXiv version

  10. arXiv:2302.03819  [pdf, other

    cs.CV cs.LG q-bio.NC

    The XPRESS Challenge: Xray Projectomic Reconstruction -- Extracting Segmentation with Skeletons

    Authors: Tri Nguyen, Mukul Narwani, Mark Larson, Yicong Li, Shuhan Xie, Hanspeter Pfister, Donglai Wei, Nir Shavit, Lu Mi, Alexandra Pacureanu, Wei-Chung Lee, Aaron T. Kuan

    Abstract: The wiring and connectivity of neurons form a structural basis for the function of the nervous system. Advances in volume electron microscopy (EM) and image segmentation have enabled mapping of circuit diagrams (connectomics) within local regions of the mouse brain. However, applying volume EM over the whole brain is not currently feasible due to technological challenges. As a result, comprehensiv… ▽ More

    Submitted 24 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 6 pages, 2 figures

  11. arXiv:2301.08664  [pdf, other

    cs.CV cs.LG cs.MM

    AccDecoder: Accelerated Decoding for Neural-enhanced Video Analytics

    Authors: Tingting Yuan, Liang Mi, Weijun Wang, Haipeng Dai, Xiaoming Fu

    Abstract: The quality of the video stream is key to neural network-based video analytics. However, low-quality video is inevitably collected by existing surveillance systems because of poor quality cameras or over-compressed/pruned video streaming protocols, e.g., as a result of upstream bandwidth limit. To address this issue, existing studies use quality enhancers (e.g., neural super-resolution) to improve… ▽ More

    Submitted 24 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

    Comments: Accepted by 2023 IEEE INFOCOM

  12. arXiv:2209.04061  [pdf, other

    cs.CV

    im2nerf: Image to Neural Radiance Field in the Wild

    Authors: Lu Mi, Abhijit Kundu, David Ross, Frank Dellaert, Noah Snavely, Alireza Fathi

    Abstract: We propose im2nerf, a learning framework that predicts a continuous neural object representation given a single input image in the wild, supervised by only segmentation output from off-the-shelf recognition methods. The standard approach to constructing neural radiance fields takes advantage of multi-view consistency and requires many calibrated views of a scene, a requirement that cannot be satis… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 12 pages, 8 figures, 4 tables

  13. arXiv:2207.06684  [pdf, other

    cs.LG cs.AI cs.CV cs.SI stat.ML

    Subgraph Frequency Distribution Estimation using Graph Neural Networks

    Authors: Zhongren Chen, Xinyue Xu, Shengyi Jiang, Hao Wang, Lu Mi

    Abstract: Small subgraphs (graphlets) are important features to describe fundamental units of a large network. The calculation of the subgraph frequency distributions has a wide application in multiple domains including biology and engineering. Unfortunately due to the inherent complexity of this task, most of the existing methods are computationally intensive and inefficient. In this work, we propose GNNS,… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: accepted by KDD 2022 Workshop on Deep Learning on Graphs

  14. arXiv:2110.06421  [pdf, other

    cs.LG

    Revisiting Latent-Space Interpolation via a Quantitative Evaluation Framework

    Authors: Lu Mi, Tianxing He, Core Francisco Park, Hao Wang, Yue Wang, Nir Shavit

    Abstract: Latent-space interpolation is commonly used to demonstrate the generalization ability of deep latent variable models. Various algorithms have been proposed to calculate the best trajectory between two encodings in the latent space. In this work, we show how data labeled with semantically continuous attributes can be utilized to conduct a quantitative evaluation of latent-space interpolation algori… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 11 pages

  15. Predicate correlation learning for scene graph generation

    Authors: Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen

    Abstract: For a typical Scene Graph Generation (SGG) method, there is often a large gap in the performance of the predicates' head classes and tail classes. This phenomenon is mainly caused by the semantic overlap between different predicates as well as the long-tailed data distribution. In this paper, a Predicate Correlation Learning (PCL) method for SGG is proposed to address the above two problems by tak… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  16. arXiv:2107.01181  [pdf, other

    cs.CV cs.AI

    Visual Relationship Forecasting in Videos

    Authors: Li Mi, Yangjun Ou, Zhenzhong Chen

    Abstract: Real-world scenarios often require the anticipation of object interactions in unknown future, which would assist the decision-making process of both humans and agents. To meet this challenge, we present a new task named Visual Relationship Forecasting (VRF) in videos to explore the prediction of visual relationships in a reasoning manner. Specifically, given a subject-object pair with H existing f… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  17. arXiv:2106.14880  [pdf, other

    cs.CV

    HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps

    Authors: Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov

    Abstract: High Definition (HD) maps are maps with precise definitions of road lanes with rich semantics of the traffic rules. They are critical for several key stages in an autonomous driving system, including motion forecasting and planning. However, there are only a small amount of real-world road topologies and geometries, which significantly limits our ability to test out the self-driving stack to gener… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  18. Learning Guided Electron Microscopy with Active Acquisition

    Authors: Lu Mi, Hao Wang, Yaron Meirovitch, Richard Schalek, Srinivas C. Turaga, Jeff W. Lichtman, Aravinthan D. T. Samuel, Nir Shavit

    Abstract: Single-beam scanning electron microscopes (SEM) are widely used to acquire massive data sets for biomedical study, material analysis, and fabrication inspection. Datasets are typically acquired with uniform acquisition: applying the electron beam with the same power and duration to all image pixels, even if there is great variety in the pixels' importance for eventual use. Many SEMs are now able t… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: MICCAI 2020

  19. arXiv:2002.10543  [pdf, other

    cs.LG stat.ML

    Variational Wasserstein Barycenters for Geometric Clustering

    Authors: Liang Mi

    Abstract: We propose to compute Wasserstein barycenters (WBs) by solving for Monge maps with variational principle. We discuss the metric properties of WBs and explore their connections, especially the connections of Monge WBs, to K-means clustering and co-clustering. We also discuss the feasibility of Monge WBs on unbalanced measures and spherical domains. We propose two new problems -- regularized K-means… ▽ More

    Submitted 29 March, 2023; v1 submitted 24 February, 2020; originally announced February 2020.

  20. arXiv:2001.11114  [pdf, other

    cs.LG cs.DM math.FA stat.ML

    A Family of Pairwise Multi-Marginal Optimal Transports that Define a Generalized Metric

    Authors: Liang Mi, Azadeh Sheikholeslami, José Bento

    Abstract: The Optimal transport (OT) problem is rapidly finding its way into machine learning. Favoring its use are its metric properties. Many problems admit solutions with guarantees only for objects embedded in metric spaces, and the use of non-metrics can complicate solving them. Multi-marginal OT (MMOT) generalizes OT to simultaneously transporting multiple distributions. It captures important relation… ▽ More

    Submitted 22 December, 2022; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Machine Learning (2022)

  21. arXiv:1910.04858  [pdf, other

    cs.CV cs.LG

    Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate

    Authors: Lu Mi, Hao Wang, Yonglong Tian, Hao He, Nir Shavit

    Abstract: Uncertainty estimation is an essential step in the evaluation of the robustness for deep learning models in computer vision, especially when applied in risk-sensitive areas. However, most state-of-the-art deep learning models either fail to obtain uncertainty estimation or need significant modification (e.g., formulating a proper Bayesian treatment) to obtain it. Most previous methods are not able… ▽ More

    Submitted 10 January, 2022; v1 submitted 27 September, 2019; originally announced October 2019.

    Comments: In proceedings of the 36th AAAI Conference on Artificial Intelligence

  22. arXiv:1907.12188   

    cs.HC

    Hand-Gesture-Recognition Based Text Input Method for AR/VR Wearable Devices

    Authors: Nizamuddin Maitlo, Yanbo Wang, Chao Ping Chen, Lantian Mi, Wenbo Zhang

    Abstract: Static and dynamic hand movements are basic way for human-machine interactions. To recognize and classify these movements, first these movements are captured by the cameras mounted on the augmented reality (AR) or virtual reality (VR) wearable devices. The hand is segmented using segmentation method and its gestures are passed to hand gesture recognition algorithm, which depends on depth-wise sepa… ▽ More

    Submitted 2 April, 2020; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: Information is not correct need to rewrite

  23. arXiv:1812.05676  [pdf, other

    cs.LG stat.ML

    A Probe Towards Understanding GAN and VAE Models

    Authors: Lu Mi, Macheng Shen, Jingzhao Zhang

    Abstract: This project report compares some known GAN and VAE models proposed prior to 2017. There has been significant progress after we finished this report. We upload this report as an introduction to generative models and provide some personal interpretations supported by empirical evidence. Both generative adversarial network models and variational autoencoders have been widely used to approximate prob… ▽ More

    Submitted 17 December, 2018; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: 9 pages, 8 figures

  24. arXiv:1812.01157  [pdf, other

    cs.CV

    Cross-Classification Clustering: An Efficient Multi-Object Tracking Technique for 3-D Instance Segmentation in Connectomics

    Authors: Yaron Meirovitch, Lu Mi, Hayk Saribekyan, Alexander Matveev, David Rolnick, Nir Shavit

    Abstract: Pixel-accurate tracking of objects is a key element in many computer vision applications, often solved by iterated individual object tracking or instance segmentation followed by object matching. Here we introduce cross-classification clustering (3C), a technique that simultaneously tracks complex, interrelated objects in an image stack. The key idea in cross-classification is to efficiently turn… ▽ More

    Submitted 15 June, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: 11 figures

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 8425-8435

  25. arXiv:1812.00338  [pdf, other

    cs.LG stat.ML

    Regularized Wasserstein Means for Aligning Distributional Data

    Authors: Liang Mi, Wen Zhang, Yalin Wang

    Abstract: We propose to align distributional data from the perspective of Wasserstein means. We raise the problem of regularizing Wasserstein means and propose several terms tailored to tackle different problems. Our formulation is based on the variational transportation to distribute a sparse discrete measure into the target domain. The resulting sparse representation well captures the desired property of… ▽ More

    Submitted 20 February, 2020; v1 submitted 2 December, 2018; originally announced December 2018.

  26. arXiv:1806.09045  [pdf, other

    cs.CV

    Variational Wasserstein Clustering

    Authors: Liang Mi, Wen Zhang, Xianfeng Gu, Yalin Wang

    Abstract: We propose a new clustering method based on optimal transportation. We solve optimal transportation with variational principles, and investigate the use of power diagrams as transportation plans for aggregating arbitrary domains into a fixed number of clusters. We iteratively drive centroids through target domains while maintaining the minimum clustering energy by adjusting the power diagrams. Thu… ▽ More

    Submitted 26 July, 2018; v1 submitted 23 June, 2018; originally announced June 2018.

    Comments: Accepted to ECCV 2018

  27. arXiv:1801.07548  [pdf, ps, other

    cs.DC astro-ph.IM

    A hybrid architecture for astronomical computing

    Authors: Changhua Li, Chenzhou Cui, Boliang He, Dongwei Fan, Linying Mi, Shanshan Li, Sisi Yang, Yunfei Xu, Jun Han, Junyi Chen, Hailong Zhang, Ce Yu, Jian Xiao, Chuanjun Wang, Zihuang Cao, Yufeng Fan, Liang Liu, Xiao Chen, Wenming Song, Kangyu Du

    Abstract: With many large science equipment constructing and putting into use, astronomy has stepped into the big data era. The new method and infrastructure of big data processing has become a new requirement of many astronomers. Cloud computing, Map/Reduce, Hadoop, Spark, etc. many new technology has sprung up in recent years. Comparing to the high performance computing(HPC), Data is the center of these n… ▽ More

    Submitted 18 January, 2018; originally announced January 2018.

    Comments: 4 pages, 2 figures, ADASS XXVI conference