Skip to main content

Showing 1–50 of 295 results for author: Yin, S

  1. arXiv:2407.10416  [pdf, other

    cs.AR

    SOFA: A Compute-Memory Optimized Sparsity Accelerator via Cross-Stage Coordinated Tiling

    Authors: Huizheng Wang, Jiahao Fang, Xinru Tang, Zhiheng Yue, Jinxi Li, Yubin Qin, Sihan Guan, Qize Yang, Yang Wang, Chao Li, Yang Hu, Shouyi Yin

    Abstract: Benefiting from the self-attention mechanism, Transformer models have attained impressive contextual comprehension capabilities for lengthy texts. The requirements of high-throughput inference arise as the large language models (LLMs) become increasingly prevalent, which calls for large-scale token parallel processing (LTPP). However, existing dynamic sparse accelerators struggle to effectively ha… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  2. arXiv:2407.07896  [pdf, other

    physics.optics cond-mat.mes-hall cs.LG physics.app-ph physics.space-ph

    Pentagonal Photonic Crystal Mirrors: Scalable Lightsails with Enhanced Acceleration via Neural Topology Optimization

    Authors: L. Norder, S. Yin, M. J. de Jong, F. Stallone, H. Aydogmus, P. M. Sberna, M. A. Bessa, R. A. Norte

    Abstract: The Starshot Breakthrough Initiative aims to send one-gram microchip probes to Alpha Centauri within 20 years, using gram-scale lightsails propelled by laser-based radiation pressure, reaching velocities nearing a fifth of light speed. This mission requires lightsail materials that challenge the fundamentals of nanotechnology, requiring innovations in optics, material science and structural engine… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2407.03966  [pdf, other

    cs.SD cs.AI eess.AS

    Serialized Output Training by Learned Dominance

    Authors: Ying Shi, Lantian Li, Shi Yin, Dong Wang, Jiqing Han

    Abstract: Serialized Output Training (SOT) has showcased state-of-the-art performance in multi-talker speech recognition by sequentially decoding the speech of individual speakers. To address the challenging label-permutation issue, prior methods have relied on either the Permutation Invariant Training (PIT) or the time-based First-In-First-Out (FIFO) rule. This study presents a model-based serialization st… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: accepted by INTERSPEECH 2024

  4. arXiv:2406.10635  [pdf, other

    cs.RO cs.DB cs.OS

    ROSfs: A User-Level File System for ROS

    Authors: Zijun Xu, Xuanjun Wen, Yanjie Song, Shu Yin

    Abstract: We present ROSfs, a novel user-level file system for the Robot Operating System (ROS). ROSfs interprets a robot file as a group of sub-files, with each having a distinct label. ROSfs applies a time index structure to enhance the flexible data query while the data file is under modification. It provides multi-robot systems (MRS) with prompt cross-robot data acquisition and collaboration. We impleme… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  5. arXiv:2406.08148  [pdf, other

    cs.LG cs.AI

    Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation

    Authors: Shuyu Yin, Fei Wen, Peilin Liu, Tao Luo

    Abstract: Semi-gradient Q-learning is applied in many fields, but due to the absence of an explicit loss function, studying its dynamics and implicit bias in the parameter space is challenging. This paper introduces the Fokker--Planck equation and employs partial data obtained through sampling to construct and visualize the effective loss landscape within a two-dimensional parameter space. This visualizatio… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.07421  [pdf, other

    cs.SD eess.AS

    A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition

    Authors: Zhenyu Zhou, Shibiao Xu, Shi Yin, Lantian Li, Dong Wang

    Abstract: Data augmentation (DA) has played a pivotal role in the success of deep speaker recognition. Current DA techniques primarily focus on speaker-preserving augmentation, which does not change the speaker trait of the speech and does not create new speakers. Recent research has shed light on the potential of speaker augmentation, which generates new speakers to enrich the training dataset. In this stu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: to be published in INTERSPEECH 2024

  7. arXiv:2406.03868  [pdf, other

    cs.DC

    PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training

    Authors: Jiahao Fang, Huizheng Wang, Qize Yang, Dehao Kong, Xu Dai, Jinyi Deng, Yang Hu, Shouyi Yin

    Abstract: Deep learning (DL) models are piquing high interest and scaling at an unprecedented rate. To this end, a handful of tiled accelerators have been proposed to support such large-scale training tasks. However, these accelerators often incorporate numerous cores or tiles even extending to wafer-scale, substantial on-chip bandwidth, and distributed memory systems. This results in an exceedingly complex… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages

  8. arXiv:2405.18132  [pdf, other

    cs.CV

    EG4D: Explicit Generation of 4D Object without Score Distillation

    Authors: Qi Sun, Zhiyang Guo, Ziyu Wan, Jing Nathan Yan, Shengming Yin, Wengang Zhou, Jing Liao, Houqiang Li

    Abstract: In recent years, the increasing demand for dynamic 3D assets in design and gaming applications has given rise to powerful generative pipelines capable of synthesizing high-quality 4D objects. Previous methods generally rely on score distillation sampling (SDS) algorithm to infer the unseen views and motion of 4D objects, thus leading to unsatisfactory results with defects like over-saturation and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  9. arXiv:2405.17221  [pdf, other

    cs.AI cs.AR

    Efficient Orchestrated AI Workflows Execution on Scale-out Spatial Architecture

    Authors: Jinyi Deng, Xinru Tang, Zhiheng Yue, Guangyang Lu, Qize Yang, Jiahao Zhang, Jinxi Li, Chao Li, Shaojun Wei, Yang Hu, Shouyi Yin

    Abstract: Given the increasing complexity of AI applications, traditional spatial architectures frequently fall short. Our analysis identifies a pattern of interconnected, multi-faceted tasks encompassing both AI and general computational processes. In response, we have conceptualized "Orchestrated AI Workflows," an approach that integrates various tasks with logic-driven decisions into dynamic, sophisticat… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  10. arXiv:2405.15223  [pdf, other

    cs.CV cs.LG cs.RO

    iVideoGPT: Interactive VideoGPTs are Scalable World Models

    Authors: Jialong Wu, Shaofeng Yin, Ningya Feng, Xu He, Dong Li, Jianye Hao, Mingsheng Long

    Abstract: World models empower model-based agents to interactively explore, reason, and plan within imagined environments for real-world decision-making. However, the high demand for interactivity poses challenges in harnessing recent advancements in video generative models for developing world models at scale. This work introduces Interactive VideoGPT (iVideoGPT), a scalable autoregressive transformer fram… ▽ More

    Submitted 2 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Project website: https://thuml.github.io/iVideoGPT

  11. arXiv:2405.10463  [pdf, other

    physics.optics eess.IV physics.bio-ph

    Single-shot volumetric fluorescence imaging with neural fields

    Authors: Oumeng Zhang, Haowen Zhou, Brandon Y. Feng, Elin M. Larsson, Reinaldo E. Alcalde, Siyuan Yin, Catherine Deng, Changhuei Yang

    Abstract: Single-shot volumetric fluorescence (SVF) imaging offers a significant advantage over traditional imaging methods that require scanning across multiple axial planes as it can capture biological processes with high temporal resolution across a large field of view. The key challenges in SVF imaging include requiring sparsity constraints to meet the multiplexing requirements of compressed sensing, el… ▽ More

    Submitted 4 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  12. arXiv:2405.07551  [pdf, other

    cs.CL cs.AI

    MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning

    Authors: Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai

    Abstract: The tool-use Large Language Models (LLMs) that integrate with external Python interpreters have significantly enhanced mathematical reasoning capabilities for open-source LLMs, while tool-free methods chose another track: augmenting math reasoning data. However, a great method to integrate the above two research paths and combine their advantages remains to be explored. In this work, we firstly in… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: The state-of-the-art open-source tool-use LLMs for mathematical reasoning

  13. arXiv:2405.06887  [pdf, other

    cs.CV

    FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment

    Authors: Jinglin Xu, Sibo Yin, Guohao Zhao, Zishuo Wang, Yuxin Peng

    Abstract: Existing action quality assessment (AQA) methods mainly learn deep representations at the video level for scoring diverse actions. Due to the lack of a fine-grained understanding of actions in videos, they harshly suffer from low credibility and interpretability, thus insufficient for stringent applications, such as Olympic diving events. We argue that a fine-grained understanding of actions requi… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024

  14. arXiv:2405.05722  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    A Framework of SO(3)-equivariant Non-linear Representation Learning and its Application to Electronic-Structure Hamiltonian Prediction

    Authors: Shi Yin, Xinyang Pan, Fengyan Wang, Feng Wu, Lixin He

    Abstract: We present both a theoretical and a methodological framework that addresses a critical challenge in applying deep learning to physical systems: the reconciliation of non-linear expressiveness with SO(3)-equivariance in predictions of SO(3)-equivariant quantities. Inspired by covariant theory in physics, we address this problem by exploring the mathematical relationships between SO(3)-invariant and… ▽ More

    Submitted 18 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  15. arXiv:2405.02155  [pdf, other

    cs.CV

    Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification

    Authors: Siqi Yin, Lifan Jiang

    Abstract: This paper introduces a novel framework for zero-shot learning (ZSL), i.e., to recognize new categories that are unseen during training, by using a multi-model and multi-alignment integration method. Specifically, we propose three strategies to enhance the model's performance to handle ZSL: 1) Utilizing the extensive knowledge of ChatGPT and the powerful image generation capabilities of DALL-E to… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  16. arXiv:2404.18612  [pdf

    cs.RO

    Enhancing Prosthetic Safety and Environmental Adaptability: A Visual-Inertial Prosthesis Motion Estimation Approach on Uneven Terrains

    Authors: Chuheng Chen, Xinxing Chen, Shucong Yin, Yuxuan Wang, Binxin Huang, Yuquan Leng, Chenglong Fu

    Abstract: Environment awareness is crucial for enhancing walking safety and stability of amputee wearing powered prosthesis when crossing uneven terrains such as stairs and obstacles. However, existing environmental perception systems for prosthesis only provide terrain types and corresponding parameters, which fails to prevent potential collisions when crossing uneven terrains and may lead to falls and oth… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  17. Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction

    Authors: Quancheng Du, Xiao Wang, Shouguo Yin, Lingxi Li, Huansheng Ning

    Abstract: Accurate prediction of agent motion trajectories is crucial for autonomous driving, contributing to the reduction of collision risks in human-vehicle interactions and ensuring ample response time for other traffic participants. Current research predominantly focuses on traditional deep learning methods, including convolutional neural networks (CNNs) and recurrent neural networks (RNNs). These meth… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 11 pages,3 figures, published to IEEE Transactions on Intelligent vehicles

  18. arXiv:2404.12104  [pdf, other

    cs.CV cs.CL cs.LG

    Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models

    Authors: Yuzhu Cai, Sheng Yin, Yuxi Wei, Chenxin Xu, Weibo Mao, Felix Juefei-Xu, Siheng Chen, Yanfeng Wang

    Abstract: The burgeoning landscape of text-to-image models, exemplified by innovations such as Midjourney and DALLE 3, has revolutionized content creation across diverse sectors. However, these advancements bring forth critical ethical concerns, particularly with the misuse of open-source models to generate content that violates societal norms. Addressing this, we introduce Ethical-Lens, a framework designe… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 42 pages, 17 figures, 29 tables

  19. arXiv:2404.06762  [pdf, other

    cs.CL cs.HC

    Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems

    Authors: Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen

    Abstract: Intelligent Tutoring Systems (ITSs) can provide personalized and self-paced learning experience. The emergence of large language models (LLMs) further enables better human-machine interaction, and facilitates the development of conversational ITSs in various disciplines such as math and language learning. In dialogic teaching, recognizing and adapting to individual characteristics can significantl… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  20. arXiv:2404.06194  [pdf, other

    cs.CV

    Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

    Authors: Ting Lei, Shaofeng Yin, Yang Liu

    Abstract: Open-vocabulary human-object interaction (HOI) detection, which is concerned with the problem of detecting novel HOIs guided by natural language, is crucial for understanding human-centric scenes. However, prior zero-shot HOI detectors often employ the same levels of feature maps to model HOIs with varying distances, leading to suboptimal performance in scenes containing human-object pairs with a… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  21. arXiv:2404.05412  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Valley edge states as bound states in the continuum

    Authors: Shunda Yin, Liping Ye, Hailong He, Xueqin Huang, Manzhu Ke, Weiyin Deng, Jiuyang Lu, Zhengyou Liu

    Abstract: Bound states in the continuum (BICs) are spatially localized states with energy embedded in the continuum spectrum of extended states. The combination of BICs physics and nontrivial band topology theory giving rise to topological BICs, which are robust against disorders and meanwhile of the merit of conventional BICs, is attracting wide attention recently. Here, we report valley edge states as top… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: A revised version has been accepted by Science Bulletin

  22. arXiv:2404.04449  [pdf

    physics.optics cond-mat.mtrl-sci physics.app-ph physics.space-ph

    Self-referencing photothermal common-path interferometry to measure absorption of Si3N4 membranes for laser-light sails

    Authors: Demeng Feng, Tanuj Kumar, Shenwei Yin, Merlin Mah, Phyo Lin, Margaret Fortman, Gabriel R. Jaffe, Chenghao Wan, Hongyan Mei, Yuzhe Xiao, Ron Synowicki, Ronald J. Warzoha, Victor W. Brar, Joseph J. Talghader, Mikhail A. Kats

    Abstract: Laser-light sails are a spacecraft concept wherein lightweight "sails" are propelled to high speeds by lasers with high intensities. The sails must comprise materials with low optical loss, to minimize the risk of laser damage. Stoichiometric silicon nitride (Si$_3$N$_4$) is a candidate material with low loss in the near infrared, but the precise absorption coefficient has not been characterized i… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Main text + supplementary

  23. arXiv:2404.03681  [pdf, other

    physics.ins-det hep-ex

    Muon beamtest results of high-density glass scintillator tiles

    Authors: Dejing Du, Yong Liu, Hua Cai, Danping Chen, Zhehao Hua, Jifeng Han, Jifeng Han, Baohua Qi, Sen Qian, Jing Ren, Xinyuan Sun, Xinyuan Sun, Dong Yang, Shenghua Yin, Minghui Zhang

    Abstract: To achieve the physics goal of precisely measure the Higgs, Z, W bosons and the top quark, future electron-positron colliders require that their detector system has excellent jet energy resolution. One feasible technical option is the high granular calorimetery based on the particle flow algorithm (PFA). A new high-granularity hadronic calorimeter with glass scintillator tiles (GSHCAL) has been pr… ▽ More

    Submitted 9 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  24. arXiv:2404.03429  [pdf, other

    cs.CL

    Scaffolding Language Learning via Multi-modal Tutoring Systems with Pedagogical Instructions

    Authors: Zhengyuan Liu, Stella Xin Yin, Carolyn Lee, Nancy F. Chen

    Abstract: Intelligent tutoring systems (ITSs) that imitate human tutors and aim to provide immediate and customized instructions or feedback to learners have shown their effectiveness in education. With the emergence of generative artificial intelligence, large language models (LLMs) further entitle the systems to complex and coherent conversational interactions. These systems would be of great help in lang… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  25. arXiv:2403.19258  [pdf, other

    cond-mat.str-el cond-mat.stat-mech

    Finite-time Scaling beyond the Kibble-Zurek Prerequisite: Driven Critical Dynamics in Strongly Interacting Dirac Systems

    Authors: Zhi Zeng, Yin-Kai Yu, Zhi-Xuan Li, Zi-Xiang Li, Shuai Yin

    Abstract: In conventional quantum critical point (QCP) characterized by order parameter fluctuations, the celebrated Kibble-Zurek mechanism (KZM) and finite-time scaling (FTS) theory provide universal descriptions of the driven critical dynamics. However, in strongly correlated fermionic systems where gapless fermions are usually present in vicinity of QCP, the driven dynamics has rarely been explored. In t… ▽ More

    Submitted 29 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 9+3 pages, 5+2 figures

  26. arXiv:2403.09084  [pdf, other

    cond-mat.str-el cond-mat.stat-mech

    Imaginary-time relaxation quantum critical dynamics in two-dimensional dimerized Heisenberg model

    Authors: Jia-Qi Cai, Yu-Rong Shu, Xue-Qing Rao, Shuai Yin

    Abstract: We study the imaginary-time relaxation critical dynamics of the Neel-paramagnetic quantum phase transition in the two-dimensional (2D) dimerized S = 1/2 Heisenberg model. We focus on the scaling correction in the short-time region. A unified scaling form including both short-time and finite-size corrections is proposed. According to this full scaling form, improved short-imaginary-time scaling rel… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 10 pages, 8 figures

    Journal ref: Phys. Rev. B 109, 184303(2024)

  27. arXiv:2403.08459  [pdf, other

    quant-ph cond-mat.dis-nn cond-mat.stat-mech

    Symmetry restoration and quantum Mpemba effect in symmetric random circuits

    Authors: Shuo Liu, Hao-Kai Zhang, Shuai Yin, Shi-Xin Zhang

    Abstract: Entanglement asymmetry, which serves as a diagnostic tool for symmetry breaking and a proxy for thermalization, has recently been proposed and studied in the context of symmetry restoration for quantum many-body systems undergoing a quench. In this Letter, we investigate symmetry restoration in various symmetric random quantum circuits, particularly focusing on the U(1) symmetry case. In contrast… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 4.5 pages, 5 figures, and Supplemental Material

  28. arXiv:2403.06770  [pdf, other

    hep-ph

    Estimates on the convergence of expansions at finite baryon chemical potentials

    Authors: Rui Wen, Shi Yin, Wei-jie Fu

    Abstract: Convergence of three different expansion schemes at finite baryon chemical potentials, including the conventional Taylor expansion, the Padé approximants, and the $T'$ expansion proposed recently in lattice QCD simulations, have been investigated in a low energy effective theory within the fRG approach. It is found that the $T'$ expansion or the Padé approximants would hardly improve the convergen… ▽ More

    Submitted 19 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 6 figures

  29. arXiv:2403.04481  [pdf, other

    cs.CL cs.AI

    Do Large Language Model Understand Multi-Intent Spoken Language ?

    Authors: Shangjian Yin, Peijie Huang, Yuhong Xu, Haojing Huang, Jiatian Chen

    Abstract: This research signifies a considerable breakthrough in leveraging Large Language Models (LLMs) for multi-intent spoken language understanding (SLU). Our approach re-imagines the use of entity slots in multi-intent SLU applications, making the most of the generative potential of LLMs within the SLU landscape, leading to the development of the EN-LLM series. Furthermore, we introduce the concept of… ▽ More

    Submitted 15 April, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  30. arXiv:2403.03742  [pdf, other

    cs.HC

    Mitigating Ageism through Virtual Reality: Intergenerational Collaborative Escape Room Design

    Authors: Ruotong Zou, Shuyu Yin, Tianqi Song, Peinuan Qin, Yi-Chieh Lee

    Abstract: As virtual reality (VR) becomes more popular for intergenerational collaboration, there is still a significant gap in research regarding understanding the potential for reducing ageism. Our study aims to address this gap by analyzing ageism levels before and after VR escape room collaborative experiences. We recruited 28 participants to collaborate with an older player in a challenging VR escape r… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  31. arXiv:2403.00019  [pdf, other

    cs.LG stat.ML

    Transformer-based Parameter Estimation in Statistics

    Authors: Xiaoxin Yin, David S. Yin

    Abstract: Parameter estimation is one of the most important tasks in statistics, and is key to helping people understand the distribution behind a sample of observations. Traditionally parameter estimation is done either by closed-form solutions (e.g., maximum likelihood estimation for Gaussian distribution), or by iterative numerical methods such as Newton-Raphson method when closed-form solution does not… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  32. arXiv:2402.16899  [pdf, other

    cs.LG cs.AI

    A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning

    Authors: Shuyu Yin, Qixuan Zhou, Fei Wen, Tao Luo

    Abstract: Deep reinforcement learning excels in numerous large-scale practical applications. However, existing performance analyses ignores the unique characteristics of continuous-time control problems, is unable to directly estimate the generalization error of the Bellman optimal loss and require a boundedness assumption. Our work focuses on continuous-time control problems and proposes a method that is a… ▽ More

    Submitted 7 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  33. arXiv:2402.16272  [pdf, other

    physics.ins-det hep-ex

    Mass production and performance study on the 20-inch PMT acrylic protection covers in JUNO

    Authors: Miao He, Zhonghua Qin, Diru Wu, Meihang Xu, Wan Xie, Fang Chen, Xiaoping Jing, Genhua Yin, Shengjiong Yin, Linhua Gu, Xiaofeng Xia, Qinchang Wang

    Abstract: The Jiangmen Underground Neutrino Observatory is a neutrino experiment that incorporates 20,012 20-inch photomultiplier tubes (PMTs) and 25,600 3-inch PMTs. A dedicated system was designed to protect the PMTs from an implosion chain reaction underwater. As a crucial element of the protection system, over 20,000 acrylic covers were manufactured through injection molding, ensuring high dimensional p… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 12 pages, 10 figures

  34. GazeTrak: Exploring Acoustic-based Eye Tracking on a Glass Frame

    Authors: Ke Li, Ruidong Zhang, Boao Chen, Siyuan Chen, Sicheng Yin, Saif Mahmud, Qikang Liang, François Guimbretière, Cheng Zhang

    Abstract: In this paper, we present GazeTrak, the first acoustic-based eye tracking system on glasses. Our system only needs one speaker and four microphones attached to each side of the glasses. These acoustic sensors capture the formations of the eyeballs and the surrounding areas by emitting encoded inaudible sound towards eyeballs and receiving the reflected signals. These reflected signals are further… ▽ More

    Submitted 23 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 16 pages, 5 figures, 7 tables, The 30th Annual International Conference on Mobile Computing and Networking (ACM MobiCom 2024)

  35. arXiv:2402.12823  [pdf, other

    nucl-th hep-ph nucl-ex

    The influence of hadronic rescatterings on the net-baryon number fluctuations

    Authors: Qian Chen, Rui Wen, Shi Yin, Wei-jie Fu, Zi-Wei Lin, Guo-Liang Ma

    Abstract: Fluctuations of conserved charges, such as the net-baryon number fluctuations, are influenced by different dynamical evolution processes. In this paper, we investigate the influence of hadronic rescatterings on different orders of cumulants of the net-baryon number distribution. At the start of hadronic rescatterings, we introduce net-baryon number distributions reconstructed based on net-baryon c… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 10 pages, 8 figures

  36. arXiv:2402.10534  [pdf, other

    cs.CV

    Using Left and Right Brains Together: Towards Vision and Language Planning

    Authors: Jun Cen, Chenfei Wu, Xiao Liu, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, Jianguo Zhang

    Abstract: Large Language Models (LLMs) and Large Multi-modality Models (LMMs) have demonstrated remarkable decision masking capabilities on a variety of tasks. However, they inherently operate planning within the language space, lacking the vision and spatial imagination ability. In contrast, humans utilize both left and right hemispheres of the brain for language and visual planning during the thinking pro… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 19 pages, 13 figures

  37. arXiv:2402.02140  [pdf, other

    cs.CV eess.IV

    Generative Visual Compression: A Review

    Authors: Bolin Chen, Shanzhi Yin, Peilin Chen, Shiqi Wang, Yan Ye

    Abstract: Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the acquisition of digital content and impelling the progress of visual compression towards competitive performance gains and diverse functionalities over traditional codecs. This paper provides a thorough review on the recent advances of generative visual compression, illustrating great potentials and promi… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  38. arXiv:2402.01271  [pdf, other

    eess.AS cs.SD

    An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec

    Authors: Linping Xu, Jiawei Jiang, Dejun Zhang, Xianjun Xia, Li Chen, Yijian Xiao, Piao Ding, Shenyi Song, Sixing Yin, Ferdous Sohel

    Abstract: Recently, neural networks have proven to be effective in performing speech coding task at low bitrates. However, under-utilization of intra-frame correlations and the error of quantizer specifically degrade the reconstructed audio quality. To improve the coding quality, we present an end-to-end neural speech codec, namely CBRC (Convolutional and Bidirectional Recurrent neural Codec). An interleave… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: INTERSPEECH 2023

  39. arXiv:2401.17840  [pdf, other

    cs.SI

    Propagation Dynamics of Rumor vs. Non-rumor across Multiple Social Media Platforms Driven by User Characteristics

    Authors: Dongpeng Hou, Shu Yin, Chao Gao, Xianghua Li, Zhen Wang

    Abstract: Studying information propagation dynamics in social media can elucidate user behaviors and patterns. However, previous research often focuses on single platforms and fails to differentiate between the nuanced roles of source users and other participants in cascades. To address these limitations, we analyze propagation cascades on Twitter and Weibo combined with a crawled dataset of nearly one mill… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  40. EchoWrist: Continuous Hand Pose Tracking and Hand-Object Interaction Recognition Using Low-Power Active Acoustic Sensing On a Wristband

    Authors: Chi-Jung Lee, Ruidong Zhang, Devansh Agarwal, Tianhong Catherine Yu, Vipin Gunda, Oliver Lopez, James Kim, Sicheng Yin, Boao Dong, Ke Li, Mose Sakashita, Francois Guimbretiere, Cheng Zhang

    Abstract: Our hands serve as a fundamental means of interaction with the world around us. Therefore, understanding hand poses and interaction context is critical for human-computer interaction. We present EchoWrist, a low-power wristband that continuously estimates 3D hand pose and recognizes hand-object interactions using active acoustic sensing. EchoWrist is equipped with two speakers emitting inaudible s… ▽ More

    Submitted 29 March, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  41. arXiv:2401.17093  [pdf, other

    cs.CV cs.CL

    StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

    Authors: Zecheng Tang, Chenfei Wu, Zekai Zhang, Mingheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu, Juntao Li, Nan Duan

    Abstract: To leverage LLMs for visual synthesis, traditional methods convert raster image information into discrete grid tokens through specialized visual modules, while disrupting the model's ability to capture the true semantic representation of visual scenes. This paper posits that an alternative representation of images, vector graphics, can effectively surmount this limitation by enabling a more natura… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  42. arXiv:2401.13886  [pdf

    cond-mat.mes-hall

    Observation of possible excitonic charge density waves and metal-insulator transitions in atomically thin semimetals

    Authors: Qiang Gao, Yang-hao Chan, Pengfei Jiao, Haiyang Chen, Shuaishuai Yin, Kanjanaporn Tangprapha, Yichen Yang, Xiaolong Li, Zhengtai Liu, Dawei Shen, Shengwei Jiang, Peng Chen

    Abstract: Charge density wave (CDW) is a collective quantum phenomenon with a charge modulation in solids1-2. Condensation of electron and hole pairs with finite momentum will lead to such an ordered state3-7. However, lattice symmetry breaking manifested as the softening of phonon modes can occur simultaneously, which makes it difficult to disentangle the origin of the transition8-14. Here, we report a con… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: https://www.nature.com/articles/s41567-023-02349-0 published in Nature Physics

  43. arXiv:2401.00744  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.LG

    Towards Harmonization of SO(3)-Equivariance and Expressiveness: a Hybrid Deep Learning Framework for Electronic-Structure Hamiltonian Prediction

    Authors: Shi Yin, Xinyang Pan, Xudong Zhu, Tianyu Gao, Haochong Zhang, Feng Wu, Lixin He

    Abstract: Deep learning for predicting the electronic-structure Hamiltonian of quantum systems necessitates satisfying the covariance laws, among which achieving SO(3)-equivariance without sacrificing the non-linear expressive capability of networks remains unsolved. To navigate the harmonization between equivariance and expressiveness, we propose a deep learning method synergizing two distinct categories o… ▽ More

    Submitted 21 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  44. arXiv:2312.13986  [pdf

    physics.optics physics.app-ph

    Deep Learning Enabled Design of Terahertz High-Q Metamaterials

    Authors: Shan Yin, Haotian Zhong, Wei Huang, Wentao Zhang, Jiaguang Han

    Abstract: Metamaterials open up a new way to manipulate electromagnetic waves and realize various functional devices. Metamaterials with high-quality (Q) resonance responses are widely employed in sensing, detection, and other applications. Traditional design of metamaterials involves laborious simulation-optimization and limits the efficiency. The high-Q metamaterials with abrupt spectral change are even h… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 17 pages, 6 figures

  45. arXiv:2312.11820  [pdf, other

    cs.AR

    SoC-Tuner: An Importance-guided Exploration Framework for DNN-targeting SoC Design

    Authors: Shixin Chen, Su Zheng, Chen Bai, Wenqian Zhao, Shuo Yin, Yang Bai, Bei Yu

    Abstract: Designing a system-on-chip (SoC) for deep neural network (DNN) acceleration requires balancing multiple metrics such as latency, power, and area. However, most existing methods ignore the interactions among different SoC components and rely on inaccurate and error-prone evaluation tools, leading to inferior SoC design. In this paper, we present SoC-Tuner, a DNN-targeting exploration framework to f… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: ASP-DAC 2024

  46. arXiv:2312.05739  [pdf, other

    cs.SI cs.AI

    GAMC: An Unsupervised Method for Fake News Detection using Graph Autoencoder with Masking

    Authors: Shu Yin, Chao Gao, Zhen Wang

    Abstract: With the rise of social media, the spread of fake news has become a significant concern, potentially misleading public perceptions and impacting social stability. Although deep learning methods like CNNs, RNNs, and Transformer-based models like BERT have enhanced fake news detection, they primarily focus on content, overlooking social context during news propagation. Graph-based techniques have in… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Journal ref: the Thirty-Eighth AAAI Conference on Artificial Intelligence,2024

  47. arXiv:2311.15534  [pdf, other

    physics.optics quant-ph

    Analogue of collectively induced transparency in metamaterials

    Authors: Wei Huang, Shi-Ting Cao, Xiaowei Qu, Shan Yin, Wentao Zhang

    Abstract: Most recently, a brand new optical phenomenon, collectively induced transparency (CIT) has already been proposed in the cavity quantum electrodynamics system, which comes from the coupling between the cavity and ions and the quantum interference of collective ions. Due to the equivalent analogue of quantum optics, metamaterial also is a good platform to realize collectively induced transparency (C… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  48. arXiv:2311.15030  [pdf, other

    cs.RO

    Tuning-free Quasi-stiffness Control Framework of a Powered Transfemoral Prosthesis for Task-adaptive Walking

    Authors: Teng Ma, Shucong Yin, Zhimin Hou, Binxin Huang, Haoyong Yu, Chenglong Fu

    Abstract: Impedance-based control represents a prevalent strategy in the development of powered transfemoral prostheses. However, creating a task-adaptive, tuning-free controller that effectively generalizes across diverse locomotion modes and terrain conditions continues to be a significant challenge. This letter proposes a tuning-free and task-adaptive quasi-stiffness control framework for powered prosthe… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: 8 pages, 10 figures. This work has been submitted to the IEEE-RAL for possible publication

  49. arXiv:2311.12259  [pdf, ps, other

    gr-qc astro-ph.CO hep-ph hep-th

    Analytical models of supermassive black holes in galaxies surrounded by dark matter halos

    Authors: Zibo Shen, Anzhong Wang, Yungui Gong, Shaoyu Yin

    Abstract: In this Letter, we present five analytical models in closed forms, each representing a supermassive black hole (SMBH) located at the center of a galaxy surrounded by dark matter (DM) halo. The density profile of the halo vanishes inside twice the Schwarzschild radius of the hole and satisfies the weak, strong, and dominant energy conditions. The spacetime are asymptotically flat, and the differenc… ▽ More

    Submitted 19 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: revtex4-2, no figures. Version to appear in Phys. Lett. B 855 (2024) 138797

    Journal ref: Phys. Lett. B 855 (2024) 138797

  50. arXiv:2311.06203  [pdf, other

    cond-mat.stat-mech

    Relaxation Critical Dynamics with Emergent Symmetry

    Authors: Yu-Rong Shu, Shuai Yin

    Abstract: Different from usual critical point characterized by a single length scale, critical point with emergent symmetry exhibits intriguing critical properties characterized by two relevant length scales, attracting long-term investigations from both theoretical and experimental aspects. A natural question is how the critical dynamics is affected by the presence of two relevant length scales. Here we st… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 7 pages, 4 figures