Skip to main content

Showing 1–50 of 69 results for author: Yamamoto, K

  1. arXiv:2406.03605  [pdf, other

    cs.RO

    Towards the Development of a Tendon-Actuated Galvanometer for Endoscopic Surgical Laser Scanning

    Authors: Kent K. Yamamoto, Tanner J. Zachem, Behnam Moradkhani, Yash Chitalia, Patrick J. Codd

    Abstract: There is a need for precision pathological sensing, imaging, and tissue manipulation in neurosurgical procedures, such as brain tumor resection. Precise tumor margin identification and resection can prevent further growth and protect critical structures. Surgical lasers with small laser diameters and steering capabilities can allow for new minimally invasive procedures by traversing through comple… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 6 pages, 7 figures, conference paper at the 2024 International Symposium on Medical Robotics

  2. arXiv:2405.20509  [pdf, ps, other

    cs.RO

    An FBG-based Stiffness Estimation Sensor for In-vivo Diagnostics

    Authors: Behnam Moradkhani, Pejman Kheradmand, Harshith Jella, Kent K. Yamamoto, Alireza Tofangchi, Patrick J. Codd, Yash Chitalia

    Abstract: In-vivo tissue stiffness identification can be useful in pulmonary fibrosis diagnostics and minimally invasive tumor identification, among many other applications. In this work, we propose a palpation-based method for tissue stiffness estimation that uses a sensorized beam buckled onto the surface of a tissue. Fiber Bragg Gratings (FBGs) are used in our sensor as a shape-estimation modality to get… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 6 pages (excluding the references), 5 figures

  3. arXiv:2404.03161  [pdf, other

    cs.CV cs.CL cs.MM

    BioVL-QR: Egocentric Biochemical Video-and-Language Dataset Using Micro QR Codes

    Authors: Taichi Nishimura, Koki Yamamoto, Yuto Haneji, Keiya Kajimura, Chihiro Nishiwaki, Eriko Daikoku, Natsuko Okuda, Fumihito Ono, Hirotaka Kameko, Shinsuke Mori

    Abstract: This paper introduces a biochemical vision-and-language dataset, which consists of 24 egocentric experiment videos, corresponding protocols, and video-and-language alignments. The key challenge in the wet-lab domain is detecting equipment, reagents, and containers is difficult because the lab environment is scattered by filling objects on the table and some objects are indistinguishable. Therefore… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 6 pages

  4. arXiv:2401.16971  [pdf, other

    cs.DC

    Autonomy Loops for Monitoring, Operational Data Analytics, Feedback, and Response in HPC Operations

    Authors: Francieli Boito, Jim Brandt, Valeria Cardellini, Philip Carns, Florina M. Ciorba, Hilary Egan, Ahmed Eleliemy, Ann Gentile, Thomas Gruber, Jeff Hanson, Utz-Uwe Haus, Kevin Huck, Thomas Ilsche, Thomas Jakobsche, Terry Jones, Sven Karlsson, Abdullah Mueen, Michael Ott, Tapasya Patki, Ivy Peng, Krishnan Raghavan, Stephen Simms, Kathleen Shoga, Michael Showerman, Devesh Tiwari , et al. (2 additional authors not shown)

    Abstract: Many High Performance Computing (HPC) facilities have developed and deployed frameworks in support of continuous monitoring and operational data analytics (MODA) to help improve efficiency and throughput. Because of the complexity and scale of systems and workflows and the need for low-latency response to address dynamic circumstances, automated feedback and response have the potential to be more… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  5. arXiv:2401.08821  [pdf

    eess.IV cs.LG cs.RO

    Surface-Enhanced Raman Spectroscopy and Transfer Learning Toward Accurate Reconstruction of the Surgical Zone

    Authors: Ashutosh Raman, Ren A. Odion, Kent K. Yamamoto, Weston Ross, Tuan Vo-Dinh, Patrick J. Codd

    Abstract: Raman spectroscopy, a photonic modality based on the inelastic backscattering of coherent light, is a valuable asset to the intraoperative sensing space, offering non-ionizing potential and highly-specific molecular fingerprint-like spectroscopic signatures that can be used for diagnosis of pathological tissue in the dynamic surgical field. Though Raman suffers from weakness in intensity, Surface-… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted to Hamlyn Symposium on Medical Robotics, 2023

  6. arXiv:2312.13787  [pdf, other

    cs.HC

    User-adaptive Tourist Information Dialogue System with Yes/No Classifier and Sentiment Estimator

    Authors: Ryo Yanagimoto, Yunosuke Kubo, Miki Oshio, Mikio Nakano, Kenta Yamamoto, Kazunori Komatani

    Abstract: We introduce our system developed for Dialogue Robot Competition 2023 (DRC2023). First, rule-based utterance selection and utterance generation using a large language model (LLM) are combined. We ensure the quality of system utterances while also being able to respond to unexpected user utterances. Second, dialogue flow is controlled by considering the results of the BERT-based yes/no classifier a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: This paper is part of the proceedings of the Dialogue Robot Competition 2023

  7. arXiv:2311.04323  [pdf, other

    cs.RO

    Incident Angle Study for Designing an Endoscopic Tool for Intraoperative Brain Tumor Detection

    Authors: Kent Y. Yamamoto, Tanner J. Zachem, Weston A. Ross, Patrick J. Codd

    Abstract: In neurosurgical procedures maximizing the resection of tumor tissue while avoiding healthy tissue is of paramount importance and a difficult task due to many factors, such as surrounding eloquent brain. Swiftly identifying tumor tissue for removal could increase surgical outcomes. The TumorID is a laser-induced fluorescence spectroscopy device that utilizes endogenous fluorophores such as NADH an… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in Hamlyn Symposium on Medical Robotics, 2023

  8. arXiv:2310.12404  [pdf, other

    cs.SD cs.CL cs.HC cs.LG eess.AS

    Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative Editing

    Authors: Yixiao Zhang, Akira Maezawa, Gus Xia, Kazuhiko Yamamoto, Simon Dixon

    Abstract: Creating music is iterative, requiring varied methods at each stage. However, existing AI music systems fall short in orchestrating multiple subsystems for diverse needs. To address this gap, we introduce Loop Copilot, a novel system that enables users to generate and iteratively refine music through an interactive, multi-round dialogue interface. The system uses a large language model to interpre… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Source code and demo video are available at \url{https://sites.google.com/view/loop-copilot}

  9. arXiv:2309.12547  [pdf, other

    cs.RO

    Real-time Motion Generation and Data Augmentation for Grasping Moving Objects with Dynamic Speed and Position Changes

    Authors: Kenjiro Yamamoto, Hiroshi Ito, Hideyuki Ichiwara, Hiroki Mori, Tetsuya Ogata

    Abstract: While deep learning enables real robots to perform complex tasks had been difficult to implement in the past, the challenge is the enormous amount of trial-and-error and motion teaching in a real environment. The manipulation of moving objects, due to their dynamic properties, requires learning a wide range of factors such as the object's position, movement speed, and grasping timing. We propose a… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  10. arXiv:2307.13007  [pdf, other

    cs.NE cs.LG

    Sparse-firing regularization methods for spiking neural networks with time-to-first spike coding

    Authors: Yusuke Sakemi, Kakei Yamamoto, Takeo Hosomi, Kazuyuki Aihara

    Abstract: The training of multilayer spiking neural networks (SNNs) using the error backpropagation algorithm has made significant progress in recent years. Among the various training schemes, the error backpropagation method that directly uses the firing time of neurons has attracted considerable attention because it can realize ideal temporal coding. This method uses time-to-first spike (TTFS) coding, in… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  11. arXiv:2211.16113  [pdf, other

    cs.NE cs.LG

    Timing-Based Backpropagation in Spiking Neural Networks Without Single-Spike Restrictions

    Authors: Kakei Yamamoto, Yusuke Sakemi, Kazuyuki Aihara

    Abstract: We propose a novel backpropagation algorithm for training spiking neural networks (SNNs) that encodes information in the relative multiple spike timing of individual neurons without single-spike restrictions. The proposed algorithm inherits the advantages of conventional timing-based methods in that it computes accurate gradients with respect to spike timing, which promotes ideal temporal coding.… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 10 pages, 5 figures

    ACM Class: I.5.1

  12. arXiv:2207.05902  [pdf, other

    cs.CV

    Verifying Attention Robustness of Deep Neural Networks against Semantic Perturbations

    Authors: Satoshi Munakata, Caterina Urban, Haruki Yokoyama, Koji Yamamoto, Kazuki Munakata

    Abstract: It is known that deep neural networks (DNNs) classify an input image by paying particular attention to certain specific pixels; a graphical representation of the magnitude of attention to each pixel is called a saliency-map. Saliency-maps are used to check the validity of the classification decision basis, e.g., it is not a valid basis for classification if a DNN pays more attention to the backgro… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: 25 pages, 12 figures

    ACM Class: D.2.4; I.1.4

  13. arXiv:2112.09407  [pdf, other

    cs.LG cs.NI

    Communication-oriented Model Fine-tuning for Packet-loss Resilient Distributed Inference under Highly Lossy IoT Networks

    Authors: Sohei Itahara, Takayuki Nishio, Yusuke Koda, Koji Yamamoto

    Abstract: The distributed inference (DI) framework has gained traction as a technique for real-time applications empowered by cutting-edge deep machine learning (ML) on resource-constrained Internet of things (IoT) devices. In DI, computational tasks are offloaded from the IoT device to the edge server via lossy IoT networks. However, generally, there is a communication system-level trade-off between commun… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: Submitted to IEEE Access

  14. arXiv:2112.06695  [pdf, other

    cs.NI eess.SP

    Bi-directional Beamforming Feedback-based Firmware-agnostic WiFi Sensing: An Empirical Study

    Authors: S. Kondo, S. Itahara, K. Yamashita, K. Yamamoto, Y. Koda, T. Nishio, A. Taya

    Abstract: In the field of WiFi sensing, as an alternative sensing source of the channel state information (CSI) matrix, the use of a beamforming feedback matrix (BFM)that is a right singular matrix of the CSI matrix has attracted significant interest owing to its wide availability regarding the underlying WiFi systems. In the IEEE 802.11ac/ax standard, the station (STA) transmits a BFM to an access point (A… ▽ More

    Submitted 27 February, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: 10 pages, 7 figures

  15. arXiv:2112.06442  [pdf, ps, other

    cs.RO

    Contact-Rich Manipulation of a Flexible Object based on Deep Predictive Learning using Vision and Tactility

    Authors: Hideyuki Ichiwara, Hiroshi Ito, Kenjiro Yamamoto, Hiroki Mori, Tetsuya Ogata

    Abstract: We achieved contact-rich flexible object manipulation, which was difficult to control with vision alone. In the unzipping task we chose as a validation task, the gripper grasps the puller, which hides the bag state such as the direction and amount of deformation behind it, making it difficult to obtain information to perform the task by vision alone. Additionally, the flexible fabric bag state con… ▽ More

    Submitted 10 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

  16. Frame-Capture-Based CSI Recomposition Pertaining to Firmware-Agnostic WiFi Sensing

    Authors: Ryosuke Hanahara, Sohei Itahara, Kota Yamashita, Yusuke Koda, Akihito Taya, Takayuki Nishio, Koji Yamamoto

    Abstract: With regard to the implementation of WiFi sensing agnostic according to the availability of channel state information (CSI), we investigate the possibility of estimating a CSI matrix based on its compressed version, which is known as beamforming feedback matrix (BFM). Being different from the CSI matrix that is processed and discarded in physical layer components, the BFM can be captured using a m… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Journal ref: Proc. IEEE 19th Annual Consumer Communications & Networking Conference (CCNC 2022)

  17. arXiv:2110.14211  [pdf, other

    eess.SP cs.NI

    Beamforming Feedback-based Model-Driven Angle of Departure Estimation Toward Legacy Support in WiFi Sensing: An Experimental Study

    Authors: Sohei Itahara, Sota Kondo, Kota Yamashita, Takayuki Nishio, Koji Yamamoto, Yusuke Koda

    Abstract: This study experimentally validated the possibility of angle of departure (AoD) estimation using multiple signal classification (MUSIC) with only WiFi control frames for beamforming feedback (BFF), defined in IEEE 802.11ac/ax. The examined BFF-based MUSIC is a model-driven algorithm, which does not require a pre-obtained database. This contrasts with most existing BFF-based sensing techniques, whi… ▽ More

    Submitted 2 February, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Submitted to IEEE Access

  18. arXiv:2107.05043  [pdf, other

    cs.CV cs.GR

    A Projector-Camera System Using Hybrid Pixels with Projection and Capturing Capabilities

    Authors: Kenta Yamamoto, Daisuke Iwai, Kosuke Sato

    Abstract: We propose a novel projector-camera system (ProCams) in which each pixel has both projection and capturing capabilities. Our proposed ProCams solves the difficulty of obtaining precise pixel correspondence between the projector and the camera. We implemented a proof-of-concept ProCams prototype and demonstrated its applicability to a dynamic projection mapping.

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: Author's version of a paper published at IDW (International Display Workshops) 2020

    Journal ref: In Proceedings of the International Display Workshops, pp. 655-658, 2020

  19. arXiv:2107.04770  [pdf, other

    cs.MM

    Computer Vision-assisted Single-antenna and Single-anchor RSSI Localization Harnessing Dynamic Blockage Events

    Authors: Tomoya Sunami, Sohei Itahara, Yusuke Koda, Takayuki Nishio, Koji Yamamoto

    Abstract: This paper demonstrates the feasibility of single-antenna and single-RF (radio frequency)- anchor received power strength indicator (RSSI) localization (SARR-LOC) with the assistance of the computer vision (CV) technique. Generally, to perform radio frequency (RF)-based device localization, either 1) fine-grained channel state information or 2) RSSIs from multiple antenna elements or multiple RF a… ▽ More

    Submitted 6 December, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE Internet of Things journal

  20. arXiv:2104.13629  [pdf, other

    eess.SP cs.DC cs.LG

    Packet-Loss-Tolerant Split Inference for Delay-Sensitive Deep Learning in Lossy Wireless Networks

    Authors: Sohei Itahara, Takayuki Nishio, Koji Yamamoto

    Abstract: The distributed inference framework is an emerging technology for real-time applications empowered by cutting-edge deep machine learning (ML) on resource-constrained Internet of things (IoT) devices. In distributed inference, computational tasks are offloaded from the IoT device to other devices or the edge server via lossy IoT networks. However, narrow-band and lossy IoT networks cause non-neglig… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

  21. ACK-Less Rate Adaptation for IEEE 802.11bc Enhanced Broadcast Services Using Sim-to-Real Deep Reinforcement Learning

    Authors: T. Kanda, Y. Koda, K. Yamamoto, T. Nishio

    Abstract: In IEEE 802.11bc, the broadcast mode on wireless local area networks (WLANs), data rate control that is based on acknowledgement (ACK) mechanism similar to the one in the current IEEE 802.11 WLANs is not applicable because ACK mechanism is not implemented. This paper addresses this challenge by proposing ACK-less data rate adaptation methods by capturing non-broadcast uplink frames of STAs. In IEE… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Journal ref: Proc. IEEE 19th Annual Consumer Communications & Networking Conference (CCNC 2022)

  22. Decentralized and Model-Free Federated Learning: Consensus-Based Distillation in Function Space

    Authors: Akihito Taya, Takayuki Nishio, Masahiro Morikura, Koji Yamamoto

    Abstract: This paper proposes a fully decentralized federated learning (FL) scheme for Internet of Everything (IoE) devices that are connected via multi-hop networks. Because FL algorithms hardly converge the parameters of machine learning (ML) models, this paper focuses on the convergence of ML models in function spaces. Considering that the representative loss functions of ML tasks e.g, mean squared error… ▽ More

    Submitted 3 October, 2022; v1 submitted 1 April, 2021; originally announced April 2021.

    Journal ref: IEEE Transactions on Signal and Information Processing over Networks, vol. 8, pp. 799-814, 2022

  23. arXiv:2103.07156  [pdf, other

    cs.CV cs.LG

    Learnable Companding Quantization for Accurate Low-bit Neural Networks

    Authors: Kohei Yamamoto

    Abstract: Quantizing deep neural networks is an effective method for reducing memory consumption and improving inference speed, and is thus useful for implementation in resource-constrained devices. However, it is still hard for extremely low-bit models to achieve accuracy comparable with that of full-precision models. To address this issue, we propose learnable companding quantization (LCQ) as a novel non-… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 5029-5038

  24. arXiv:2103.01598  [pdf, ps, other

    cs.RO

    Spatial Attention Point Network for Deep-learning-based Robust Autonomous Robot Motion Generation

    Authors: Hideyuki Ichiwara, Hiroshi Ito, Kenjiro Yamamoto, Hiroki Mori, Tetsuya Ogata

    Abstract: Deep learning provides a powerful framework for automated acquisition of complex robotic motions. However, despite a certain degree of generalization, the need for vast amounts of training data depending on the work-object position is an obstacle to industrial applications. Therefore, a robot motion-generation model that can respond to a variety of work-object positions with a small amount of trai… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  25. arXiv:2102.08055  [pdf, other

    cs.LG cs.NI

    Zero-Shot Adaptation for mmWave Beam-Tracking on Overhead Messenger Wires through Robust Adversarial Reinforcement Learning

    Authors: Masao Shinzaki, Yusuke Koda, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura, Yushi Shirato, Daisei Uchida, Naoki Kita

    Abstract: Millimeter wave (mmWave) beam-tracking based on machine learning enables the development of accurate tracking policies while obviating the need to periodically solve beam-optimization problems. However, its applicability is still arguable when training-test gaps exist in terms of environmental parameters that affect the node dynamics. From this skeptical point of view, the contribution of this stu… ▽ More

    Submitted 10 July, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 13 pages, 13 figures, 3 tables, under submission for possible publication for IEEE

  26. arXiv:2101.11326  [pdf, other

    cs.HC

    See-Through Captions: Real-Time Captioning on Transparent Display for Deaf and Hard-of-Hearing People

    Authors: Kenta Yamamoto, Ippei Suzuki, Akihisa Shitara, Yoichi Ochiai

    Abstract: Real-time captioning is a useful technique for deaf and hard-of-hearing (DHH) people to talk to hearing people. With the improvement in device performance and the accuracy of automatic speech recognition (ASR), real-time captioning is becoming an important tool for helping DHH people in their daily lives. To realize higher-quality communication and overcome the limitations of mobile and augmented-… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

  27. arXiv:2012.02431  [pdf

    cs.SD eess.AS physics.app-ph

    Acoustic Hologram Optimisation Using Automatic Differentiation

    Authors: Tatsuki Fushimi, Kenta Yamamoto, Yoichi Ochiai

    Abstract: Acoustic holograms are the keystone of modern acoustics. It encodes three-dimensional acoustic fields in two dimensions, and its quality determine the performance of acoustic systems. Optimisation methods that control only the phase of an acoustic wave are considered inferior to methods that control both the amplitude and phase of the wave. In this paper, we present Diff-PAT, an acoustic hologram… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 25 pages, 5 figures, manuscript

  28. arXiv:2012.00982  [pdf, other

    cs.NI

    Millimeter Wave Communications on Overhead Messenger Wire: Deep Reinforcement Learning-Based Predictive Beam Tracking

    Authors: Yusuke Koda, Masao Shinzaki, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura, Yushi Shirato, Daisei Uchida, Naoki Kita

    Abstract: This paper discusses the feasibility of beam tracking against dynamics in millimeter wave (mmWave) nodes placed on overhead messenger wires, including wind-forced perturbations and disturbances caused by impulsive forces to wires. Our main contribution is to answer whether or not historical positions and velocities of a mmWave node is useful to track directional beams given the complicated on-wire… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: 12 pages, 18 figures

  29. arXiv:2009.13879  [pdf, other

    cs.NI

    MAB-based Client Selection for Federated Learning with Uncertain Resources in Mobile Networks

    Authors: Naoya Yoshida, Takayuki Nishio, Masahiro Morikura, Koji Yamamoto

    Abstract: This paper proposes a client selection method for federated learning (FL) when the computation and communication resource of clients cannot be estimated; the method trains a machine learning (ML) model using the rich data and computational resources of mobile clients without collecting their data in central systems. Conventional FL with client selection estimates the required time for an FL round… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  30. Online Trainable Wireless Link Quality Prediction System using Camera Imagery

    Authors: Sohei Itahara, Takayuki Nishio, Masahiro Morikura, Koji Yamamoto

    Abstract: Machine-learning-based prediction of future wireless link quality is an emerging technique that can potentially improve the reliability of wireless communications, especially at higher frequencies (e.g., millimeter-wave and terahertz technologies), through predictive handover and beamforming to solve line-of-sight (LOS) blockage problem. In this study, a real-time online trainable wireless link qu… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  31. Distillation-Based Semi-Supervised Federated Learning for Communication-Efficient Collaborative Training with Non-IID Private Data

    Authors: Sohei Itahara, Takayuki Nishio, Yusuke Koda, Masahiro Morikura, Koji Yamamoto

    Abstract: This study develops a federated learning (FL) framework overcoming largely incremental communication costs due to model sizes in typical frameworks without compromising model performance. To this end, based on the idea of leveraging an unlabeled open dataset, we propose a distillation-based semi-supervised FL (DS-FL) algorithm that exchanges the outputs of local models among mobile devices, instea… ▽ More

    Submitted 20 January, 2021; v1 submitted 13 August, 2020; originally announced August 2020.

    Journal ref: IEEE Transactions on Mobile Computing (2021) 1-15

  32. arXiv:2008.01645  [pdf, other

    cs.HC cs.CV cs.GR cs.LG

    A Visual Analytics Framework for Reviewing Multivariate Time-Series Data with Dimensionality Reduction

    Authors: Takanori Fujiwara, Shilpika, Naohisa Sakamoto, Jorji Nonaka, Keiji Yamamoto, Kwan-Liu Ma

    Abstract: Data-driven problem solving in many real-world applications involves analysis of time-dependent multivariate data, for which dimensionality reduction (DR) methods are often used to uncover the intrinsic structure and features of the data. However, DR is usually applied to a subset of data that is either single-time-point multivariate or univariate time-series, resulting in the need to manually exa… ▽ More

    Submitted 27 October, 2021; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: This is the author's version of the article that has been published in IEEE Transactions on Visualization and Computer Graphics. The final version of this record is available at: 10.1109/TVCG.2020.3028889

  33. arXiv:2007.08208  [pdf, other

    cs.NI eess.SP

    Distributed Heteromodal Split Learning for Vision Aided mmWave Received Power Prediction

    Authors: Yusuke Koda, Jihong Park, Mehdi Bennis, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: The goal of this work is the accurate prediction of millimeter-wave received power leveraging both radio frequency (RF) signals and heterogeneous visual data from multiple distributed cameras, in a communication and energy-efficient manner while preserving data privacy. To this end, firstly focusing on data privacy, we propose heteromodal split learning with feature aggregation (HetSLAgg) that spl… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: 14 pages, 17 figures

  34. arXiv:2006.01413  [pdf

    cs.CV

    Resolving Class Imbalance in Object Detection with Weighted Cross Entropy Losses

    Authors: Trong Huy Phan, Kazuma Yamamoto

    Abstract: Object detection is an important task in computer vision which serves a lot of real-world applications such as autonomous driving, surveillance and robotics. Along with the rapid thrive of large-scale data, numerous state-of-the-art generalized object detectors (e.g. Faster R-CNN, YOLO, SSD) were developed in the past decade. Despite continual efforts in model modification and improvement in train… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  35. arXiv:2005.12027  [pdf, other

    eess.IV cs.CV

    A Preliminary Study for Identification of Additive Manufactured Objects with Transmitted Images

    Authors: Kenta Yamamoto, Ryota Kawamura, Kazuki Takazawa, Hiroyuki Osone, Yoichi Ochiai

    Abstract: Additive manufacturing has the potential to become a standard method for manufacturing products, and product information is indispensable for the item distribution system. While most products are given barcodes to the exterior surfaces, research on embedding barcodes inside products is underway. This is because additive manufacturing makes it possible to carry out manufacturing and information add… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

  36. arXiv:2005.00833  [pdf, other

    cs.NI eess.SP

    Transfer Learning-Based Received Power Prediction with Ray-tracing Simulation and Small Amount of Measurement Data

    Authors: Masahiro Iwasaki, Takayuki Nishio, Masahiro Morikura, Koji Yamamoto

    Abstract: This paper proposes a method to predict received power in urban area deterministically, which can learn a prediction model from small amount of measurement data by a simulation-aided transfer learning and data augmentation. Recent development in machine learning such as artificial neural network (ANN) enables us to predict radio propagation and path loss accurately. However, training a high-perfor… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

  37. Lottery Hypothesis based Unsupervised Pre-training for Model Compression in Federated Learning

    Authors: Sohei Itahara, Takayuki Nishio, Masahiro Morikura, Koji Yamamoto

    Abstract: Federated learning (FL) enables a neural network (NN) to be trained using privacy-sensitive data on mobile devices while retaining all the data on their local storages. However, FL asks the mobile devices to perform heavy communication and computation tasks, i.e., devices are requested to upload and download large-volume NN models and train them. This paper proposes a novel unsupervised pre-traini… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  38. Differentially Private AirComp Federated Learning with Power Adaptation Harnessing Receiver Noise

    Authors: Yusuke Koda, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: Over-the-air computation (AirComp)-based federated learning (FL) enables low-latency uploads and the aggregation of machine learning models by exploiting simultaneous co-channel transmission and the resultant waveform superposition. This study aims at realizing secure AirComp-based FL against various privacy attacks where malicious central servers infer clients' private data from aggregated global… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: 6 pages, 4 figures

  39. arXiv:2004.00835  [pdf, other

    cs.NI

    Adversarial Reinforcement Learning-based Robust Access Point Coordination Against Uncoordinated Interference

    Authors: Yuto Kihira, Yusuke Koda, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: This paper proposes a robust adversarial reinforcement learning (RARL)-based multi-access point (AP) coordination method that is robust even against unexpected decentralized operations of uncoordinated APs. Multi-AP coordination is a promising technique towards IEEE 802.11be, and there are studies that use RL for multi-AP coordination. Indeed, a simple RL-based multi-AP coordination method diminis… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

  40. arXiv:2003.10094  [pdf, other

    cs.NI eess.SP

    Penalized and Decentralized Contextual Bandit Learning for WLAN Channel Allocation with Contention-Driven Feature Extraction

    Authors: Kota Yamashita, Shotaro Kamiya, Koji Yamamoto, Yusuke Koda, Takayuki Nishio, Masahiro Morikura

    Abstract: In this study, a contextual multi-armed bandit (CMAB)-based decentralized channel exploration framework disentangling a channel utility function (i.e., reward) with respect to contending neighboring access points (APs) is proposed. The proposed framework enables APs to evaluate observed rewards compositionally for contending APs, allowing both robustness against reward fluctuation due to neighbori… ▽ More

    Submitted 1 December, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: 12 pages, 6 figures, 3 Tables

  41. arXiv:2003.00645  [pdf, other

    cs.NI

    Communication-Efficient Multimodal Split Learning for mmWave Received Power Prediction

    Authors: Yusuke Koda, Jihong Park, Mehdi Bennis, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: The goal of this study is to improve the accuracy of millimeter wave received power prediction by utilizing camera images and radio frequency (RF) signals, while gathering image inputs in a communication-efficient and privacy-preserving manner. To this end, we propose a distributed multimodal machine learning (ML) framework, coined multimodal split learning (MultSL), in which a large neural networ… ▽ More

    Submitted 2 March, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Comments: 5 pages, 7 figures, to be published at IEEE Communications Letters

  42. arXiv:2001.00594  [pdf, ps, other

    cs.LG cs.SI stat.ML

    Large-scale Gender/Age Prediction of Tumblr Users

    Authors: Yao Zhan, Changwei Hu, Yifan Hu, Tejaswi Kasturi, Shanmugam Ramasamy, Matt Gillingham, Keith Yamamoto

    Abstract: Tumblr, as a leading content provider and social media, attracts 371 million monthly visits, 280 million blogs and 53.3 million daily posts. The popularity of Tumblr provides great opportunities for advertisers to promote their products through sponsored posts. However, it is a challenging task to target specific demographic groups for ads, since Tumblr does not require user information like gende… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

    Journal ref: IEEE ICMLA 2019

  43. Random Access with Opportunity Detection in Wireless Networks

    Authors: Jinho Choi, Seung-Woo Ko, Koji Yamamoto, Seong-Lyun Kim

    Abstract: This letter proposes a novel random medium access control (MAC) based on a transmission opportunity prediction, which can be measured in a form of a conditional success probability given transmitter-side interference. A transmission probability depends on the opportunity prediction, preventing indiscriminate transmissions and reducing excessive interference causing collisions. Using stochastic geo… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: 4 pages, 4 figures

    Journal ref: IEEE Wireless Communications Letters ( Volume: 8 , Issue: 5 , Oct. 2019 )

  44. arXiv:1912.03880  [pdf, other

    cs.RO cs.CV

    Video Motion Capture from the Part Confidence Maps of Multi-Camera Images by Spatiotemporal Filtering Using the Human Skeletal Model

    Authors: Takuya Ohashi, Yosuke Ikegami, Kazuki Yamamoto, Wataru Takano, Yoshihiko Nakamura

    Abstract: This paper discusses video motion capture, namely, 3D reconstruction of human motion from multi-camera images. After the Part Confidence Maps are computed from each camera image, the proposed spatiotemporal filter is applied to deliver the human motion data with accuracy and smoothness for human motion analysis. The spatiotemporal filter uses the human skeleton and mixes temporal smoothing in two-… ▽ More

    Submitted 10 December, 2019; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: International Conference on Intelligent Robots and Systems (IROS), 2018

  45. One Pixel Image and RF Signal Based Split Learning for mmWave Received Power Prediction

    Authors: Yusuke Koda, Jihong Park, Mehdi Bennis, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: Focusing on the received power prediction of millimeter-wave (mmWave) radio-frequency (RF) signals, we propose a multimodal split learning (SL) framework that integrates RF received signal powers and depth-images observed by physically separated entities. To improve its communication efficiency while preserving data privacy, we propose an SL neural network architecture that compresses the communic… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 3 pages, Accepted in ACM CoNEXT 2019 Poster Session

  46. arXiv:1906.05694  [pdf, other

    cs.NI

    Cooperative Sensing in Deep RL-Based Image-to-Decision Proactive Handover for mmWave Networks

    Authors: Yusuke Koda, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: For reliable millimeter-wave (mmWave) networks, this paper proposes cooperative sensing with multi-camera operation in an image-to-decision proactive handover framework that directly maps images to a handover decision. In the framework, camera images are utilized to allow for the prediction of blockage effects in a mmWave link, whereby a network controller triggers a handover in a proactive fashio… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: text overlap with arXiv:1904.04585

  47. arXiv:1905.07210  [pdf, other

    cs.LG cs.DC stat.ML

    Hybrid-FL for Wireless Networks: Cooperative Learning Mechanism Using Non-IID Data

    Authors: Naoya Yoshida, Takayuki Nishio, Masahiro Morikura, Koji Yamamoto, Ryo Yonetani

    Abstract: This paper proposes a cooperative mechanism for mitigating the performance degradation due to non-independent-and-identically-distributed (non-IID) data in collaborative machine learning (ML), namely federated learning (FL), which trains an ML model using the rich data and computational resources of mobile clients without gathering their data to central systems. The data of mobile clients is typic… ▽ More

    Submitted 5 March, 2020; v1 submitted 17 May, 2019; originally announced May 2019.

    Journal ref: Proc. IEEE ICC 2019, Dublin, Ireland, June 2020

  48. arXiv:1905.07144  [pdf, ps, other

    eess.SP cs.LG cs.NI

    Deep Reinforcement Learning-Based Channel Allocation for Wireless LANs with Graph Convolutional Networks

    Authors: Kota Nakashima, Shotaro Kamiya, Kazuki Ohtsu, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: Last year, IEEE 802.11 Extremely High Throughput Study Group (EHT Study Group) was established to initiate discussions on new IEEE 802.11 features. Coordinated control methods of the access points (APs) in the wireless local area networks (WLANs) are discussed in EHT Study Group. The present study proposes a deep reinforcement learning-based channel allocation scheme using graph convolutional netw… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

  49. arXiv:1904.04585  [pdf, other

    cs.NI

    Handover Management for mmWave Networks with Proactive Performance Prediction Using Camera Images and Deep Reinforcement Learning

    Authors: Yusuke Koda, Kota Nakashima, Koji Yamamoto, Takayuki Nishio, Masahiro Morikura

    Abstract: For millimeter-wave networks, this paper presents a paradigm shift for leveraging time-consecutive camera images in handover decision problems. While making handover decisions, it is important to predict future long-term performance---e.g., the cumulative sum of time-varying data rates---proactively to avoid making myopic decisions. However, this study experimentally notices that a time-variation… ▽ More

    Submitted 17 July, 2020; v1 submitted 9 April, 2019; originally announced April 2019.

    Comments: 14 pages, 19 figures, Published at IEEE Transactions on Cognitive Communications and Networking

  50. GEDI: Gammachirp Envelope Distortion Index for Predicting Intelligibility of Enhanced Speech

    Authors: Katsuhiko Yamamoto, Toshio Irino, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani

    Abstract: In this study, we propose a new concept, the gammachirp envelope distortion index (GEDI), based on the signal-to-distortion ratio in the auditory envelope, SDRenv to predict the intelligibility of speech enhanced by nonlinear algorithms. The objective of GEDI is to calculate the distortion between enhanced and clean-speech representations in the domain of a temporal envelope extracted by the gamma… ▽ More

    Submitted 19 July, 2020; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Preprint, 37 pages, 6 tables, 9 figures

    Journal ref: Speech Communication, Vol. 123, pp. 43-58, 2020