Skip to main content

Showing 1–45 of 45 results for author: Furukawa, Y

  1. arXiv:2406.00259  [pdf, other

    cs.CV

    PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify

    Authors: Zhengqing Wang, Jiacheng Chen, Yasutaka Furukawa

    Abstract: This paper proposes a novel "auto-agglomerative" 3D fracture assembly method, PuzzleFusion++, resembling how humans solve challenging spatial puzzles. Starting from individual fragments, the approach 1) aligns and merges fragments into larger groups akin to agglomerative clustering and 2) repeats the process iteratively in completing the assembly akin to auto-regressive methods. Concretely, a diff… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: Project page: https://puzzlefusion-plusplus.github.io

  2. RetailOpt: Opt-In, Easy-to-Deploy Trajectory Estimation from Smartphone Motion Data and Retail Facility Information

    Authors: Ryo Yonetani, Jun Baba, Yasutaka Furukawa

    Abstract: We present RetailOpt, a novel opt-in, easy-to-deploy system for tracking customer movements offline in indoor retail environments. The system uses readily accessible information from customer smartphones and retail apps, including motion data, store maps, and purchase records. This eliminates the need for additional hardware installations/maintenance and ensures customers full data control. Specif… ▽ More

    Submitted 15 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2403.15951  [pdf, other

    cs.CV

    MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping

    Authors: Jiacheng Chen, Yuefan Wu, Jiaqi Tan, Hang Ma, Yasutaka Furukawa

    Abstract: This paper presents a vector HD-mapping algorithm that formulates the mapping as a tracking task and uses a history of memory latents to ensure consistent reconstructions over time. Our method, MapTracker, accumulates a sensor stream into memory buffers of two latent representations: 1) Raster latents in the bird's-eye-view (BEV) space and 2) Vector latents over the road elements (i.e., pedestrian… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Project page: https://map-tracker.github.io

  4. arXiv:2402.12712  [pdf, other

    cs.CV

    MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

    Authors: Shitao Tang, Jiacheng Chen, Dilin Wang, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Yasutaka Furukawa, Rakesh Ranjan

    Abstract: This paper presents a neural architecture MVDiffusion++ for 3D object reconstruction that synthesizes dense and high-resolution views of an object given one or a few images without camera poses. MVDiffusion++ achieves superior flexibility and scalability with two surprisingly simple ideas: 1) A ``pose-free architecture'' where standard self-attention among 2D latent features learns 3D consistency… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 3D generation, project page: https://mvdiffusion-plusplus.github.io/

  5. arXiv:2401.15563  [pdf, other

    cs.CV cs.LG

    BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry

    Authors: Xiang Xu, Joseph G. Lambourne, Pradeep Kumar Jayaraman, Zhengqing Wang, Karl D. D. Willis, Yasutaka Furukawa

    Abstract: This paper presents BrepGen, a diffusion-based generative approach that directly outputs a Boundary representation (B-rep) Computer-Aided Design (CAD) model. BrepGen represents a B-rep model as a novel structured latent geometry in a hierarchical tree. With the root node representing a whole CAD solid, each element of a B-rep model (i.e., a face, an edge, or a vertex) progressively turns into a ch… ▽ More

    Submitted 16 May, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted to ACM SIGGRAPH 2024. Code at https://github.com/samxuxiang/BrepGen

  6. arXiv:2311.18166  [pdf, other

    cs.CV

    A-Scan2BIM: Assistive Scan to Building Information Modeling

    Authors: Weilian Song, Jieliang Luo, Dale Zhao, Yan Fu, Chin-Yi Cheng, Yasutaka Furukawa

    Abstract: This paper proposes an assistive system for architects that converts a large-scale point cloud into a standardized digital representation of a building for Building Information Modeling (BIM) applications. The process is known as Scan-to-BIM, which requires many hours of manual work even for a single building floor by a professional architect. Given its challenging nature, the paper focuses on hel… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: BMVC 2023, order evaluation updated after fixing evaluation bug

  7. arXiv:2307.01097  [pdf, other

    cs.CV

    MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

    Authors: Shitao Tang, Fuyang Zhang, Jiacheng Chen, Peng Wang, Yasutaka Furukawa

    Abstract: This paper introduces MVDiffusion, a simple yet effective method for generating consistent multi-view images from text prompts given pixel-to-pixel correspondences (e.g., perspective crops from a panorama or multi-view images given depth maps and poses). Unlike prior methods that rely on iterative image warping and inpainting, MVDiffusion simultaneously generates all images with a global awareness… ▽ More

    Submitted 24 December, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Project page, https://mvdiffusion.github.io; NeurIPS 2023 (spotlight); Compressed camera-ready version

  8. arXiv:2307.00149  [pdf, other

    cs.CV cs.LG

    Hierarchical Neural Coding for Controllable CAD Model Generation

    Authors: Xiang Xu, Pradeep Kumar Jayaraman, Joseph G. Lambourne, Karl D. D. Willis, Yasutaka Furukawa

    Abstract: This paper presents a novel generative model for Computer Aided Design (CAD) that 1) represents high-level design concepts of a CAD model as a three-level hierarchical tree of neural codes, from global part arrangement down to local curve geometry; and 2) controls the generation or completion of CAD models by specifying the target design using a code tree. Concretely, a novel variant of a vector q… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

    Comments: Accepted to ICML 2023. Project website at https://hnc-cad.github.io/

  9. arXiv:2306.01461  [pdf, other

    cs.CV

    PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models

    Authors: Jiacheng Chen, Ruizhi Deng, Yasutaka Furukawa

    Abstract: This paper presents PolyDiffuse, a novel structured reconstruction algorithm that transforms visual sensor data into polygonal shapes with Diffusion Models (DM), an emerging machinery amid exploding generative AI, while formulating reconstruction as a generation process conditioned on sensor data. The task of structured reconstruction poses two fundamental challenges to DM: 1) A structured geometr… ▽ More

    Submitted 24 December, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Project page: https://poly-diffuse.github.io/; NeurIPS 2023 camera-ready version

  10. arXiv:2305.17852  [pdf, other

    cs.CV

    Hierarchical Neural Memory Network for Low Latency Event Processing

    Authors: Ryuhei Hamaguchi, Yasutaka Furukawa, Masaki Onishi, Ken Sakurada

    Abstract: This paper proposes a low latency neural network architecture for event-based dense prediction tasks. Conventional architectures encode entire scene contents at a fixed rate regardless of their temporal characteristics. Instead, the proposed network encodes contents at a proper temporal scale depending on its movement speed. We achieve this by constructing temporal hierarchy using stacked latent m… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2023

  11. arXiv:2211.13785  [pdf, other

    cs.AI cs.CV

    PuzzleFusion: Unleashing the Power of Diffusion Models for Spatial Puzzle Solving

    Authors: Sepidehsadat Hosseini, Mohammad Amin Shabani, Saghar Irandoust, Yasutaka Furukawa

    Abstract: This paper presents an end-to-end neural architecture based on Diffusion Models for spatial puzzle solving, particularly jigsaw puzzle and room arrangement tasks. In the latter task, for instance, the proposed system "PuzzleFusion" takes a set of room layouts as polygonal curves in the top-down view and aligns the room layout pieces by estimating their 2D translations and rotations, akin to solvin… ▽ More

    Submitted 3 October, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  12. arXiv:2211.13287  [pdf, other

    cs.CV cs.AI

    HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising

    Authors: Mohammad Amin Shabani, Sepidehsadat Hosseini, Yasutaka Furukawa

    Abstract: The paper presents a novel approach for vector-floorplan generation via a diffusion model, which denoises 2D coordinates of room/door corners with two inference objectives: 1) a single-step noise as the continuous quantity to precisely invert the continuous forward process; and 2) the final 2D coordinate as the discrete quantity to establish geometric incident relationships such as parallelism, or… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  13. arXiv:2211.11177  [pdf, other

    cs.CV

    NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization

    Authors: Shitao Tang, Sicong Tang, Andrea Tagliasacchi, Ping Tan, Yasutaka Furukawa

    Abstract: This paper presents an end-to-end neural mapping method for camera localization, dubbed NeuMap, encoding a whole scene into a grid of latent codes, with which a Transformer-based auto-decoder regresses 3D coordinates of query pixels. State-of-the-art feature matching methods require each scene to be stored as a 3D point cloud with per-point features, consuming several gigabytes of storage per scen… ▽ More

    Submitted 26 March, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: CVPR2023

  14. arXiv:2207.04632  [pdf, other

    cs.CV cs.LG

    SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks

    Authors: Xiang Xu, Karl D. D. Willis, Joseph G. Lambourne, Chin-Yi Cheng, Pradeep Kumar Jayaraman, Yasutaka Furukawa

    Abstract: We present SkexGen, a novel autoregressive generative model for computer-aided design (CAD) construction sequences containing sketch-and-extrude modeling operations. Our model utilizes distinct Transformer architectures to encode topological, geometric, and extrusion variations of construction sequences into disentangled codebooks. Autoregressive Transformer decoders generate CAD construction sequ… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted to ICML 2022

  15. arXiv:2206.00645  [pdf, other

    cs.CV cs.AI

    Floorplan Restoration by Structure Hallucinating Transformer Cascades

    Authors: Sepidehsadat Hosseini, Yasutaka Furukawa

    Abstract: This paper presents an extreme floorplan reconstruction task, a new benchmark for the task, and a neural architecture as a solution. Given a partial floorplan reconstruction inferred or curated from panorama images, the task is to reconstruct a complete floorplan including invisible architectural structures. The proposed neural network 1) encodes an input partial floorplan into a set of latent vec… ▽ More

    Submitted 3 October, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Published at BMVC 2023

  16. arXiv:2203.15851  [pdf, other

    cs.RO cs.CV

    Neural Inertial Localization

    Authors: Sachini Herath, David Caruso, Chen Liu, Yufan Chen, Yasutaka Furukawa

    Abstract: This paper proposes the inertial localization problem, the task of estimating the absolute location from a sequence of inertial sensor measurements. This is an exciting and unexplored area of indoor localization research, where we present a rich dataset with 53 hours of inertial sensor data and the associated ground truth locations. We developed a solution, dubbed neural inertial localization (NIL… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  17. arXiv:2111.15344  [pdf, other

    cs.RO cs.LG

    Material Classification Using Active Temperature Controllable Robotic Gripper

    Authors: Yukiko Osawa, Kei Kase, Yukiyasu Domae, Yoshiyuki Furukawa, Abderrahmane Kheddar

    Abstract: Recognition techniques allow robots to make proper planning and control strategies to manipulate various objects. Object recognition is more reliable when made by combining several percepts, e.g., vision and haptics. One of the distinguishing features of each object's material is its heat properties, and classification can exploit heat transfer, similarly to human thermal sensation. Thermal-based… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Journal ref: Proceedings of the 14th IEEE/SICE International Symposium on System Integration (SII 2022), Jan 2022, Narvik, Norway

  18. arXiv:2111.15143  [pdf, other

    cs.CV

    HEAT: Holistic Edge Attention Transformer for Structured Reconstruction

    Authors: Jiacheng Chen, Yiming Qian, Yasutaka Furukawa

    Abstract: This paper presents a novel attention-based neural network for structured reconstruction, which takes a 2D raster image as an input and reconstructs a planar graph depicting an underlying geometric structure. The approach detects corners and classifies edge candidates between corners in an end-to-end manner. Our contribution is a holistic edge classification architecture, which 1) initializes the… ▽ More

    Submitted 19 June, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: CVPR 2022 camera-ready

  19. arXiv:2108.07990  [pdf, other

    cs.CV

    Structured Outdoor Architecture Reconstruction by Exploration and Classification

    Authors: Fuyang Zhang, Xiang Xu, Nelson Nauata, Yasutaka Furukawa

    Abstract: This paper presents an explore-and-classify framework for structured architectural reconstruction from an aerial image. Starting from a potentially imperfect building reconstruction by an existing algorithm, our approach 1) explores the space of building models by modifying the reconstruction via heuristic actions; 2) learns to classify the correctness of building models while generating classific… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 2021 International Conference on Computer Vision (ICCV 2021)

  20. arXiv:2106.05375  [pdf, other

    cs.CV cs.GR

    Plan2Scene: Converting Floorplans to 3D Scenes

    Authors: Madhawa Vidanapathirana, Qirui Wu, Yasutaka Furukawa, Angel X. Chang, Manolis Savva

    Abstract: We address the task of converting a floorplan and a set of associated photos of a residence into a textured 3D mesh model, a task which we call Plan2Scene. Our system 1) lifts a floorplan image to a 3D mesh model; 2) synthesizes surface textures based on the input photos; and 3) infers textures for unobserved surfaces using a graph neural network architecture. To train and evaluate our system we c… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: This paper is accepted to CVPR 2021. For code, data and pretrained models, see https://3dlg-hcvc.github.io/plan2scene/

  21. arXiv:2105.08837  [pdf, other

    cs.RO cs.CV

    Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments

    Authors: Sachini Herath, Saghar Irandoust, Bowen Chen, Yiming Qian, Pyojin Kim, Yasutaka Furukawa

    Abstract: The paper proposes a multi-modal sensor fusion algorithm that fuses WiFi, IMU, and floorplan information to infer an accurate and dense location history in indoor environments. The algorithm uses 1) an inertial navigation algorithm to estimate a relative motion trajectory from IMU sensor data; 2) a WiFi-based localization API in industry to obtain positional constraints and geo-localize the trajec… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: To be published in ICRA 2021. Code and data: https://github.com/Sachini/Fusion-DHL

    Journal ref: ICRA 2021

  22. arXiv:2105.06025  [pdf

    cs.LG

    Machine-learning-based investigation on classifying binary and multiclass behavior outcomes of children with PIMD/SMID

    Authors: Von Ralph Dane Marquez Herbuela, Tomonori Karita, Yoshiya Furukawa, Yoshinori Wada, Yoshihiro Yagi, Shuichiro Senba, Eiko Onishi, Tatsuo Saeki

    Abstract: Recently, the importance of weather parameters and location information to better understand the context of the communication of children with profound intellectual and multiple disabilities (PIMD) or severe motor and intellectual disorders (SMID) has been proposed. However, an investigation on whether these data can be used to classify their behavior for system optimization aimed for predicting t… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  23. arXiv:2104.11176  [pdf, other

    cs.CV

    Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation

    Authors: Ryuhei Hamaguchi, Yasutaka Furukawa, Masaki Onishi, Ken Sakurada

    Abstract: This paper proposes a novel heterogeneous grid convolution that builds a graph-based image representation by exploiting heterogeneity in the image content, enabling adaptive, efficient, and controllable computations in a convolutional architecture. More concretely, the approach builds a data-adaptive graph structure from a convolutional layer by a differentiable clustering method, pools features t… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  24. arXiv:2103.02574  [pdf, other

    cs.CV

    House-GAN++: Generative Adversarial Layout Refinement Networks

    Authors: Nelson Nauata, Sepidehsadat Hosseini, Kai-Hung Chang, Hang Chu, Chin-Yi Cheng, Yasutaka Furukawa

    Abstract: This paper proposes a novel generative adversarial layout refinement network for automated floorplan generation. Our architecture is an integration of a graph-constrained relational GAN and a conditional GAN, where a previously generated layout becomes the next input constraint, enabling iterative refinement. A surprising discovery of our research is that a simple non-iterative training process, d… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  25. arXiv:2012.09340  [pdf, other

    cs.CV

    Roof-GAN: Learning to Generate Roof Geometry and Relations for Residential Houses

    Authors: Yiming Qian, Hao Zhang, Yasutaka Furukawa

    Abstract: This paper presents Roof-GAN, a novel generative adversarial network that generates structured geometry of residential roof structures as a set of roof primitives and their relationships. Given the number of primitives, the generator produces a structured roof model as a graph, which consists of 1) primitive geometry as raster images at each node, encoding facet segmentation and angles; 2) inter-p… ▽ More

    Submitted 6 May, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: CVPR 2021

  26. arXiv:2009.00260  [pdf

    cs.HC

    Children with PIMD/SMID expressive behaviors: Development and testing of ChildSIDE app, the first step for independent communication and mobility

    Authors: Von Ralph Dane Marquez Herbuela, Tomonori Karita, Yoshiya Furukawa, Yoshinori Wada, Shuichiro Senba, Eiko Onishi, Tatsuo Saeki

    Abstract: Children with profound intellectual and multiple disabilities or severe motor and intellectual disabilities only communicate through movements, vocalizations, body postures, muscle tensions, or facial expressions on a pre- or protosymbolic level. Yet, to the best of our knowledge, hardly any system has been developed to interpret their expressive behaviors. This paper describes the design, develop… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

  27. arXiv:2003.06988  [pdf, other

    cs.CV

    House-GAN: Relational Generative Adversarial Networks for Graph-constrained House Layout Generation

    Authors: Nelson Nauata, Kai-Hung Chang, Chin-Yi Cheng, Greg Mori, Yasutaka Furukawa

    Abstract: This paper proposes a novel graph-constrained generative adversarial network, whose generator and discriminator are built upon relational architecture. The main idea is to encode the constraint into the graph structure of its relational networks. We have demonstrated the proposed architecture for a new house layout generation problem, whose task is to take an architectural constraint as a graph (i… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

  28. arXiv:1912.05135  [pdf, other

    cs.CV

    Vectorizing World Buildings: Planar Graph Reconstruction by Primitive Detection and Relationship Inference

    Authors: Nelson Nauata, Yasutaka Furukawa

    Abstract: This paper tackles a 2D architecture vectorization problem, whose task is to infer an outdoor building architecture as a 2D planar graph from a single RGB image. We provide a new benchmark with ground-truth annotations for 2,001 complex buildings across the cities of Atlanta, Paris, and Las Vegas. We also propose a novel algorithm utilizing 1) convolutional neural networks (CNNs) that detects geom… ▽ More

    Submitted 14 March, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  29. arXiv:1912.01756  [pdf, other

    cs.CV

    Conv-MPN: Convolutional Message Passing Neural Network for Structured Outdoor Architecture Reconstruction

    Authors: Fuyang Zhang, Nelson Nauata, Yasutaka Furukawa

    Abstract: This paper proposes a novel message passing neural (MPN) architecture Conv-MPN, which reconstructs an outdoor building as a planar graph from a single RGB image. Conv-MPN is specifically designed for cases where nodes of a graph have explicit spatial embedding. In our problem, nodes correspond to building edges in an image. Conv-MPN is different from MPN in that 1) the feature associated with a no… ▽ More

    Submitted 6 June, 2021; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted by CVPR2020

  30. arXiv:1908.06702  [pdf, other

    cs.CV

    Floor-SP: Inverse CAD for Floorplans by Sequential Room-wise Shortest Path

    Authors: Jiacheng Chen, Chen Liu, Jiaye Wu, Yasutaka Furukawa

    Abstract: This paper proposes a new approach for automated floorplan reconstruction from RGBD scans, a major milestone in indoor mapping research. The approach, dubbed Floor-SP, formulates a novel optimization problem, where room-wise coordinate descent sequentially solves dynamic programming to optimize the floorplan graph structure. The objective function consists of data terms guided by deep neural netwo… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: 10 pages, 9 figures, accepted to ICCV 2019

  31. arXiv:1906.04962  [pdf, other

    cs.CV eess.IV

    Synthesizing Diverse Lung Nodules Wherever Massively: 3D Multi-Conditional GAN-based CT Image Augmentation for Object Detection

    Authors: Changhee Han, Yoshiro Kitamura, Akira Kudo, Akimichi Ichinose, Leonardo Rundo, Yujiro Furukawa, Kazuki Umemoto, Yuanzhong Li, Hideki Nakayama

    Abstract: Accurate Computer-Assisted Diagnosis, relying on large-scale annotated pathological images, can alleviate the risk of overlooking the diagnosis. Unfortunately, in medical imaging, most available datasets are small/fragmented. To tackle this, as a Data Augmentation (DA) method, 3D conditional Generative Adversarial Networks (GANs) can synthesize desired realistic/diverse 3D images as additional tra… ▽ More

    Submitted 12 August, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: 9 pages, 6 figures, accepted to 3DV 2019

  32. arXiv:1905.13456  [pdf, other

    eess.IV cs.AI cs.CV

    Combining Noise-to-Image and Image-to-Image GANs: Brain MR Image Augmentation for Tumor Detection

    Authors: Changhee Han, Leonardo Rundo, Ryosuke Araki, Yudai Nagano, Yujiro Furukawa, Giancarlo Mauri, Hideki Nakayama, Hideaki Hayashi

    Abstract: Convolutional Neural Networks (CNNs) achieve excellent computer-assisted diagnosis with sufficient annotated training data. However, most medical imaging datasets are small and fragmented. In this context, Generative Adversarial Networks (GANs) can synthesize realistic/diverse additional training images to fill the data lack in the real image distribution; researchers have improved classification… ▽ More

    Submitted 9 October, 2019; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: 12 pages, 7 figures, accepted to IEEE ACCESS

  33. arXiv:1905.12853  [pdf, other

    cs.CV cs.RO

    RoNIN: Robust Neural Inertial Navigation in the Wild: Benchmark, Evaluations, and New Methods

    Authors: Hang Yan, Sachini Herath, Yasutaka Furukawa

    Abstract: This paper sets a new foundation for data-driven inertial navigation research, where the task is the estimation of positions and orientations of a moving subject from a sequence of IMU sensor measurements. More concretely, the paper presents 1) a new benchmark containing more than 40 hours of IMU sensor data from 100 human subjects with ground-truth 3D trajectories under natural human motions; 2)… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  34. arXiv:1903.12564  [pdf, other

    cs.CV cs.AI

    Infinite Brain MR Images: PGGAN-based Data Augmentation for Tumor Detection

    Authors: Changhee Han, Leonardo Rundo, Ryosuke Araki, Yujiro Furukawa, Giancarlo Mauri, Hideki Nakayama, Hideaki Hayashi

    Abstract: Due to the lack of available annotated medical images, accurate computer-assisted diagnosis requires intensive Data Augmentation (DA) techniques, such as geometric/intensity transformations of original images; however, those transformed images intrinsically have a similar distribution to the original ones, leading to limited performance improvement. To fill the data lack in the real image distribu… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

    Comments: 13 pages, 6 figures, Accepted to Neural Approaches to Dynamics of Signal Exchanges as a Springer book chapter

  35. arXiv:1902.04478  [pdf, other

    cs.CV

    MASC: Multi-scale Affinity with Sparse Convolution for 3D Instance Segmentation

    Authors: Chen Liu, Yasutaka Furukawa

    Abstract: We propose a new approach for 3D instance segmentation based on sparse convolution and point affinity prediction, which indicates the likelihood of two points belonging to the same instance. The proposed network, built upon submanifold sparse convolution [3], processes a voxelized point cloud and predicts semantic scores for each occupied voxel as well as the affinity between neighboring voxels at… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  36. arXiv:1812.04072  [pdf, other

    cs.CV

    PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image

    Authors: Chen Liu, Kihwan Kim, Jinwei Gu, Yasutaka Furukawa, Jan Kautz

    Abstract: This paper proposes a deep neural architecture, PlaneRCNN, that detects and reconstructs piecewise planar surfaces from a single RGB image. PlaneRCNN employs a variant of Mask R-CNN to detect planes with their plane parameters and segmentation masks. PlaneRCNN then jointly refines all the segmentation masks with a novel loss enforcing the consistency with a nearby view during training. The paper a… ▽ More

    Submitted 7 January, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

  37. arXiv:1804.06278  [pdf, other

    cs.CV

    PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image

    Authors: Chen Liu, Jimei Yang, Duygu Ceylan, Ersin Yumer, Yasutaka Furukawa

    Abstract: This paper proposes a deep neural network (DNN) for piece-wise planar depthmap reconstruction from a single RGB image. While DNNs have brought remarkable progress to single-image depth prediction, piece-wise planar depthmap reconstruction requires a structured geometry representation, and has been a difficult task to master even for DNNs. The proposed end-to-end DNN learns to directly infer a set… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: CVPR 2018

  38. arXiv:1804.00090  [pdf, other

    cs.CV

    FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans

    Authors: Chen Liu, Jiaye Wu, Yasutaka Furukawa

    Abstract: The ultimate goal of this indoor mapping research is to automatically reconstruct a floorplan simply by walking through a house with a smartphone in a pocket. This paper tackles this problem by proposing FloorNet, a novel deep neural architecture. The challenge lies in the processing of RGBD streams spanning a large 3D space. FloorNet effectively processes the data through three neural network bra… ▽ More

    Submitted 30 March, 2018; originally announced April 2018.

  39. arXiv:1712.09004  [pdf, other

    cs.CV

    RIDI: Robust IMU Double Integration

    Authors: Hang Yan, Qi Shan, Yasutaka Furukawa

    Abstract: This paper proposes a novel data-driven approach for inertial navigation, which learns to estimate trajectories of natural human motions just from an inertial measurement unit (IMU) in every smartphone. The key observation is that human motions are repetitive and consist of a few major modes (e.g., standing, walking, or turning). Our algorithm regresses a velocity vector from the history of linear… ▽ More

    Submitted 30 December, 2017; v1 submitted 24 December, 2017; originally announced December 2017.

  40. Exploiting 2D Floorplan for Building-scale Panorama RGBD Alignment

    Authors: Erik Wijmans, Yasutaka Furukawa

    Abstract: This paper presents a novel algorithm that utilizes a 2D floorplan to align panorama RGBD scans. While effective panorama RGBD alignment techniques exist, such a system requires extremely dense RGBD image sampling. Our approach can significantly reduce the number of necessary scans with the aid of a floorplan image. We formulate a novel Markov Random Field inference problem as a scan placement ove… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

  41. arXiv:1612.01256  [pdf, other

    cs.CV

    Panoramic Structure from Motion via Geometric Relationship Detection

    Authors: Satoshi Ikehata, Ivaylo Boyadzhiev, Qi Shan, Yasutaka Furukawa

    Abstract: This paper addresses the problem of Structure from Motion (SfM) for indoor panoramic image streams, extremely challenging even for the state-of-the-art due to the lack of textures and minimal parallax. The key idea is the fusion of single-view and multi-view reconstruction techniques via geometric relationship detection (e.g., detecting 2D lines as coplanar in 3D). Rough geometry suffices to perfo… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

  42. arXiv:1612.01235  [pdf, other

    cs.CV

    Turning an Urban Scene Video into a Cinemagraph

    Authors: Hang Yan, Yebin Liu, Yasutaka Furukawa

    Abstract: This paper proposes an algorithm that turns a regular video capturing urban scenes into a high-quality endless animation, known as a Cinemagraph. The creation of a Cinemagraph usually requires a static camera in a carefully configured scene. The task becomes challenging for a regular video with a moving camera and objects. Our approach first warps an input video into the viewpoint of a reference c… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  43. arXiv:1612.01234  [pdf, other

    cs.CV

    Multi-way Particle Swarm Fusion

    Authors: Chen Liu, Hang Yan, Pushmeet Kohli, Yasutaka Furukawa

    Abstract: This paper proposes a novel MAP inference framework for Markov Random Field (MRF) in parallel computing environments. The inference framework, dubbed Swarm Fusion, is a natural generalization of the Fusion Move method. Every thread (in a case of multi-threading environments) maintains and updates a solution. At each iteration, a thread can generate arbitrary number of solution proposals and take a… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  44. arXiv:1612.01225  [pdf, other

    cs.CV

    Deep Multi-Modal Image Correspondence Learning

    Authors: Chen Liu, Jiajun Wu, Pushmeet Kohli, Yasutaka Furukawa

    Abstract: Inference of correspondences between images from different modalities is an extremely important perceptual ability that enables humans to understand and recognize cross-modal concepts. In this paper, we consider an instance of this problem that involves matching photographs of building interiors with their corresponding floorplan. This is a particularly challenging problem because a floorplan, as… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  45. arXiv:cs/0111027  [pdf

    cs.NI

    Upgrade of Spring-8 Beamline Network with Vlan Technology Over Gigabit Ethernet

    Authors: M. Ishii, T. Fukui, Y. Furukawa, T. Nakatani, T. Ohata, R. Tanaka

    Abstract: The beamline network system at SPring-8 consists of three LANs; a BL-LAN for beamline component control, a BL-USER-LAN for beamline experimental users and an OA-LAN for the information services. These LANs are interconnected by a firewall system. Since the network traffic and the number of beamlines have increased, we upgraded the backbone of BL-USER-LAN from Fast Ethernet to Gigabit Ethernet. A… ▽ More

    Submitted 17 December, 2001; v1 submitted 9 November, 2001; originally announced November 2001.

    Comments: 3 pages, 2 figure, 8th International Conference on Accelerator and Large Experimental Physics Control Systems (PSN TUAP056), San Jose, CA, USA, November 27-30

    ACM Class: C.2.1

    Journal ref: eConf C011127 (2001) TUAP056