-
Metropolitan Scale and Longitudinal Dataset of Anonymized Human Mobility Trajectories
Authors:
Takahiro Yabe,
Kota Tsubouchi,
Toru Shimizu,
Yoshihide Sekimoto,
Kaoru Sezaki,
Esteban Moro,
Alex Pentland
Abstract:
Modeling and predicting human mobility trajectories in urban areas is an essential task for various applications. The recent availability of large-scale human movement data collected from mobile devices have enabled the development of complex human mobility prediction models. However, human mobility prediction methods are often trained and tested on different datasets, due to the lack of open-sour…
▽ More
Modeling and predicting human mobility trajectories in urban areas is an essential task for various applications. The recent availability of large-scale human movement data collected from mobile devices have enabled the development of complex human mobility prediction models. However, human mobility prediction methods are often trained and tested on different datasets, due to the lack of open-source large-scale human mobility datasets amid privacy concerns, posing a challenge towards conducting fair performance comparisons between methods. To this end, we created an open-source, anonymized, metropolitan scale, and longitudinal (90 days) dataset of 100,000 individuals' human mobility trajectories, using mobile phone location data. The location pings are spatially and temporally discretized, and the metropolitan area is undisclosed to protect users' privacy. The 90-day period is composed of 75 days of business-as-usual and 15 days during an emergency. To promote the use of the dataset, we will host a human mobility prediction data challenge (`HuMob Challenge 2023') using the human mobility dataset, which will be held in conjunction with ACM SIGSPATIAL 2023.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Crowdsensing-based Road Damage Detection Challenge (CRDDC-2022)
Authors:
Deeksha Arya,
Hiroya Maeda,
Sanjay Kumar Ghosh,
Durga Toshniwal,
Hiroshi Omata,
Takehiro Kashiyama,
Yoshihide Sekimoto
Abstract:
This paper summarizes the Crowdsensing-based Road Damage Detection Challenge (CRDDC), a Big Data Cup organized as a part of the IEEE International Conference on Big Data'2022. The Big Data Cup challenges involve a released dataset and a well-defined problem with clear evaluation metrics. The challenges run on a data competition platform that maintains a real-time online evaluation system for the p…
▽ More
This paper summarizes the Crowdsensing-based Road Damage Detection Challenge (CRDDC), a Big Data Cup organized as a part of the IEEE International Conference on Big Data'2022. The Big Data Cup challenges involve a released dataset and a well-defined problem with clear evaluation metrics. The challenges run on a data competition platform that maintains a real-time online evaluation system for the participants. In the presented case, the data constitute 47,420 road images collected from India, Japan, the Czech Republic, Norway, the United States, and China to propose methods for automatically detecting road damages in these countries. More than 60 teams from 19 countries registered for this competition. The submitted solutions were evaluated using five leaderboards based on performance for unseen test images from the aforementioned six countries. This paper encapsulates the top 11 solutions proposed by these teams. The best-performing model utilizes ensemble learning based on YOLO and Faster-RCNN series models to yield an F1 score of 76% for test data combined from all 6 countries. The paper concludes with a comparison of current and past challenges and provides direction for the future.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Road Rutting Detection using Deep Learning on Images
Authors:
Poonam Kumari Saha,
Deeksha Arya,
Ashutosh Kumar,
Hiroya Maeda,
Yoshihide Sekimoto
Abstract:
Road rutting is a severe road distress that can cause premature failure of road incurring early and costly maintenance costs. Research on road damage detection using image processing techniques and deep learning are being actively conducted in the past few years. However, these researches are mostly focused on detection of cracks, potholes, and their variants. Very few research has been done on th…
▽ More
Road rutting is a severe road distress that can cause premature failure of road incurring early and costly maintenance costs. Research on road damage detection using image processing techniques and deep learning are being actively conducted in the past few years. However, these researches are mostly focused on detection of cracks, potholes, and their variants. Very few research has been done on the detection of road rutting. This paper proposes a novel road rutting dataset comprising of 949 images and provides both object level and pixel level annotations. Object detection models and semantic segmentation models were deployed to detect road rutting on the proposed dataset, and quantitative and qualitative analysis of model predictions were done to evaluate model performance and identify challenges faced in the detection of road rutting using the proposed method. Object detection model YOLOX-s achieves mAP@IoU=0.5 of 61.6% and semantic segmentation model PSPNet (Resnet-50) achieves IoU of 54.69 and accuracy of 72.67, thus providing a benchmark accuracy for similar work in future. The proposed road rutting dataset and the results of our research study will help accelerate the research on detection of road rutting using deep learning.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
RDD2022: A multi-national image dataset for automatic Road Damage Detection
Authors:
Deeksha Arya,
Hiroya Maeda,
Sanjay Kumar Ghosh,
Durga Toshniwal,
Yoshihide Sekimoto
Abstract:
The data article describes the Road Damage Dataset, RDD2022, which comprises 47,420 road images from six countries, Japan, India, the Czech Republic, Norway, the United States, and China. The images have been annotated with more than 55,000 instances of road damage. Four types of road damage, namely longitudinal cracks, transverse cracks, alligator cracks, and potholes, are captured in the dataset…
▽ More
The data article describes the Road Damage Dataset, RDD2022, which comprises 47,420 road images from six countries, Japan, India, the Czech Republic, Norway, the United States, and China. The images have been annotated with more than 55,000 instances of road damage. Four types of road damage, namely longitudinal cracks, transverse cracks, alligator cracks, and potholes, are captured in the dataset. The annotated dataset is envisioned for developing deep learning-based methods to detect and classify road damage automatically. The dataset has been released as a part of the Crowd sensing-based Road Damage Detection Challenge (CRDDC2022). The challenge CRDDC2022 invites researchers from across the globe to propose solutions for automatic road damage detection in multiple countries. The municipalities and road agencies may utilize the RDD2022 dataset, and the models trained using RDD2022 for low-cost automatic monitoring of road conditions. Further, computer vision and machine learning researchers may use the dataset to benchmark the performance of different algorithms for other image-based applications of the same type (classification, object detection, etc.).
△ Less
Submitted 18 September, 2022;
originally announced September 2022.
-
Pseudo-PFLOW: Development of nationwide synthetic open dataset for people movement based on limited travel survey and open statistical data
Authors:
Takehiro Kashiyama,
Yanbo Pang,
Yoshihide Sekimoto,
Takahiro Yabe
Abstract:
People flow data are utilized in diverse fields such as urban and commercial planning and disaster management. However, people flow data collected from mobile phones, such as using global positioning system and call detail records data, are difficult to obtain because of privacy issues. Even if the data were obtained, they would be difficult to handle. This study developed pseudo-people-flow data…
▽ More
People flow data are utilized in diverse fields such as urban and commercial planning and disaster management. However, people flow data collected from mobile phones, such as using global positioning system and call detail records data, are difficult to obtain because of privacy issues. Even if the data were obtained, they would be difficult to handle. This study developed pseudo-people-flow data covering all of Japan by combining public statistical and travel survey data from limited urban areas. This dataset is not a representation of actual travel movements but of typical weekday movements of people. Therefore it is expected to be useful for various purposes. Additionally, the dataset represents the seamless movement of people throughout Japan, with no restrictions on coverage, unlike the travel surveys. In this paper, we propose a method for generating pseudo-people-flow and describe the development of a "Pseudo-PFLOW" dataset covering the entire population of approximately 130 million people. We then evaluated the accuracy of the dataset using mobile phone and trip survey data from multiple metropolitan areas. The results showed that a coefficient of determination of more than 0.5 was confirmed for comparisons regarding population distribution and trip volume.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Development of current estimated household data and agent-based simulation of the future population distribution of households in Japan
Authors:
Kajiwara Kento,
Jue Ma,
Toshikazu Seto,
Yoshihide Sekimoto,
Yoshiki Ogawa,
Hiroshi Omata
Abstract:
In response to the declining population and aging infrastructure in Japan, local governments are implementing compact city policies such as the location normalization plan. To optimize the reorganization of urban public infrastructure, it is important to provide detailed and accurate forecasts of the distribution of urban populations and households. However, many local governments do not have the…
▽ More
In response to the declining population and aging infrastructure in Japan, local governments are implementing compact city policies such as the location normalization plan. To optimize the reorganization of urban public infrastructure, it is important to provide detailed and accurate forecasts of the distribution of urban populations and households. However, many local governments do not have the necessary data and forecasting capability. Moreover, current forecasts of gender- and age-based population data only exist at the municipal level, and household data are only available by family type at the prefecture level. Meanwhile, the accuracy is limited with an assumption of same change rate of population in all municipalities and within each city. Therefore, the aim of this study was to develop an agent-based microsimulation household transition model, with the household as the unit and agent, and household data was estimated for all cities in Japan from 2015. Estimated household data comprised the family type, house type, and address, age, and gender of household members, obtained from the national census, and building data. The resulting household transition model was used to forecast the attributes of each household every five years. Simulations in Toyama and Shizuoka Prefectures, Japan from 1980 to 2010 provided highly accurate estimates of municipal-level population by age and household volume by family type. The proposed model was also applied to predict the future distribution of disappearing villages and vacant houses in Japan.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Global Road Damage Detection: State-of-the-art Solutions
Authors:
Deeksha Arya,
Hiroya Maeda,
Sanjay Kumar Ghosh,
Durga Toshniwal,
Hiroshi Omata,
Takehiro Kashiyama,
Yoshihide Sekimoto
Abstract:
This paper summarizes the Global Road Damage Detection Challenge (GRDDC), a Big Data Cup organized as a part of the IEEE International Conference on Big Data'2020. The Big Data Cup challenges involve a released dataset and a well-defined problem with clear evaluation metrics. The challenges run on a data competition platform that maintains a leaderboard for the participants. In the presented case,…
▽ More
This paper summarizes the Global Road Damage Detection Challenge (GRDDC), a Big Data Cup organized as a part of the IEEE International Conference on Big Data'2020. The Big Data Cup challenges involve a released dataset and a well-defined problem with clear evaluation metrics. The challenges run on a data competition platform that maintains a leaderboard for the participants. In the presented case, the data constitute 26336 road images collected from India, Japan, and the Czech Republic to propose methods for automatically detecting road damages in these countries. In total, 121 teams from several countries registered for this competition. The submitted solutions were evaluated using two datasets test1 and test2, comprising 2,631 and 2,664 images. This paper encapsulates the top 12 solutions proposed by these teams. The best performing model utilizes YOLO-based ensemble learning to yield an F1 score of 0.67 on test1 and 0.66 on test2. The paper concludes with a review of the facets that worked well for the presented challenge and those that could be improved in future challenges.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
Transfer Learning-based Road Damage Detection for Multiple Countries
Authors:
Deeksha Arya,
Hiroya Maeda,
Sanjay Kumar Ghosh,
Durga Toshniwal,
Alexander Mraz,
Takehiro Kashiyama,
Yoshihide Sekimoto
Abstract:
Many municipalities and road authorities seek to implement automated evaluation of road damage. However, they often lack technology, know-how, and funds to afford state-of-the-art equipment for data collection and analysis of road damages. Although some countries, like Japan, have developed less expensive and readily available Smartphone-based methods for automatic road condition monitoring, other…
▽ More
Many municipalities and road authorities seek to implement automated evaluation of road damage. However, they often lack technology, know-how, and funds to afford state-of-the-art equipment for data collection and analysis of road damages. Although some countries, like Japan, have developed less expensive and readily available Smartphone-based methods for automatic road condition monitoring, other countries still struggle to find efficient solutions. This work makes the following contributions in this context. Firstly, it assesses the usability of the Japanese model for other countries. Secondly, it proposes a large-scale heterogeneous road damage dataset comprising 26620 images collected from multiple countries using smartphones. Thirdly, we propose generalized models capable of detecting and classifying road damages in more than one country. Lastly, we provide recommendations for readers, local agencies, and municipalities of other countries when one other country publishes its data and model for automatic road damage detection and classification. Our dataset is available at (https://github.com/sekilab/RoadDamageDetector/).
△ Less
Submitted 30 August, 2020;
originally announced August 2020.
-
City2City: Translating Place Representations across Cities
Authors:
Takahiro Yabe,
Kota Tsubouchi,
Toru Shimizu,
Yoshihide Sekimoto,
Satish V. Ukkusuri
Abstract:
Large mobility datasets collected from various sources have allowed us to observe, analyze, predict and solve a wide range of important urban challenges. In particular, studies have generated place representations (or embeddings) from mobility patterns in a similar manner to word embeddings to better understand the functionality of different places within a city. However, studies have been limited…
▽ More
Large mobility datasets collected from various sources have allowed us to observe, analyze, predict and solve a wide range of important urban challenges. In particular, studies have generated place representations (or embeddings) from mobility patterns in a similar manner to word embeddings to better understand the functionality of different places within a city. However, studies have been limited to generating such representations of cities in an individual manner and has lacked an inter-city perspective, which has made it difficult to transfer the insights gained from the place representations across different cities. In this study, we attempt to bridge this research gap by treating \textit{cities} and \textit{languages} analogously. We apply methods developed for unsupervised machine language translation tasks to translate place representations across different cities. Real world mobility data collected from mobile phone users in 2 cities in Japan are used to test our place representation translation methods. Translated place representations are validated using landuse data, and results show that our methods were able to accurately translate place representations from one city to another.
△ Less
Submitted 26 November, 2019;
originally announced November 2019.
-
Congestion Analysis of Convolutional Neural Network-Based Pedestrian Counting Methods on Helicopter Footage
Authors:
Gergely Csönde,
Yoshihide Sekimoto,
Takehiro Kashiyama
Abstract:
Over the past few years, researchers have presented many different applications for convolutional neural networks, including those for the detection and recognition of objects from images. The desire to understand our own nature has always been an important motivation for research. Thus, the visual recognition of humans is among the most important issues facing machine learning today. Most solutio…
▽ More
Over the past few years, researchers have presented many different applications for convolutional neural networks, including those for the detection and recognition of objects from images. The desire to understand our own nature has always been an important motivation for research. Thus, the visual recognition of humans is among the most important issues facing machine learning today. Most solutions for this task have been developed and tested by using several publicly available datasets. These datasets typically contain images taken from street-level closed-circuit television cameras offering a low-angle view. There are major differences between such images and those taken from the sky. In addition, aerial images are often very congested, containing hundreds of targets. These factors may have significant impact on the quality of the results. In this paper, we investigate state-of-the-art methods for counting pedestrians and the related performance of aerial footage. Furthermore, we analyze this performance with respect to the congestion levels of the images.
△ Less
Submitted 5 November, 2019;
originally announced November 2019.
-
Predicting Evacuation Decisions using Representations of Individuals' Pre-Disaster Web Search Behavior
Authors:
Takahiro Yabe,
Kota Tsubouchi,
Toru Shimizu,
Yoshihide Sekimoto,
Satish V. Ukkusuri
Abstract:
Predicting the evacuation decisions of individuals before the disaster strikes is crucial for planning first response strategies. In addition to the studies on post-disaster analysis of evacuation behavior, there are various works that attempt to predict the evacuation decisions beforehand. Most of these predictive methods, however, require real time location data for calibration, which are becomi…
▽ More
Predicting the evacuation decisions of individuals before the disaster strikes is crucial for planning first response strategies. In addition to the studies on post-disaster analysis of evacuation behavior, there are various works that attempt to predict the evacuation decisions beforehand. Most of these predictive methods, however, require real time location data for calibration, which are becoming much harder to obtain due to the rising privacy concerns. Meanwhile, web search queries of anonymous users have been collected by web companies. Although such data raise less privacy concerns, they have been under-utilized for various applications. In this study, we investigate whether web search data observed prior to the disaster can be used to predict the evacuation decisions. More specifically, we utilize a "session-based query encoder" that learns the representations of each user's web search behavior prior to evacuation. Our proposed approach is empirically tested using web search data collected from users affected by a major flood in Japan. Results are validated using location data collected from mobile phones of the same set of users as ground truth. We show that evacuation decisions can be accurately predicted (84%) using only the users' pre-disaster web search data as input. This study proposes an alternative method for evacuation prediction that does not require highly sensitive location data, which can assist local governments to prepare effective first response strategies.
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
Universality of population recovery patterns after disasters
Authors:
Takahiro Yabe,
Kota Tsubouchi,
Naoya Fujiwara,
Yoshihide Sekimoto,
Satish V. Ukkusuri
Abstract:
Despite the rising importance of enhancing community resilience to disasters, our understanding on how communities recover from catastrophic events is limited. Here we study the population recovery dynamics of disaster affected regions by observing the movements of over 2.5 million mobile phone users across three countries before, during and after five major disasters. We find that, although the r…
▽ More
Despite the rising importance of enhancing community resilience to disasters, our understanding on how communities recover from catastrophic events is limited. Here we study the population recovery dynamics of disaster affected regions by observing the movements of over 2.5 million mobile phone users across three countries before, during and after five major disasters. We find that, although the regions affected by the five disasters have significant differences in socio-economic characteristics, we observe a universal recovery pattern where displaced populations return in an exponential manner after all disasters. Moreover, the heterogeneity in initial and long-term displacement rates across communities across the three countries were explained by a set of key universal factors including the community's median income level, population size, housing damage rate, and the connectedness to other cities. These universal properties of recovery dynamics extracted from large scale evidence could impact efforts on urban resilience and sustainability across various disciplines.
△ Less
Submitted 5 May, 2019;
originally announced May 2019.
-
Cross-comparative analysis of evacuation behavior after earthquakes using mobile phone data
Authors:
Takahiro Yabe,
Yoshihide Sekimoto,
Kota Tsubouchi,
Satoshi Ikemoto
Abstract:
Despite the importance of predicting evacuation mobility dynamics after large scale disasters for effective first response and disaster relief, our general understanding of evacuation behavior remains limited because of the lack of empirical evidence on the evacuation movement of individuals across multiple disaster instances. Here we investigate the GPS trajectories of a total of more than 1 mill…
▽ More
Despite the importance of predicting evacuation mobility dynamics after large scale disasters for effective first response and disaster relief, our general understanding of evacuation behavior remains limited because of the lack of empirical evidence on the evacuation movement of individuals across multiple disaster instances. Here we investigate the GPS trajectories of a total of more than 1 million anonymized mobile phone users whose positions are tracked for a period of 2 months before and after four of the major earthquakes that occurred in Japan. Through a cross comparative analysis between the four disaster instances, we find that in contrast with the assumed complexity of evacuation decision making mechanisms in crisis situations, the individuals' evacuation probability is strongly dependent on the seismic intensity that they experience. In fact, we show that the evacuation probabilities in all earthquakes collapse into a similar pattern, with a critical threshold at around seismic intensity 5.5. This indicates that despite the diversity in the earthquakes profiles and urban characteristics, evacuation behavior is similarly dependent on seismic intensity. Moreover, we found that probability density functions of the distances that individuals evacuate are not dependent on seismic intensities that individuals experience. These insights from empirical analysis on evacuation from multiple earthquake instances using large scale mobility data contributes to a deeper understanding of how people react to earthquakes, and can potentially assist decision makers to simulate and predict the number of evacuees in urban areas with little computational time and cost, by using population density information and seismic intensity which can be observed instantaneously after the shock.
△ Less
Submitted 8 November, 2018;
originally announced November 2018.
-
Road Damage Detection Using Deep Neural Networks with Images Captured Through a Smartphone
Authors:
Hiroya Maeda,
Yoshihide Sekimoto,
Toshikazu Seto,
Takehiro Kashiyama,
Hiroshi Omata
Abstract:
Research on damage detection of road surfaces using image processing techniques has been actively conducted, achieving considerably high detection accuracies. Many studies only focus on the detection of the presence or absence of damage. However, in a real-world scenario, when the road managers from a governing body need to repair such damage, they need to clearly understand the type of damage in…
▽ More
Research on damage detection of road surfaces using image processing techniques has been actively conducted, achieving considerably high detection accuracies. Many studies only focus on the detection of the presence or absence of damage. However, in a real-world scenario, when the road managers from a governing body need to repair such damage, they need to clearly understand the type of damage in order to take effective action. In addition, in many of these previous studies, the researchers acquire their own data using different methods. Hence, there is no uniform road damage dataset available openly, leading to the absence of a benchmark for road damage detection. This study makes three contributions to address these issues. First, to the best of our knowledge, for the first time, a large-scale road damage dataset is prepared. This dataset is composed of 9,053 road damage images captured with a smartphone installed on a car, with 15,435 instances of road surface damage included in these road images. In order to generate this dataset, we cooperated with 7 municipalities in Japan and acquired road images for more than 40 hours. These images were captured in a wide variety of weather and illuminance conditions. In each image, we annotated the bounding box representing the location and type of damage. Next, we used a state-of-the-art object detection method using convolutional neural networks to train the damage detection model with our dataset, and compared the accuracy and runtime speed on both, using a GPU server and a smartphone. Finally, we demonstrate that the type of damage can be classified into eight types with high accuracy by applying the proposed object detection method. The road damage dataset, our experimental results, and the developed smartphone application used in this study are publicly available (https://github.com/sekilab/RoadDamageDetector/).
△ Less
Submitted 1 February, 2018; v1 submitted 29 January, 2018;
originally announced January 2018.