WTTFNet: A Weather-Time-Trajectory Fusion Network for Pedestrian Trajectory Prediction in Urban Complex

Read original: arXiv:2405.18945 - Published 5/30/2024 by Ho Chun Wu, Esther Hoi Shan Lau, Paul Yuen, Kevin Hung, John Kwok Tai Chui, Andrew Kwok Fai Lui

🌐

Overview

This paper presents a new method called the "weather-time-trajectory fusion network" (WTTFNet) for improving the performance of baseline deep neural network architectures in modeling pedestrian trajectories in urban environments.
The key challenge is that pedestrians can have many possible destinations, and their behavior can be influenced by weather and time-of-day factors.
By incorporating weather and time-of-day information as an embedding structure, the proposed WTTFNet uses a novel gate multimodal unit to fuse the multimodal information and deep representation of trajectories.
The model also uses a joint loss function based on focal loss to co-optimize both the deep trajectory features and the final classifier, which helps improve accuracy in predicting pedestrian destinations and trajectories, even in scenarios with class imbalances.

Plain English Explanation

Predicting how pedestrians move through urban areas is a complex task, as people can have many different destinations, like shops, escalators, or attractions. Additionally, factors like weather and time-of-day can influence how people walk around. To address this challenge, the researchers developed a new deep learning model called the "weather-time-trajectory fusion network" (WTTFNet).

The key idea behind WTTFNet is to incorporate information about the weather and time-of-day as part of the model, in addition to the data about the pedestrians' actual movements. This allows the model to better understand how these external factors impact people's walking behavior and trajectories. The model uses a specialized "gate multimodal unit" to combine this weather and time-of-day information with the deep learning features extracted from the trajectory data.

The researchers also designed a specialized loss function that helps the model predict pedestrian destinations and trajectories more accurately, even in cases where some destinations are much more common than others. This addresses the problem of "class imbalance," where the model might struggle to recognize less common destinations.

Overall, the WTTFNet model demonstrated significant improvements in predicting pedestrian trajectories compared to other state-of-the-art approaches. This could be useful for applications like pedestrian facility engineering, public space development, and technology-driven retail.

Technical Explanation

The researchers proposed the "weather-time-trajectory fusion network" (WTTFNet) to improve the performance of baseline deep neural network architectures in modeling pedestrian trajectories in urban environments. They identified that weather and time-of-day factors can significantly influence pedestrian behavior and destinations, which poses a challenge for existing models.

The key components of the WTTFNet architecture include:

An embedding structure to incorporate weather and time-of-day information
A novel "gate multimodal unit" to fuse the multimodal information (weather, time, trajectories)
A joint loss function based on focal loss to co-optimize the deep trajectory features and the final classifier

The focal loss function helps the model address class imbalance issues, where some pedestrian destinations are much more common than others. This allows the model to better recognize and predict less common destinations.

Experiments using the Osaka Asia and Pacific Trade Center (ATC) dataset showed that the proposed WTTFNet approach outperformed state-of-the-art algorithms by a significant margin. Specifically, it achieved a 23.67% increase in classification accuracy, a 9.16% reduction in average displacement error, and a 7.07% reduction in final displacement error.

Critical Analysis

The paper provides a comprehensive approach to incorporating weather and time-of-day information into a deep learning model for pedestrian trajectory prediction. The use of the gate multimodal unit and the focal loss function are innovative techniques that help the model handle the challenges of pedestrian behavior in urban environments.

However, the paper does not address the potential limitations of the dataset used for evaluation. The Osaka ATC dataset may not be representative of all urban environments, and the results may not generalize well to other locations with different pedestrian behaviors and infrastructure.

Additionally, the paper does not discuss the computational complexity and inference time of the WTTFNet model. In real-world applications, such factors may be important considerations, especially for applications that require real-time trajectory prediction.

Further research could explore the robustness of the WTTFNet approach in the face of incomplete or noisy sensor data, as well as its performance on larger and more diverse datasets. Comparisons to other multimodal fusion techniques, such as time-evidence fusion networks or deep multi-view channel-wise spatio-temporal models, could also provide additional insights.

Conclusion

The "weather-time-trajectory fusion network" (WTTFNet) proposed in this paper represents a significant advancement in pedestrian trajectory modeling for urban environments. By incorporating weather and time-of-day information, along with novel architectural and optimization techniques, the model demonstrates substantial improvements in predicting pedestrian destinations and trajectories compared to state-of-the-art approaches.

This research has important implications for a variety of applications, such as pedestrian facility engineering, public space development, and technology-driven retail. The ability to accurately forecast pedestrian movements can inform infrastructure design, urban planning, and business strategies, ultimately enhancing the experiences and safety of people in public spaces.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🌐

WTTFNet: A Weather-Time-Trajectory Fusion Network for Pedestrian Trajectory Prediction in Urban Complex

Ho Chun Wu, Esther Hoi Shan Lau, Paul Yuen, Kevin Hung, John Kwok Tai Chui, Andrew Kwok Fai Lui

Pedestrian trajectory modelling in an urban complex is challenging because pedestrians can have many possible destinations, such as shops, escalators, and attractions. Moreover, weather and time-of-day may affect pedestrian behavior. In this paper, a new weather-time-trajectory fusion network (WTTFNet) is proposed to improve the performance of baseline deep neural network architecture. By incorporating weather and time-of-day information as an embedding structure, a novel WTTFNet based on gate multimodal unit is used to fuse the multimodal information and deep representation of trajectories. A joint loss function based on focal loss is used to co-optimize both the deep trajectory features and final classifier, which helps to improve the accuracy in predicting the intended destination of pedestrians and hence the trajectories under possible scenarios of class imbalances. Experimental results using the Osaka Asia and Pacific Trade Center (ATC) dataset shows improved performance of the proposed approach over state-of-the-art algorithms by 23.67% increase in classification accuracy, 9.16% and 7.07% reduction of average and final displacement error. The proposed approach may serve as an attractive approach for improving existing baseline trajectory prediction models when they are applied to scenarios with influences of weather-time conditions. It can be employed in numerous applications such as pedestrian facility engineering, public space development and technology-driven retail.

5/30/2024

Urban Traffic Forecasting with Integrated Travel Time and Data Availability in a Conformal Graph Neural Network Framework

Mayur Patil, Qadeer Ahmed, Shawn Midlam-Mohler

Traffic flow prediction is a big challenge for transportation authorities as it helps in planning and developing better infrastructure. State-of-the-art models often struggle to consider the data in the best way possible, intrinsic uncertainties, and the actual physics of the traffic. In this study, we propose a novel framework to incorporate travel times between stations into a weighted adjacency matrix of a Graph Neural Network (GNN) architecture with information from traffic stations based on their data availability. To handle uncertainty, we utilized the Adaptive Conformal Prediction (ACP) method that adjusts prediction intervals based on real-time validation residuals. To validate our results, we model a microscopic traffic scenario and perform a Monte-Carlo simulation to get a travel time distribution for a Vehicle Under Test (VUT) while it is navigating the traffic scenario, and this distribution is compared against the actual data. Experiments show that the proposed model outperformed the next-best model by approximately 24% in MAE and 8% in RMSE and validation showed the simulated travel time closely matches the 95th percentile of the observed travel time value.

7/18/2024

Context-aware Multi-task Learning for Pedestrian Intent and Trajectory Prediction

Farzeen Munir, Tomasz Piotr Kucner

The advancement of socially-aware autonomous vehicles hinges on precise modeling of human behavior. Within this broad paradigm, the specific challenge lies in accurately predicting pedestrian's trajectory and intention. Traditional methodologies have leaned heavily on historical trajectory data, frequently overlooking vital contextual cues such as pedestrian-specific traits and environmental factors. Furthermore, there's a notable knowledge gap as trajectory and intention prediction have largely been approached as separate problems, despite their mutual dependence. To bridge this gap, we introduce PTINet (Pedestrian Trajectory and Intention Prediction Network), which jointly learns the trajectory and intention prediction by combining past trajectory observations, local contextual features (individual pedestrian behaviors), and global features (signs, markings etc.). The efficacy of our approach is evaluated on widely used public datasets: JAAD and PIE, where it has demonstrated superior performance over existing state-of-the-art models in trajectory and intention prediction. The results from our experiments and ablation studies robustly validate PTINet's effectiveness in jointly exploring intention and trajectory prediction for pedestrian behaviour modelling. The experimental evaluation indicates the advantage of using global and local contextual features for pedestrian trajectory and intention prediction. The effectiveness of PTINet in predicting pedestrian behavior paves the way for the development of automated systems capable of seamlessly interacting with pedestrians in urban settings.

7/25/2024

Wavelet-based Temporal Attention Improves Traffic Forecasting

Yash Jakhmola, Nitish Kumar Mishra, Kripabandhu Ghosh, Tanujit Chakraborty

Spatio-temporal forecasting of traffic flow data represents a typical problem in the field of machine learning, impacting urban traffic management systems. Traditional statistical and machine learning methods cannot adequately handle both the temporal and spatial dependencies in these complex traffic flow datasets. A prevalent approach in the field is to combine graph convolutional networks and multi-head attention mechanisms for spatio-temporal processing. This paper proposes a wavelet-based temporal attention model, namely a wavelet-based dynamic spatio-temporal aware graph neural network (W-DSTAGNN), for tackling the traffic forecasting problem. Benchmark experiments using several statistical metrics confirm that our proposal efficiently captures spatio-temporal correlations and outperforms ten state-of-the-art models on three different real-world traffic datasets. Our proposed ensemble data-driven method can handle dynamic temporal and spatial dependencies and make long-term forecasts in an efficient manner.

7/8/2024