EGAT: Extended Graph Attention Network for Pedestrian Trajectory Prediction

To improve foresight and make correct judgment in advance, pedestrian trajectory prediction has a wide range of application values in autonomous driving, robot interaction, and safety monitoring. However, most of the existing methods only focus on the interaction of local pedestrians according to distance, ignoring the influence of far pedestrians; the range of network input (receptive field) is small. In this paper, an extended graph attention network (EGAT) is proposed to increase receptive field, which focuses not only on local pedestrians, but also on those who are far away, to further strengthen pedestrian interaction. In the temporal domain, TSG-LSTM (TS-LSTM and TG-LSTM) and P-LSTM are proposed based on LSTM to enhance information transmission by residual connection. Compared with state-of-the-art methods, the model EGAT achieves excellent performance on both ETH and UCY public datasets and generates more reliable trajectories.

Download Full-text

A Dynamic and Static Context-Aware Attention Network for Trajectory Prediction

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10050336 ◽

2021 ◽

Vol 10 (5) ◽

pp. 336

Author(s):

Jian Yu ◽

Meng Zhou ◽

Xin Wang ◽

Guoliang Pu ◽

Chengqi Cheng ◽

...

Keyword(s):

Real World ◽

Autonomous Driving ◽

Attention Mechanism ◽

Context Aware ◽

Trajectory Prediction ◽

Attention Network ◽

Series Of Experiments ◽

Real World Datasets ◽

The Moment ◽

Autonomous Driving System

Forecasting the motion of surrounding vehicles is necessary for an autonomous driving system applied in complex traffic. Trajectory prediction helps vehicles make more sensible decisions, which provides vehicles with foresight. However, traditional models consider the trajectory prediction as a simple sequence prediction task. The ignorance of inter-vehicle interaction and environment influence degrades these models in real-world datasets. To address this issue, we propose a novel Dynamic and Static Context-aware Attention Network named DSCAN in this paper. The DSCAN utilizes an attention mechanism to dynamically decide which surrounding vehicles are more important at the moment. We also equip the DSCAN with a constraint network to consider the static environment information. We conducted a series of experiments on a real-world dataset, and the experimental results demonstrated the effectiveness of our model. Moreover, the present study suggests that the attention mechanism and static constraints enhance the prediction results.

Download Full-text

5G Mobile Services and Scenarios: Challenges and Solutions

Sustainability ◽

10.3390/su10103626 ◽

2018 ◽

Vol 10 (10) ◽

pp. 3626 ◽

Cited By ~ 18

Author(s):

Yousaf Zikria ◽

Sung Kim ◽

Muhammad Afzal ◽

Haoxiang Wang ◽

Mubashir Rehmani

Keyword(s):

Mobile Network ◽

Future Internet ◽

Autonomous Driving ◽

Mobile Services ◽

Data Traffic ◽

Fifth Generation ◽

Wide Range ◽

Industrial Iot ◽

5G Network ◽

Massive Number

The Fifth generation (5G) network is projected to support large amount of data traffic and massive number of wireless connections. Different data traffic has different Quality of Service (QoS) requirements. 5G mobile network aims to address the limitations of previous cellular standards (i.e., 2G/3G/4G) and be a prospective key enabler for future Internet of Things (IoT). 5G networks support a wide range of applications such as smart home, autonomous driving, drone operations, health and mission critical applications, Industrial IoT (IIoT), and entertainment and multimedia. Based on end users’ experience, several 5G services are categorized into immersive 5G services, intelligent 5G services, omnipresent 5G services, autonomous 5G services, and public 5G services. In this paper, we present a brief overview of 5G technical scenarios. We then provide a brief overview of accepted papers in our Special Issue on 5G mobile services and scenarios. Finally, we conclude this paper.

Download Full-text

FAC-Net: Feedback Attention Network Based on Context Encoder Network for Skin Lesion Segmentation

Sensors ◽

10.3390/s21155172 ◽

2021 ◽

Vol 21 (15) ◽

pp. 5172

Author(s):

Yuying Dong ◽

Liejun Wang ◽

Shuli Cheng ◽

Yongming Li

Keyword(s):

Skin Lesion ◽

Real Life ◽

Good Deal ◽

Experimental Tests ◽

Skin Lesions ◽

Lesion Segmentation ◽

Attention Network ◽

Effective Feedback ◽

Public Datasets ◽

Learning Architectures

Considerable research and surveys indicate that skin lesions are an early symptom of skin cancer. Segmentation of skin lesions is still a hot research topic. Dermatological datasets in skin lesion segmentation tasks generated a large number of parameters when data augmented, limiting the application of smart assisted medicine in real life. Hence, this paper proposes an effective feedback attention network (FAC-Net). The network is equipped with the feedback fusion block (FFB) and the attention mechanism block (AMB), through the combination of these two modules, we can obtain richer and more specific feature mapping without data enhancement. Numerous experimental tests were given by us on public datasets (ISIC2018, ISBI2017, ISBI2016), and a good deal of metrics like the Jaccard index (JA) and Dice coefficient (DC) were used to evaluate the results of segmentation. On the ISIC2018 dataset, we obtained results for DC equal to 91.19% and JA equal to 83.99%, compared with the based network. The results of these two main metrics were improved by more than 1%. In addition, the metrics were also improved in the other two datasets. It can be demonstrated through experiments that without any enhancements of the datasets, our lightweight model can achieve better segmentation performance than most deep learning architectures.

Download Full-text

The Impact of Mental States on Semi-autonomous Driving Takeover Performance: A Systematic Review

Proceedings of the Human Factors and Ergonomics Society Annual Meeting ◽

10.1177/1071181320641328 ◽

2020 ◽

Vol 64 (1) ◽

pp. 1372-1376

Author(s):

Gaojian Huang ◽

Christine Petersen ◽

Brandon J. Pitts

Keyword(s):

Systematic Review ◽

Autonomous Vehicles ◽

Response Times ◽

Autonomous Driving ◽

Mental States ◽

Manual Control ◽

Monitoring Systems ◽

Wide Range ◽

Different Types ◽

The Impact

Semi-autonomous vehicles still require drivers to occasionally resume manual control. However, drivers of these vehicles may have different mental states. For example, drivers may be engaged in non-driving related tasks or may exhibit mind wandering behavior. Also, monitoring monotonous driving environments can result in passive fatigue. Given the potential for different types of mental states to negatively affect takeover performance, it will be critical to highlight how mental states affect semi-autonomous takeover. A systematic review was conducted to synthesize the literature on mental states (such as distraction, fatigue, emotion) and takeover performance. This review focuses specifically on five fatigue studies. Overall, studies were too few to observe consistent findings, but some suggest that response times to takeover alerts and post-takeover performance may be affected by fatigue. Ultimately, this review may help researchers improve and develop real-time mental states monitoring systems for a wide range of application domains.

Download Full-text

Practical Model Selection for Prospective Virtual Screening

10.1101/337956 ◽

2018 ◽

Cited By ~ 1

Author(s):

Shengchao Liu ◽

Moayad Alnammi ◽

Spencer S. Ericksen ◽

Andrew F. Voter ◽

Gene E. Ananiev ◽

...

Keyword(s):

Random Forest ◽

Virtual Screening ◽

Protein Interactions ◽

High Throughput Screening ◽

Screening Methods ◽

Protein Protein Interactions ◽

Screening Algorithm ◽

Screening Performance ◽

Wide Range ◽

Public Datasets

AbstractVirtual (computational) high-throughput screening provides a strategy for prioritizing compounds for experimental screens, but the choice of virtual screening algorithm depends on the dataset and evaluation strategy. We consider a wide range of ligand-based machine learning and docking-based approaches for virtual screening on two protein-protein interactions, PriA-SSB and RMI-FANCM, and present a strategy for choosing which algorithm is best for prospective compound prioritization. Our workflow identifies a random forest as the best algorithm for these targets over more sophisticated neural network-based models. The top 250 predictions from our selected random forest recover 37 of the 54 active compounds from a library of 22,434 new molecules assayed on PriA-SSB. We show that virtual screening methods that perform well in public datasets and synthetic benchmarks, like multi-task neural networks, may not always translate to prospective screening performance on a specific assay of interest.

Download Full-text

Mosaic Super-resolution via Sequential Feature Pyramid Networks

10.36227/techrxiv.11402130 ◽

2019 ◽

Author(s):

Mehrdad Shoeiby ◽

Mohammad Ali Armin ◽

Sadegh Aliakbarian ◽

Saeed Anwar ◽

Lars petersson

Keyword(s):

State Of The Art ◽

Super Resolution ◽

Autonomous Driving ◽

Single Shot ◽

Current State ◽

Wide Range ◽

Feature Pyramid ◽

Novel Method ◽

Convolutional Lstm ◽

Mosaic Images

<div>Advances in the design of multi-spectral cameras have</div><div>led to great interests in a wide range of applications, from</div><div>astronomy to autonomous driving. However, such cameras</div><div>inherently suffer from a trade-off between the spatial and</div><div>spectral resolution. In this paper, we propose to address</div><div>this limitation by introducing a novel method to carry out</div><div>super-resolution on raw mosaic images, multi-spectral or</div><div>RGB Bayer, captured by modern real-time single-shot mo-</div><div>saic sensors. To this end, we design a deep super-resolution</div><div>architecture that benefits from a sequential feature pyramid</div><div>along the depth of the network. This, in fact, is achieved</div><div>by utilizing a convolutional LSTM (ConvLSTM) to learn the</div><div>inter-dependencies between features at different receptive</div><div>fields. Additionally, by investigating the effect of different</div><div>attention mechanisms in our framework, we show that a</div><div>ConvLSTM inspired module is able to provide superior at-</div><div>tention in our context. Our extensive experiments and anal-</div><div>yses evidence that our approach yields significant super-</div><div>resolution quality, outperforming current state-of-the-art</div><div>mosaic super-resolution methods on both Bayer and multi-</div><div>spectral images. Additionally, to the best of our knowledge,</div><div>our method is the first specialized method to super-resolve</div><div>mosaic images, whether it be multi-spectral or Bayer.</div><div><br></div>

Download Full-text

Deep Predictive Autonomous Driving Using Multi-Agent Joint Trajectory Prediction and Traffic Rules

2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros40897.2019.8967708 ◽

2019 ◽

Cited By ~ 2

Author(s):

Kyunghoon Cho ◽

Timothy Ha ◽

Gunmin Lee ◽

Songhwai Oh

Keyword(s):

Autonomous Driving ◽

Trajectory Prediction ◽

Multi Agent ◽

Traffic Rules

Download Full-text

Road-Aware Trajectory Prediction for Autonomous Driving on Highways

Sensors ◽

10.3390/s20174703 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4703

Author(s):

Yookhyun Yoon ◽

Taeyeon Kim ◽

Ho Lee ◽

Jahnghyon Park

Keyword(s):

Deep Learning ◽

Autonomous Vehicles ◽

Prediction Method ◽

Autonomous Driving ◽

Structural Constraints ◽

High Definition ◽

Trajectory Prediction ◽

The Road ◽

Road Geometry ◽

Efficient Learning

For driving safely and comfortably, the long-term trajectory prediction of surrounding vehicles is essential for autonomous vehicles. For handling the uncertain nature of trajectory prediction, deep-learning-based approaches have been proposed previously. An on-road vehicle must obey road geometry, i.e., it should run within the constraint of the road shape. Herein, we present a novel road-aware trajectory prediction method which leverages the use of high-definition maps with a deep learning network. We developed a data-efficient learning framework for the trajectory prediction network in the curvilinear coordinate system of the road and a lane assignment for the surrounding vehicles. Then, we proposed a novel output-constrained sequence-to-sequence trajectory prediction network to incorporate the structural constraints of the road. Our method uses these structural constraints as prior knowledge for the prediction network. It is not only used as an input to the trajectory prediction network, but is also included in the constrained loss function of the maneuver recognition network. Accordingly, the proposed method can predict a feasible and realistic intention of the driver and trajectory. Our method has been evaluated using a real traffic dataset, and the results thus obtained show that it is data-efficient and can predict reasonable trajectories at merging sections.

Download Full-text

Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction Using a Graph Vehicle-Pedestrian Attention Network

IEEE Robotics and Automation Letters ◽

10.1109/lra.2020.3004324 ◽

2020 ◽

Vol 5 (4) ◽

pp. 5026-5033

Author(s):

Stuart Eiffert ◽

Kunming Li ◽

Mao Shan ◽

Stewart Worrall ◽

Salah Sukkarieh ◽

...

Keyword(s):

Trajectory Prediction ◽

Attention Network

Download Full-text

Phonetics and Ambiguity Comprehension Gated Attention Network for Humor Recognition

Complexity ◽

10.1155/2020/2509018 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Xiaochao Fan ◽

Hongfei Lin ◽

Liang Yang ◽

Yufeng Diao ◽

Chen Shen ◽

...

Keyword(s):

Neural Network ◽

Semantic Information ◽

Semantic Representation ◽

Research Attention ◽

Attention Network ◽

Phonetic Information ◽

The Neural Network ◽

Ambiguous Words ◽

Public Datasets

Humor refers to the quality of being amusing. With the development of artificial intelligence, humor recognition is attracting a lot of research attention. Although phonetics and ambiguity have been introduced by previous studies, existing recognition methods still lack suitable feature design for neural networks. In this paper, we illustrate that phonetics structure and ambiguity associated with confusing words need to be learned for their own representations via the neural network. Then, we propose the Phonetics and Ambiguity Comprehension Gated Attention network (PACGA) to learn phonetic structures and semantic representation for humor recognition. The PACGA model can well represent phonetic information and semantic information with ambiguous words, which is of great benefit to humor recognition. Experimental results on two public datasets demonstrate the effectiveness of our model.

Download Full-text