FindingFeasibleRouteswithReinforcementLearningUsingMacro资源-CSDN文库

需积分: 5 38 浏览量 2024-08-15 03:19:48 上传评论收藏 680KB PDF 举报

资源推荐

资源详情

资源评论

Finding Feasible Routes with Reinforcement

Learning Using Macro-Level Traﬃc Measurements

Mustafa Can Ozkan

1

#

SpaceTimeLab, University College London, UK

Tao Cheng #

SpaceTimeLab, University College London, UK

Abstract

The quest for identifying feasible routes holds immense signiﬁcance in the realm of transportation,

spanning a diverse range of applications, from logistics and emergency systems to taxis and public

transport services. This research area oﬀers multifaceted beneﬁts, including optimising traﬃc

management, maximising traﬃc ﬂow, and reducing carbon emissions and fuel consumption. Extensive

studies have been conducted to address this critical issue, with a primary focus on ﬁnding the

shortest paths, while some of them incorporate various traﬃc conditions such as waiting times at

traﬃc lights and traﬃc speeds on road segments. In this study, we direct our attention towards

historical data sets that encapsulate individuals’ route preferences, assuming they encompass all

traﬃc conditions, real-time decisions and topological features. We acknowledge that the prevailing

preferences during the recorded period serve as a guide for feasible routes. The study’s noteworthy

contribution lies in our departure from analysing individual preferences and trajectory information,

instead focusing solely on macro-level measurements of each road segment, such as traﬃc ﬂow or

traﬃc speed. These types of macro-level measurements are easier to collect compared to individual

data sets. We propose an algorithm based on Q-learning, employing traﬃc measurements within a

road network as positive attractive rewards for an agent. In short, observations from macro-level

decisions will help us to determine optimal routes between any two points. Preliminary results

demonstrate the agent’s ability to accurately identify the most feasible routes within a short training

period.

2012 ACM Subject Classiﬁcation Computing methodologies → Q-learning

Keywords and phrases routing, reinforcement learning, q-learning, data mining, macro-level patterns

Digital Object Identiﬁer 10.4230/LIPIcs.GIScience.2023.58

Category Short Paper

1 Introduction

The topic of ﬁnding routes between two points has been studied in many diﬀerent ﬁelds, such

as computer systems, transportation systems and communication networks. The majority

of research concentrates on route optimisation, seeking to reduce travel time or distance

or to maximise operational eﬃciencies, such as the maximum number of taxi customers

or the maximum storage of a delivery truck. These studies, which employ mathematical

optimisation techniques, include optimisation constraints such as the truck’s maximum cargo

capacity and minimise/maximise the objective function of the main aim, such as travel time.

They often take into account the average travel time on a route depending on the length of

the road, the timing of the traﬃc lights, or occasionally the traﬃc situation, including actual

or historical traﬃc ﬂow and speeds. They also factor in user preferences from surveys or GPS

1

corresponding author

© Mustafa Can Ozkan and Tao Cheng;

licensed under Creative Commons License CC-BY 4.0

12th International Conference on Geographic Information Science (GIScience 2023).

Editors: Roger Beecham, Jed A. Long, Dianna Smith, Qunshan Zhao, and Sarah Wise; Article No. 58; pp. 58:1–58:6

Leibniz International Proceedings in Informatics

Schloss Dagstuhl – Leibniz-Zentrum für Informatik, Dagstuhl Publishing, Germany

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余5页未读，立即下载

内容反馈

gllgool

粉丝: 183
资源: 7

最新资源

资源上传下载、课程学习等过程中有任何疑问或建议，欢迎提出宝贵意见哦~我们会及时处理！点击此处反馈

feedback-tip