Loughborough University
Leicestershire, UK
LE11 3TU
+44 (0)1509 263171
Loughborough University

Loughborough University Institutional Repository

Please use this identifier to cite or link to this item: https://dspace.lboro.ac.uk/2134/32720

Title: Agent-based modelling for driving policy learning in connected and autonomous vehicles
Authors: De Silva, Varuna
Kondoz, Ahmet
Keywords: Agent-based learning
Reinforcement learning
Driving policy
Data driven control
Imitation learning
Issue Date: 2018
Publisher: IEEE
Citation: DE SILVA, V. and KONDOZ, A., 2018. Agent-based modelling for driving policy learning in connected and autonomous vehicles. Presented at the Intelligent Systems Conference (IntelliSys 2018), London, 6-7th September.
Abstract: Due to the complexity of the natural world, a programmer cannot foresee all possible situations a connected and autonomous vehicle (CAV) will face during its operation, and hence, CAVs will need to learn to make decisions autonomously. Due to the sensing of its surroundings and information exchanged with other vehicles and road infrastructure a CAV will have access to large amounts of useful data. While different control algorithms have been proposed for CAVs, the benefits brought about by connectedness of autonomous vehicles to other vehicles and to the infrastructure, and its implications on policy learning has not been investigated in literature. This paper investigates a data driven driving policy learning framework through an agent-based modelling approaches. The contributions of the paper are twofold. A dynamic programming framework is proposed for in vehicle policy learning with and without connectivity to neighboring vehicles. The simulation results indicate that while a CAV can learn to make autonomous decisions, vehicle-to-vehicle (V2V) communication of information improves this capability. Furthermore, to overcome the limitations of sensing in a CAV, the paper proposes a novel concept for infrastructure-led policy learning and communication with autonomous vehicles. In infrastructure-led policy learning, road-side infrastructure senses and captures successful vehicle maneuvers and learns an optimal policy from those temporal sequences, and when a vehicle approaches the road-side unit, the policy is communicated to the CAV. Deep-imitation learning methodology is proposed to develop such an infrastructure-led policy learning framework.
Description: This paper is in closed access until it is published.
Version: Accepted for publication
URI: https://dspace.lboro.ac.uk/2134/32720
Publisher Link: https://ieeexplore.ieee.org/Xplore/home.jsp
Appears in Collections:Closed Access (Loughborough University London)

Files associated with this item:

File Description SizeFormat
CameraReady.pdfAccepted version591.05 kBAdobe PDFView/Open


SFX Query

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.