ENBIS-24 Leuven Conference

Name: ENBIS-24 Leuven Conference
Start: 2024-09-15T09:00:00+02:00
End: 2024-09-19T22:00:00+02:00
Location: Leuven, Belgium

15–19 Sept 2024

Leuven, Belgium

Europe/Berlin timezone

Chair of the Local Organising Committee

Optimising for average reward in a continuing environment: an application to industrial production planning

17 Sept 2024, 11:30

30m

Conference room 1

AI in Industry frEnbis invited session: Deep learning in industry

Paul Berhaut (Air Liquide)

Our research addresses the industrial challenge of minimising production costs in an undiscounted, continuing, partially observable setting. We argue that existing state-of-the-art reinforcement learning algorithms are unsuitable for this context. We introduce Clipped Horizon Average Reward (CHAR), a method tailored for undiscounted optimisation. CHAR is an extension applicable to any off-policy reinforcement learning algorithm which exploits known characteristic times of environments to simplify the problem. We apply CHAR to an industrial gas supplier case study and demonstrate its superior performance in the specific studied environment. Finally, we benchmark our results against the standard industry algorithm, presenting the merits and drawbacks of our approach.

Type of presentation	Talk
Classification	Mainly application
Keywords	reinforcement learning, production planning, industrial application

Paul Berhaut (Air Liquide)

Mr Patrick Sampaio dos Santos Brandao (Air Liquide)

There are no materials yet.

ENBIS-24 Leuven Conference

Chair of the Local Organising Committee

Optimising for average reward in a continuing environment: an application to industrial production planning

Conference room 1

Speaker

Description

Primary author

Co-author

Presentation materials

Choose timezone

ENBIS-24 Leuven Conference

Chair of the Local Organising Committee

Speaker

Description

Primary author

Co-author

Presentation materials