The French National Centre for Scientific Research

Post-doc

Post-Doc Position: Safe AI Planning and Reinforcement Learning using Formal Methods

접수중2025.10.11~2026.01.01

채용 정보

  • 접수 기간

    2025.10.11 00:00~2026.01.01 12:00

  • 접수 방법

    홈페이지지원더보기

  • 채용 구분

    경력 무관

  • 고용 형태

    계약직

  • 지원 자격

    박사

  • 모집 전공

    제어계측공학, 정보・통신공학, 전자공학, 전산학・컴퓨터공학, 전기공학, 의공학, 응용소프트웨어공학, 광학공학더보기

  • 기관 유형

    연구기관

  • 근무 지역

    해외(프랑스)더보기

  • 연봉 정보

The post-doc position is part of a collaboration between Inria and Mitsubishi Electric R&D Centre Europe (MERCE) within the FRAIME project on artificial intelligence and formal methods. The project explores, on the one hand, how Formal Methods can provide guarantees on AI systems, and on the other hand how AI can help Formal Methods to be more efficient and easier to use by practitioners. The vision is to intertwine Formal Methods and AI to efficiently design safe systems.

This is a postdoctoral position in the fields of AI planning, reinforcement learning (RL), and formal methods. The position is initially funded for 12 months, but it is further extensible to at least another year.

While this is an academic position based at Inria Rennes, the candidate will collaborate with researchers from both Inria and MERCE, thus benefiting from both academic and industrial research environments.
The work will be done in collaboration with Nathalie Bertrand and Ocan Sankur (Inria DEVINE team https://devine.inria.fr/) and Benoît Boyer (MERCE).

The main objective is to develop safe planning and reinforcement learning algorithms with various degrees of confidence for variants of Markov decision processes.
More precisely, we will develop algorithms for multi-environment MDPs, partially observable MDPs, and their variants and apply these in appropriate applications provided by MERCE.

We will focus on developing practical solutions for these formalisms. Some possibilities are to develop solutions based on dynamic programming over finite horizon, or using mathematical solvers, or adapting reinforcement learning algorithms to the desired context. Furthermore, the candidate can also study theoretical properties of the developed algorithms such as their complexity, optimality, and measures such as the regret.
These algorithms are expected to be validated experimentally on appropriate case studies.

The overall objective is to contribute to the state of the art of planning and RL algorithms with strong safety guarantees.

Some references:
- Sun et al. Online MDP with Prototypes Information: A Robust Adaptive Approach. AAAI 2025.
- Royer et al. Multiple-environment markov decision processes: Efficient analysis and applications. ICAPS 2020.
- Chatterjee et al. The Value Problem for Multiple-Environment MDPs with Parity Objective. ICALP 2025.

Advantages

  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training

The candidate can start anytime during the 2025-2026 academic year.

Salary: Monthly gross salary of 2788 euros

근무 예정지

대표The French National Centre for Scientific Research(해외) : 3, rue Michel-Ange 75794 Paris cedex 16

해외(프랑스) : France, IRISA, Inria, CNRS, Université de Rennes, Rennes, 35000

관련 키워드

Computer science

기관 정보

The French National Centre for Scientific Research

닫기신규 공고 알림받기신규 공고 알림받기 관심 기관 설정으로 신규 공고를 누구보다
먼저 받아보세요.

  • 기관유형

    연구기관

  • 대표전화

    +33 1 44 96 40 00

  • 대표주소

    3, rue Michel-Ange 75794 Paris cedex 16

  • 홈페이지

    바로가기

이 공고와 함께
많이 본 공고

이런 공고는 어떠세요?