Hi, I’m Jérémie 👋

I’m a PhD candidate in ML security and privacy at École Polytechnique Paris, supervised by Prof. Sonia Vanier and Prof. Davide Buscaldi. I also collaborate with Crédit Agricole as part of the “Responsible and Trustworthy AI” partnership with École Polytechnique.

My research focuses on security and privacy issues in machine learning. I’m particularly interested in the robustness of Large Language Models against privacy attacks. I have developed some new adversarial attacks to reconstruct training data from multimodal models. I am now working on auditing tools to better understand and predict memorization, for practitioners to develop robust models.

I grew up in the south of France, near Aix-en-Provence. I followed the French engineering curriculum, with two years of “classe préparatoire” at Sainte Geneviève in Versailles, before attending the engineering curriculum at École Polytechnique, where I graduated in 2022. I also graduated from ENS Paris-Saclay “MVA” Master in 2023, specializing in “Mathematics, Vision, leArning”. I started my PhD at École Polytechnique Paris in 2023.

🔥 News

🔈 Invited talks

  • 2025.01.28: FIIA 2025, Paris. Measuring and understanding ivacy risks in Language Models. Sonia Vanier and Jérémie Dentan. [Slides] [LinkedIn Post]
  • 2024.10.02: Google Responsible AI Summit, Paris. Towards security and privacy in document understanding models. Sonia Vanier and Jérémie Dentan. [Slides] [LinkedIn Post]
  • 2024.10.02: Google Responsible AI Summit, Paris. Trust and Security in AI. Sonia Vanier and Jérémie Dentan. [Slides] [LinkedIn Post]

📝 Publications

Preprint
sym

Predicting and analyzing memorization within fine-tuned Large Language Models

Arxiv Preprint - September 2024

Jérémie Dentan, Davide Buscaldi, Aymen Shabou, Sonia Vanier

  • An auditing tool for practitioners to evaluate their models and predict vulnerable samples before they are memorized
  • Theoretical justification and strong empirical results
  • Easy to use with a low computational budget
  • [Code] [Pitch Slides]
Usenix Security 24
sym

Reconstructing Training Data From Document Understanding Models

Usenix Security Symposium - August 2024

Jérémie Dentan, Arnaud Paran, Aymen Shabou

  • Presents the first reconstruction attacks against document understanding models
  • Strong empirical results, with up to 4.1% of perfect reconstructions
  • Focus on the impact of multimodality on the performance of the attack

📖 Educations

  • 2022-2023, Master of Science in Mathematics, Vision, leArning (MVA). ENS Paris-Saclay, Gif-sur-Yvette, France.
  • 2019-2022, Master of Engineering in Applied Mathematics and Computer Science. École Polytechnique, Palaiseau, France.
  • 2017-2019, Classe préparatoire MPSI/MP*. Lycée Privé Ste Geneviève, Versailles, France.

💻 Internships

  • 2023.04 - 2023.09, Research Assistant. Crédit Agricole DataLab Groupe, Montrouge, France.
  • 2022.04 - 2022.09, Research Assistant. Oracle Labs, Zurich, Switzerland.
  • 2021.06 - 2021.08, Business Analyst. BearingPoint, Paris, France.