Hi, I’m Jérémie 👋

I’m a PhD candidate in ML security and privacy at École Polytechnique Paris, supervised by Prof. Sonia Vanier and Prof. Davide Buscaldi. I also collaborate with Crédit Agricole as part of the “Responsible and Trustworthy AI” partnership with École Polytechnique.

My research focuses on security and privacy issues in machine learning. I’m particularly interested in the robustness of Large Language Models against privacy attacks. I have developed some new adversarial attacks to reconstruct training data from multimodal models. I am now working on auditing tools to better understand and predict memorization, for practitioners to develop robust models.

I grew up in the south of France, near Aix-en-Provence. I followed the French engineering curriculum, with two years of “classe préparatoire” at Sainte Geneviève in Versailles, before attending the engineering curriculum at École Polytechnique, where I graduated in 2022. I also graduated from ENS Paris-Saclay “MVA” Master in 2023, specializing in “Mathematics, Vision, leArning”. I started my PhD at École Polytechnique Paris in 2023.

🔥 News

2024.09: 🗣️ I attended Google Responsible AI Summit in Paris to present some of my work, including our last preprint
2024.09: 👨‍🏫 I start teaching labs at “Machine Learning and Deep Learning” Master’s course at École Polytechnique
2024.08: 🗣️ I was at Usenix Security Symposium in Philadelphie to present our paper Reconstructing Training Data From Document Understanding Models
2024.06: 🥳 Our paper was accepted to Usenix Security 24! See our preprint here!
2024.03: 🍾 The “Responsible and Trustworthy AI” between Crédit Agricole and École Polytechnique is signed! Check out this article here.

🔈 Invited talks

2025.01.28: FIIA 2025, Paris. Measuring and understanding ivacy risks in Language Models. Sonia Vanier and Jérémie Dentan. [Slides] [LinkedIn Post]
2024.10.02: Google Responsible AI Summit, Paris. Towards security and privacy in document understanding models. Sonia Vanier and Jérémie Dentan. [Slides] [LinkedIn Post]
2024.10.02: Google Responsible AI Summit, Paris. Trust and Security in AI. Sonia Vanier and Jérémie Dentan. [Slides] [LinkedIn Post]

📝 Publications

Preprint

Predicting and analyzing memorization within fine-tuned Large Language Models

Arxiv Preprint - September 2024

Jérémie Dentan, Davide Buscaldi, Aymen Shabou, Sonia Vanier

An auditing tool for practitioners to evaluate their models and predict vulnerable samples before they are memorized
Theoretical justification and strong empirical results
Easy to use with a low computational budget
[Code] [Pitch Slides]

Usenix Security 24

Reconstructing Training Data From Document Understanding Models

Usenix Security Symposium - August 2024

Jérémie Dentan, Arnaud Paran, Aymen Shabou

Presents the first reconstruction attacks against document understanding models
Strong empirical results, with up to 4.1% of perfect reconstructions
Focus on the impact of multimodality on the performance of the attack

Using Error Level Analysis to remove Underspecification Jérémie Dentan. 2023.
Towards a reliable detection of forgeries based on demosaicing Jérémie Dentan. 2023.
Cellular Component Ontology Prediction Jérémie Dentan, Abdellah El Mrini, Meryem Jaaidan. 2023.

📖 Educations

2022-2023, Master of Science in Mathematics, Vision, leArning (MVA). ENS Paris-Saclay, Gif-sur-Yvette, France.
2019-2022, Master of Engineering in Applied Mathematics and Computer Science. École Polytechnique, Palaiseau, France.
2017-2019, Classe préparatoire MPSI/MP*. Lycée Privé Ste Geneviève, Versailles, France.

💻 Internships

2023.04 - 2023.09, Research Assistant. Crédit Agricole DataLab Groupe, Montrouge, France.
2022.04 - 2022.09, Research Assistant. Oracle Labs, Zurich, Switzerland.
2021.06 - 2021.08, Business Analyst. BearingPoint, Paris, France.