Dans le cadre de l’axe transverse IA-Science des données, le LIRMM accueillera le Lundi 18 Novembre à 15h, le Professeur Preslav Nakov qui donnera un séminaire dans l’Amphi St Priest (JJ Moreau – Bât 2).
Vous êtes attendu à partir de 14h30 pour un accueil café. Séminaire gratuit et sans inscription.
We will discuss the risks, the challenges, and the opportunities that Large Language Models (LLMs) bring regarding factuality. We will then delve into our recent work on using LLMs for fact-checking, on detecting machine-generated text, and on fighting the ongoing misinformation pollution with LLMs. We will also discuss work on safeguarding LLMs, and the safety mechanisms we incorporated in Jais-chat, the world’s best open Arabic-centric foundation and instruction-tuned LLM, based on our Do-Not-Answer dataset. Finally, we will present a number of LLM fact-checking tools recently developed at MBZUAI: (i) LM-Polygraph, a tool to predict an LLM’s uncertainty in its output using cheap and fast uncertainty quantification techniques, (ii) Factcheck-Bench, a fine-grained evaluation benchmark and framework for fact-checking the output of LLMs, (iii) OpenFactVerification (Loki), an open-source tool for fact-checking the output of LLMs, developed based on Factcheck-Bench and optimized for speed and quality, and (iv) OpenFactCheck, a framework for building customized fact-checking systems and for benchmarking entire LLMs.
Professor and Department Chair for NLP
Natural Language Processing
Mohamed bin Zayed University of Artificial Intelligence
Dr. Preslav Nakov is a Professor and Chair of the Natural Language Processing department at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). His research focuses on computational linguistics, large language models, fact-checking, disinformation, propaganda, and detecting machine-generated text. He helped develop Jais, the leading open-source Arabic-centric LLM, and is part of MBZUAI’s LLM360 team. Nakov holds a PhD from UC Berkeley and has held roles at Qatar Computing Research Institute, the National University of Singapore, and Sofia University. He has authored multiple books and over 300 research papers,
receiving numerous awards for his work on fake news detection, propaganda, and machine-generated content. He is Chair-Elect of the European Chapter of the Association for Computational Linguistics (EACL) and serves on the editorial boards of several prestigious journals. His research has been featured in 100+ media outlets, including MIT Technology Review, Forbes, and CNN.