Doing More with Less while Not Losing Trust: How Should Evaluation Standards Evolve in an AI Augmented World?

Sobre o evento

AI has the potential to rapidly reshaping how evaluations are planned, conducted, synthesised, and communicated particularly in low resource and humanitarian contexts, where pressure to “do more with less” is acute. However, it simultaneously introduces significant ethical, methodological, and governance risks that directly challenge the profession’s credibility, independence, and trustworthiness.

AI adds the greatest value when it reduces duplication, lowers transaction costs, and frees human expertise for judgment, ethics, and contextual interpretation, not when it substitutes core evaluative functions or obscures accountability. This creates an urgent need for professional standards to evolve, not by endorsing AI wholesale but defining where it is appropriate, where it is not, and what competencies and safeguards are required.

Looking at their own use case, this roundtable will explore questions around how evaluation standards, competencies, and institutional frameworks adapt to ensure AI strengthen rather than undermine ethical practice, methodological rigor, and public trust. It will be particularly relevant for evaluators and commissioners wrestling with how to translate high-level AI ethics and governance principles into concrete evaluation standards and professional practice.

Orador/a

Nome	Título	Biography
Alexandra Priebe, PhD	Evaluation Officer, OEV WFP	Alexandra Priebe is an Evaluation Officer in WFP’s Office of Evaluation, Use Unit, supporting the development and adoption of an AI evidence mining tool. She brings 20+ years’ experience across humanitarian and development contexts.
Steven Jonckheere	Senior Evaluation Officer, IOE IFAD	Steven Jonckheere is Senior Evaluation Officer at IFAD’s Independent Office of Evaluation (IOE). With over 20 years of experience in agricultural and rural development across Africa, Asia, Europe, and Latin America, he specializes in social inclusion, gender, and methodological innovation. At IOE, Steven oversees and leads the Office’s work on AI and innovation in evaluation.
Fabrizio Felloni	Deputy Director, Independent Evaluation Office of the GEF	Fabrizio Felloni is Chief Evaluation Officer and Deputy Director at the Independent Evaluation Office of the Global Environment Facility (Washington DC), from September 2024. He was previously, Deputy Director at the Independent Office of Evaluation of IFAD (2016-2024), Lead Evaluation Officer in the same office (2010-2016), Evaluation Specialist at UNDP (2008-2010) and at IFAD (2001-2007). He has led project, country-level, sub-regional, thematic and corporate evaluations in over 25 countries (Africa, Asia, Eastern Europe, Latin America). He holds a Master’s Degree in Agricultural Economics from Washington State University and a Master-equivalent degree in Social and Economic Sciences from Bocconi University (Italy). He is the author / co-author of over a dozen publications in peer-reviewed journals. He is fluent in English, French and Spanish
Anupam Anand, PhD	Senior Evaluation Officer, GEF IEO	Dr. Anupam Anand is a Senior Evaluation Officer at the GEF IEO, where he serves as the program manager for biodiversity evaluations and methods, developing applied approaches that integrate LLMs, geospatial analysis, satellite data, drones and field methods into evaluative practice. With over 17 years of experience in academia, evaluation and international development, he designs and deploys these tools to generate stronger, field-grounded evaluative evidence, bridging technical execution with evaluation design. Previously, he led NASA-funded projects and conducted climate risk assessments for the World Bank. He holds a Ph.D. from the University of Maryland and a postgraduate diploma in environmental law.
Thanicha Ruangmas,PhD	Data Scientist, GEF IEO	Dr. Thanicha Ruangmas is a data scientist at the GEF IEO, where she develops LLMs to classify large document corpora. Her work supports implementation issue classification, activity classification, and policy coherence analysis. She holds a Ph.D. from the University of Wisconsin-Madison.
Carlos Tarazona	Senior Evaluation Officer, Food and Agricultural Organization.	Carlos Tarazona is a Senior Evaluation Officer at FAO with over 20 years of experience in the evaluation of agricultural and rural development programmes. He has led major evaluations and previously worked with the International Atomic Energy Agency. His work focuses on evaluation methods, learning, and strengthening evaluation practice.
Zhiqi Xu	Evaluation Specialist, Food and Agricultural Organization	Zhiqi Xu is an Evaluation Analyst at the FAO Office of Evaluation specializing in mixed-methods evaluation and methodological innovation. She promotes data-informed and AI-assisted approaches in evaluation practice. She is currently pursuing a PhD in Development Studies at Erasmus University Rotterdam on policy experimentation and co-production in China’s poverty alleviation.
Aiko Ward	Principal Evaluation Officer	Aiko is a Principal Evaluation Officer at the IEU. She comes with over 18 years of experience in monitoring, evaluation, and learning (MEL) and data management in the private and public sectors. Prior to joining the IEU, Aiko worked within the GCF Secretariat for 4 years leading the development and implementation of the GCF’s impact framework known as the integrated results management framework (IRMF). Prior to that, she held various roles in the areas of MEL and data management and analysis in an international NGO in London, UK, United Nations agencies in Sri Lanka, Bangladesh and at the headquarters in New York. Prior to that she also worked in the private sector as an investment banker in Tokyo monitoring financial risks as well as a development consultant for the Japan International Cooperation Agency (JICA) in countries such as Afghanistan and Cambodia. She has a BA in Economics from the University of Virginia in the US, an MA in International Relations from Waseda University in Tokyo, Japan, and MSc. in Social Research Methods from the University College London (UCL), UK

Moderators

Nome	Título	Biography
Anoop Sharma	Evaluation and AI Specialist	Anoop Sharma is Evaluation and AI Specialist at IFAD’s Independent Office of Evaluation (IOE). He has more than seven years of UN experience in evaluations, operational assessments, and AI and data driven analysis. At IOE, Anoop is leading the integration of the Office’s AI strategy into evaluation processes.
Innocent Chamisa	EvalforEarth CoP Coordinator	International development specialist with over 10 years of experience across food systems, land governance, digital innovation, and evaluation. FAO award recipient for policy coordination and sustainable agriculture. Currently serving as Global Coordinator of EvalforEarth, supporting evaluation for food security, environment, agriculture, and rural development.

Resumo

SUMMARY NOTE As part of the 2026 gLOCAL Evaluation Week 2026, an inter-agency roundtable discussion was convened under the title: “Doing More with Less While Not Losing Trust: How Should Evaluation Standards Evolve in an AI-Augmented World”. The discussion examined the integration of artificial intelligence (AI) into evaluation practice across organizations including Food and Agriculture Organization of the United Nations, World Food Programme, Green Climate Fund, Global Environment Facility and International Fund for Agricultural Development. The discussion focused on three interrelated areas: 1. AI integration in evaluation methodologies and processes; 2. Skills and competencies required for AI-assisted evaluation; and 3. Governance, ethics and institutional trust frameworks. Panelists highlighted that AI tools can support evaluators in processing and synthesizing large volumes of information through document classification, coding, retrieval augmented generation (RAG), and systematic analysis approaches. However, participants emphasized that AI outputs require robust validation mechanisms, including traceability to source evidence, human quality assurance, and structured review processes. The discussion underscored that AI should support, rather than replace, evaluator judgment. While evaluators are not expected to become data scientists, they require a minimum level of AI literacy, including understanding AI limitations, prompt design, verification techniques, and ethical data governance considerations. Several operational and governance challenges were discussed, including data privacy, accountability, transparency of AI-assisted processes, bias management, and unequal institutional capacities between developed and developing contexts. Panelists noted that AI can improve efficiency in data analysis and synthesis, but cannot compensate for weak or poor-quality underlying data. The discussion also highlighted the need for evaluation standards and institutional frameworks to evolve from broad principles toward operational guidance on AI-assisted evaluation practice. This includes clearer governance arrangements, documentation requirements, transparency protocols, and quality assurance procedures to maintain trust, credibility, and professional standards in evaluation. Key messages emerging from the discussion included: • AI can improve efficiency and support evidence synthesis in evaluation processes; • Human oversight and professional judgment remain central to evaluation credibility; • Validation, transparency, and traceability are essential in AI-assisted evaluation work; • Institutional capacities and evaluator competencies will need to adapt to AI-enabled approaches; and • Evaluation standards and governance frameworks should evolve to address the operational implications of AI use while safeguarding trust and accountability.

Follow-up Action Points • Develop and publish a follow-up blog capturing key reflections, lessons, and practical experiences shared during the discussion. • Explore the organisation of an online discussion forum through EvalforEarth to continue exchange on AI, evaluation standards, ethics, governance, and institutional practice. • Continue inter-agency dialogue and peer learning on responsible AI use in evaluation through webinars, blogs, and knowledge-sharing activities. • Document and share institutional experiences, lessons learned, tools, guidance materials, and training resources related to AI-assisted evaluation practices. • Promote continued exchange on practical approaches for validation, transparency, traceability, and quality assurance in AI-supported evaluation work. • Encourage sharing of emerging practices and institutional frameworks for responsible AI use across organizations and regions. • Facilitate continued discussion within the EvalforEarth Community of Practice on evolving evaluator competencies and capacity needs related to AI. • Explore opportunities for collaborative development of operational guidance and good practice approaches for AI-assisted evaluation.

Video Summary

Image Gallery

Tópicos e Temas

Decisores VOPEs / Redes de avaliação Mídia Acadêmicos Sociedade civil Estudantes Governança Servidor Público / Funcionário da Organização Internacional Tema anual: Avaliação, Evidências e Confiança na Era da IA

Back to events calendar

English

Doing More with Less while Not Losing Trust: How Should Evaluation Standards Evolve in an AI Augmented World?

Sobre o evento

Orador/a

Moderators

Resumo

Video Summary

Image Gallery

Links

Tópicos e Temas

Event Details

Você tem alguma pergunta?

Doing More with Less while Not Losing Trust: How Should Evaluation Standards Evolve in an AI Augmented World?

Sobre o evento

Orador/a

Moderators

Resumo

Conclusions

Follow-up Actions

Video Summary

Image Gallery

Links

Tópicos e Temas

Event Details

Você tem alguma pergunta?