Doing More with Less while Not Losing Trust: How Should Evaluation Standards Evolve in an AI Augmented World?
Roundtable | Online
-
Organizado por:
EvalforEarth
Sobre o evento
AI has the potential to rapidly reshaping how evaluations are planned, conducted, synthesised, and communicated particularly in low resource and humanitarian contexts, where pressure to “do more with less” is acute. However, it simultaneously introduces significant ethical, methodological, and governance risks that directly challenge the profession’s credibility, independence, and trustworthiness.
AI adds the greatest value when it reduces duplication, lowers transaction costs, and frees human expertise for judgment, ethics, and contextual interpretation, not when it substitutes core evaluative functions or obscures accountability. This creates an urgent need for professional standards to evolve, not by endorsing AI wholesale but defining where it is appropriate, where it is not, and what competencies and safeguards are required.
Looking at their own use case, this roundtable will explore questions around how evaluation standards, competencies, and institutional frameworks adapt to ensure AI strengthen rather than undermine ethical practice, methodological rigor, and public trust. It will be particularly relevant for evaluators and commissioners wrestling with how to translate high-level AI ethics and governance principles into concrete evaluation standards and professional practice.
AI adds the greatest value when it reduces duplication, lowers transaction costs, and frees human expertise for judgment, ethics, and contextual interpretation, not when it substitutes core evaluative functions or obscures accountability. This creates an urgent need for professional standards to evolve, not by endorsing AI wholesale but defining where it is appropriate, where it is not, and what competencies and safeguards are required.
Looking at their own use case, this roundtable will explore questions around how evaluation standards, competencies, and institutional frameworks adapt to ensure AI strengthen rather than undermine ethical practice, methodological rigor, and public trust. It will be particularly relevant for evaluators and commissioners wrestling with how to translate high-level AI ethics and governance principles into concrete evaluation standards and professional practice.
Orador/a
| Nome | Título | Biography |
|---|---|---|
| Alexandra Priebe, PhD | Evaluation Officer, OEV WFP | Alexandra Priebe is an Evaluation Officer in WFP’s Office of Evaluation, Use Unit, supporting the development and adoption of an AI evidence mining tool. She brings 20+ years’ experience across humanitarian and development contexts. |
| Steven Jonckheere | Senior Evaluation Officer, IOE IFAD | Steven Jonckheere is Senior Evaluation Officer at IFAD’s Independent Office of Evaluation (IOE). With over 20 years of experience in agricultural and rural development across Africa, Asia, Europe, and Latin America, he specializes in social inclusion, gender, and methodological innovation. At IOE, Steven oversees and leads the Office’s work on AI and innovation in evaluation. |
| Fabrizio Felloni | Deputy Director, Independent Evaluation Office of the GEF | Fabrizio Felloni is Chief Evaluation Officer and Deputy Director at the Independent Evaluation Office of the Global Environment Facility (Washington DC), from September 2024. He was previously, Deputy Director at the Independent Office of Evaluation of IFAD (2016-2024), Lead Evaluation Officer in the same office (2010-2016), Evaluation Specialist at UNDP (2008-2010) and at IFAD (2001-2007). He has led project, country-level, sub-regional, thematic and corporate evaluations in over 25 countries (Africa, Asia, Eastern Europe, Latin America). He holds a Master’s Degree in Agricultural Economics from Washington State University and a Master-equivalent degree in Social and Economic Sciences from Bocconi University (Italy). He is the author / co-author of over a dozen publications in peer-reviewed journals. He is fluent in English, French and Spanish |
| Anupam Anand, PhD | Senior Evaluation Officer, GEF IEO | Dr. Anupam Anand is a Senior Evaluation Officer at the GEF IEO, where he serves as the program manager for biodiversity evaluations and methods, developing applied approaches that integrate LLMs, geospatial analysis, satellite data, drones and field methods into evaluative practice. With over 17 years of experience in academia, evaluation and international development, he designs and deploys these tools to generate stronger, field-grounded evaluative evidence, bridging technical execution with evaluation design. Previously, he led NASA-funded projects and conducted climate risk assessments for the World Bank. He holds a Ph.D. from the University of Maryland and a postgraduate diploma in environmental law. |
| Thanicha Ruangmas,PhD | Data Scientist, GEF IEO | Dr. Thanicha Ruangmas is a data scientist at the GEF IEO, where she develops LLMs to classify large document corpora. Her work supports implementation issue classification, activity classification, and policy coherence analysis. She holds a Ph.D. from the University of Wisconsin-Madison. |
| Carlos Tarazona | Senior Evaluation Officer, Food and Agricultural Organization. | Carlos Tarazona is a Senior Evaluation Officer at FAO with over 20 years of experience in the evaluation of agricultural and rural development programmes. He has led major evaluations and previously worked with the International Atomic Energy Agency. His work focuses on evaluation methods, learning, and strengthening evaluation practice. |
| Zhiqi Xu | Evaluation Specialist, Food and Agricultural Organization | Zhiqi Xu is an Evaluation Analyst at the FAO Office of Evaluation specializing in mixed-methods evaluation and methodological innovation. She promotes data-informed and AI-assisted approaches in evaluation practice. She is currently pursuing a PhD in Development Studies at Erasmus University Rotterdam on policy experimentation and co-production in China’s poverty alleviation. |
| Aiko Ward | Principal Evaluation Officer | Aiko is a Principal Evaluation Officer at the IEU. She comes with over 18 years of experience in monitoring, evaluation, and learning (MEL) and data management in the private and public sectors. Prior to joining the IEU, Aiko worked within the GCF Secretariat for 4 years leading the development and implementation of the GCF’s impact framework known as the integrated results management framework (IRMF). Prior to that, she held various roles in the areas of MEL and data management and analysis in an international NGO in London, UK, United Nations agencies in Sri Lanka, Bangladesh and at the headquarters in New York. Prior to that she also worked in the private sector as an investment banker in Tokyo monitoring financial risks as well as a development consultant for the Japan International Cooperation Agency (JICA) in countries such as Afghanistan and Cambodia. She has a BA in Economics from the University of Virginia in the US, an MA in International Relations from Waseda University in Tokyo, Japan, and MSc. in Social Research Methods from the University College London (UCL), UK |
Moderators
| Nome | Título | Biography |
|---|---|---|
| Anoop Sharma | Evaluation and AI Specialist | Anoop Sharma is Evaluation and AI Specialist at IFAD’s Independent Office of Evaluation (IOE). He has more than seven years of UN experience in evaluations, operational assessments, and AI and data driven analysis. At IOE, Anoop is leading the integration of the Office’s AI strategy into evaluation processes. |
| Innocent Chamisa | EvalforEarth CoP Coordinator | International development specialist with over 10 years of experience across food systems, land governance, digital innovation, and evaluation. FAO award recipient for policy coordination and sustainable agriculture. Currently serving as Global Coordinator of EvalforEarth, supporting evaluation for food security, environment, agriculture, and rural development. |
Resumo
SUMMARY NOTE
As part of the 2026 gLOCAL Evaluation Week 2026, an inter-agency roundtable discussion was convened under the title: “Doing More with Less While Not Losing Trust: How Should Evaluation Standards Evolve in an AI-Augmented World”.
The discussion examined the integration of artificial intelligence (AI) into evaluation practice across organizations including Food and Agriculture Organization of the United Nations, World Food Programme, Green Climate Fund, Global Environment Facility and International Fund for Agricultural Development.
The discussion focused on three interrelated areas:
1. AI integration in evaluation methodologies and processes;
2. Skills and competencies required for AI-assisted evaluation; and
3. Governance, ethics and institutional trust frameworks.
Panelists highlighted that AI tools can support evaluators in processing and synthesizing large volumes of information through document classification, coding, retrieval augmented generation (RAG), and systematic analysis approaches. However, participants emphasized that AI outputs require robust validation mechanisms, including traceability to source evidence, human quality assurance, and structured review processes.
The discussion underscored that AI should support, rather than replace, evaluator judgment. While evaluators are not expected to become data scientists, they require a minimum level of AI literacy, including understanding AI limitations, prompt design, verification techniques, and ethical data governance considerations.
Several operational and governance challenges were discussed, including data privacy, accountability, transparency of AI-assisted processes, bias management, and unequal institutional capacities between developed and developing contexts. Panelists noted that AI can improve efficiency in data analysis and synthesis, but cannot compensate for weak or poor-quality underlying data.
The discussion also highlighted the need for evaluation standards and institutional frameworks to evolve from broad principles toward operational guidance on AI-assisted evaluation practice. This includes clearer governance arrangements, documentation requirements, transparency protocols, and quality assurance procedures to maintain trust, credibility, and professional standards in evaluation.
Key messages emerging from the discussion included:
• AI can improve efficiency and support evidence synthesis in evaluation processes;
• Human oversight and professional judgment remain central to evaluation credibility;
• Validation, transparency, and traceability are essential in AI-assisted evaluation work;
• Institutional capacities and evaluator competencies will need to adapt to AI-enabled approaches; and
• Evaluation standards and governance frameworks should evolve to address the operational implications of AI use while safeguarding trust and accountability.
Follow-up Action Points
• Develop and publish a follow-up blog capturing key reflections, lessons, and practical experiences shared during the discussion.
• Explore the organisation of an online discussion forum through EvalforEarth to continue exchange on AI, evaluation standards, ethics, governance, and institutional practice.
• Continue inter-agency dialogue and peer learning on responsible AI use in evaluation through webinars, blogs, and knowledge-sharing activities.
• Document and share institutional experiences, lessons learned, tools, guidance materials, and training resources related to AI-assisted evaluation practices.
• Promote continued exchange on practical approaches for validation, transparency, traceability, and quality assurance in AI-supported evaluation work.
• Encourage sharing of emerging practices and institutional frameworks for responsible AI use across organizations and regions.
• Facilitate continued discussion within the EvalforEarth Community of Practice on evolving evaluator competencies and capacity needs related to AI.
• Explore opportunities for collaborative development of operational guidance and good practice approaches for AI-assisted evaluation.