AI in Action: Human-in-the-Loop Evaluation for Rigorous Insights

Mesa redonda | En línea
  • Organizado por:
    Independent Evaluation Group (IEG) World Bank Group

Sobre el evento

The session explores two practical AI applications in evaluation through a human-in-the-loop lens, embedding AI across the data analytics phase. The first case demonstrates classifying 1,000+ project documents into interventions using a pre-defined taxonomy via a five-stage pipeline: inductive taxonomy development, LLM-assisted multi-level classification, benchmarking against human validation, and iterative QA to ensure transparency, accuracy, and interpretability. A second segment expands this perspective by introducing geospatial AI as a complementary set of applications for generating evaluative evidence from spatial data. It presents three case studies illustrating the use of AI-generated datasets and AI modelling approaches across different evaluative contexts. Across these applications, the session highlights how AI is not only supporting analysis but actively shaping the data itself. This shift raises critical methodological and ethical considerations, including bias in training data, uneven geographic representation, opacity in model pipelines, and challenges to reproducibility. Emphasis is placed on the role of human-in-the-loop approaches to validate AI-generated outputs, ensure contextual relevance, and maintain accountability in evaluation practice.

Presentador/a

Nombre Título Biografía
Mercedes Vellez Senior Evaluation Officer Mercedes Vellez is a Senior Evaluation Specialist at IEG, leading the design of complex, multi-level, mixed-method evaluations across human development with innovative, AI-enabled approaches. She applies rigorous methods, causal inference, and portfolio analytics to generate actionable insights in health, nutrition, and emergency preparedness, contributing to flagship evaluations and technical publications.
Virginia Ziulu Data Scientist Virginia Ziulu is a data scientist at IEG, where her works focuses on the application of complex geospatial methods in the context of IEG evaluations. Her background and expertise lie at the intersection of remote sensing and computer vision. She is the author of a book chapter on geospatial artificial intelligence in evaluation practice, featured in the upcoming book Geospatial Impact Evaluation in Practice, as well as numerous technical papers.

Moderador/a

Nombre Título Biografía
TBD TBD TBD

Temas

Evaluadores Comisionados de Evaluación Académicos Tema anual: Evaluación, evidencia y confianza en la era de la IA Enfoques y métodos de evaluación Innovación en la evaluación

Detalles del evento

Iniciar sesión