Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 743 Bytes

README.md

File metadata and controls

13 lines (8 loc) · 743 Bytes

Evaluation-Working-Group

The OPEA Evaluation Working Group is chartered to identify standardized methodologies and frameworks for evaluating the RAG pipeline, to aid in the benchmarking of the individual components and the end-to-end solution.

The Evaluation will comprise of both Quantitative and Qualitative metrics in the domains of Performance, Safety, Trustworthiness and Scalability.

Scope and Priority

  • Methodology and Eval Frameworks
  • Performance – Focus on metrics/KPIs for each component and End to end
  • Trustworthiness - Ability to guarantee quality, security, robustness & relevance to Government or other policies
  • Scalability / Enterprise Readiness - Ability to be used in production in enterprise environments