[BE] - AI 모델 성능 테스트를 위한 LangSmith 설치 및 API 추가 #35
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
📝작업 내용
QuizEvaluationService
QuizEvaluationController
추가createLargeDataset
- 다량의 데이터셋 생성 요청 메서드jsonSchemaEvaluator
- zod를 이용한 JSON 응답 검증 평가자 메서드relevanceEvaluator
- AI 기반 퀴즈 문맥 적절성 평가 메서드AI 모델 성능 테스트의 목적
AI 모델의 퀴즈 생성 기능을 한 번이 아닌 많은 양의 테스트를 진행하여 AI 모델 성능을 한 눈에 확인하는 데 있다.
AI 모델 성능 테스트 평가 기준
스크린샷