Commit Graph

5 Commits

Author SHA1 Message Date
pseco 56349184fb Add evaluation results for AVAP knowledge models and update evaluation notebook
- Created a new JSON file containing evaluation results for the AVAP knowledge models, including scores for faithfulness, answer relevancy, context recall, and context precision.
- Updated the evaluation notebook to use a new embedding model and fixed execution counts for code cells.
2026-04-06 11:55:33 +02:00
pseco 00a7cc727d Refactor code structure and remove redundant code blocks for improved readability and maintainability 2026-04-06 11:20:21 +02:00
pseco 8f501d3e52 Refactor code structure for improved readability and maintainability 2026-03-31 11:16:03 +02:00
pseco 344230c2cf Refactor code structure for improved readability and maintainability 2026-03-27 14:09:18 +01:00
pseco febf955a62 Add new JSON output files for candidate F reward statistics and MBPP tasks
- Created `candidate_F_reward_10_coverage_stats.json` with coverage statistics including total cells, filled cells, fill rate, and node type frequency.
- Added `mbpp_avap.json` containing 14 tasks with descriptions, code implementations, test inputs, and expected test results for various endpoints and functionalities.
2026-03-26 17:18:45 +01:00